|
 |
 |
| dtSearch Case Study IE Discovery |
|
| Text
Retrieval Engine is Crucial to A Web-based Document Repository |
dtSearchs
powerful search engine, rich feature sets,
and professional technical support staff allowed
IE Discovery to develop InfoDox in a fraction
of the time normally required for a project
of this scope.
|
|
The
evolving technologies of the Internet, along with
the demands of the litigation process, were the
driving factors behind the development of InfoDox,™
IE Discovery's Web-based document repository.
As IE Discovery developed, offering off-the-shelf
litigation and document management applications
and designing custom databases, the concept of
a comprehensive Web-based document repository
began to evolve. To meet the varied demands of
its legal clients, an efficient interface to the
Internet was crucial.
To insure that InfoDox met its design criteria,
IE Discovery needed a search capability that was
fast, secure and flexible enough to hold warehouse-sized
volumes of documents yet sufficiently cost-effective
to support smaller document projects. It was also
important that the system have powerful searching
capabilities, be continuously up-to-date, have
Internet accessibility, but not burden the user
with a tremendous amount of hardware and software
maintenance.

The
development team realized that the ability to
quickly sift through very large collections of
textual and fielded document data and deliver
document images over the Internet was fundamental.
IE Discoverys primary sources of documents
are litigation documents, or documents related
to legal issues. These "cases" tend
to evolve over time and a solution would need
to be flexible enough to adapt to changing circumstances.
For example, if a case is related to injuries
persons received due to a companys alleged
negligence with its chemical waste products,
it may be months or even years until all medical
records are obtained from all parties involved.
The document collection size could double or more
in size. In contrast, a client may want to have
all their case documents loaded into a database
in order to weed through and remove any documents
that are not important to the case. In this example,
the database size could reduce drastically from
where it began.
Because these document collections can be massive,
InfoDox also had to provide the ability to jump
directly to the specified text and image, providing
both a view of the documents page and taking
the viewer directly to the line of text searched
upon in the retrieved files. Document collections
are collected from numerous sources. Because these
documents are all received in different file formats,
some hard copy, others may be taped video depositions,
the system needed to receive files from a wide
variety of sources. Handling multiple file types
was essential, as was the need to convert these
files to formats that could be displayed and searched
from the Web. Through months of research, the
IE Discovery development team was able to meet
these product requirements as well as deliver
efficient image delivery over the Web.
Each InfoDox database consists of a document and
related "names mentioned" table. For
each new database loaded into InfoDox, the customer
is asked to answer the following questions concerning
their data needs:
1) What properties (fields) of the documents do
you want available to capture? Examples: Document
Date, Document Type, Bates Number, etc.
2) Which of the above fields should pick-lists
be made available for searching and entering data
to ensure consistency in the data?
3) What values should be available for each of
the pick-lists described in step 2?
|
The
dtSearch
Text Retrieval
Engine
provided
all of
InfoDox’s
search needs
including
highlighted
results in
full text
searches.
|
|
Once
the customers requirements have been determined,
a database administrator at IE Discovery will
design the appropriate database structure and
build the database as part of the InfoDox service.
IE Discovery takes data, or documents in this
case, and breaks them down into fielded data (data
entry either within InfoDox or loaded from a data
entry vendor), images (scanned hard copies of
the documents) and OCR (full text from the images).
All the converted data is then loaded into the
InfoDox application. All document processing,
imaging, coding, OCR, etc., is provided by IE
Discovery if needed. At this stage InfoDox utilizes
dtSearchs Text Retrieval Engine and a relational
database search engine resulting in delivery via
the users Web browser using standard HTML.
The user receives images that are the result of
combining a specialized high-compression scheme
with streaming, text which is converted to HTML
with hit highlighting and fielded data.
According to IE Discovery, The dtSearch
Text Retrieval Engine provided all of InfoDox's
search needs including highlighted results in
full text searches. By combining their search
engine and advanced Internet imaging technology,
the specified speed and performance requirements
were achieved. dtSearchs powerful search
engine, rich feature sets, and professional technical
support staff allowed IE Discovery to develop
InfoDox in a fraction of the time normally required
for a project of this scope. The final product
allows clients to search mountains of textual
data instantly while incorporating regular expression,
phrase, Boolean, stemming and thesaurical searching
capability. Users can even take advantage of the
advanced fuzzy searching capabilities of the engine
to find documents even if the terms they seek
are misspelled due to typographical or OCR interpretation
errors.
To ensure that InfoDox documents remain secure,
IE Discovery incorporated a protocol called Secure
Sockets Layer (SSL) to identify authorized users,
prohibit unauthorized access, and encrypt data
transmissions from the document repository to
the user's Web browser. This is the same high-end
technology used by large financial institutions
to transmit their extremely sensitive data and
the Internet standard for processing credit card
transactions.
With the introduction of InfoDox, IE Discovery
pioneered the use of the Application Service Provider
(ASP) model to the field of document imaging and
litigation support. While InfoDox's success is
due to several technologies that IE Discovery
combined to produce a powerful, yet cost-effective
solution for their clients: The performance
of the dtSearch Text Retrieval Engine was an important
factor. The product allows InfoDox users to search
millions of documents in seconds instead of minutes
or hours.
For
further information, please call IE Discovery
at 512-498-7400, email sales@iediscovery.com,
or visit IE Discovery online at www.iediscovery.com. |
| |
Return
to Case Study contents Page |
|
|
|