Forensically Sound Indexing and Conceptual Search
Many search engines available today miss relevant information because of their performance enhancing shortcuts that are designed to improve the response time and relevancy of information access requests from employees. These shortcuts include partial indexing—a technique whereby the search engine chooses not to index the entire content of the document, but only the first X pages based on assumptions. For example, if a document contains 500 pages of information, the search engine may only index the first ten pages. If information relevant to the case appears first on page 60, it will not have been indexed and the search engine may miss this document and others. Autonomy’s high-performing IDOL engine always indexes the entire content of the file, including all its metadata, so that no critical information is missed and the search is FRCP-compliant. Another shortcut is ‘jump out,’ which misses potentially relevant documents as it stops looking across an index for potentially relevant information once a certain number of documents have been retrieved. When these shortcut techniques are applied over even a modest number of files the result is an arbitrary and incomplete set of documents. In legal cases, where a single document has the potential to drastically change the direction of a case, the consequences of these search techniques can be disastrous. By integrating the IDOL enterprise search engine into the SharePoint interface, users are able to perform FRCP compliant search without sacrificing performance. Built on a unique pattern-recognition technology, IDOL’s conceptual query mechanism allows a seemingly simple query expression to be evaluated in complex ways; as well as the matching of the basic terms within documents, it is able to “read between the lines” and determine conceptual matches that legacy search engines would be unable to locate. This advanced search method is used in conjunction with semantic parsing and other legacy approaches to yield highly accurate results.
- Autonomy’s connector framework enables access to all enterprise content, including rich media, allowing IDOL to search every file in the enterprise
- IDOL extracts and indexes the entire content of a file
- IDOL searches the entire content of a file without premature cessation (jump-out)
- IDOL uses forensically sound indexing methods and preserves the complete integrity of the information itself and the metadata
|