Language:  English English | French French | German GermanArabic Arabic 
 

How to use dtSearch with IFilters

Last Reviewed: February 6, 2009

Article: DTS0205

Applies to: dtSearch 7.22 and later

dtSearch includes its own file parsers for a wide range file formats, which are listed here.  

IFilters provide a way to add file format support for additional formats.  IFilters are components that enable various Microsoft search products, such as Microsoft Index Server, to extract text from documents. For example, when you install Microsoft OneNote, an IFilter is installed to enable searching of *.one files.  

dtSearch Desktop

To tell dtSearch to use installed IFilters to process some of your files, set up a rule in Options > Preferences > File Types:

1.  In dtSearch Desktop, click Options > Preferences > File Types

2.  Under File type select IFilter

3.  Under Filename filters to identify files of this type enter one or more filename filters (like *.one) to identify files that should be indexed using IFilters.

dtSearch Engine API

To use IFilters in an application that uses the dtSearch Engine, first create one or more file type rules in dtSearch Desktop as described above.   These rules are stored in a filetype.xml file that will be in your dtSearch UserData folder.

Next, in your application use Options.FileTypeTableFile to specify the location of the filetype.xml file to use in your application.  

Finally, make sure dtSearch Engine is loading the dtv_ifilter.dll component so IFilter integration is enabled.  The dtv_ifilter.dll component is an external file parser and must be located in a "Viewers" subfolder under the dtSearch " Home" directory.  For more information on the Home directory, see the "Home and Private Directories" in the dtSearch Text Retrieval Engine Programmer's Reference.

Additional Information

dtSearch implements IFilter support using an external file parser DLL, dtv_ifilter.dll.  If this component is not installed, IFilter integration will not work.

The dtSearch IFilter adapter only works on systems with the Microsoft component query.dll installed. For information on products that include query.dll, see http://support.microsoft.com/dllhelp
For more information on IFilters, see http://www.ifilter.org/ or http://channel9.msdn.com/wiki/default.aspx/Channel9.DesktopSearchIFilters

The dtSearch product line can instantly search terabytes of text across a desktop, network, Internet or Intranet site.
dtSearch products also serve as tools for publishing, with instant text searching, large document collections to Web sites or CD/DVDs.
over two dozen indexed, unindexed, fielded and full-text search options
highlights hits in HTML, XML and PDF, while displaying embedded links, formatting and images
converts other file types — word processor, database, spreadsheet, email and full-text of email attachments, ZIP, Unicode, etc. — to HTML for display with highlighted hits
built-in Spider adds a third-party or other Web site (public, secure content, password accessible, etc.) to your searchable database
Spider supports Web-based content (HTML, PDF, XML, etc.) as well as dynamically-generated content (ASP.NET, MS CMS, SharePoint, etc.)
General supported file types
SQL and similar data sources