JObjects QuestAgent 3.0

Document Indexing

In order to create and prepare index file for use with the QuestAgent search applet you have to use QuestEngine Lite which consist of two separate command line tools:

Indexer parses all html and txt documents it can find in a specified directory tree. Indexer creates raw index file and prepares data for further processing with the Optimizer tool.

Optimizer does data sorting, ranking and prepares index files for use with the QuestAgent. This tool is separated from the Indexer to give a chance a user to change some parameters such is ignorance of too common words. Modification of these parameters can significantly change a size of the final index file. Optimizer is less time consuming tool then Indexer, so you can play with these parameters to get appropriate index size-search result ratio.

Both Indexer and Optimizer use the same configuration file which contains settings necessary for its functioning. In a directory $JOQUEST\questengine you can find a configuration file template which contains detailed parameter descriptions for these indexing tools.

Before you start indexing make sure that you've set the following parameters:

IndexDir - directory you want to index. Files with suffixes ".html", ".htm" and ".txt" in all subdirectories will be indexed. You can use FileExclusionList parameter to prevent indexing of some files or directories.

WorkingDir - raw data will be saved here

OutputDir - output directory

HtmlHandler.RemovePrefix - use to make a relative document path from an absolute
HtmlHandler.AddPrefix

Now, you are ready to start indexing. In you command line (MS-DOS or UNIX shell) type:

java com.jobjects.QuestIndexer -c qeconfig.txt

or

$JOQUEST\bin\QuestIndexer -c qeconfig.txt

(if you're using JRE bundled with QuestAgent.)

In a directory specified by the WorkingDir configuration parameter Indexer will create raw data files. Now it's time to run the Optimizer:

java com.jobjects.QuestOptimizer -c qeconfig.txt

or

$JOQUEST\bin\QuestOptimizer -c qeconfig.txt

In a directory specified by the OutputDir configuration parameter Optimizer will create index data files ready for use with the QuestAgent.

 


Copyright © 1997, 1998 JObjects - All right reserved.
contact@jobjects.com