The following test drive demonstrates a query form that queries in different languages and uses the stemming features of Index Server.
Note Do this test drive only if you have installed Index Server with all of its supported languages.
This topic takes you on a test drive through querying in another language and then summarizes the results.
The sample corpus contains some forms and scripts that are not installed by default during catalog server setup. To use these scripts you must first configure IIS.
To configure IIS for this test driveThe entry should have an alias of /Corpus.
The Read check box should already be selected. This setting tells IIS to execute scripts from within the Corpus subdirectory.
You have configured IIS to execute scripts from within the Corpus subdirectories. The steps so far have not involved Index Server. They have merely set up IIS to serve the proper query form.
To test your query formMake sure that the drop-down list box on the right side of the form is set to English – United States.
You should get eight results. This step ensures the form has been set up correctly.
Now you’re ready to query in another language.
To query in German
This query looks for all documents that contain the German word gehen, which means to go in English. The two asterisks instruct Index Server to stem the word. Stemming is a linguistic process that takes a given word and reduces it to its root linguistic form. For example, the English stem for swam is swim. After stemming is performed, Index Server inflects the stemmed form into all the grammatically correct variants. For English, stemming swam would generate the root form swim and all the other variants, such as swim, swimmer, swimmers, swam, swum, and so on. In this query, Index Server will stem gehen and inflect it to all its forms and post a query using the variants. Index Server knows to use German linguistics for stemming this word because you selected the German language in the drop-down list.
Executing the query may take some time because Index Server needs to load the German linguistics modules. Subsequent German queries will take much less time because the modules are already loaded.
Index Server returns four documents. The first is a stock sample to demonstrate the query results more clearly. The stock sample contains only the text you see in the abstract. Note that it does not contain the word gehen anywhere in the text. It does, however, contain the word gegangen, which is the past-tense form of gehen. Index Server stemmed gehen and inflected it out to its linguistic forms, in this case, including gegangen.
Note also that the numeric values and time and date-stamps in the references have been formatted to German conventions (that is, using a period instead of a comma for thousands separators, and so on).
Index Server can be configured to use a default locale and language so that the language need not be specified by every query and query form. This form also allows the user to override any default locale and language settings for the purposes of the exercise. For more information, see Support for Multiple Languages.