Main -> Documentation -> Database Builder – Newton -> Creating a New SiteSearch Database -> Database Description (.dsc) File: Structure and Syntax -> Stopwords Definition

Stopwords Definition

A "stopword" is a term you specify to exclude from your database because it occurs too frequently or because of its unimportance to the database content. Typical stopwords include "a," "and," "an," "the," "but," and so on. The stopwords definition in the database description (.dsc) file allows you to specify one or more stopwords to exclude from your database when it is built. You can specify that a stopword term be excluded from all keyword indexes in the database or from a list of specific keyword indexes.

Note:

Stopwords are inappropriate for phrase indexes.

Syntax

begin stopwords

stopword_list[/index_list]

end stopwords

Element

Description

begin stopwords

end stopwords

Denotes the beginning and ending of the stopwords definition in the .dsc file.

stopwords_list

Lists each of the terms to stopword on its own line.

index_list

Allows certain terms to be included as a stopword for specific indexes only. In the example below, the word "book" will not be indexed in index numbers 30 and 1. All indexes in the list must be numeric, and a comma and blank space must separate them

Example

In the following example, the first four stopwords are excluded from the entire database. The fifth stopword, "book," is excluded from index 30 and index 1.

begin stopwords
a
and
the
but
book/30, 1
end stopwords

See Also

Creating a Database Description (.dsc) File
Database Description (.dsc) File: Structure and Syntax
Database Description (.dsc) File Example


[Main][Documentation][Support][Technical Reference][Community][Glossary][Search]

Last Modified: