Pears-Newton Index Routine Comparison

Main -> Documentation -> Database Builder – Pears -> Pears-Newton Index Routine Comparison

Pears-Newton Index Routine Comparison

Contents

Introduction
Document Conventions
Newton Index Routines Covered in This Document
Phrase Indexes
Keyword Indexes

Introduction

This document explains how to set up Pears index definitions within a database description configuration file to create indexes analogous to the most frequently used Newton index routines. It also shows how to set up index definitions in a WebZ database configuration file that use Pears index routines as query normalizers.

You can often use the Pears general-purpose index routines (ORG.oclc.pears.IndexRoutines.Phrase and ORG.oclc.pears.IndexRoutines.Words) with optional parameters to obtain results comparable to a Newton index routine. In other cases, you use another Pears index routine written for a specific purpose, such as ORG.oclc.pears.IndexRoutines.PublicationDate or ORG.oclc.pears.IndexRoutines.Numbers to emulate a Newton index routine.

In addition to demonstrating the implementation of Newton index routines in Pears, this document also demonstrates the flexibility of Pears index routines. For Newton index routines not included here, it illustrates how to use the optional parameters of a Pears index routine to create a similar index in Pears.

Document Conventions

dbnamedesc.ini refers to a Pears database description configuration file. While you can use any name you wish for a database description configuration file, including "desc" in the file name is a convention that helps you differentiate this file from the database's WebZ database configuration file.
dbname.ini refers to a WebZ database configuration file for a Pears database.
The Pears index definition sections in this document include only the parameters required to replicate a specific Newton index routine. They do not contain the variables required for all index definitions:

index=uniqueID_number
tagpath*=BER_tag_path

They also do not include variables that a particular index routine may use in other circumstances.

The WebZ index definition sections in this document include only the variables related to using a Pears index routine to perform query normalization for a specific index. They do not contain other required or optional parameters in the [index_definition] section.
indxpkg refers to the fully qualified class package name for Pears index routines, ORG.oclc.pears.IndexRoutines. For example, indxpkg.Phrase stands for ORG.oclc.pears.IndexRoutines.Phrase. You would use the fully qualified class package name for the value of the routine parameter in a dbnamedesc.ini file and the filter parameter in a dbname.ini file.

Return to Contents

Newton Index Routines Covered in This Document

Phrase Indexes

Keyword Indexes

combad()
greekphrase()
marcla()
phrase2()
sgmlphrase()
uptoparen()

adddelim()
cpunct()
ddc()
esdate()
govtdoc()
gxauthr()
isbn()
lcclass()
medlinewords()
nohypwd()
nsnumbr()

nssubst()
numrang()
padzero(param)
phrbhyp()
pubdate(1)
repnum()
substr(param)
substr1(param)
udc()
ugsbjcl()
wrddelim()

These links lead to a section for each index routine that:

briefly describes the index routine
indicates how to set up an Pears index definition in the dbnamedesc.ini configuration file
indicates how to set up the query normalization portion of the index's [index_definition] section in the database's dbname.ini configuration file
where applicable, provides notes or other information

Phrase Indexes

combad()

Description:

Identical to Newton phrase2() routine or Pears Phrase class except that it operates only on subfields 'a' and 'd', which it combines into a single term of up to 72 characters.

Pears Index Definition (in dbnamedesc.ini)	WebZ Index Definition (in dbname.ini)	Notes
routine=indxpkg.Phrase subfield* = 1 subfield* = 4 maxlength = 72 joinFieldsWith = \u0020	filter=indxpkg.Phrase maxlength=72	The Pears Phrase index class can create index terms greater than 72 characters long. The 72-character limit used here demonstrates how to replicate this limit in Pears.