|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectorg.apache.lucene.analysis.Analyzer
edu.mayo.informatics.indexer.lucene.analyzers.NormAnalyzer
public class NormAnalyzer
This is an analyzer that uses LVG to normalize each term before it is inserted into the index.
Field Summary | |
---|---|
static int |
LVG_CACHE_SIZE
|
static java.lang.String |
LVG_CONFIG_FILE_ABSOLUTE
|
Constructor Summary | |
---|---|
NormAnalyzer()
Create a new NormAnalyzer. |
|
NormAnalyzer(boolean keepOrigional)
|
|
NormAnalyzer(boolean keepOrigional,
java.lang.String[] stopWords,
char[] charsToRemove,
char[] charsToTreatAsWhiteSpace)
Create a norm analyzer. |
|
NormAnalyzer(java.lang.String lvgConfigFileLocation,
boolean keepOrigional)
The lvg config file location is required. |
|
NormAnalyzer(java.lang.String lvgConfigFileLocation,
boolean keepOrigional,
java.lang.String[] stopWords,
char[] charsToRemove,
char[] charsToTreatAsWhiteSpace)
The lvg config file location is required. |
Method Summary | |
---|---|
WhiteSpaceLowerCaseAnalyzer |
getWhiteSpaceLowerCaseAnalyzer()
This method should not be part of the public API - but design requirements require it to be public. |
void |
setWhiteSpaceLowerCaseAnalyzer(WhiteSpaceLowerCaseAnalyzer whiteSpaceLowerCaseAnalyzer)
This method should not be part of the public API - but design requirements require it to be public. |
org.apache.lucene.analysis.TokenStream |
tokenStream(java.lang.String fieldname,
java.io.Reader reader)
|
Methods inherited from class org.apache.lucene.analysis.Analyzer |
---|
getPositionIncrementGap, getPreviousTokenStream, reusableTokenStream, setPreviousTokenStream |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static java.lang.String LVG_CONFIG_FILE_ABSOLUTE
public static int LVG_CACHE_SIZE
Constructor Detail |
---|
public NormAnalyzer()
public NormAnalyzer(java.lang.String lvgConfigFileLocation, boolean keepOrigional, java.lang.String[] stopWords, char[] charsToRemove, char[] charsToTreatAsWhiteSpace)
lvgConfigFileLocation
- keepOrigional
- stopWords
- - Stop words to use - not used if null or empty.charsToRemove
- - characters to remove from input (before norm) - not used if
null or empty.charsToTreatAsWhiteSpace
- - characters to treat as whitespace (split points) - defaults
to typical whitespace if null or empty.public NormAnalyzer(boolean keepOrigional, java.lang.String[] stopWords, char[] charsToRemove, char[] charsToTreatAsWhiteSpace)
keepOrigional
- stopWords
- - Stop words to use - not used if null or empty.charsToRemove
- - characters to remove from input (before norm) - not used if
null or empty.charsToTreatAsWhiteSpace
- - characters to treat as whitespace (split points) - defaults
to typical whitespace if null or empty.public NormAnalyzer(java.lang.String lvgConfigFileLocation, boolean keepOrigional)
lvgConfigFileLocation
- keepOrigional
- public NormAnalyzer(boolean keepOrigional)
Method Detail |
---|
public final org.apache.lucene.analysis.TokenStream tokenStream(java.lang.String fieldname, java.io.Reader reader)
tokenStream
in class org.apache.lucene.analysis.Analyzer
public WhiteSpaceLowerCaseAnalyzer getWhiteSpaceLowerCaseAnalyzer()
public void setWhiteSpaceLowerCaseAnalyzer(WhiteSpaceLowerCaseAnalyzer whiteSpaceLowerCaseAnalyzer)
|
Copyright: (c) 2004-2006 Mayo Foundation for Medical Education and Research (MFMER). All rights reserved. MAYO, MAYO CLINIC, and the triple-shield Mayo logo are trademarks and service marks of MFMER. | ||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |