Package | Description |
---|---|
org.apache.lucene.analysis.ar |
Analyzer for Arabic.
|
org.apache.lucene.analysis.bg |
Analyzer for Bulgarian.
|
org.apache.lucene.analysis.br |
Analyzer for Brazilian Portuguese.
|
org.apache.lucene.analysis.ca |
Analyzer for Catalan.
|
org.apache.lucene.analysis.cjk |
Analyzer for Chinese, Japanese, and Korean, which indexes bigrams.
|
org.apache.lucene.analysis.ckb |
Analyzer for Sorani Kurdish.
|
org.apache.lucene.analysis.cn.smart |
Analyzer for Simplified Chinese, which indexes words.
|
org.apache.lucene.analysis.commongrams |
Construct n-grams for frequently occurring terms and phrases.
|
org.apache.lucene.analysis.compound |
A filter that decomposes compound words you find in many Germanic
languages into the word parts.
|
org.apache.lucene.analysis.core |
Basic, general-purpose analysis components.
|
org.apache.lucene.analysis.cz |
Analyzer for Czech.
|
org.apache.lucene.analysis.da |
Analyzer for Danish.
|
org.apache.lucene.analysis.de |
Analyzer for German.
|
org.apache.lucene.analysis.el |
Analyzer for Greek.
|
org.apache.lucene.analysis.en |
Analyzer for English.
|
org.apache.lucene.analysis.es |
Analyzer for Spanish.
|
org.apache.lucene.analysis.eu |
Analyzer for Basque.
|
org.apache.lucene.analysis.fa |
Analyzer for Persian.
|
org.apache.lucene.analysis.fi |
Analyzer for Finnish.
|
org.apache.lucene.analysis.fr |
Analyzer for French.
|
org.apache.lucene.analysis.ga |
Analysis for Irish.
|
org.apache.lucene.analysis.gl |
Analyzer for Galician.
|
org.apache.lucene.analysis.hi |
Analyzer for Hindi.
|
org.apache.lucene.analysis.hu |
Analyzer for Hungarian.
|
org.apache.lucene.analysis.hy |
Analyzer for Armenian.
|
org.apache.lucene.analysis.id |
Analyzer for Indonesian.
|
org.apache.lucene.analysis.in |
Analysis components for Indian languages.
|
org.apache.lucene.analysis.it |
Analyzer for Italian.
|
org.apache.lucene.analysis.lv |
Analyzer for Latvian.
|
org.apache.lucene.analysis.miscellaneous |
Miscellaneous TokenStreams
|
org.apache.lucene.analysis.ngram |
Character n-gram tokenizers and filters.
|
org.apache.lucene.analysis.nl |
Analyzer for Dutch.
|
org.apache.lucene.analysis.no |
Analyzer for Norwegian.
|
org.apache.lucene.analysis.pl |
Analyzer for Polish.
|
org.apache.lucene.analysis.pt |
Analyzer for Portuguese.
|
org.apache.lucene.analysis.query |
Automatically filter high-frequency stopwords.
|
org.apache.lucene.analysis.reverse |
Filter to reverse token text.
|
org.apache.lucene.analysis.ro |
Analyzer for Romanian.
|
org.apache.lucene.analysis.ru |
Analyzer for Russian.
|
org.apache.lucene.analysis.shingle |
Word n-gram filters
|
org.apache.lucene.analysis.snowball |
TokenFilter and Analyzer implementations that use Snowball
stemmers. |
org.apache.lucene.analysis.standard |
Fast, general-purpose grammar-based tokenizers.
|
org.apache.lucene.analysis.sv |
Analyzer for Swedish.
|
org.apache.lucene.analysis.th |
Analyzer for Thai.
|
org.apache.lucene.analysis.tr |
Analyzer for Turkish.
|
org.apache.lucene.analysis.util |
Utility functions for text analysis.
|
org.apache.lucene.collation |
Unicode collation support.
|
org.apache.lucene.index |
Misc index tools and index support.
|
org.apache.lucene.queryparser.analyzing |
QueryParser that passes Fuzzy-, Prefix-, Range-, and WildcardQuerys through the given analyzer.
|
org.apache.lucene.queryparser.classic |
A simple query parser implemented with JavaCC.
|
org.apache.lucene.queryparser.complexPhrase |
QueryParser which permits complex phrase query syntax eg "(john jon jonathan~) peters*"
|
org.apache.lucene.queryparser.ext |
Extendable QueryParser provides a simple and flexible extension mechanism by overloading query field names.
|
org.apache.lucene.search.suggest.analyzing |
Analyzer based autosuggest.
|
org.apache.lucene.util |
Some utility classes.
|
Constructor and Description |
---|
ArabicAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
ArabicAnalyzer.DEFAULT_STOPWORD_FILE . |
ArabicAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words
|
ArabicAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop word.
|
ArabicLetterTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader in)
Deprecated.
Construct a new ArabicLetterTokenizer using a given
AttributeSource.AttributeFactory . |
ArabicLetterTokenizer(Version matchVersion,
Reader in)
Deprecated.
Construct a new ArabicLetterTokenizer.
|
Constructor and Description |
---|
BulgarianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
BulgarianAnalyzer.DEFAULT_STOPWORD_FILE . |
BulgarianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
BulgarianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words and a stem exclusion set.
|
Constructor and Description |
---|
BrazilianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words (
BrazilianAnalyzer.getDefaultStopSet() ). |
BrazilianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words
|
BrazilianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words and stemming exclusion words
|
Constructor and Description |
---|
CatalanAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
CatalanAnalyzer.DEFAULT_STOPWORD_FILE . |
CatalanAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
CatalanAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
CJKAnalyzer(Version matchVersion)
Builds an analyzer which removes words in
CJKAnalyzer.getDefaultStopSet() . |
CJKAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words
|
Constructor and Description |
---|
SoraniAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
SoraniAnalyzer.DEFAULT_STOPWORD_FILE . |
SoraniAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
SoraniAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
SmartChineseAnalyzer(Version matchVersion)
Create a new SmartChineseAnalyzer, using the default stopword list.
|
SmartChineseAnalyzer(Version matchVersion,
boolean useDefaultStopWords)
Create a new SmartChineseAnalyzer, optionally using the default stopword list.
|
SmartChineseAnalyzer(Version matchVersion,
CharArraySet stopWords)
Create a new SmartChineseAnalyzer, using the provided
Set of stopwords. |
Constructor and Description |
---|
CommonGramsFilter(Version matchVersion,
TokenStream input,
CharArraySet commonWords)
Construct a token stream filtering the given input using a Set of common
words to create bigrams.
|
Modifier and Type | Field and Description |
---|---|
protected Version |
CompoundWordTokenFilterBase.matchVersion |
Constructor and Description |
---|
CompoundWordTokenFilterBase(Version matchVersion,
TokenStream input,
CharArraySet dictionary) |
CompoundWordTokenFilterBase(Version matchVersion,
TokenStream input,
CharArraySet dictionary,
boolean onlyLongestMatch) |
CompoundWordTokenFilterBase(Version matchVersion,
TokenStream input,
CharArraySet dictionary,
int minWordSize,
int minSubwordSize,
int maxSubwordSize,
boolean onlyLongestMatch) |
DictionaryCompoundWordTokenFilter(Version matchVersion,
TokenStream input,
CharArraySet dictionary)
Creates a new
DictionaryCompoundWordTokenFilter |
DictionaryCompoundWordTokenFilter(Version matchVersion,
TokenStream input,
CharArraySet dictionary,
int minWordSize,
int minSubwordSize,
int maxSubwordSize,
boolean onlyLongestMatch)
Creates a new
DictionaryCompoundWordTokenFilter |
HyphenationCompoundWordTokenFilter(Version matchVersion,
TokenStream input,
HyphenationTree hyphenator)
Create a HyphenationCompoundWordTokenFilter with no dictionary.
|
HyphenationCompoundWordTokenFilter(Version matchVersion,
TokenStream input,
HyphenationTree hyphenator,
CharArraySet dictionary)
Creates a new
HyphenationCompoundWordTokenFilter instance. |
HyphenationCompoundWordTokenFilter(Version matchVersion,
TokenStream input,
HyphenationTree hyphenator,
CharArraySet dictionary,
int minWordSize,
int minSubwordSize,
int maxSubwordSize,
boolean onlyLongestMatch)
Creates a new
HyphenationCompoundWordTokenFilter instance. |
HyphenationCompoundWordTokenFilter(Version matchVersion,
TokenStream input,
HyphenationTree hyphenator,
int minWordSize,
int minSubwordSize,
int maxSubwordSize)
Create a HyphenationCompoundWordTokenFilter with no dictionary.
|
Modifier and Type | Method and Description |
---|---|
static CharArraySet |
StopFilter.makeStopSet(Version matchVersion,
List<?> stopWords)
Builds a Set from an array of stop words,
appropriate for passing into the StopFilter constructor.
|
static CharArraySet |
StopFilter.makeStopSet(Version matchVersion,
List<?> stopWords,
boolean ignoreCase)
Creates a stopword set from the given stopword list.
|
static CharArraySet |
StopFilter.makeStopSet(Version matchVersion,
String... stopWords)
Builds a Set from an array of stop words,
appropriate for passing into the StopFilter constructor.
|
static CharArraySet |
StopFilter.makeStopSet(Version matchVersion,
String[] stopWords,
boolean ignoreCase)
Creates a stopword set from the given stopword array.
|
Constructor and Description |
---|
LetterTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader in)
Construct a new LetterTokenizer using a given
AttributeSource.AttributeFactory . |
LetterTokenizer(Version matchVersion,
Reader in)
Construct a new LetterTokenizer.
|
LowerCaseFilter(Version matchVersion,
TokenStream in)
Create a new LowerCaseFilter, that normalizes token text to lower case.
|
LowerCaseTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader in)
Construct a new LowerCaseTokenizer using a given
AttributeSource.AttributeFactory . |
LowerCaseTokenizer(Version matchVersion,
Reader in)
Construct a new LowerCaseTokenizer.
|
SimpleAnalyzer(Version matchVersion)
Creates a new
SimpleAnalyzer |
StopAnalyzer(Version matchVersion)
Builds an analyzer which removes words in
StopAnalyzer.ENGLISH_STOP_WORDS_SET . |
StopAnalyzer(Version matchVersion,
CharArraySet stopWords)
Builds an analyzer with the stop words from the given set.
|
StopAnalyzer(Version matchVersion,
File stopwordsFile)
Builds an analyzer with the stop words from the given file.
|
StopAnalyzer(Version matchVersion,
Reader stopwords)
Builds an analyzer with the stop words from the given reader.
|
StopFilter(Version matchVersion,
TokenStream in,
CharArraySet stopWords)
Constructs a filter which removes words from the input TokenStream that are
named in the Set.
|
TypeTokenFilter(Version version,
boolean enablePositionIncrements,
TokenStream input,
Set<String> stopTypes)
Deprecated.
enablePositionIncrements=false is not supported anymore as of Lucene 4.4.
|
TypeTokenFilter(Version version,
boolean enablePositionIncrements,
TokenStream input,
Set<String> stopTypes,
boolean useWhiteList)
Deprecated.
enablePositionIncrements=false is not supported anymore as of Lucene 4.4.
|
TypeTokenFilter(Version version,
TokenStream input,
Set<String> stopTypes)
Create a new
TypeTokenFilter that filters tokens out
(useWhiteList=false). |
TypeTokenFilter(Version version,
TokenStream input,
Set<String> stopTypes,
boolean useWhiteList)
Create a new
TypeTokenFilter . |
UpperCaseFilter(Version matchVersion,
TokenStream in)
Create a new UpperCaseFilter, that normalizes token text to upper case.
|
WhitespaceAnalyzer(Version matchVersion)
Creates a new
WhitespaceAnalyzer |
WhitespaceTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader in)
Construct a new WhitespaceTokenizer using a given
AttributeSource.AttributeFactory . |
WhitespaceTokenizer(Version matchVersion,
Reader in)
Construct a new WhitespaceTokenizer.
|
Constructor and Description |
---|
CzechAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words (
CzechAnalyzer.getDefaultStopSet() ). |
CzechAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
CzechAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionTable)
Builds an analyzer with the given stop words and a set of work to be
excluded from the
CzechStemFilter . |
Constructor and Description |
---|
DanishAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
DanishAnalyzer.DEFAULT_STOPWORD_FILE . |
DanishAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
DanishAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
GermanAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
GermanAnalyzer.getDefaultStopSet() . |
GermanAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words
|
GermanAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words
|
Constructor and Description |
---|
GreekAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words.
|
GreekAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
GreekLowerCaseFilter(Version matchVersion,
TokenStream in)
Create a GreekLowerCaseFilter that normalizes Greek token text.
|
Constructor and Description |
---|
EnglishAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
EnglishAnalyzer.getDefaultStopSet() . |
EnglishAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
EnglishAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
EnglishPossessiveFilter(Version version,
TokenStream input) |
Constructor and Description |
---|
SpanishAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
SpanishAnalyzer.DEFAULT_STOPWORD_FILE . |
SpanishAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
SpanishAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
BasqueAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
BasqueAnalyzer.DEFAULT_STOPWORD_FILE . |
BasqueAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
BasqueAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
PersianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
PersianAnalyzer.DEFAULT_STOPWORD_FILE . |
PersianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words
|
Constructor and Description |
---|
FinnishAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
FinnishAnalyzer.DEFAULT_STOPWORD_FILE . |
FinnishAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
FinnishAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
FrenchAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words (
FrenchAnalyzer.getDefaultStopSet() ). |
FrenchAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words
|
FrenchAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclutionSet)
Builds an analyzer with the given stop words
|
Constructor and Description |
---|
IrishAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
IrishAnalyzer.DEFAULT_STOPWORD_FILE . |
IrishAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
IrishAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
GalicianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
GalicianAnalyzer.DEFAULT_STOPWORD_FILE . |
GalicianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
GalicianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
HindiAnalyzer(Version version)
Builds an analyzer with the default stop words:
HindiAnalyzer.DEFAULT_STOPWORD_FILE . |
HindiAnalyzer(Version version,
CharArraySet stopwords)
Builds an analyzer with the given stop words
|
HindiAnalyzer(Version version,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words
|
Constructor and Description |
---|
HungarianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
HungarianAnalyzer.DEFAULT_STOPWORD_FILE . |
HungarianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
HungarianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
ArmenianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
ArmenianAnalyzer.DEFAULT_STOPWORD_FILE . |
ArmenianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
ArmenianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
IndonesianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
IndonesianAnalyzer.DEFAULT_STOPWORD_FILE . |
IndonesianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words
|
IndonesianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop word.
|
Constructor and Description |
---|
IndicTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader input)
Deprecated.
|
IndicTokenizer(Version matchVersion,
Reader input)
Deprecated.
|
Constructor and Description |
---|
ItalianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
ItalianAnalyzer.DEFAULT_STOPWORD_FILE . |
ItalianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
ItalianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
LatvianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
LatvianAnalyzer.DEFAULT_STOPWORD_FILE . |
LatvianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
LatvianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
CodepointCountFilter(Version version,
TokenStream in,
int min,
int max)
Create a new
CodepointCountFilter . |
KeepWordFilter(Version version,
boolean enablePositionIncrements,
TokenStream in,
CharArraySet words)
Deprecated.
enablePositionIncrements=false is not supported anymore as of Lucene 4.4.
|
KeepWordFilter(Version version,
TokenStream in,
CharArraySet words)
Create a new
KeepWordFilter . |
LengthFilter(Version version,
boolean enablePositionIncrements,
TokenStream in,
int min,
int max)
Deprecated.
enablePositionIncrements=false is not supported anymore as of Lucene 4.4.
|
LengthFilter(Version version,
TokenStream in,
int min,
int max)
Create a new
LengthFilter . |
PatternAnalyzer(Version matchVersion,
Pattern pattern,
boolean toLowerCase,
CharArraySet stopWords)
Deprecated.
Constructs a new instance with the given parameters.
|
TrimFilter(Version version,
TokenStream in)
Create a new
TrimFilter on top of in . |
TrimFilter(Version version,
TokenStream in,
boolean updateOffsets)
Deprecated.
Offset updates are not supported anymore as of Lucene 4.4.
|
WordDelimiterFilter(Version matchVersion,
TokenStream in,
byte[] charTypeTable,
int configurationFlags,
CharArraySet protWords)
Creates a new WordDelimiterFilter
|
WordDelimiterFilter(Version matchVersion,
TokenStream in,
int configurationFlags,
CharArraySet protWords)
Creates a new WordDelimiterFilter using
WordDelimiterIterator.DEFAULT_WORD_DELIM_TABLE
as its charTypeTable |
Constructor and Description |
---|
EdgeNGramTokenFilter(Version version,
TokenStream input,
EdgeNGramTokenFilter.Side side,
int minGram,
int maxGram)
Deprecated.
|
EdgeNGramTokenFilter(Version version,
TokenStream input,
int minGram,
int maxGram)
Creates EdgeNGramTokenFilter that can generate n-grams in the sizes of the given range
|
EdgeNGramTokenFilter(Version version,
TokenStream input,
String sideLabel,
int minGram,
int maxGram)
Deprecated.
|
EdgeNGramTokenizer(Version version,
AttributeSource.AttributeFactory factory,
Reader input,
int minGram,
int maxGram)
Creates EdgeNGramTokenizer that can generate n-grams in the sizes of the given range
|
EdgeNGramTokenizer(Version version,
Reader input,
int minGram,
int maxGram)
Creates EdgeNGramTokenizer that can generate n-grams in the sizes of the given range
|
Lucene43EdgeNGramTokenizer(Version version,
AttributeSource.AttributeFactory factory,
Reader input,
int minGram,
int maxGram)
Deprecated.
Creates EdgeNGramTokenizer that can generate n-grams in the sizes of the given range
|
Lucene43EdgeNGramTokenizer(Version version,
AttributeSource.AttributeFactory factory,
Reader input,
Lucene43EdgeNGramTokenizer.Side side,
int minGram,
int maxGram)
Deprecated.
|
Lucene43EdgeNGramTokenizer(Version version,
AttributeSource.AttributeFactory factory,
Reader input,
String sideLabel,
int minGram,
int maxGram)
Deprecated.
|
Lucene43EdgeNGramTokenizer(Version version,
Reader input,
int minGram,
int maxGram)
Deprecated.
Creates EdgeNGramTokenizer that can generate n-grams in the sizes of the given range
|
Lucene43EdgeNGramTokenizer(Version version,
Reader input,
Lucene43EdgeNGramTokenizer.Side side,
int minGram,
int maxGram)
Deprecated.
|
Lucene43EdgeNGramTokenizer(Version version,
Reader input,
String sideLabel,
int minGram,
int maxGram)
Deprecated.
|
NGramTokenFilter(Version version,
TokenStream input)
Creates NGramTokenFilter with default min and max n-grams.
|
NGramTokenFilter(Version version,
TokenStream input,
int minGram,
int maxGram)
Creates NGramTokenFilter with given min and max n-grams.
|
NGramTokenizer(Version version,
AttributeSource.AttributeFactory factory,
Reader input,
int minGram,
int maxGram)
Creates NGramTokenizer with given min and max n-grams.
|
NGramTokenizer(Version version,
Reader input)
Creates NGramTokenizer with default min and max n-grams.
|
NGramTokenizer(Version version,
Reader input,
int minGram,
int maxGram)
Creates NGramTokenizer with given min and max n-grams.
|
Constructor and Description |
---|
DutchAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words (
DutchAnalyzer.getDefaultStopSet() )
and a few default entries for the stem exclusion table. |
DutchAnalyzer(Version matchVersion,
CharArraySet stopwords) |
DutchAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionTable) |
DutchAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionTable,
CharArrayMap<String> stemOverrideDict) |
Constructor and Description |
---|
NorwegianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
NorwegianAnalyzer.DEFAULT_STOPWORD_FILE . |
NorwegianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
NorwegianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
PolishAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
PolishAnalyzer.DEFAULT_STOPWORD_FILE . |
PolishAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
PolishAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
PortugueseAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
PortugueseAnalyzer.DEFAULT_STOPWORD_FILE . |
PortugueseAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
PortugueseAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
QueryAutoStopWordAnalyzer(Version matchVersion,
Analyzer delegate,
IndexReader indexReader)
Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for all
indexed fields from terms with a document frequency percentage greater than
QueryAutoStopWordAnalyzer.defaultMaxDocFreqPercent |
QueryAutoStopWordAnalyzer(Version matchVersion,
Analyzer delegate,
IndexReader indexReader,
Collection<String> fields,
float maxPercentDocs)
Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for the
given selection of fields from terms with a document frequency percentage
greater than the given maxPercentDocs
|
QueryAutoStopWordAnalyzer(Version matchVersion,
Analyzer delegate,
IndexReader indexReader,
Collection<String> fields,
int maxDocFreq)
Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for the
given selection of fields from terms with a document frequency greater than
the given maxDocFreq
|
QueryAutoStopWordAnalyzer(Version matchVersion,
Analyzer delegate,
IndexReader indexReader,
float maxPercentDocs)
Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for all
indexed fields from terms with a document frequency percentage greater than
the given maxPercentDocs
|
QueryAutoStopWordAnalyzer(Version matchVersion,
Analyzer delegate,
IndexReader indexReader,
int maxDocFreq)
Creates a new QueryAutoStopWordAnalyzer with stopwords calculated for all
indexed fields from terms with a document frequency greater than the given
maxDocFreq
|
Modifier and Type | Method and Description |
---|---|
static void |
ReverseStringFilter.reverse(Version matchVersion,
char[] buffer)
Reverses the given input buffer in-place
|
static void |
ReverseStringFilter.reverse(Version matchVersion,
char[] buffer,
int len)
Partially reverses the given input buffer in-place from offset 0
up to the given length.
|
static void |
ReverseStringFilter.reverse(Version matchVersion,
char[] buffer,
int start,
int len)
Partially reverses the given input buffer in-place from the given offset
up to the given length.
|
static String |
ReverseStringFilter.reverse(Version matchVersion,
String input)
Reverses the given input string
|
Constructor and Description |
---|
ReverseStringFilter(Version matchVersion,
TokenStream in)
Create a new ReverseStringFilter that reverses all tokens in the
supplied
TokenStream . |
ReverseStringFilter(Version matchVersion,
TokenStream in,
char marker)
Create a new ReverseStringFilter that reverses and marks all tokens in the
supplied
TokenStream . |
Constructor and Description |
---|
RomanianAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
RomanianAnalyzer.DEFAULT_STOPWORD_FILE . |
RomanianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
RomanianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
RussianAnalyzer(Version matchVersion) |
RussianAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words
|
RussianAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words
|
RussianLetterTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader in)
Deprecated.
Construct a new RussianLetterTokenizer using a given
AttributeSource.AttributeFactory . |
RussianLetterTokenizer(Version matchVersion,
Reader in)
Deprecated.
Construct a new RussianLetterTokenizer.
|
Constructor and Description |
---|
ShingleAnalyzerWrapper(Version matchVersion)
Wraps
StandardAnalyzer . |
ShingleAnalyzerWrapper(Version matchVersion,
int minShingleSize,
int maxShingleSize)
Wraps
StandardAnalyzer . |
Constructor and Description |
---|
SnowballAnalyzer(Version matchVersion,
String name)
Deprecated.
Builds the named analyzer with no stop words.
|
SnowballAnalyzer(Version matchVersion,
String name,
CharArraySet stopWords)
Deprecated.
Builds the named analyzer with the given stop words.
|
Constructor and Description |
---|
ClassicAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words (
ClassicAnalyzer.STOP_WORDS_SET ). |
ClassicAnalyzer(Version matchVersion,
CharArraySet stopWords)
Builds an analyzer with the given stop words.
|
ClassicAnalyzer(Version matchVersion,
Reader stopwords)
Builds an analyzer with the stop words from the given reader.
|
ClassicTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader input)
Creates a new ClassicTokenizer with a given
AttributeSource.AttributeFactory |
ClassicTokenizer(Version matchVersion,
Reader input)
Creates a new instance of the
ClassicTokenizer . |
StandardAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words (
StandardAnalyzer.STOP_WORDS_SET ). |
StandardAnalyzer(Version matchVersion,
CharArraySet stopWords)
Builds an analyzer with the given stop words.
|
StandardAnalyzer(Version matchVersion,
Reader stopwords)
Builds an analyzer with the stop words from the given reader.
|
StandardFilter(Version matchVersion,
TokenStream in) |
StandardTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader input)
Creates a new StandardTokenizer with a given
AttributeSource.AttributeFactory |
StandardTokenizer(Version matchVersion,
Reader input)
Creates a new instance of the
StandardTokenizer . |
UAX29URLEmailAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words (
UAX29URLEmailAnalyzer.STOP_WORDS_SET ). |
UAX29URLEmailAnalyzer(Version matchVersion,
CharArraySet stopWords)
Builds an analyzer with the given stop words.
|
UAX29URLEmailAnalyzer(Version matchVersion,
Reader stopwords)
Builds an analyzer with the stop words from the given reader.
|
UAX29URLEmailTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader input)
Creates a new UAX29URLEmailTokenizer with a given
AttributeSource.AttributeFactory |
UAX29URLEmailTokenizer(Version matchVersion,
Reader input)
Creates a new instance of the UAX29URLEmailTokenizer.
|
Constructor and Description |
---|
SwedishAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
SwedishAnalyzer.DEFAULT_STOPWORD_FILE . |
SwedishAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
SwedishAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Constructor and Description |
---|
ThaiAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words.
|
ThaiAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
ThaiWordFilter(Version matchVersion,
TokenStream input)
Deprecated.
Creates a new ThaiWordFilter with the specified match version.
|
Constructor and Description |
---|
TurkishAnalyzer(Version matchVersion)
Builds an analyzer with the default stop words:
TurkishAnalyzer.DEFAULT_STOPWORD_FILE . |
TurkishAnalyzer(Version matchVersion,
CharArraySet stopwords)
Builds an analyzer with the given stop words.
|
TurkishAnalyzer(Version matchVersion,
CharArraySet stopwords,
CharArraySet stemExclusionSet)
Builds an analyzer with the given stop words.
|
Modifier and Type | Field and Description |
---|---|
protected Version |
AbstractAnalysisFactory.luceneMatchVersion
the luceneVersion arg
|
protected Version |
StopwordAnalyzerBase.matchVersion |
protected Version |
FilteringTokenFilter.version |
Modifier and Type | Method and Description |
---|---|
Version |
AbstractAnalysisFactory.getLuceneMatchVersion() |
Modifier and Type | Method and Description |
---|---|
static <V> CharArrayMap<V> |
CharArrayMap.copy(Version matchVersion,
Map<?,? extends V> map)
Returns a copy of the given map as a
CharArrayMap . |
static CharArraySet |
CharArraySet.copy(Version matchVersion,
Set<?> set)
Returns a copy of the given set as a
CharArraySet . |
static CharacterUtils |
CharacterUtils.getInstance(Version matchVersion)
Returns a
CharacterUtils implementation according to the given
Version instance. |
static CharArraySet |
WordlistLoader.getSnowballWordSet(Reader reader,
Version matchVersion)
Reads stopwords from a stopword list in Snowball format.
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
String comment,
Version matchVersion)
Reads lines from a Reader and adds every non-comment line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
static CharArraySet |
WordlistLoader.getWordSet(Reader reader,
Version matchVersion)
Reads lines from a Reader and adds every line as an entry to a CharArraySet (omitting
leading and trailing whitespace).
|
protected static CharArraySet |
StopwordAnalyzerBase.loadStopwordSet(File stopwords,
Version matchVersion)
Creates a CharArraySet from a file.
|
protected static CharArraySet |
StopwordAnalyzerBase.loadStopwordSet(Reader stopwords,
Version matchVersion)
Creates a CharArraySet from a file.
|
Constructor and Description |
---|
CharArrayMap(Version matchVersion,
int startSize,
boolean ignoreCase)
Create map with enough capacity to hold startSize terms
|
CharArrayMap(Version matchVersion,
Map<?,? extends V> c,
boolean ignoreCase)
Creates a map from the mappings in another map.
|
CharArraySet(Version matchVersion,
Collection<?> c,
boolean ignoreCase)
Creates a set from a Collection of objects.
|
CharArraySet(Version matchVersion,
int startSize,
boolean ignoreCase)
Create set with enough capacity to hold startSize terms
|
CharTokenizer(Version matchVersion,
AttributeSource.AttributeFactory factory,
Reader input)
Creates a new
CharTokenizer instance |
CharTokenizer(Version matchVersion,
Reader input)
Creates a new
CharTokenizer instance |
FilteringTokenFilter(Version version,
boolean enablePositionIncrements,
TokenStream input)
Deprecated.
enablePositionIncrements=false is not supported anymore as of Lucene 4.4
|
FilteringTokenFilter(Version version,
TokenStream in)
Create a new
FilteringTokenFilter . |
StopwordAnalyzerBase(Version version)
Creates a new Analyzer with an empty stopword set
|
StopwordAnalyzerBase(Version version,
CharArraySet stopwords)
Creates a new instance initialized with the given stopword set
|
Constructor and Description |
---|
CollationKeyAnalyzer(Version matchVersion,
Collator collator)
Create a new CollationKeyAnalyzer, using the specified collator.
|
Modifier and Type | Field and Description |
---|---|
protected Version |
LiveIndexWriterConfig.matchVersion
Version that IndexWriter should emulate. |
Modifier and Type | Method and Description |
---|---|
void |
MultiPassIndexSplitter.split(Version version,
IndexReader in,
Directory[] outputs,
boolean seq)
Split source index into multiple parts.
|
Constructor and Description |
---|
IndexUpgrader(Directory dir,
Version matchVersion)
Creates index upgrader on the given directory, using an
IndexWriter using the given
matchVersion . |
IndexUpgrader(Directory dir,
Version matchVersion,
PrintStream infoStream,
boolean deletePriorCommits)
Creates index upgrader on the given directory, using an
IndexWriter using the given
matchVersion . |
IndexWriterConfig(Version matchVersion,
Analyzer analyzer)
|
PKIndexSplitter(Version version,
Directory input,
Directory dir1,
Directory dir2,
Filter docsInFirstIndex)
Split an index based on a
Filter . |
PKIndexSplitter(Version version,
Directory input,
Directory dir1,
Directory dir2,
Term midTerm)
Split an index based on a given primary key term
and a 'middle' term.
|
Constructor and Description |
---|
AnalyzingQueryParser(Version matchVersion,
String field,
Analyzer analyzer) |
Modifier and Type | Method and Description |
---|---|
void |
QueryParserBase.init(Version matchVersion,
String f,
Analyzer a)
Initializes a query parser.
|
static Query |
MultiFieldQueryParser.parse(Version matchVersion,
String[] queries,
String[] fields,
Analyzer analyzer)
Parses a query which searches on the fields specified.
|
static Query |
MultiFieldQueryParser.parse(Version matchVersion,
String[] queries,
String[] fields,
BooleanClause.Occur[] flags,
Analyzer analyzer)
Parses a query, searching on the fields specified.
|
static Query |
MultiFieldQueryParser.parse(Version matchVersion,
String query,
String[] fields,
BooleanClause.Occur[] flags,
Analyzer analyzer)
Parses a query, searching on the fields specified.
|
Constructor and Description |
---|
MultiFieldQueryParser(Version matchVersion,
String[] fields,
Analyzer analyzer)
Creates a MultiFieldQueryParser.
|
MultiFieldQueryParser(Version matchVersion,
String[] fields,
Analyzer analyzer,
Map<String,Float> boosts)
Creates a MultiFieldQueryParser.
|
QueryParser(Version matchVersion,
String f,
Analyzer a)
Create a query parser.
|
Constructor and Description |
---|
ComplexPhraseQueryParser(Version matchVersion,
String f,
Analyzer a) |
Constructor and Description |
---|
ExtendableQueryParser(Version matchVersion,
String f,
Analyzer a)
Creates a new
ExtendableQueryParser instance |
ExtendableQueryParser(Version matchVersion,
String f,
Analyzer a,
Extensions ext)
Creates a new
ExtendableQueryParser instance |
Modifier and Type | Method and Description |
---|---|
protected IndexWriterConfig |
AnalyzingInfixSuggester.getIndexWriterConfig(Version matchVersion,
Analyzer indexAnalyzer,
IndexWriterConfig.OpenMode openMode)
Override this to customize index settings, e.g.
|
Constructor and Description |
---|
AnalyzingInfixSuggester(Version matchVersion,
Directory dir,
Analyzer analyzer)
Create a new instance, loading from a previously built
AnalyzingInfixSuggester directory, if it exists.
|
AnalyzingInfixSuggester(Version matchVersion,
Directory dir,
Analyzer indexAnalyzer,
Analyzer queryAnalyzer,
int minPrefixChars)
Create a new instance, loading from a previously built
AnalyzingInfixSuggester directory, if it exists.
|
BlendedInfixSuggester(Version matchVersion,
Directory dir,
Analyzer analyzer)
Create a new instance, loading from a previously built
directory, if it exists.
|
BlendedInfixSuggester(Version matchVersion,
Directory dir,
Analyzer indexAnalyzer,
Analyzer queryAnalyzer,
int minPrefixChars,
BlendedInfixSuggester.BlenderType blenderType,
int numFactor)
Create a new instance, loading from a previously built
directory, if it exists.
|
Modifier and Type | Method and Description |
---|---|
static Version |
Version.parseLeniently(String version) |
static Version |
Version.valueOf(String name)
Returns the enum constant of this type with the specified name.
|
static Version[] |
Version.values()
Returns an array containing the constants of this enum type, in
the order they are declared.
|
Modifier and Type | Method and Description |
---|---|
boolean |
Version.onOrAfter(Version other) |
Copyright © 2000-2016 The Apache Software Foundation. All Rights Reserved.