@Deprecated public class QGramExtended extends AbstractTokenizer
This class is immutable and thread-safe.
Constructor and Description |
---|
QGramExtended(int q)
Deprecated.
Constructs a q-gram tokenizer with the given q and default padding.
|
QGramExtended(int q,
String startPadding,
String endPadding)
Deprecated.
Constructs a q-gram tokenizer with the given q and padding.
|
Modifier and Type | Method and Description |
---|---|
String |
getEndPadding()
Deprecated.
Returns the end padding.
|
String |
getStartPadding()
Deprecated.
Returns the start padding.
|
List<String> |
tokenizeToList(String input)
Deprecated.
Return tokenized version of a string as a list of tokens.
|
String |
toString()
Deprecated.
|
tokenizeToSet
public QGramExtended(int q, String startPadding, String endPadding)
q
- size of the tokensstartPadding
- padding to apply at the start of short tokensendPadding
- padding to apply at the end of short tokenspublic QGramExtended(int q)
q
- size of the tokenspublic String getStartPadding()
public String getEndPadding()
public List<String> tokenizeToList(String input)
Tokenizer
input
- input string to tokenizeCopyright © 2014–2018. All rights reserved.