
@Title(value="Term frequency parser") @Description(value="Parse a file containing term frequencies. The expected format is \'label term1term2 ...\'. Terms must not contain the separator character!") public class TermFrequencyParser extends NumberVectorLabelParser<SparseFloatVector>
| Modifier and Type | Class and Description |
|---|---|
static class |
TermFrequencyParser.Parameterizer
Parameterization class.
|
BundleStreamSource.Event| Modifier and Type | Field and Description |
|---|---|
(package private) HashMap<String,Integer> |
keymap
Map
|
private static Logging |
logger
Class logger
|
(package private) int |
maxdim
Maximum dimension used
|
(package private) boolean |
normalize
Normalize
|
columnnames, curlbl, curvec, dimensionality, DIMENSIONALITY_UNKNOWN, DIMENSIONALITY_VARIABLE, factory, LABEL_INDICES_ID, labelcolumns, labelIndices, lineNumber, meta, nextevent, VECTOR_TYPE_IDATTRIBUTE_CONCATENATION, COLUMN_SEPARATOR_ID, COMMENT, DEFAULT_SEPARATOR, NUMBER_PATTERN, QUOTE_CHAR, QUOTE_ID, quoteChar| Constructor and Description |
|---|
TermFrequencyParser(boolean normalize,
Pattern colSep,
char quoteChar,
BitSet labelIndices)
Constructor.
|
| Modifier and Type | Method and Description |
|---|---|
protected Logging |
getLogger()
Get the logger for this class.
|
protected SimpleTypeInformation<SparseFloatVector> |
getTypeInformation(int dimensionality)
Get a prototype object for the given dimensionality.
|
protected void |
parseLineInternal(String line)
Internal method for parsing a single line.
|
buildMeta, createDBObject, data, getMeta, initStream, nextEventparsetokenize, toStringprivate static final Logging logger
int maxdim
boolean normalize
protected void parseLineInternal(String line)
NumberVectorLabelParserparseLineInternal in class NumberVectorLabelParser<SparseFloatVector>line - Line to processprotected SimpleTypeInformation<SparseFloatVector> getTypeInformation(int dimensionality)
NumberVectorLabelParsergetTypeInformation in class NumberVectorLabelParser<SparseFloatVector>dimensionality - Dimensionalityprotected Logging getLogger()
AbstractParsergetLogger in class NumberVectorLabelParser<SparseFloatVector>