Constructor and Description |
---|
Tokenization(String[] tokens,
IntPair[] characterOffsets,
int[] sentenceEndTokenIndexes) |
Modifier and Type | Method and Description |
---|---|
IntPair[] |
getCharacterOffsets() |
int[] |
getSentenceEndTokenIndexes() |
List<Pair<String[],IntPair[]>> |
getTokenizedSentences()
get a list of pairs, each pair corresponding to a sentence's tokens and their *absolute*
character offsets.
|
String[] |
getTokens() |
public List<Pair<String[],IntPair[]>> getTokenizedSentences()
public String[] getTokens()
public int[] getSentenceEndTokenIndexes()
public IntPair[] getCharacterOffsets()
Copyright © 2017. All rights reserved.