Package LBJ2.nlp.seg

The segmentation of sequences of words into semantically meaningful groups is a common NLP paradigm; this package aims to support such tasks in a general way.

See:
          Description

Class Summary
BIOTester This class may be used to produce a detailed report of the segment by segment performance of a given classifier on given labeled testing data.
PlainToTokenParser This parser takes the Words in the representation created by another Parser and creates a new representation consisting of Tokens.
POSBracketToToken This parser expects labeled data as described in POSBracketToVector, and it returns Token objects representing that labeled data.
SegmentTagPlain Use this command line program to produce textual segment annotations on an input text using a learned Word classifier.
Token Simple extension of the Word class from LBJ's NLP library.
WordsToTokens This parser calls another parser that returns LinkedVectors of Words, converts the Words to Tokens, and returns LinkedVectors of Tokens.
 

Package LBJ2.nlp.seg Description

The segmentation of sequences of words into semantically meaningful groups is a common NLP paradigm; this package aims to support such tasks in a general way. Two examples of such a task, a chunker and a named entity tagger, have been implemented using the tools in this package and the classifiers in the LBJ2.nlp package.