
This is a state of the art NER tagger that tags plain text with named entitites (people / organizations / locations / miscellaneous). It uses gazetteers extracted from Wikipedia, word class model derived from unlabeled text and expressive non-local features. The best performance is 90.8 F1 on the CoNLL03 shared task data. The tagger is robust and has been evaluated on a variety of datasets. For detailed results, design and modeling details, please read the paper:
If you're using this system, please cite:
L. Ratinov and D. Roth
Design Challenges and Misconceptions in Named Entity Recognition
CoNLL 2009
Using the tagger is simple - just download, unzip the package and look at sample usage in the README.txt file