Brown Corpus Data for Context Sensitive Spelling Correction

The files have one of three extensions:

Filenames with suffix 20 are test files, those with suffix 80 are the training files (correspond to 20%, 80% of the data, respectively).

Download the entire corpus here.