Corpora for a number of NLP tasks