NLP-Named-Entity-Recognition-CoNLL-2003
PublicCoNLL-2003 is a named entity recognition dataset. Consists of eight files covering two languages: English and German, although German wasn't used. For each of the languages there is a training file, a development file, a test file and a large file with unannotated news data, from August 1996 and August 1997.