Sport5 Corpus
Articles from Sport5 website, 2014-2015.
-
Plain Text
Use utf-8 encoding to properly view the files.
-
Tokenized Text in XML
The XML schema follows MILA's corpus standards.
-
Morphologically-Analyzed Text in XML
Tokenized text tagged with all possible morphological analyses.
The XML schema follows MILA's corpus standards.
View all corpora...
View corpus standards...