Multilingual historical corpora and annotated benchmarks

Summary
Describes the plain text corpora collected for each project language and covering different time spans. It describes also the benchmarks annotated for the evaluation of the text processing modules (T3.1)