D3.2.1 First Version of the Data Extraction Benchmark for unstructured data

Summary
This deliverable will describe the first version of the benchmark for unstructured data streams. Moreover, baseline implementations provided by HOBBIT will be presented.