D3.2.2 Second Version of the Data Extraction Benchmark for unstructured data

Summary
This deliverable will describe the second version of the extraction benchmark for unstructured data streams. Moreover, improved baseline implementations provided by HOBBIT will be presented.