E2DATA | European Extreme Performing Big Data Stacks

Summary
Imagine a Big Data application with the following characteristics: (i) it has to process large amounts of complex streaming data,
(ii) the application logic that processes the incoming data must execute and complete within a strict time limit,
and (iii) there is a limited budget for infrastructure resources.

In today’s world, the data would be streamed from the local network or edge devices to a cloud provider which is rented by a customer to perform the data execution. The Big Data software stack, in an application and hardware agnostic manner, will split the execution stream into multiple tasks and send them for processing on the nodes the customer has paid for. If the outcome does not match the strict three second business requirement, then the customer has two options:
1) scale-up (by upgrading processors at node level),
2) scale-out (by adding nodes to their clusters), or 3) manually implement code optimizations specific to the underlying hardware.

E2Data proposes an end-to-end solution for Big Data deployments that will fully exploit and advance the state-of-the-art in infrastructure services by delivering a performance increase of up to 10x while utilizing up to 50% less cloud resources.
E2Data will provide a new Big Data paradigm, by combining state-of-the-art software components, in order to achieve maximum resource utilization for heterogeneous cloud deployments without affecting current programming norms (i.e. no code changes in the original source).
The E2Data innovations will be driven by the requirements of four resource demanding applications from the finance, health, green buildings, and security domains.
Finally, the evaluation will be conducted on both high-performing x86 and low-power ARM cluster architectures representing realistic execution scenarios of real-world deployments.
Unfold all
/
Fold all
More information & hyperlinks
Web resources: https://cordis.europa.eu/project/id/780245
Start date: 01-01-2018
End date: 31-12-2020
Total budget - Public funding: 4 676 250,00 Euro - 4 676 250,00 Euro
Cordis data

Original description

Imagine a Big Data application with the following characteristics: (i) it has to process large amounts of complex streaming data,
(ii) the application logic that processes the incoming data must execute and complete within a strict time limit,
and (iii) there is a limited budget for infrastructure resources.

In today’s world, the data would be streamed from the local network or edge devices to a cloud provider which is rented by a customer to perform the data execution. The Big Data software stack, in an application and hardware agnostic manner, will split the execution stream into multiple tasks and send them for processing on the nodes the customer has paid for. If the outcome does not match the strict three second business requirement, then the customer has two options:
1) scale-up (by upgrading processors at node level),
2) scale-out (by adding nodes to their clusters), or 3) manually implement code optimizations specific to the underlying hardware.

E2Data proposes an end-to-end solution for Big Data deployments that will fully exploit and advance the state-of-the-art in infrastructure services by delivering a performance increase of up to 10x while utilizing up to 50% less cloud resources.
E2Data will provide a new Big Data paradigm, by combining state-of-the-art software components, in order to achieve maximum resource utilization for heterogeneous cloud deployments without affecting current programming norms (i.e. no code changes in the original source).
The E2Data innovations will be driven by the requirements of four resource demanding applications from the finance, health, green buildings, and security domains.
Finally, the evaluation will be conducted on both high-performing x86 and low-power ARM cluster architectures representing realistic execution scenarios of real-world deployments.

Status

CLOSED

Call topic

ICT-16-2017

Update Date

27-10-2022
Images
No images available.
Geographical location(s)
Structured mapping
Unfold all
/
Fold all
Horizon 2020
H2020-EU.2. INDUSTRIAL LEADERSHIP
H2020-EU.2.1. INDUSTRIAL LEADERSHIP - Leadership in enabling and industrial technologies
H2020-EU.2.1.1. INDUSTRIAL LEADERSHIP - Leadership in enabling and industrial technologies - Information and Communication Technologies (ICT)
H2020-EU.2.1.1.0. INDUSTRIAL LEADERSHIP - ICT - Cross-cutting calls
H2020-ICT-2017-1
ICT-16-2017 Big data PPP: research addressing main technology challenges of the data economy