DAMA | Extreme-Scale Data Management

Summary
This project is concerned with the I/O challenges that arise from the convergence between high performance computing (HPC) and big data, two very different paradigms. This convergence is an important topic for the scientific community today, and extreme-scale machines are expected to observe a heterogeneous workload composed of traditional scientific applications and data analytics tasks.
The goal of this action is to provide data management for extreme-scale computing environments for the convergence scenario, to benefit both types of workload. The methodology will be an experimental one, and the instrument will be the development of an I/O middleware, the data manager. The data manager will combine storage capacity available in the supercomputer, including NVRAM devices, transparently. Its activities will be optimized by minimizing data movement and applying coordination to avoid performance interference due to concurrency. The most important characteristic of this project among the state-of-the-art is the intelligence to learn and predict applications needs, so storage capacity and data can be available at a close location before the user needs them.
The action will benefit from the researcher's experience on parallel I/O for HPC, allied to the host laboratory expertise in in-situ processing, big data, and machine learning. Through this two-year fellowship, the researcher will have the opportunity to expand her knowledge while conducting highly innovative research, what will improve her perspectives for future employment.
Unfold all
/
Fold all
More information & hyperlinks
Web resources: https://cordis.europa.eu/project/id/800144
Start date: 01-11-2018
End date: 31-10-2020
Total budget - Public funding: 185 076,00 Euro - 185 076,00 Euro
Cordis data

Original description

This project is concerned with the I/O challenges that arise from the convergence between high performance computing (HPC) and big data, two very different paradigms. This convergence is an important topic for the scientific community today, and extreme-scale machines are expected to observe a heterogeneous workload composed of traditional scientific applications and data analytics tasks.
The goal of this action is to provide data management for extreme-scale computing environments for the convergence scenario, to benefit both types of workload. The methodology will be an experimental one, and the instrument will be the development of an I/O middleware, the data manager. The data manager will combine storage capacity available in the supercomputer, including NVRAM devices, transparently. Its activities will be optimized by minimizing data movement and applying coordination to avoid performance interference due to concurrency. The most important characteristic of this project among the state-of-the-art is the intelligence to learn and predict applications needs, so storage capacity and data can be available at a close location before the user needs them.
The action will benefit from the researcher's experience on parallel I/O for HPC, allied to the host laboratory expertise in in-situ processing, big data, and machine learning. Through this two-year fellowship, the researcher will have the opportunity to expand her knowledge while conducting highly innovative research, what will improve her perspectives for future employment.

Status

CLOSED

Call topic

MSCA-IF-2017

Update Date

28-04-2024
Images
No images available.
Geographical location(s)
Structured mapping
Unfold all
/
Fold all
Horizon 2020
H2020-EU.1. EXCELLENT SCIENCE
H2020-EU.1.3. EXCELLENT SCIENCE - Marie Skłodowska-Curie Actions (MSCA)
H2020-EU.1.3.2. Nurturing excellence by means of cross-border and cross-sector mobility
H2020-MSCA-IF-2017
MSCA-IF-2017