Embed2Scale | Earth Observation & Weather Data Federation with AI Embeddings

Summary
The full potential of the Copernicus Programme unfolds when fused with additional geo-information such as weather models or GNSS measurements. However, no single platform can host all the hundreds of petabytes of geospatial data. Currently, service suppliers download data from different archives, and the sheer volume to be transferred render many applications economically not viable.
With Embed2Scale we strive to overcome these limitations enabling efficient exchange of data through AI-based data compression. We will explore the training of deep neural networks on HPC systems with self-supervised learning to transform raw geo-information into embeddings with up to 1000-fold compression. The main innovations will enable i) decentralized applications through substantial reduction of “data gravity”, ii) the portability of geospatial analytics by significantly lowering computational demand, iii) minimizing data labeling by few-shot learning, and iv) the near-real-time similarity search at petabyte scale of Earth observation and weather/climate data archives.
The objectives of Embed2Scale target i) the exploration of ground-breaking AI-compressors enabling data federation to proliferate a MLOps reference implementation for embeddings in data centers, ii) to demonstrate data federation on real-world use-cases for the Copernicus Programme, and iii) to enable the Earth observation community by open-sourcing and standardization. Within Embed2Scale, we will benchmark the use of embeddings in four applications: i) maritime awareness, ii) aboveground biomass estimation, iii) climate and air pollution prediction, and iv) crop stress & early yield detection. Overall, Embed2Scale will enable near-real time quantitative assessments of geo-information at continental scale - we respond to challenge 2 of the call: “new, enabling, scalable, operational solutions and technologies to improve capabilities of the Copernicus value chain and supporting infrastructure”.
Unfold all
/
Fold all
More information & hyperlinks
Web resources: https://cordis.europa.eu/project/id/101131841
Start date: 01-01-2024
End date: 31-12-2026
Total budget - Public funding: 2 618 047,50 Euro - 2 618 047,00 Euro
Cordis data

Original description

The full potential of the Copernicus Programme unfolds when fused with additional geo-information such as weather models or GNSS measurements. However, no single platform can host all the hundreds of petabytes of geospatial data. Currently, service suppliers download data from different archives, and the sheer volume to be transferred render many applications economically not viable.
With Embed2Scale we strive to overcome these limitations enabling efficient exchange of data through AI-based data compression. We will explore the training of deep neural networks on HPC systems with self-supervised learning to transform raw geo-information into embeddings with up to 1000-fold compression. The main innovations will enable i) decentralized applications through substantial reduction of “data gravity”, ii) the portability of geospatial analytics by significantly lowering computational demand, iii) minimizing data labeling by few-shot learning, and iv) the near-real-time similarity search at petabyte scale of Earth observation and weather/climate data archives.
The objectives of Embed2Scale target i) the exploration of ground-breaking AI-compressors enabling data federation to proliferate a MLOps reference implementation for embeddings in data centers, ii) to demonstrate data federation on real-world use-cases for the Copernicus Programme, and iii) to enable the Earth observation community by open-sourcing and standardization. Within Embed2Scale, we will benchmark the use of embeddings in four applications: i) maritime awareness, ii) aboveground biomass estimation, iii) climate and air pollution prediction, and iv) crop stress & early yield detection. Overall, Embed2Scale will enable near-real time quantitative assessments of geo-information at continental scale - we respond to challenge 2 of the call: “new, enabling, scalable, operational solutions and technologies to improve capabilities of the Copernicus value chain and supporting infrastructure”.

Status

SIGNED

Call topic

HORIZON-EUSPA-2022-SPACE-02-55

Update Date

12-03-2024
Images
No images available.
Geographical location(s)
Structured mapping
Unfold all
/
Fold all
Horizon Europe
HORIZON.2 Global Challenges and European Industrial Competitiveness
HORIZON.2.4 Digital, Industry and Space
HORIZON.2.4.0 Cross-cutting call topics
HORIZON-EUSPA-2022-SPACE
HORIZON-EUSPA-2022-SPACE-02-55 Large-scale Copernicus data uptake with AI and HPC
HORIZON.2.4.10 Space, including Earth Observation
HORIZON-EUSPA-2022-SPACE
HORIZON-EUSPA-2022-SPACE-02-55 Large-scale Copernicus data uptake with AI and HPC