FoundationDX | Foundation models for molecular diagnostics - machine learning with biological ‘common sense’

Summary
Molecular diagnostics is crucial in fulfilling the promise of personalized medicine. While we are amidst an AI revolution, current machine learning models (ML) struggle to effectively learn from molecular (‘omics’) patient profiles and fail to make robust predictions. Perhaps this is not a surprise. After all, molecular disease biology is immensely complex, and we ask ML models to predict such complicated things as patient prognosis, without them ‘knowing’ anything about molecular biology and based on limited training data.

To address this, I will create foundation models on top of the vast troves of available biomolecular data, such as multi-omics profiles in healthy and diseased tissues, high-resolution single-cell data and biological knowledge graphs. This unique approach is driven by self-supervised learning (SSL), an important driver of AI, which offers the opportunity to learn a comprehensive representation of the multimodal biology of the cell – without the need for well-annotated patient data.

Starting from this strong basis, the FoundationDX model can then reliably predict cancer subtype or prognosis as it no longer needs to start from scratch on too high-dimensional, too low sample-size datasets. Effectively, we give our systems biological ‘common sense’, foregoing the need for millions of labeled training samples. This uniquely enables us to address one of the most clinically relevant questions: which treatment is best for the patient?

The FoundationDX research program is designed to deliver key insights into how the SSL revolution can be used to drive progress in the field of molecular diagnostics. It contains a ‘clinical-grade’ benchmarking module and solves three urgent diagnostic challenges, including noninvasive subtyping of pediatric brain cancer. The time for powerful, robust and generalizable, knowledge-aware machine learning solutions to previously intractable molecular diagnostics problems has come. FoundationDX aims to deliver this.
Unfold all
/
Fold all
More information & hyperlinks
Web resources: https://cordis.europa.eu/project/id/101125814
Start date: 01-05-2024
End date: 30-04-2029
Total budget - Public funding: 2 000 000,00 Euro - 2 000 000,00 Euro
Cordis data

Original description

Molecular diagnostics is crucial in fulfilling the promise of personalized medicine. While we are amidst an AI revolution, current machine learning models (ML) struggle to effectively learn from molecular (‘omics’) patient profiles and fail to make robust predictions. Perhaps this is not a surprise. After all, molecular disease biology is immensely complex, and we ask ML models to predict such complicated things as patient prognosis, without them ‘knowing’ anything about molecular biology and based on limited training data.

To address this, I will create foundation models on top of the vast troves of available biomolecular data, such as multi-omics profiles in healthy and diseased tissues, high-resolution single-cell data and biological knowledge graphs. This unique approach is driven by self-supervised learning (SSL), an important driver of AI, which offers the opportunity to learn a comprehensive representation of the multimodal biology of the cell – without the need for well-annotated patient data.

Starting from this strong basis, the FoundationDX model can then reliably predict cancer subtype or prognosis as it no longer needs to start from scratch on too high-dimensional, too low sample-size datasets. Effectively, we give our systems biological ‘common sense’, foregoing the need for millions of labeled training samples. This uniquely enables us to address one of the most clinically relevant questions: which treatment is best for the patient?

The FoundationDX research program is designed to deliver key insights into how the SSL revolution can be used to drive progress in the field of molecular diagnostics. It contains a ‘clinical-grade’ benchmarking module and solves three urgent diagnostic challenges, including noninvasive subtyping of pediatric brain cancer. The time for powerful, robust and generalizable, knowledge-aware machine learning solutions to previously intractable molecular diagnostics problems has come. FoundationDX aims to deliver this.

Status

SIGNED

Call topic

ERC-2023-COG

Update Date

12-03-2024
Images
No images available.
Geographical location(s)
Structured mapping
Unfold all
/
Fold all
Horizon Europe
HORIZON.1 Excellent Science
HORIZON.1.1 European Research Council (ERC)
HORIZON.1.1.0 Cross-cutting call topics
ERC-2023-COG ERC CONSOLIDATOR GRANTS
HORIZON.1.1.1 Frontier science
ERC-2023-COG ERC CONSOLIDATOR GRANTS