SIMBAD | Statistical Inference from Multiscale Biological Data: theory, algorithms, applications

Summary
The last two decades have witnessed giant experimental breakthroughs in different areas of the life sciences, from genomics to epidemiology. Thanks to modern high-throughput techniques, biological systems across multiple scales –from single molecules up to entire populations– can now be probed quantitatively at high spatial and temporal resolutions. Besides enhancing our basic knowledge of a system’s constituents, these data potentially encode a plethora of information about the functional constraints that govern its evolution and the physical constraints that limit its performance, as well as about levels of organization, dynamical constraints or design principles that would be hard to identify from low-throughput data. Extracting this information is also crucial for applications ranging from the design of proteins with a desired functionality to the reconstruction of contacts during an epidemics. Inverse statistical mechanics attempts to do it by inferring generative models (Boltzmann distributions) from data using methods from the physics of disordered and random systems. Specific characteristics of biological data however, like strong undersampling and heterogeneity, limit the effectiveness of these tools. SIMBAD aims at developing a class of statistical inference techniques capable of overcoming these issues. In SIMBAD, theoretical work will supply concepts and methods to address four pressing problems (learning protein sequence landscapes, inverse modeling metabolic networks, inferring contact networks from epidemiological data, and improving survival analysis models), which in turn will guide the theory towards integration with the existing standards of each field. This effort promises to open new pathways for basic research to impact economic, technological and societal issues; the high- profile cross-disciplinary expertise represented in SIMBAD ensures instead for measurable and achievable objectives, placing SIMBAD in an ideal position to achieve its goals
Unfold all
/
Fold all
More information & hyperlinks
Web resources: https://cordis.europa.eu/project/id/101131463
Start date: 01-12-2023
End date: 30-11-2027
Total budget - Public funding: - 740 600,00 Euro
Cordis data

Original description

The last two decades have witnessed giant experimental breakthroughs in different areas of the life sciences, from genomics to epidemiology. Thanks to modern high-throughput techniques, biological systems across multiple scales –from single molecules up to entire populations– can now be probed quantitatively at high spatial and temporal resolutions. Besides enhancing our basic knowledge of a system’s constituents, these data potentially encode a plethora of information about the functional constraints that govern its evolution and the physical constraints that limit its performance, as well as about levels of organization, dynamical constraints or design principles that would be hard to identify from low-throughput data. Extracting this information is also crucial for applications ranging from the design of proteins with a desired functionality to the reconstruction of contacts during an epidemics. Inverse statistical mechanics attempts to do it by inferring generative models (Boltzmann distributions) from data using methods from the physics of disordered and random systems. Specific characteristics of biological data however, like strong undersampling and heterogeneity, limit the effectiveness of these tools. SIMBAD aims at developing a class of statistical inference techniques capable of overcoming these issues. In SIMBAD, theoretical work will supply concepts and methods to address four pressing problems (learning protein sequence landscapes, inverse modeling metabolic networks, inferring contact networks from epidemiological data, and improving survival analysis models), which in turn will guide the theory towards integration with the existing standards of each field. This effort promises to open new pathways for basic research to impact economic, technological and societal issues; the high- profile cross-disciplinary expertise represented in SIMBAD ensures instead for measurable and achievable objectives, placing SIMBAD in an ideal position to achieve its goals

Status

SIGNED

Call topic

HORIZON-MSCA-2022-SE-01-01

Update Date

12-03-2024
Images
No images available.
Geographical location(s)
Structured mapping
Unfold all
/
Fold all
Horizon Europe
HORIZON.1 Excellent Science
HORIZON.1.2 Marie Skłodowska-Curie Actions (MSCA)
HORIZON.1.2.0 Cross-cutting call topics
HORIZON-MSCA-2022-SE-01
HORIZON-MSCA-2022-SE-01-01 MSCA Staff Exchanges 2022