PANGAIA Pan-genome Graph Algorithms and Data Integration

Summary

Genomes are strings over the letters A,C,G,T, which represent nucleotides, the building blocks of DNA. In view of ultra-large amounts of genome sequence data emerging from ever more and technologically rapidly advancing genome sequencing devices—in the meantime, amounts of sequencing data accrued are reaching into the exabyte scale—the driving, urgent question is: how can we arrange and analyze these data masses in a formally rigorous, computationally efficient and biomedically rewarding manner?
Graph based data structures have been pointed out to have disruptive benefits over traditional sequence based structures when representing pan-genomes, sufficiently large, evolutionarily coherent collections of genomes. This idea has its immediate justification in the laws of genetics: evolutionarily closely related genomes vary only in relatively little amounts of letters, while sharing the majority of their sequence content. Graph-based pan-genome representations that allow to remove redundancies without having to discard individual differences, make utmost sense. In this project, we will put this shift of paradigms—from sequence to graph based representations of genomes—into full effect. As a result, we can expect a wealth of practically relevant advantages, among which arrangement, analysis, compression, integration and exploitation of genome data are the most fundamental points. In addition, we will also open up a significant source of inspiration for computer science itself.

Resources

Show all and search (95)

Unfold all

Fold all

More information & hyperlinks

Web resources:	https://cordis.europa.eu/project/id/872539
Start date:	01-01-2020
End date:	31-10-2025
Total budget - Public funding:	1 140 800,00 Euro - 1 140 800,00 Euro

Cordis data

Original description

Status

SIGNED

Url

https://cordis.europa.eu/project/id/872539

Call topic

MSCA-RISE-2019

Update Date

28-04-2024

Geographical location(s)

Structured mapping

Unfold all

Fold all

EU-Programme-Call

Organisations

Show all (15)

PANGAIA | Pan-genome Graph Algorithms and Data Integration

Original description

Status

Url

Call topic

Update Date