Summary
As a result of the rapid development of next generation sequencing, we now have access to hundreds and often many thousands of sequences which belong to the same family. Such a large amount of sequence data for a particular protein family, along with recent developments in computational statistics, enables an entirely new kind of evolutionary analysis to be performed on sequences, where for the first time we can compute statistically significant networks of correlated mutations. The proposal describes an integrated programme of work to fully explore the potential applications of the new amino acid covariation techniques in predicting aspects of protein structure and function. A particular emphasis in this proposal are proteins which are difficult to study by experimental techniques i.e. disordered proteins, transmembrane proteins and large complexes. The first objective will be to explore key developments in the underpinning algorithms, tackling both the issue of needing very large numbers of homologous sequences and also the downstream 3-D embedding to produce viable models. The second objective will involve experimental work with a collaborator where the idea that de novo protein design techniques might be exploited to artificially expand the set of available sequences for a given proto-family will be explored. The third objective will focus specifically on transmembrane protein modelling, where covariation-based approaches have proven to be highly effective. Here the goal will be to extend our existing FILM3 method to encompass both beta-barrel type TM proteins, but also to try to handle the issue of homomultimers, which is a critical aspect of TM protein modelling as so many families are known to adopt higher orders of structure than the fold level alone. Finally, applications of covariation analysis to probing multiple conformations of disordered proteins will be developed, with a specific focus on interactions of disordered proteins with DNA and RNA.
Unfold all
/
Fold all
More information & hyperlinks
Web resources: | https://cordis.europa.eu/project/id/695558 |
Start date: | 01-11-2016 |
End date: | 30-04-2022 |
Total budget - Public funding: | 2 433 679,00 Euro - 2 433 679,00 Euro |
Cordis data
Original description
As a result of the rapid development of next generation sequencing, we now have access to hundreds and often many thousands of sequences which belong to the same family. Such a large amount of sequence data for a particular protein family, along with recent developments in computational statistics, enables an entirely new kind of evolutionary analysis to be performed on sequences, where for the first time we can compute statistically significant networks of correlated mutations. The proposal describes an integrated programme of work to fully explore the potential applications of the new amino acid covariation techniques in predicting aspects of protein structure and function. A particular emphasis in this proposal are proteins which are difficult to study by experimental techniques i.e. disordered proteins, transmembrane proteins and large complexes. The first objective will be to explore key developments in the underpinning algorithms, tackling both the issue of needing very large numbers of homologous sequences and also the downstream 3-D embedding to produce viable models. The second objective will involve experimental work with a collaborator where the idea that de novo protein design techniques might be exploited to artificially expand the set of available sequences for a given proto-family will be explored. The third objective will focus specifically on transmembrane protein modelling, where covariation-based approaches have proven to be highly effective. Here the goal will be to extend our existing FILM3 method to encompass both beta-barrel type TM proteins, but also to try to handle the issue of homomultimers, which is a critical aspect of TM protein modelling as so many families are known to adopt higher orders of structure than the fold level alone. Finally, applications of covariation analysis to probing multiple conformations of disordered proteins will be developed, with a specific focus on interactions of disordered proteins with DNA and RNA.Status
CLOSEDCall topic
ERC-ADG-2015Update Date
27-04-2024
Images
No images available.
Geographical location(s)