ML Potentials | Constructing Intermolecular Potentials by Combining Physics and Machine Learning

Summary
Statistical-learning approaches are emerging as powerful alternatives to direct approaches to solving the electronic Schrödinger equation for determining the energy and other properties of molecules. Despite the recent success of methods like deep neural networks, these methods are limited to relatively small molecules. The issue is that predicting long-range intermolecular interactions with machine learning requires sampling the vast diversity of chemical environments that occur on an extended length scale, leading to a combinatorial explosion in the amount of training data that is required. To solve this problem, the functional form of the long-range interactions is taken from physical models, but the parameters that enter those expressions (atomic charges/multipoles; induced charges/multipoles; van der Waals coefficients) are determined by combining physical insight with machine learning. In this model, machine learning is used only to predict short-range phenomena like the dependence of atomic charges/multipoles on the molecular structure and the dependence of induced atomic charges/multipoles on the local electric field. The resulting machine-learned physically-motivated atomistic intermolecular potentials are valid for molecules of any size, but only require training data from small- and medium-sized molecules.
This development will provide molecular energies with the accuracy of quantum methods, at the computational cost of classical molecular mechanics approaches. This not only allows one to compute interaction energies for large molecules (e.g., the binding energy between a drug and a receptor), but allows the computational screening of molecules based on computed interaction energies. In addition to its transformative computational utility, this pioneering strategy—using physical insight to build a model, then using machine learning methods for the parameters in the model—can be extended to many other problems in chemistry, physics, and materials science.
Unfold all
/
Fold all
More information & hyperlinks
Web resources: https://cordis.europa.eu/project/id/800130
Start date: 15-03-2018
End date: 14-03-2020
Total budget - Public funding: 160 800,00 Euro - 160 800,00 Euro
Cordis data

Original description

Statistical-learning approaches are emerging as powerful alternatives to direct approaches to solving the electronic Schrödinger equation for determining the energy and other properties of molecules. Despite the recent success of methods like deep neural networks, these methods are limited to relatively small molecules. The issue is that predicting long-range intermolecular interactions with machine learning requires sampling the vast diversity of chemical environments that occur on an extended length scale, leading to a combinatorial explosion in the amount of training data that is required. To solve this problem, the functional form of the long-range interactions is taken from physical models, but the parameters that enter those expressions (atomic charges/multipoles; induced charges/multipoles; van der Waals coefficients) are determined by combining physical insight with machine learning. In this model, machine learning is used only to predict short-range phenomena like the dependence of atomic charges/multipoles on the molecular structure and the dependence of induced atomic charges/multipoles on the local electric field. The resulting machine-learned physically-motivated atomistic intermolecular potentials are valid for molecules of any size, but only require training data from small- and medium-sized molecules.
This development will provide molecular energies with the accuracy of quantum methods, at the computational cost of classical molecular mechanics approaches. This not only allows one to compute interaction energies for large molecules (e.g., the binding energy between a drug and a receptor), but allows the computational screening of molecules based on computed interaction energies. In addition to its transformative computational utility, this pioneering strategy—using physical insight to build a model, then using machine learning methods for the parameters in the model—can be extended to many other problems in chemistry, physics, and materials science.

Status

CLOSED

Call topic

MSCA-IF-2017

Update Date

28-04-2024
Images
No images available.
Geographical location(s)
Structured mapping
Unfold all
/
Fold all
Horizon 2020
H2020-EU.1. EXCELLENT SCIENCE
H2020-EU.1.3. EXCELLENT SCIENCE - Marie Skłodowska-Curie Actions (MSCA)
H2020-EU.1.3.2. Nurturing excellence by means of cross-border and cross-sector mobility
H2020-MSCA-IF-2017
MSCA-IF-2017