Δ-Machine learning for quantum chemistry prediction of solution-phase molecular properties at the ground and excited states

Xu Chen; Pinyuan Li; Eugen Hruska; Fang Liu

doi:10.1039/D3CP00506B

Δ-Machine learning for quantum chemistry prediction of solution-phase molecular properties at the ground and excited states†

Xu Chen,

^a Pinyuan Li,

^a Eugen Hruska

^a and Fang Liu

*^a

Author affiliations

* Corresponding authors

^a Department of Chemistry, Emory University, Atlanta, Georgia
E-mail: fang.liu@emory.edu

Abstract

Due to the limitation of solvent models, quantum chemistry calculation of solution-phase molecular properties often deviates from experimental measurements. Recently, Δ-machine learning (Δ-ML) was shown to be a promising approach to correcting errors in the quantum chemistry calculation of solvated molecules. However, this approach's applicability to different molecular properties and its performance in various cases are still unknown. In this work, we tested the performance of Δ-ML in correcting redox potential and absorption energy calculations using four types of input descriptors and various ML methods. We sought to understand the dependence of Δ-ML performance on the property to predict the quantum chemistry method, the data set distribution/size, the type of input feature, and the feature selection techniques. We found that Δ-ML can effectively correct the errors in redox potentials calculated using density functional theory (DFT) and absorption energies calculated by time-dependent DFT. For both properties, the Δ-ML-corrected results showed less sensitivity to the DFT functional choice than the raw results. The optimal input descriptor depends on the property, regardless of the specific ML method used. The solvent–solute descriptor (SS) is the best for redox potential, whereas the combined molecular fingerprint (cFP) is the best for absorption energy. A detailed analysis of the feature space and the physical foundation of different descriptors well explained these observations. Feature selection did not further improve the Δ-ML performance. Finally, we analyzed the limitation of our Δ-ML solvent effect approach in data sets with molecules of varying degrees of electronic structure errors.

This article is part of the themed collection: Insightful Machine Learning for Physical Chemistry

Supplementary files

Article information

DOI: https://doi.org/10.1039/D3CP00506B
Article type: Paper
Submitted: 01 Feb 2023
Accepted: 20 Apr 2023
First published: 21 Apr 2023

Download Citation

Phys. Chem. Chem. Phys., 2023,25, 13417-13428

Permissions

Request permissions

Δ-Machine learning for quantum chemistry prediction of solution-phase molecular properties at the ground and excited states

X. Chen, P. Li, E. Hruska and F. Liu, Phys. Chem. Chem. Phys., 2023, 25, 13417 DOI: 10.1039/D3CP00506B

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Physical Chemistry Chemical Physics

Δ-Machine learning for quantum chemistry prediction of solution-phase molecular properties at the ground and excited states†

Abstract

Supplementary files

Article information

Download Citation

Permissions

Δ-Machine learning for quantum chemistry prediction of solution-phase molecular properties at the ground and excited states

Social activity

Search articles by author

Spotlight

Advertisements