Efficient calculation of protein–ligand binding free energy using GFN methods: the power of the cluster model

Yuan-qiang Chen; Yan-jing Sheng; Yu-qiang Ma; Hong-ming Ding

doi:10.1039/D2CP00161F

Efficient calculation of protein–ligand binding free energy using GFN methods: the power of the cluster model†

Yuan-qiang Chen,^a Yan-jing Sheng,^a Yu-qiang Ma

^b and Hong-ming Ding

*^a

Author affiliations

* Corresponding authors

^a Center for Soft Condensed Matter Physics and Interdisciplinary Research, School of Physical Science and Technology, Soochow University, Suzhou 215006, China
E-mail: dinghm@suda.edu.cn

^b National Laboratory of Solid State Microstructures and Department of Physics, Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing 210093, China

Abstract

Protein–ligand interactions are crucial in many biochemical processes and biomedical applications, yet accurately calculating the binding free energy of the interactions still remains challenging. In this work, we systematically investigate the performance of a generic force field GFN-FF and some semi-empirical quantum mechanical (SQM) methods (GFNn, n = 0, 1, 2) in terms of the accuracy of the calculated binding free energy. It is found that the performance of the GFN-FF method is quite good in a neutral-ligand system since the Pearson correlation coefficient (r_p) is 0.70 and the mean absolute error (MAE) is 5.49 kcal mol⁻¹. However, it may fail in a charged-ligand system (the MAE is 18.98 kcal mol⁻¹). Moreover, we also propose a cluster model (i.e., truncating the protein at a given cutoff) along with the SQM method in the GFN family. Importantly, the GFN2-xTB shows the best performance among the SQM methods (the MAE is 4.91 kcal mol⁻¹ and 10.25 kcal mol⁻¹ in the neutral-ligand and charged-ligand systems, respectively), much better than GFN-FF in the charged-ligand system. Notably, the computing cost of the GFN2-xTB in the appropriate cluster model is even lower than that of the GFN-FF (in the entire complex). The present study sheds some light on the potential power of the GFN family in the efficient calculation of the binding free energy in bio-systems.

Supplementary files

Article information

DOI: https://doi.org/10.1039/D2CP00161F
Article type: Paper
Submitted: 11 Jan 2022
Accepted: 16 May 2022
First published: 17 May 2022

Download Citation

Phys. Chem. Chem. Phys., 2022,24, 14339-14347

Permissions

Request permissions

Efficient calculation of protein–ligand binding free energy using GFN methods: the power of the cluster model

Y. Chen, Y. Sheng, Y. Ma and H. Ding, Phys. Chem. Chem. Phys., 2022, 24, 14339 DOI: 10.1039/D2CP00161F

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Physical Chemistry Chemical Physics

Efficient calculation of protein–ligand binding free energy using GFN methods: the power of the cluster model†

Abstract

Supplementary files

Article information

Download Citation

Permissions

Efficient calculation of protein–ligand binding free energy using GFN methods: the power of the cluster model

Social activity

Search articles by author

Spotlight

Advertisements