Are 2D fingerprints still valuable for drug discovery?

Kaifu Gao; Duc Duy Nguyen; Vishnu Sresht; Alan M. Mathiowetz; Meihua Tu; Guo-Wei Wei

doi:10.1039/D0CP00305K

Are 2D fingerprints still valuable for drug discovery?†

Kaifu Gao,^a Duc Duy Nguyen,^a Vishnu Sresht,^b Alan M. Mathiowetz,^b Meihua Tu^b and Guo-Wei Wei

*^acd

Author affiliations

* Corresponding authors

^a Department of Mathematics, Michigan State University, MI 48824, USA
E-mail: wei@math.msu.edu

^b Pfizer Medicine Design, 610 Main St, Cambridge, MA 02139, USA

^c Department of Electrical and Computer Engineering, Michigan State University, MI 48824, USA

^d Department of Biochemistry and Molecular Biology, Michigan State University, MI 48824, USA

Abstract

Recently, molecular fingerprints extracted from three-dimensional (3D) structures using advanced mathematics, such as algebraic topology, differential geometry, and graph theory have been paired with efficient machine learning, especially deep learning algorithms to outperform other methods in drug discovery applications and competitions. This raises the question of whether classical 2D fingerprints are still valuable in computer-aided drug discovery. This work considers 23 datasets associated with four typical problems, namely protein–ligand binding, toxicity, solubility and partition coefficient to assess the performance of eight 2D fingerprints. Advanced machine learning algorithms including random forest, gradient boosted decision tree, single-task deep neural network and multitask deep neural network are employed to construct efficient 2D-fingerprint based models. Additionally, appropriate consensus models are built to further enhance the performance of 2D-fingerprint-based methods. It is demonstrated that 2D-fingerprint-based models perform as well as the state-of-the-art 3D structure-based models for the predictions of toxicity, solubility, partition coefficient and protein–ligand binding affinity based on only ligand information. However, 3D structure-based models outperform 2D fingerprint-based methods in complex-based protein–ligand binding affinity predictions.

This article is part of the themed collections: Emerging AI Approaches in Physical Chemistry, 2020 PCCP HOT Articles and PCCP Editor’s Choice, 2020

Supplementary files

Article information

DOI: https://doi.org/10.1039/D0CP00305K
Article type: Paper
Submitted: 17 Jan 2020
Accepted: 18 Mar 2020
First published: 20 Mar 2020

Download Citation

Phys. Chem. Chem. Phys., 2020,22, 8373-8390

Author version available

Download author version (PDF)

Permissions

Request permissions

Are 2D fingerprints still valuable for drug discovery?

K. Gao, D. D. Nguyen, V. Sresht, A. M. Mathiowetz, M. Tu and G. Wei, Phys. Chem. Chem. Phys., 2020, 22, 8373 DOI: 10.1039/D0CP00305K

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Physical Chemistry Chemical Physics

Are 2D fingerprints still valuable for drug discovery?†

Abstract

Supplementary files

Article information

Download Citation

Author version available

Permissions

Are 2D fingerprints still valuable for drug discovery?

Social activity

Search articles by author

Spotlight

Advertisements