Deep learning for molecular design—a review of the state of the art

Daniel C. Elton; Zois Boukouvalas; Mark D. Fuge; Peter W. Chung

doi:10.1039/C9ME00039A

Deep learning for molecular design—a review of the state of the art

Daniel C. Elton,

†*^a Zois Boukouvalas,^ab Mark D. Fuge^a and Peter W. Chung^a

Author affiliations

* Corresponding authors

^a Department of Mechanical Engineering, University of Maryland, College Park, Maryland, USA
E-mail: daniel.elton@nih.gov

^b Department of Mathematics and Statistics, American University, Washington, D.C., USA

Abstract

In the space of only a few years, deep generative modeling has revolutionized how we think of artificial creativity, yielding autonomous systems which produce original images, music, and text. Inspired by these successes, researchers are now applying deep generative modeling techniques to the generation and optimization of molecules—in our review we found 45 papers on the subject published in the past two years. These works point to a future where such systems will be used to generate lead molecules, greatly reducing resources spent downstream synthesizing and characterizing bad leads in the lab. In this review we survey the increasingly complex landscape of models and representation schemes that have been proposed. The four classes of techniques we describe are recursive neural networks, autoencoders, generative adversarial networks, and reinforcement learning. After first discussing some of the mathematical fundamentals of each technique, we draw high level connections and comparisons with other techniques and expose the pros and cons of each. Several important high level themes emerge as a result of this work, including the shift away from the SMILES string representation of molecules towards more sophisticated representations such as graph grammars and 3D representations, the importance of reward function design, the need for better standards for benchmarking and testing, and the benefits of adversarial training and reinforcement learning over maximum likelihood based training.

Article information

https://doi.org/10.1039/C9ME00039A

Article type

Review Article

Submitted

18 Mar 2019

Accepted

22 May 2019

First published

22 May 2019

Download Citation

Mol. Syst. Des. Eng., 2019,4, 828-849

Permissions

Request permissions

Deep learning for molecular design—a review of the state of the art

D. C. Elton, Z. Boukouvalas, M. D. Fuge and P. W. Chung, Mol. Syst. Des. Eng., 2019, 4, 828 DOI: 10.1039/C9ME00039A

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Molecular Systems Design & Engineering

Deep learning for molecular design—a review of the state of the art

Abstract

Article information

Download Citation

Author version available

Permissions

Deep learning for molecular design—a review of the state of the art

Social activity

Search articles by author

Spotlight

Advertisements