A Bayesian graph convolutional network for reliable prediction of molecular properties with uncertainty quantification

Seongok Ryu; Yongchan Kwon; Woo Youn Kim

doi:10.1039/C9SC01992H

A Bayesian graph convolutional network for reliable prediction of molecular properties with uncertainty quantification†

Seongok Ryu,^a Yongchan Kwon

^b and Woo Youn Kim

*^ac

Author affiliations

* Corresponding authors

^a Department of Chemistry, KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea
E-mail: wooyoun@kaist.ac.kr

^b Department of Statistics, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul 08826, Republic of Korea

^c KI for Artificial Intelligence, KAIST, 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea

Abstract

Deep neural networks have been increasingly used in various chemical fields. In the nature of a data-driven approach, their performance strongly depends on data used in training. Therefore, models developed in data-deficient situations can cause highly uncertain predictions, leading to vulnerable decision making. Here, we show that Bayesian inference enables more reliable prediction with quantitative uncertainty analysis. Decomposition of the predictive uncertainty into model- and data-driven uncertainties allows us to elucidate the source of errors for further improvements. For molecular applications, we devised a Bayesian graph convolutional network (GCN) and evaluated its performance for molecular property predictions. Our study on the classification problem of bio-activity and toxicity shows that the confidence of prediction can be quantified in terms of the predictive uncertainty, leading to more accurate virtual screening of drug candidates than standard GCNs. The result of log P prediction illustrates that data noise affects the data-driven uncertainty more significantly than the model-driven one. Based on this finding, we could identify artefacts that arose from quantum mechanical calculations in the Harvard Clean Energy Project dataset. Consequently, the Bayesian GCN is critical for molecular applications under data-deficient conditions.

This article is part of the themed collection: Celebrating Chemical Science in Korea

Supplementary files

Article information

DOI: https://doi.org/10.1039/C9SC01992H
Article type: Edge Article
Submitted: 22 Apr 2019
Accepted: 21 Jul 2019
First published: 22 Jul 2019
This article is Open Access

All publication charges for this article have been paid for by the Royal Society of Chemistry

Download Citation

Chem. Sci., 2019,10, 8438-8446

Permissions

Request permissions

A Bayesian graph convolutional network for reliable prediction of molecular properties with uncertainty quantification

S. Ryu, Y. Kwon and W. Y. Kim, Chem. Sci., 2019, 10, 8438 DOI: 10.1039/C9SC01992H

This article is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported Licence. You can use material from this article in other publications, without requesting further permission from the RSC, provided that the correct acknowledgement is given and it is not used for commercial purposes.

To request permission to reproduce material from this article in a commercial publication, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party commercial publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Chemical Science

A Bayesian graph convolutional network for reliable prediction of molecular properties with uncertainty quantification†

Abstract

Supplementary files

Article information

Download Citation

Permissions

A Bayesian graph convolutional network for reliable prediction of molecular properties with uncertainty quantification

Social activity

Search articles by author

Spotlight

Advertisements