Uncertainty quantification for molecular property predictions with graph neural architecture search

Shengli Jiang; Shiyi Qin; Reid C. Van Lehn; Prasanna Balaprakash; Victor M. Zavala

doi:10.1039/D4DD00088A

Uncertainty quantification for molecular property predictions with graph neural architecture search†

Shengli Jiang,

*^a Shiyi Qin,^a Reid C. Van Lehn,^a Prasanna Balaprakash^b and Victor M. Zavala

^ac

Author affiliations

* Corresponding authors

^a Department of Chemical and Biological Engineering, University of Wisconsin–Madison, 1415 Engineering Dr, Madison, WI 53706, USA
E-mail: sjiang87@wisc.edu

^b Computing and Computational Sciences Directorate, Oak Ridge National Laboratory, P.O. Box 2008, Oak Ridge, TN 37831, USA

^c Mathematics and Computer Science Division, Argonne National Laboratory, Lemont, IL 60439, USA

Abstract

Graph Neural Networks (GNNs) have emerged as a prominent class of data-driven methods for molecular property prediction. However, a key limitation of typical GNN models is their inability to quantify uncertainties in the predictions. This capability is crucial for ensuring the trustworthy use and deployment of models in downstream tasks. To that end, we introduce AutoGNNUQ, an automated uncertainty quantification (UQ) approach for molecular property prediction. AutoGNNUQ leverages architecture search to generate an ensemble of high-performing GNNs, enabling the estimation of predictive uncertainties. Our approach employs variance decomposition to separate data (aleatoric) and model (epistemic) uncertainties, providing valuable insights for reducing them. In our computational experiments, we demonstrate that AutoGNNUQ outperforms existing UQ methods in terms of both prediction accuracy and UQ performance on multiple benchmark datasets, and generalizes well to out-of-distribution datasets. Additionally, we utilize t-SNE visualization to explore correlations between molecular features and uncertainty, offering insight for dataset improvement. AutoGNNUQ has broad applicability in domains such as drug discovery and materials science, where accurate uncertainty quantification is crucial for decision-making.

Supplementary files

Article information

DOI: https://doi.org/10.1039/D4DD00088A
Article type: Paper
Submitted: 01 Apr 2024
Accepted: 07 Jun 2024
First published: 25 Jun 2024
This article is Open Access

Download Citation

Digital Discovery, 2024,3, 1534-1553

Permissions

Request permissions

Uncertainty quantification for molecular property predictions with graph neural architecture search

S. Jiang, S. Qin, R. C. Van Lehn, P. Balaprakash and V. M. Zavala, Digital Discovery, 2024, 3, 1534 DOI: 10.1039/D4DD00088A

This article is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported Licence. You can use material from this article in other publications, without requesting further permission from the RSC, provided that the correct acknowledgement is given and it is not used for commercial purposes.

To request permission to reproduce material from this article in a commercial publication, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party commercial publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Digital Discovery

Uncertainty quantification for molecular property predictions with graph neural architecture search†

Abstract

Supplementary files

Article information

Download Citation

Permissions

Uncertainty quantification for molecular property predictions with graph neural architecture search

Social activity

Search articles by author

Spotlight

Advertisements