Self-learning entropic population annealing for interpretable materials design

Jiawen Li; Jinzhe Zhang; Ryo Tamura; Koji Tsuda

doi:10.1039/D1DD00043H

Self-learning entropic population annealing for interpretable materials design†

Jiawen Li,^a Jinzhe Zhang,^a Ryo Tamura*^abcd and Koji Tsuda

*^acd

Author affiliations

* Corresponding authors

^a Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa 2778561, Japan
E-mail: tsuda@k.u-tokyo.ac.jp

^b International Center for Materials Nanoarchitectonics (WPI-MANA), National Institute for Materials Science, 1-1 Namiki, Tsukuba, Ibaraki 305-0044, Japan
E-mail: tamura.ryo@nims.go.jp

^c Research and Services Division of Materials Data and Integrated System, National Institute for Materials Science, 1-1 Namiki, Tsukuba, Ibaraki 305-0044, Japan

^d RIKEN Center for Advanced Intelligence Project, 1-4-1 Nihonbashi, Chuo-ku, Tokyo 103-0027, Japan

Abstract

In automatic materials design, samples obtained from black-box optimization offer an attractive opportunity for scientists to gain new knowledge. Statistical analyses of the samples are often conducted, e.g., to discover key descriptors. Since most black-box optimization algorithms are biased samplers, post hoc analyses may result in misleading conclusions. To cope with the problem, we propose a new method called self-learning entropic population annealing (SLEPA) that combines entropic sampling and a surrogate machine learning model. Samples of SLEPA come with weights to estimate the joint distribution of the target property and a descriptor of interest correctly. In short peptide design, SLEPA was compared with pure black-box optimization in estimating the residue distributions at multiple thresholds of the target property. While black-box optimization was better at the tail of the target property, SLEPA was better for a wide range of thresholds. Our result shows how to reconcile statistical consistency with efficient optimization in materials discovery.

Supplementary files

Article information

DOI: https://doi.org/10.1039/D1DD00043H
Article type: Paper
Submitted: 27 Nov 2021
Accepted: 04 Apr 2022
First published: 04 Apr 2022
This article is Open Access

Download Citation

Digital Discovery, 2022,1, 295-302

Permissions

Request permissions

Self-learning entropic population annealing for interpretable materials design

J. Li, J. Zhang, R. Tamura and K. Tsuda, Digital Discovery, 2022, 1, 295 DOI: 10.1039/D1DD00043H

This article is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported Licence. You can use material from this article in other publications, without requesting further permission from the RSC, provided that the correct acknowledgement is given and it is not used for commercial purposes.

To request permission to reproduce material from this article in a commercial publication, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party commercial publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Digital Discovery

Self-learning entropic population annealing for interpretable materials design†

Abstract

Supplementary files

Article information

Download Citation

Permissions

Self-learning entropic population annealing for interpretable materials design

Social activity

Search articles by author

Spotlight

Advertisements