Issue 4, 2024

Learning conditional policies for crystal design using offline reinforcement learning

Abstract

Navigating through the exponentially large chemical space to search for desirable materials is an extremely challenging task in material discovery. Recent developments in generative and geometric deep learning have shown promising results in molecule and material discovery but often lack evaluation with high-accuracy computational methods. This work aims to design novel and stable crystalline materials conditioned on a desired band gap. To achieve conditional generation, we: (1) formulate crystal design as a sequential decision-making problem, create relevant trajectories based on high-quality materials data, and use conservative Q-learning to learn a conditional policy from these trajectories. To do so, we formulate a reward function that incorporates constraints for energetic and electronic properties obtained directly from density functional theory (DFT) calculations; (2) evaluate the generated materials from the policy using DFT calculations for both energy and band gap; (3) compare our results to relevant baselines, including behavioral cloning and unconditioned policy learning. Our experiments show that conditioned policies achieve targeted crystal design and demonstrate the capability to perform crystal discovery evaluated with accurate and computationally expensive DFT calculations.

Graphical abstract: Learning conditional policies for crystal design using offline reinforcement learning

Supplementary files

Transparent peer review

To support increased transparency, we offer authors the option to publish the peer review history alongside their article.

View this article’s peer review history

Article information

Article type
Paper
Submitted
15 Jan 2024
Accepted
27 Feb 2024
First published
29 Feb 2024
This article is Open Access
Creative Commons BY license

Digital Discovery, 2024,3, 769-785

Learning conditional policies for crystal design using offline reinforcement learning

P. Govindarajan, S. Miret, J. Rector-Brooks, M. Phielipp, J. Rajendran and S. Chandar, Digital Discovery, 2024, 3, 769 DOI: 10.1039/D4DD00024B

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements