Beyond Interpolation: Integration of Data and AI-Extracted Knowledge for High-Entropy Alloy Discovery

Abstract

Discovering novel high-entropy alloys (HEAs) with desirable properties is challenged by the vast compositional space and the complexity of phase formation mechanisms. Several inductive screening methods that excel at interpolation have been developed; however, they struggle with extrapolating to novel alloy systems. This study introduces a framework that addresses the extrapolation limitation by systematically integrating knowledge extracted from material datasets with expert knowledge derived from scientific literature using large language models (LLMs). Central to our framework is the elemental substitution principle, which identifies chemically similar elements that can be interchanged while preserving desired properties. To model and combine evidence from these multi-source knowledge, we employ the Dempster--Shafer theory, which provides a mathematical foundation for reasoning under uncertainty. Our framework consistently outperforms conventional phase selection models that rely on single-source knowledge across all experiments, showing notable advantages in predicting phase stability for compositions containing elements absent from training data. Importantly, the framework intends to effectively complement the strengths of the existing methods. Moreover, it provides interpretable reasoning that elucidates element substitutability patterns critical to alloy stability in HEAs formation. These results highlight the framework's potential for knowledge transfer and extrapolation, offering an efficient approach to exploring the vast compositional space of HEAs with enhanced generalizability and interpretability.

Supplementary files

Transparent peer review

To support increased transparency, we offer authors the option to publish the peer review history alongside their article.

View this article’s peer review history

Article information

Article type
Paper
Submitted
08 Sep 2025
Accepted
11 Dec 2025
First published
19 Dec 2025
This article is Open Access
Creative Commons BY license

Digital Discovery, 2025, Accepted Manuscript

Beyond Interpolation: Integration of Data and AI-Extracted Knowledge for High-Entropy Alloy Discovery

M. Q. Ha, D. Le, V. Nguyen, H. Kino, S. Curtarolo and H. Dam, Digital Discovery, 2025, Accepted Manuscript , DOI: 10.1039/D5DD00400D

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements