Bayesian active learning to accelerate high throughput phase diagram exploration

Mingzhou Fan; Yucheng Wang; Guillermo Vazquez; Ruida Zhou; Ibrahim Karaman; Raymundo Arróyave; Xiaoning Qian

doi:10.1039/D5DD00459D

View PDF Version

Open Access Article

This Open Access Article is licensed under a Creative Commons Attribution-Non Commercial 3.0 Unported Licence

DOI: 10.1039/D5DD00459D (Paper) Digital Discovery, 2026, Advance Article

Bayesian active learning to accelerate high throughput phase diagram exploration

Mingzhou Fan† ^a, Yucheng Wang†^a, Guillermo Vazquez^b, Ruida Zhou^c, Ibrahim Karaman^b, Raymundo Arróyave*^bde and Xiaoning Qian*^afg
^aDepartment of Electrical and Computer Engineering, Texas A&M University, College Station, TX 77843, USA. E-mail: xqian@ece.tamu.edu
^bDepartment of Materials Science and Engineering, Texas A&M University, College Station, TX 77843, USA. E-mail: raymundo.arroyave@tamu.edu
^cDepartment of Electrical and Computer Engineering, University of California, Los Angeles, CA 90095, USA
^dJ. Mike Walker ‘66 Department of Mechanical Engineering, Texas A&M University, College Station, TX 77843, USA
^eWm Michael Barnes ‘64 Department of Industrial and Systems Engineering, Texas A&M University, College Station, TX 77843, USA
^fDepartment of Computer Science and Engineering, Texas A&M University, College Station, TX 77843, USA
^gComputing & Data Sciences Directorate, Brookhaven National Laboratory, Upton, NY 11973, USA

Received 10th October 2025 , Accepted 22nd May 2026

First published on 4th June 2026

Abstract

Phase diagrams are fundamental for understanding phase stability and guiding the synthesis of new materials. However, constructing high-dimensional phase diagrams through exhaustive CALPHAD (CALculation of PHAse Diagrams) computations remains costly. We introduce a Bayesian Active Learning for Phase Diagram Discovery (BALPI) framework that efficiently identifies phase stability regions by adaptively sampling the thermodynamic space using uncertainty-aware acquisition strategies. BALPI integrates Gaussian Process Classifiers and Regressors within two complementary formulations—classification and level-set estimation—and introduces non-myopic Bayesian acquisition functions, including the Soft Mean Objective Cost of Uncertainty (SMOCU) and an extended straddle (e-straddle) criterion. Using CALPHAD-based phase stability predictions as the ground-truth oracle, BALPI achieves accurate reconstruction of phase boundaries with significantly fewer queries than conventional label propagation and label spreading baselines. Results on SiO₂–Al₂O₃–MgO and Ni–Ti–Hf–Cu systems demonstrate that BALPI captures disconnected phase regions and achieves consistent reductions in Bayesian error and computational cost. More importantly, this work establishes BALPI as a general framework for uncertainty-guided phase diagram discovery and highlights the potential of Bayesian active learning to accelerate computational thermodynamics and materials design, through the efficient exploration of the phase stability landscape at much lower costs relative to competing strategies.

1. Introduction

Phase stability underlies the design and optimization of all materials systems. The equilibrium phase constitution of a system depends on external or internal thermodynamic conditions—temperature, pressure, and composition—and is summarized in phase diagrams. From a thermodynamic standpoint, the equilibrium state corresponds to the global constrained minimum of the Gibbs free energy over all phases that may participate in equilibrium. The phase boundaries that comprise the phase diagram separate regions of thermodynamic space associated with distinct equilibrium phase assemblages. These diagrams of stability, in turn, serve as blueprints for understanding microstructural evolution and selecting processing routes that yield desired material properties.^1,2

The CALPHAD (CALculation of PHAse Diagrams) framework has become the de facto standard for constructing phase diagrams of complex alloy systems.^3,4 By parameterizing the Gibbs free energies of all relevant phases and minimizing the total free energy under specified conditions, CALPHAD enables the rigorous prediction of equilibrium phase stability across a desired thermodynamic space. These free energy functions are constructed by combining experimental phase stability measurements, thermo-chemical data, and first-principles calculations.⁵ While powerful, this process remains computationally and experimentally intensive—particularly for multi-component systems where exhaustive sampling of the compositional and thermal space is infeasible. Moreover, traditional grid-based exploration strategies fail to leverage the fact that certain compositions are disproportionately informative for thermodynamic model calibration and phase boundary discovery.

1.1. From grid-based sampling to active learning

Active Learning (AL) offers a principled approach to efficiently explore design spaces by sequentially selecting the most informative data points to label.⁶ In materials discovery, AL has proven effective in accelerating experimental design and phase diagram mapping.^7–11 For instance, the Phase-Mapper project employed AL to enhance phase identification in combinatorial thin-film libraries through adaptive sampling guided by X-ray diffraction feedback.^12,13 More recent studies applied label propagation and label spreading models to automate phase-diagram reconstruction.¹¹ However, these methods rely on heuristic uncertainty measures or KNN-based surrogates whose performance is highly sensitive to hyperparameter choices. In contrast, Gaussian Processes (GPs) provide a Bayesian, non-parametric alternative that enables closed-form uncertainty quantification and empirical Bayes hyperparameter optimization.^14,15 Building on this foundation, recent work has demonstrated how embedding physics-informed priors into the mean function of GP classifiers can improve constraint-aware sampling and accelerate phase stability discovery under limited data conditions.¹⁶

1.2. Bayesian active learning for CALPHAD-guided phase discovery

Here, we introduce a unified framework—Bayesian Active Learning for Phase Identification (BALPI)—that couples uncertainty-aware sampling with CALPHAD-based phase computations. Rather than re-parameterizing thermodynamic models, BALPI treats CALPHAD (implemented via Thermo-Calc) as a ground-truth oracle that returns either discrete phase labels or continuous phase-fraction scores. The learner iteratively queries new compositions in thermodynamic space to minimize global predictive uncertainty in the phase stability landscape. The use of a CALPHAD oracle mimics an experimental workflow in which phase stability is assessed composition-by-composition via synthesis, followed by characterization of the resulting phase constitution using microscopy, spectroscopy, or diffraction techniques.

We formulate phase discovery under two complementary views:

1. Classification: learning the stability domains of distinct phases as a multi-class decision problem using a Gaussian Process Classifier (GPC);

2. Level-set estimation: modeling a continuous latent function corresponding to phase-fraction or stability scores using a Gaussian Process Regressor (GPR).

Unlike prior active learning approaches for phase diagram construction that rely primarily on discrete phase labels and myopic uncertainty sampling,^7–11,17 BALPI unifies classification and level-set estimation formulations within a single Bayesian perspective. In particular, BALPI supports queries of either binary phase indicators or continuous phase-fraction or stability scores, enabling the use of both GPC and GPR models coupled with non-myopic acquisition functions such as Soft Mean Objective Cost of Uncertainty (SMOCU)^18,19 and extended straddle. Importantly, BALPI is agnostic to the source of phase information: while CALPHAD equilibrium computations are used in this work as a reproducible oracle for benchmarking, the framework is directly applicable to experimental workflows in which phase stability is determined via synthesis and characterization.

For adaptive sampling, we evaluate a suite of Bayesian acquisition functions, including: (i) Maximum Entropy Search (MES), which selects points with maximal expected information gain;²⁰ (ii) Bayesian Active Learning by Disagreement (BALD), which targets epistemic uncertainty via disagreement in posterior predictive distributions;²¹ (iii) a soft Mean Objective Cost of Uncertainty (SMOCU) criterion for non-myopic, outcome-aware exploration;^18,19 and (iv) an extended straddle (e-straddle) function that balances phase-boundary refinement with global uncertainty reduction.²²

1.3. Contributions and impact

Our contributions are threefold:

• We propose a modular Bayesian active learning platform (BALPI) for uncertainty-aware exploration of CALPHAD-computed phase diagrams.

• We formulate phase discovery as either a classification or a level-set estimation problem, combining GPC/GPR models with principled Bayesian acquisition functions.

• We empirically demonstrate that BALPI outperforms conventional label propagation and label spreading baselines on representative ternary and quaternary systems, achieving faster convergence and resolving disconnected phase regions that are often missed by deterministic methods.

Importantly, and beyond CALPHAD-related applications, BALPI establishes a general framework for autonomous phase diagram discovery where querying phase information—experimentally or via simulation—is costly. The combination of Gaussian process surrogates and Bayesian acquisition functions offers a robust pathway toward interpretable, uncertainty-aware materials exploration.

2. Related work

Active learning (AL) has emerged as a powerful strategy for accelerating the construction of phase diagrams and phase mapping in both computational and experimental materials science. Early efforts such as the Phase-Mapper Project^12,13 demonstrated how machine learning can guide experimental synthesis and phase identification from high-throughput X-ray diffraction data. Building upon these ideas, Terayama et al.¹¹ introduced an uncertainty-sampling strategy for efficient phase diagram construction by integrating label propagation and label spreading models with CALPHAD-based thermodynamic calculations.

More recent studies have incorporated explicit Bayesian or thermodynamics-informed models to further improve sampling efficiency. Ament et al.⁷ proposed a hierarchical Bayesian active learning framework capable of autonomously discovering nonequilibrium phase diagrams from experimental data. Abranches et al.⁹ developed a thermodynamics-informed Gaussian process (GP) active learning approach to estimate activity coefficients and build phase diagrams, emphasizing the integration of domain-specific priors. Dai et al.⁸ formulated phase diagram exploration as an active learning problem with adaptive sampling driven by model uncertainty, while Deffrennes et al.¹⁰ combined supervised machine learning with CALPHAD computations to directly predict phase boundaries. In parallel, Lookman et al.¹⁷ demonstrated that Gaussian process regression and Bayesian inference provide a flexible and interpretable foundation for modeling phase diagrams and efficiently sampling phase boundaries.

Despite these advances, most existing frameworks rely on myopic acquisition strategies or a single modeling formulation (typically classification), limiting their ability to identify disconnected or multimodal phase-stability regions. Our proposed Bayesian Active Learning for Phase Identification (BALPI) framework extends this line of research by (i) unifying classification and level-set estimation formulations under a single Bayesian perspective, (ii) introducing non-myopic and differentiable acquisition functions such as the Soft Mean Objective Cost of Uncertainty (SMOCU) and extended straddle (e-straddle), and (iii) demonstrating superior convergence and robustness on both glass-ceramic and shape-memory alloy systems. BALPI thus generalizes prior AL paradigms into a modular, uncertainty-aware framework for the autonomous exploration of CALPHAD-derived thermodynamic spaces.

3. Methods

The proposed Bayesian Active Learning for Phase Diagram Discovery (BALPI) framework provides a unified workflow for automated phase diagram exploration by combining CALPHAD-based thermodynamic modeling with Gaussian Process (GP)–based surrogate learning and Bayesian decision-making. As illustrated in Fig. 1, BALPI operates as a closed-loop system consisting of three modules: Model, Utility, and Query. In each iteration, the GP surrogate (either a classifier or a regressor) is trained on existing CALPHAD data—serving as a proxy for a fully experimental or computational campaign. The Bayesian acquisition function is then used to compute the expected information gain across all candidate compositions, and the most informative sample is selected for evaluation—either by querying the CALPHAD oracle or by performing a phase-stability experiment. The resulting phase label or stability score is appended to the dataset, and the loop continues until convergence in uncertainty or exhaustion of the sampling budget. This design allows BALPI to systematically map phase stability regions with minimal computational cost, serving as a general-purpose platform for uncertainty-guided exploration of thermodynamic systems.


	Fig. 1 Schematic of the BALPI (Bayesian Active Learning for Phase Diagram Discovery) workflow. The framework integrates Model, Utility, and Query modules: a Gaussian Process Classifier or Regressor serves as the surrogate model; Bayesian utilities such as Straddle, Entropy, Variance, or SMOCU guide sampling; and CALPHAD simulations or experiments provide new phase observations to augment the dataset, iteratively refining the phase diagram.

From a technical perspective, the BALPI workflow can be interpreted as a Bayesian decision-making process built on Gaussian Process (GP) surrogates. Each module in Fig. 1 corresponds to a distinct mathematical component: the Model stage defines a GP prior over the latent thermodynamic function f(x), the Utility stage evaluates an acquisition function U(x) that quantifies the expected benefit — such as information gain, uncertainty reduction, or decision-quality improvement — from evaluating a new query, and the Query stage provides ground-truth phase data from CALPHAD or experiments to update the dataset. This modular decomposition allows the active learning process to be formalized within a unified probabilistic framework, where the objective is to minimize the epistemic uncertainty of the surrogate model while efficiently identifying phase boundaries. The following subsections provide the mathematical formulation of this framework, including (i) the problem setup, (ii) the Gaussian Process surrogate models for phase prediction, (iii) the Bayesian acquisition functions used to guide exploration, and (iv) the complete active learning loop that defines the BALPI algorithm.

3.1. Problem formulations

Table 1 summarizes the notations used throughout the BALPI framework. The goal of BALPI is to efficiently discover the phase stability landscape of a multi-component material system within a specified thermodynamic space. Let the system be parameterized by composition

temperature T, and pressure P. Each query to the thermodynamic model or experiment returns a phase indicator, which can be represented as either a discrete label

y ∈ {0,1},

where y = 1 indicates that the queried design x belongs to the phase of interest and y = 0 otherwise, or a continuous score

s ∈ [0,1],

such as the predicted phase fraction or stability probability obtained from CALPHAD-based simulations (e.g., Thermo-Calc). The dataset at iteration t is thus denoted as

depending on the availability of discrete or continuous outputs.

Table 1 Summary of notation used in the Bayesian Active Learning for Phase Diagram Discovery (BALPI) framework

Symbol	Description
	Composition vector in d-dimensional design space
T, P	Temperature and pressure defining thermodynamic conditions
y ∈ {0, 1}	Binary phase indicator (1: Phase of interest, 0: Otherwise)
s ∈ [0, 1]	Continuous stability score or phase fraction (e.g., CALPHAD output)
f(x)	Latent phase-stability function modeled by the GP surrogate
	Dataset at iteration
K(x, x′)	Covariance (kernel) function defining GP prior correlation
a, l	Kernel hyperparameters: Amplitude and length scale
µ(x), σ²(x)	GP posterior mean and variance at x
U(X)	Bayesian acquisition (utility) function guiding query selection
U_SMOCU(x)	Soft mean objective cost of uncertainty (decision-theoretic utility)
U_e-straddle(x)	Extended straddle utility (heuristic boundary-based utility)
T	Active learning iteration
g(·)	Transformation in e-straddle reward term (g(r) = r, r², or )
Β	Hyperparameter balancing exploration and exploitation (e-straddle)
K	Softness parameter in S-MOCU controlling LogSumExp smoothness
	Final dataset after T active learning iterations
	Predictive phase probability from GP surrogate
	Posterior variance of latent function (epistemic uncertainty)
x_t+1	Next query selected by maximizing U(x)
Τ	Decision threshold for phase boundary (typically t = 0.5)

The objective of BALPI is to construct a surrogate model that accurately approximates the phase stability function across while minimizing the number of expensive CALPHAD evaluations or experimental measurements. This objective can be formalized as minimizing the total model uncertainty over the domain:

where f(x) denotes the latent function representing the thermodynamic state (e.g., the phase indicator score).

At each iteration, BALPI actively selects the next query x_t+1 according to a Bayesian acquisition function U(X):

where U(x) quantifies the expected information gain or uncertainty reduction that would result from evaluating the phase at x. This active querying strategy enables targeted sampling near phase boundaries, where uncertainty in phase stability is typically the highest, thereby accelerating convergence of the phase diagram.

In this study, we focus on the ternary system SiO₂–Al₂O₃–MgO as a representative case for demonstrating the efficiency of the proposed framework. However, the formulation is general and applicable to arbitrary multicomponent systems and thermodynamic conditions.

3.1.1 Feature preprocessing and compositional constraints. In all experiments, the model inputs are composition fractions, so the feasible design space is a compositional simplex rather than an unconstrained hypercube. We therefore parameterize the surrogate directly in terms of independent composition coordinates and enforce the sum-to-one constraint analytically. For the ternary SiO₂–Al₂O₃–MgO case for example, the independent inputs include (x, y) and the third coordinate is recovered as z = 1 − x − y, with only feasible points satisfying x, y, z ∈ [0, 1] retained. For the quaternary NiTiHfCu case, compositions are parameterized as Cu_xHf_yNi_z/2Ti_z/2 with x + y + z = 1, so the independent variables again lie on a simplex that uniquely determines the full composition. Because these variables are already expressed as normalized mole fractions on [0,1], we do not introduce an additional z-score or min–max feature scaling step beyond enforcing the compositional feasibility constraint.

Due to differences in problem formulation, we employ Gaussian Process Classification (GPC) and Gaussian Process Regression (GPR) as predictive models for the classification and regression-based versions of BALPI, respectively. Each formulation is paired with its own set of Bayesian utility functions. The predictive models and associated utility functions are summarized in Fig. 2, and the full BALPI workflow is illustrated in Fig. 1. We describe both components in more detail below.


	Fig. 2 Predictive models and Bayesian utility functions used in the two phase identification formulations of the BALPI framework. The classification formulation employs Gaussian Process Classification with MES, BALD, MOCU, or SMOCU utilities, while the level-set formulation uses Gaussian Process Regression with the straddle utility.

3.2. Gaussian Process surrogates

In Bayesian Active Learning, the predictive surrogate plays a central role by providing both mean estimates and principled uncertainty quantification required to compute the acquisition utility. Gaussian Processes (GPs) are a natural choice for this purpose due to their analytical tractability, interpretability, and inherent ability to model predictive uncertainty. They have been widely adopted as surrogate models in Bayesian optimization and Bayesian experimental design,²³ particularly in problems with moderate input dimensionality such as the phase identification tasks considered in this work. We employ two GP variants: the Gaussian Process Classifier (GPC) for phase classification and the Gaussian Process Regressor (GPR) for level-set estimation. Both models share a common prior formulation, but differ in their likelihood models.

3.2.1 Gaussian Process Classifier (GPC). For classification-based phase identification, we define a data set

with labels y_i ∈ {0, 1} that indicate the presence or absence of a specific phase. The GPC assumes a latent function f(x) drawn from a Gaussian process prior:


	(1)

where K(·,·) is a positive-definite kernel function encoding the smoothness of f. Each label y_i follows a Bernoulli likelihood with a sigmoid link function:


	(2)

where σ(·) denotes the logistic sigmoid. Because this likelihood is non-Gaussian, the posterior

is analytically intractable. Approximate inference methods such as Laplace Approximation (LA)²⁴ and Expectation Propagation (EP)²⁵ are therefore employed. We adopt EP in this work for its superior accuracy in approximating the latent function posterior through iterative local Gaussian refinement. The resulting predictive distribution

provides class probabilities and corresponding predictive uncertainty, which serve as the foundation for evaluating uncertainty-aware acquisition functions such as SMOCU.

3.2.2 Gaussian Process Regressor (GPR). For level-set estimation and continuous phase-score modeling, we employ a Gaussian Process Regressor defined on the dataset

where each observation follows

with σ²_n representing Gaussian observation noise. Assuming the same prior as in eqn (1), the posterior predictive distribution at any new input x_* is Gaussian:


	(3)

with closed-form expressions for the mean and variance:


	(4)


	(5)

where X = [x₁, …, x_N], y = [y₁^c,…,y_N^c]^⊤, and the kernel matrix K = [k₁, …, k_N], k_j = [K(x₁,x_j),…,K(x_N,x_j)]^⊤, k_∗ = [K(x₁,x_∗),…,K(x_N,x_∗)]^⊤. The predictive variance σ²(x_∗) naturally captures the epistemic uncertainty of the model and provides the quantitative basis for active sampling in level-set estimation tasks.

3.2.3 Kernel function and hyperparameter estimation. The covariance kernel K(x, x′) specifies the correlation structure of the GP prior and governs the smoothness of the learned phase landscape. Among standard choices such as the Radial Basis Function (RBF), Matérn, and linear kernels, we employ the RBF kernel for its interpretability and smooth generalization:


	(6)

where a and l denote the kernel amplitude and length-scale hyperparameters, respectively. These hyperparameters are estimated using empirical Bayes by maximizing the marginal likelihood:


	(7)

where K_a,l is the kernel matrix constructed from K_a,l(x, x′). The resulting GP surrogate thus provides both mean predictions and calibrated uncertainty estimates, forming the probabilistic foundation for the acquisition functions described in the following section.

3.3. Bayesian acquisition functions

The acquisition function determines the sampling policy in Bayesian Active Learning (BAL) by quantifying the expected information gain or uncertainty reduction from evaluating a new point. In this work, we focus on two complementary acquisition strategies: a decision-theoretic utility based on the Soft Mean Objective Cost of Uncertainty (S-MOCU) and a heuristic, uncertainty-driven criterion termed extended straddle (e-straddle). Both approaches rely on the predictive uncertainty provided by the Gaussian Process surrogates described in Section 3.2.

3.3.1 Soft Mean Objective Cost of Uncertainty (S-MOCU). The Mean Objective Cost of Uncertainty (MOCU) framework^18,19 quantifies the expected degradation in performance caused by model uncertainty. In the context of phase identification, MOCU measures the expected loss in decision quality that arises from uncertainty in the phase stability classifier across the design space. To minimize this loss, the MOCU-based acquisition policy selects the next query that maximally reduces global uncertainty in a one-step look-ahead manner.

We adopt a Soft MOCU (S-MOCU) formulation that relaxes the original non-differentiable MOCU objective by replacing the max operator with a smooth LogSumExp approximation. This results in a strictly concave and differentiable utility function, enabling efficient gradient-based optimization over the design space.

The S-MOCU utility is defined as


	(8)

where x is the candidate composition to query, x′ spans all points in the design space, and

denotes the predictive distribution after a hypothetical observation y at x. The parameter k > 0 controls the smoothness of the LogSumExp relaxation: as k → ∞, the expression converges to the original MOCU objective, while smaller k yields smoother gradients. By softening the discontinuous maximum, S-MOCU preserves the decision-theoretic interpretation of MOCU while offering improved numerical stability and more efficient optimization. Its one-step look-ahead nature ensures globally informed exploration that adaptively focuses sampling in regions of high phase uncertainty.

3.3.2 Extended straddle (e-straddle). While S-MOCU provides a principled decision-theoretic approach, the straddle criterion²⁶ offers a computationally efficient, heuristic alternative particularly suited for level-set estimation tasks. The classical straddle utility,


U_straddle(x) = −\|µ(x)\| + 1.96σ(x),	(9)

can be interpreted as an upper-confidence-bound (UCB) rule that balances proximity to the decision boundary (exploitation) and prediction uncertainty (exploration).

We generalize this formulation and propose the extended straddle (e-straddle) utility:


U_e-straddle(x) = −g(\|µ(x) − τ\|) + βσ(x),	(10)

where τ denotes the decision threshold (typically τ = 0.5), β is a tunable coefficient controlling the exploration–exploitation balance, and g(·) is a transformation function applied to the reward term. The transformation modulates the sampling preference with respect to the distance from the phase boundary. Specifically:

• g(r) = r recovers the standard straddle criterion;

• g(r) = r² increases the penalty for points farther from the boundary, encouraging broader exploration;

• reduces this penalty, refining sampling near the boundary.

By adjusting g(·), the e-straddle framework flexibly interpolates between exploratory and exploitative behaviors, allowing efficient and interpretable control of the active sampling dynamics. While heuristic in nature, e-straddle remains computationally lightweight and performs competitively against more complex Bayesian criteria such as S-MOCU.

3.3.3 Comparison and complementarity. The two acquisition functions are conceptually complementary: S-MOCU explicitly minimizes the expected global cost of uncertainty and thus offers a theoretically grounded approach to phase-space exploration, whereas e-straddle provides a simpler yet effective heuristic for rapidly refining phase boundaries. Together, these two strategies enable both globally aware and locally adaptive sampling within the proposed BAL framework.

3.4. Active learning workflow implementation

The proposed Bayesian Active Learning for Phase Identification (BALPI) framework integrates Gaussian Process surrogates with Bayesian acquisition strategies to adaptively construct phase diagrams with minimal thermodynamic evaluations. Fig. 1 provides an overview of the iterative workflow, and Algorithm 1 summarizes the full procedure.

3.4.1 Workflow overview. The BALPI workflow consists of four main stages:

1. Initialization. A small set of initial sampling points {x_i}^N₀_i=1 is selected using a low-discrepancy design (e.g., Latin hypercube sampling) or uniformly over the design space . Phase labels or scores {y_i} are obtained via CALPHAD calculations or experiments, forming the initial dataset .

2. Surrogate model training. A Gaussian Process surrogate (GPC or GPR, depending on whether the task involves classification or level-set estimation) is trained on . The surrogate provides predictive mean µ(x) and uncertainty σ²(x) for all candidate compositions.

3. Acquisition and query selection. The acquisition function U(X) is evaluated across the design space using either the S-MOCU utility for global uncertainty reduction or the e-straddle utility for efficient boundary refinement. The next query point is chosen as

The new phase observation y_t+1 is then obtained and appended to the dataset:

4. Update and convergence. The surrogate is retrained with the updated dataset , and the process repeats until a predefined stopping criterion is met, such as (i) the maximum predictive uncertainty falls below a tolerance, or (ii) the number of allowed evaluations T is reached.

This iterative loop efficiently allocates computational or experimental resources to the most informative regions of the thermodynamic space—typically near multi-phase boundaries—resulting in a rapidly convergent and uncertainty-aware phase mapping.

Algorithm 1 Bayesian Active Learning for Phase Identification (BALPI)

Require: design space

, initial dataset

, surrogate model (GPC or GPR), acquisition function U(x), maximum iterations T

1: For t = 0 to T − 1 do

2: Train GP surrogate on

3: Evaluate acquisition function U(x) across

4: Select next query

5: Obtain observation y_t+1 from CALPHAD or experiment

6: Update dataset

7: End for

8: Return final surrogate model and predicted phase map

3.4.2 Implementation and integration with CALPHAD. In this work, the BALPI framework interfaces directly with the Thermo-Calc software package to obtain CALPHAD-based phase stability data. The surrogate and acquisition functions are implemented in Python using NumPy, allowing efficient training, batched evaluation, and uncertainty propagation. The modular design of the platform enables future extensions to higher-dimensional or multi-fidelity settings, as well as integration with experimental automation systems.

4. Results and discussion

We evaluate the proposed Bayesian Active Learning for Phase Identification (BALPI) framework on two representative material systems: (1) the SiO₂–Al₂O₃–MgO glass-ceramic glazes system, and (2) the NiTiHfCu shape memory alloy system. These systems highlight different aspects of the framework: the former involves the discovery of well-separated phase stability regions (Spinel and Mullite phases) within a ternary diagram, whereas the latter features multiple disconnected stability domains for the BCC–B2 phase within a more complex composition space. Across both studies, we compare BALPI with prior non-Bayesian approaches based on Label Propagation (LP) and Label Spreading (LS),¹¹ as well as with other Bayesian active learning utilities including Maximum Entropy Search (MES),²⁰ Bayesian Active Learning by Disagreement (BALD),²¹ and Mean Objective Cost of Uncertainty (MOCU).^18,19 Performance is evaluated using the Bayesian error metric, which measures the expected phase identification uncertainty over the design space.

4.1. Evaluation metric

To compare methods consistently, we report the Bayesian error, which measures the expected misclassification probability under the surrogate's predictive distribution:


	(11)

In practice, we evaluate (11) as an empirical average over a dense composition grid.

4.1.1 Level-set evaluation. For the level-set formulation with GPR, the surrogate models a latent continuous field

and a phase boundary threshold τ. We convert the regression output to phase probabilities via


	(12)


	(13)

These probabilities are then used in (11) for a fair comparison with classification-based methods.

4.2. Phase identification for the SiO₂–Al₂O₃–MgO glass-ceramic system

We first test the performance of BALPI on the SiO₂–Al₂O₃–MgO ternary system, where the objective is to identify the regions of the Spinel and Mullite phases. Both phases are of technological importance due to their high-temperature stability and chemical resistance.^27,28 The composition space is parameterized as (SiO₂)_x–(Al₂O₃)_y–(MgO)_z with the compositional constraint x + y + z = 1. Each query returns a binary phase label, where y = 1 corresponds to the phase of interest (Spinel or Mullite) and y = 0 otherwise. The ground-truth phase diagram used for validation is shown in Fig. 3a.


	Fig. 3 (a) Ground-truth phase diagram of (SiO₂)_x–(Al₂O₃)_y–(MgO)_z system.²⁸ (b and c) Bayesian error over active learning iterations for Spinel and Mullite phase identification. The proposed BALPI methods achieve significantly lower Bayesian errors than the benchmark LP and LS approaches.

Fig. 3b and c report the Bayesian error of the competing methods for Spinel and Mullite identification, respectively. In both cases, the proposed BALPI methods achieve substantially lower Bayesian error than LP and LS throughout the active learning iterations. Among all acquisition strategies, the SMOCU-guided BAL exhibits the fastest error decay, consistently outperforming MES and other utilities. This advantage stems from SMOCU's one-step-look-ahead formulation, which explicitly accounts for the global uncertainty reduction effect of each query. Although the e-straddle utilities do not always achieve the best performance, they maintain a steady improvement trend and perform markedly better than the non-Bayesian LP and LS baselines. These results demonstrate that the BALPI framework effectively balances exploration and exploitation, even in relatively simple ternary systems.

4.3. BCC–B2 phase identification for the NiTiHfCu shape memory alloy system

We next evaluate BALPI on the quaternary NiTiHfCu shape memory alloy (SMA) system, a material of increasing interest for high temperature actuation applications.^29,30 Recent work has shown that NiTiHfCu alloys offer a promising route to achieve high transformation temperatures (T_trans > 250 °C) while reducing the large thermal hysteresis that historically has limited the cyclic stability and actuator efficiency of conventional NiTiHf systems.³¹ In particular, Cu additions of 5–15 at% have been shown to promote the formation of B19 orthorhombic martensite in place of B19′, enhancing crystallographic compatibility and dramatically reducing hysteresis. However, this benefit is counterbalanced by the strong sensitivity of transformation temperatures to both Cu and Ni content, as well as the potential for brittle precipitate formation in Hf-rich regimes. As demonstrated in our recent constrained Bayesian optimization campaign,³¹ the thermodynamic design space of NiTiHfCu is extremely sparse, with only a narrow and disconnected manifold yielding compositions that simultaneously meet target martensite start temperatures (250–350 °C) and hysteresis constraints (<50 °C). The BCC–B2 austenite phase is of particular importance, as it is the only parent phase from which a reversible martensitic transformation can occur. Accordingly, the goal is to localize and characterize regions of high BCC–B2 phase fraction at 800 K with minimal queries to CALPHAD (or, more critically, experimental) oracles. This presents a stringent test for BALPI, which must balance global exploration and local refinement to recover small, topologically fragmented stability domains embedded in a high-dimensional composition-processing space.

The composition space in this system is parameterized as Cu_xHf_yNi_z/2Ti_z/2, subject to x + y + z = 1. As mentioned above, this system is particularly challenging because the BCC–B2 phase domains are disjoint and separated by multi-phase regions.^32–34 In this work, we perform two sets of experiments: (i) a binary phase classification, where compositions with BCC–B2 fraction >0.8 are labeled as y = 1, and (ii) a continuous level-set estimation, where the phase stability score itself is directly queried. The corresponding ground-truth phase maps are shown in Fig. 4a and b.


	Fig. 4 (a and b) Target BCC–B2 phase maps for the NiTiHfCu system at 800 K. (c) Bayesian error over active learning iterations showing that all BALPI-based methods outperform LP and LS baselines. The continuous-query e-straddle(c) achieves the fastest convergence.

As illustrated in Fig. 4c, all methods under the BALPI framework achieve lower Bayesian errors than LS and LP for the same query budget. Notably, the e-straddle utilities outperform SMOCU and MES in this task, especially in the early iterations where phase boundary localization is critical. Among the two e-straddle formulations, the continuous-query version (e-straddle(c)) achieves faster convergence than its binary counterpart (e-straddle(b)).

Fig. 5 visualizes the queried compositions after 80 iterations for representative methods. Benchmark approaches such as label propagation (LP) tend to oversample large regions of low uncertainty, often missing smaller—and practically critical—pockets of BCC–B2 stability. In contrast, BALPI variants guided by SMOCU and e-straddle successfully identify multiple disconnected phase regions within the sparse stability landscape. This improved sampling efficiency arises from: (i) SMOCU's global, non-myopic utility formulation, which accounts for the expected uncertainty reduction impact of each query; and (ii) e-straddle's adaptive balancing of exploration and exploitation, modulated by the transformation function g(·).


	Fig. 5 Final queried compositions and predicted BCC–B2 phase distributions after 80 iterations. Benchmark methods such as LP and MES tend to oversample low-uncertainty regions, missing smaller, disconnected BCC–B2 stability domains. In contrast, SMOCU- and e-straddle–guided BALPI efficiently identify multiple sparse, non-contiguous BCC–B2 regions, highlighting their ability to navigate complex phase landscapes with higher sampling efficiency.

4.3.1 Quantitative comparison. Table 2 reports the final Bayesian error after 80 iterations for different transformation functions g(·) and exploration weights β in the e-straddle utility. For the continuous-query formulation, all transformations outperform the LP and LS baselines, with the best performance achieved by

when β = 2. This result highlights the benefit of flexible reward shaping in modulating exploration–exploitation behavior. Table 3 summarizes the final Bayesian error for different initial sample sizes. In all configurations, the BALPI variants, particularly SMOCU and e-straddle(c), consistently achieve the lowest errors, demonstrating the robustness of the framework with respect to initialization. Importantly, Table 3 also provides a direct quantitative comparison against the representative Bayesian baselines BALD and MES. Across all three initialization settings, SMOCU and e-straddle(c) achieve lower final Bayesian error than both BALD and MES, showing that the gains of BALPI are not limited to comparisons with LP/LS alone.

Table 2 Final Bayesian error of the e-straddle utility with different transformation functions g(·) and hyperparameters β. Bold values denote the best performance for binary and continuous query functions, respectively

	e-straddle(b)			e-straddle(c)
Transformation	g(r) = r	g(r) = r²		g(r) = r	g(r) = r²
β = 0.5	0.0492 ± 0.0218	0.0493 ± 0.0143	0.0794 ± 0.0419	0.0479 ± 0.0224	0.0442 ± 0.0106	0.0510 ± 0.0157
β = 1	0.0403 ± 0.0067	0.0542 ± 0.0218	0.0660 ± 0.0408	0.0452 ± 0.0134	0.0526 ± 0.0167	0.0396 ± 0.0134
β = 2	0.0482 ± 0.0100	0.0524 ± 0.0073	0.0423 ± 0.0096	0.0561 ± 0.0137	0.0515 ± 0.0134	0.0388 ± 0.0094

Table 3 Final Bayesian error after 80 iterations with varying numbers of randomly selected initial samples. SMOCU and e-straddle(c) achieve the lowest errors across all configurations

	LP	LS	BALD	MES	SMOCU	e-straddle(b)	e-straddle(c)
4 samples	0.107 ± 0.008	0.108 ± 0.010	0.107 ± 0.006	0.060 ± 0.009	0.058 ± 0.022	0.050 ± 0.010	0.045 ± 0.014
9 samples	0.097 ± 0.010	0.096 ± 0.009	0.103 ± 0.009	0.060 ± 0.013	0.048 ± 0.007	0.056 ± 0.010	0.041 ± 0.004
16 samples	0.098 ± 0.013	0.098 ± 0.005	0.103 ± 0.013	0.045 ± 0.007	0.041 ± 0.003	0.046 ± 0.015	0.038 ± 0.010

4.3.2 Summary of findings. Across both material systems used as demonstration problems, the BALPI framework consistently achieves faster convergence and lower Bayesian error than existing active learning and non-Bayesian phase identification methods. SMOCU provides globally informed sampling and robust recovery of sparse phase regions, while e-straddle offers an interpretable and computationally efficient alternative that performs especially well near complex phase boundaries. Together, these results demonstrate that BALPI constitutes a unified, uncertainty-aware, and computationally efficient approach for adaptive phase mapping in both discrete and continuous formulations.

4.4. Discussion

4.4.1 Interpretation of the Bayesian error metric. In this work, we adopt the Bayesian error as a unified performance metric for both classification and level-set formulations. Unlike standard accuracy or mean squared error, which quantify deviation from ground truth labels in a deterministic sense, the Bayesian error reflects expected probability of misclassification under the predictive posterior of the model. Formally, it is defined as the integral over the input space of the uncertainty-weighted misprediction of the model, as shown in eqn (11). This probabilistic interpretation is particularly well suited for Bayesian active learning, where the objective is to minimize predictive uncertainty as efficiently as possible—rather than to simply maximize accuracy on a fixed test set. A lower Bayesian error thus indicates that the model achieves not only greater predictive accuracy but also higher calibrated confidence with respect to the ground-truth phase distribution. Importantly, the Bayesian error can be computed directly from the model's predictive distributions without requiring a separate validation set, making it an interpretable, uncertainty-aware, and model-agnostic evaluation criterion.

4.4.2 Practical benefits of Bayesian active learning for CALPHAD-guided discovery. A central motivation of this work is to demonstrate that uncertainty-aware sampling can dramatically reduce the computational effort required to map phase stability regions from CALPHAD simulations. Exhaustive CALPHAD computations over dense ternary or quaternary grids are typically infeasible due to the combinatorial growth of sampling points and the time cost of each equilibrium calculation. For instance, generating a full pseudo-ternary grid with 100 × 100 points may require several hours to days of computation, even when parallelized. By contrast, our BALPI framework converges to a comparable or superior phase boundary description using only tens to a few hundred adaptively selected samples—when considering an experimental campaign, the sample efficiency of BALPI relative to competing strategies is much more significant. BALPI's sample efficiency arises from the ability of the Bayesian utility functions—such as SMOCU and e-straddle—to automatically focus new queries near high-uncertainty regions, effectively balancing exploration of poorly sampled areas with exploitation near phase boundaries. The resulting acceleration makes BALPI well-suited not only for CALPHAD-based studies but also for integration into experimental design loops where querying the phase stability domain of target materials involves significant time or material cost.

4.4.3 Strengths and limitations. The results on the glass-ceramic (SiO₂–Al₂O₃–MgO) and shape-memory alloy (Ni–Ti–Hf–Cu) systems demonstrate that BALPI can capture disconnected phase stability regions and avoid overconfidence in sparsely sampled domains. The modularity of the framework enables flexible combinations of predictive models (GPC or GPR) and acquisition functions (MES, BALD, SMOCU, e-straddle), making it extensible to a wide range of materials systems. However, several limitations should be noted. First, Gaussian processes are most computationally efficient in low-dimensional spaces; direct scaling to systems beyond quaternary composition spaces may require sparse GP or deep kernel learning variants. Second, the current implementation assumes that CALPHAD serves as a reliable oracle and does not explicitly handle uncertainties in the thermodynamic database itself. Extending BALPI to jointly account for CALPHAD parameter uncertainty or experimental measurement noise would provide a more comprehensive uncertainty model. Finally, while SMOCU and e-straddle balance exploration and exploitation effectively, their optimization relies on empirical hyperparameters (k, β), and adaptive tuning of these parameters could further improve robustness across different systems.

4.4.4 Outlook. Despite these limitations, the present study establishes BALPI as a general and interpretable Bayesian active learning framework for phase diagram discovery. By unifying classification and level-set formulations under a probabilistic surrogate model, BALPI provides a natural interface between physics-based CALPHAD models and data-driven exploration strategies.

Looking ahead, the methodology presented here can be seamlessly extended to experimental workflows, enabling closed-loop autonomous materials design in which each new experiment is selected based on quantified uncertainty. In such settings, experimentally determined phase constitution states could be used not only for exploration, but also to refine existing CALPHAD models,³⁵ including those constructed from computational approaches such as machine-learned interatomic potentials.^36,37 Future work will focus on coupling BALPI with high-throughput synthesis and characterization platforms to enable such closed-loop integration.

While the present study focuses on binary phase identification to efficiently localize the stability region of a target phase, the BALPI framework is not intrinsically limited to two-phase settings. Extension to multi-phase phase-diagram construction is straightforward. For example, multi-class Gaussian process classification can be employed to directly model multiple phase labels, or, alternatively, parallel level-set estimation of individual phase-fraction fields can be used to track several phase boundaries simultaneously. In either formulation, BALPI retains its core objective: efficient localization of decision boundaries in thermodynamic state space.

Beyond identifying complete phase fields, one may instead target the zero phase fraction (ZPF) hyperplane³⁸ for phase i, which separates regions of state space where phase i is present from those where it is absent.² This formulation reduces the problem to a one-versus-all classification task, allowing BALPI to operate exactly as implemented here, without modifying the original algorithm in any way.

Importantly, the collection of ZPF hyperplane projections for all phases constitutes the two-dimensional projection of a phase diagram in a system of arbitrary dimensionality.^38,39 Consequently, systematic identification of ZPF manifolds provides a scalable pathway toward high-dimensional phase-diagram reconstruction.

From a scalability perspective, extending BALPI to higher-dimensional composition or processing spaces will require surrogate models that can efficiently accommodate larger query budgets. While the present work employs standard Gaussian processes, future extensions will explore sparse and inducing-point GP formulations, batch acquisition strategies, and multi-fidelity active learning to balance computational tractability with uncertainty-aware exploration. These developments will enable BALPI to remain effective in quaternary and higher-order systems, where phase stability regions are often sparse, disconnected, and embedded in high-dimensional thermodynamic spaces.

5. Conclusion

In this work, we have introduced BALPI, a Bayesian active learning framework for adaptive phase diagram discovery grounded in CALPHAD thermodynamics. By unifying Gaussian Process Classification and Regression under a common probabilistic surrogate model, BALPI enables uncertainty-aware exploration through multiple acquisition strategies, including non-myopic formulations such as SMOCU and boundary-aware alternatives such as extended straddle. This dual capability—of supporting both discrete and continuous phase representations—makes BALPI a flexible and interpretable tool for phase mapping across a wide range of materials systems.

Demonstrated on two distinct thermodynamic systems—a ternary glass-ceramic and a quaternary shape memory alloy—BALPI consistently outperforms conventional sampling strategies, achieving faster convergence and lower Bayesian error while using a fraction of the CALPHAD query budget. Crucially, the framework succeeds even in scenarios with sparse and topologically fragmented stability regions, such as disconnected BCC-B2 pockets critical for the design and optimization of novel multi-component shape memory alloys. These results constitute strong evidence for the power of active, uncertainty-guided sampling grounded in Bayesian principles—not merely as a tool for efficient model training, but as an engine for discovery in systems where the ground truth occupies sparse, fragmented, or otherwise non-intuitive regions of chemical space.

Looking ahead, BALPI opens several promising research directions. First, its integration with high-throughput experimental workflows could enable closed-loop, autonomous discovery of phase boundaries, where surrogate uncertainty guides physical synthesis and characterization. Second, BALPI provides a natural interface for model refinement: experimentally observed phase constitutions could be used not just for exploration, but to iteratively update CALPHAD assessments—particularly those constructed from high-throughput DFT or MLIP data.³⁷ Third, embedding thermodynamic priors into the GP kernel—such as convexity constraints, symmetry relations, known limiting behaviors or topological rules of thermodynamic spaces—could improve sample efficiency and enhance generalization to unobserved regions of composition-temperature space.

Beyond phase diagrams, the BALPI paradigm is extensible to broader materials-informatics tasks involving phase selection, stability prediction, and structural motif classification. In systems governed by sharp transitions or highly constrained free energy landscapes, the combination of interpretable surrogate models and targeted information acquisition offers a principled foundation for self-driving materials platforms. As phase mapping increasingly becomes a bottleneck in alloy design and microstructure engineering, frameworks like BALPI will be central to accelerating both understanding and deployment of next-generation materials.

Author contributions

Fan: conceptualization, investigation, formal analysis, methodology, writing - original draft. Wang: investigation, formal analysis, writing - review & editing. Vazquez: investigation, writing - original draft. Zhou: investigation, formal analysis, methodology. Karaman: conceptualization, funding acquisition. Arróyave: conceptualization, supervision, writing - review & editing, funding acquisition. Qian: conceptualization, supervision, writing - review & editing, funding acquisition.

Conflicts of interest

The authors declare no competing interests.

Data availability

The code developed in this paper is publicly available at the following GitHub repository: https://github.com/MingzhouFan97/BALPI. The associated dataset is publicly archived on Zenodo at https://doi.org/10.5281/zenodo.19896908.

Acknowledgements

The authors would like to acknowledge support from National Science Foundation (NSF) Grants No. DMREF-2119103, DMREF-1905325, CCF-1553281, SHF-2215573, and IIS-2212419. Research was also partly sponsored by the Army Research Laboratory and was accomplished under Cooperative Agreement Number W911NF-22-2-0106 (HTMDEC-BIRDSHOT program). Computing resources at Texas A&M University were used for the calculations.

References

R. DeHoff, Thermodynamics in materials science, CRC Press, 2006 Search PubMed.
R. Arróyave, Phase stability through machine learning, J. Phase Equilib. Diffus., 2022, 43(6), 606–628 Search PubMed.
L. Kaufman and H. Bernstein, Computer Calculation of Phase Diagrams with Special Reference to Refractory Metals, Academic Press, New York, 1970 Search PubMed.
U. R. Kattner, The calphad method and its role in material and process development, Tecnol. Metal., Mater. Min., 2016, 13(1), 3 Search PubMed.
P. E. Turchi, I. A. Abrikosov, B. Burton, S. G. Fries, G. Grimvall, L. Kaufman, P. Korzhavyi, V. R. Manga, M. Ohno and A. Pisch, et al., Interface between quantum-mechanical-based approaches, experiments, and calphad methodology, Calphad, 2007, 31(1), 4–27 CrossRef CAS.
P. Ren, Y. Xiao, X. Chang, P.-Y. Huang, Z. Li, B. B. Gupta, X. Chen and X. Wang, A survey of deep active learning, ACM Comput. Surv., 2021, 54(9), 1–40 Search PubMed.
S. Ament, M. Amsler, D. R. Sutherland, M.-C. Chang, D. Guevarra, A. B. Connolly, J. M. Gregoire, M. O. Thompson, C. P. Gomes and R. B. Van Dover, Autonomous materials synthesis via hierarchical active learning of nonequilibrium phase diagrams, Sci. Adv., 2021, 7(51), eabg4930 Search PubMed.
C. Dai and S. C. Glotzer, Efficient phase diagram sampling by active learning, J. Phys. Chem. B, 2020, 124(7), 1275–1284 Search PubMed.
D. O. Abranches, E. J. Maginn and Y. J. Colón, Activity coefficient acquisition with thermodynamics-informed active learning for phase diagram construction, AIChE J., 2023, 69(8), e18141 Search PubMed.
G. Deffrennes, K. Terayama, T. Abe and R. Tamura, A machine learning–based classification approach for phase diagram prediction, Mater. Des., 2022, 215, 110497 Search PubMed.
K. Terayama, R. Tamura, Y. Nose, H. Hiramatsu, H. Hosono, Y. Okuno and K. Tsuda, Efficient construction method for phase diagrams using uncertainty sampling, Phys. Rev. Mater., 2019, 3(3), 033802 Search PubMed.
Y. Xue, J. Bai, R. Le Bras, R. Bernstein, J. Bjorck, L. L. L. Longpre, S. Suram, R. Van Dover, J. Gregoire, C. Gomes, Phase-mapper: an AI platform to accelerate high throughput materials discovery, in Proceedings of the AAAI Conference on Artificial Intelligence, 2017, vol. 31, pp. 4635–4642 Search PubMed.
J. Bai, Y. Xue, J. Bjorck, R. Le Bras, B. Rappazzo, R. Bernstein, S. K. Suram, R. B. Van Dover, J. M. Gregoire and C. P. Gomes, Phase mapper: Accelerating materials discovery with ai, Ai Magazine, 2018, 39(1), 15–26 Search PubMed.
C. E. Rasmussen and H. Nickisch, Gaussian processes for machine learning (gpml) toolbox, J. Mach. Learn. Res., 2010, 11, 3011–3015 Search PubMed.
J. Deely and D. Lindley, Bayes empirical bayes, J. Am. Stat. Assoc., 1981, 76(376), 833–841 Search PubMed.
C. Hardcastle, R. O’Mullan, R. Arroyave and B. Vela, Physics-informed gaussian process classification for constraint-aware alloy design, Digital Discovery, 2025, 4, 1884–1900 Search PubMed.
T. Lookman, P. V. Balachandran, D. Xue and R. Yuan, Active learning in materials science with emphasis on adaptive sampling using uncertainties for targeted design, npj Comput. Mater., 2019, 5(1), 21 Search PubMed.
G. Zhao, E. Dougherty, B.-J. Yoon, F. J. Alexander and X. Qian, Bayesian active learning by soft mean objective cost of uncertainty, in: International Conference on Artificial Intelligence and Statistics, PMLR, 2021, pp. 3970–3978 Search PubMed.
G. Zhao, E. Dougherty, B.-J. Yoon, F. Alexander and X. Qian, Efficient active learning for gaussian process classification by error reduction, Adv. Neural Inf. Process. Syst., 2021, 34, 9734–9746 Search PubMed.
R. Jiang, H. Zhou, H. Wang and S. S. Ge, Maximum entropy searching, CAAI Trans. Intell. Technol, 2019, 4(1), 1–8 Search PubMed.
Y. Gal, R. Islam and Z. Ghahramani, Deep Bayesian active learning with image data, in International conference on machine learning, PMLR, 2017, pp. 1183–1192 Search PubMed.
A. Gotovos, Active learning for level set estimation, Master's thesis, Eidgenössische Technische Hochschule Zürich, Department of Computer Science, 2013 Search PubMed.
P. I. Frazier, A tutorial on bayesian optimization, arXiv, arXiv:1807.02811 Search PubMed.
K. Friston, J. Mattout, N. Trujillo-Barreto, J. Ashburner and W. Penny, Variational free energy and the laplace approximation, Neuroimage, 2007, 34(1), 220–234 Search PubMed.
H.-C. Kim and Z. Ghahramani, Bayesian gaussian process classification with the em-ep algorithm, IEEE Trans. Pattern Anal. Mach. Intell., 2006, 28(12), 1948–1959 Search PubMed.
B. Bryan, R. C. Nichol, C. R. Genovese, J. G. Schneider, C. J. Miller and L. A. Wasserman, Active learning for identifying function threshold boundaries, Advances in Neural Information Processing Systems 18 (NIPS 2005), 2005, pp. 163–170 Search PubMed.
S. A. Ansar, S. Bhattacharya, S. Dutta, S. Ghosh and S. Mukhopadhyay, Development of mullite and spinel coatings on graphite for improved water-wettability and oxidation resistance, Ceram. Int., 2010, 36(6), 1837–1844 Search PubMed.
M. Leśniak, J. Partyka, K. Pasiut and M. Sitarz, Microstructure study of opaque glazes from SiO2–Al2O3–MgO–K2O–Na2O system by variable molar ratio of SiO2/Al2O3 by FTIR and Raman spectroscopy, J. Mol. Struct., 2016, 1126, 240–250 Search PubMed.
H. Karaca, E. Acar, H. Tobe and S. Saghaian, NiTiHf-based shape memory alloys, Mater. Sci. Technol., 2014, 30(13), 1530–1544 Search PubMed.
H. E. Karaca, E. Acar, G. Ded, S. Saghaian, B. Basaran, H. Tobe, M. Kok, H. Maier, R. Noebe and Y. Chumlyakov, Microstructure and transformation related behaviors of a Ni45.3Ti29.7Hf20Cu5 high temperature shape memory alloy, Mater. Sci. Eng. A, 2015, 627, 82–94 Search PubMed.
J. Broucek, D. Khatamsaz, C. Cakirhan, S. H. Zadeh, M. Fan, G. Vazquez, K. Atli, X. Qian, R. Arroyave and I. Karaman, Design of high-temperature niticuhf shape memory alloys with minimum thermal hysteresis using Bayesian optimization, Acta Mater., 2025, 286, 120651 Search PubMed.
J. Liu, L. Zhu, X. Huang, G. Cai and Z. Jin, Investigation of the phase equilibria in ti-ni-hf system using diffusion triples and equilibrated alloys, Calphad, 2017, 58, 160–168 Search PubMed.
L. Lv, Z. Deng, H. Zhang, L. Liu and L. Zhang, Thermodynamic assessment of the ternary Ni–Ti–Hf system, CALPHAD, 2023, 81, 102546 Search PubMed.
K. Li and W. Zheng, Thermodynamic modeling of the Ni–Ti–Hf system, CALPHAD, 2023, 81, 102558 Search PubMed.
C. Kunselman, S. Zhu, D. Sariturk and R. Arroyave, Construction and tuning of calphad models using machine-learned interatomic potentials and experimental data: A case study of the pt-w system, arXiv, arXiv:2508.01028 Search PubMed.
S. Zhu, D. Sarıtürk and R. Arróyave, Accelerating calphad-based phase diagram predictions in complex alloys using universal machine learning potentials: Opportunities and challenges, Acta Mater., 2025, 120747 Search PubMed.
S. Zhu, D. Sariturk and R. Arroyave, Machine learning potentials for alloys: A detailed workflow to predict phase diagrams and benchmark accuracy, arXiv, arXiv:2506.16771 Search PubMed.
H. Gupta, J. E. Morral and H. Nowotny, Constructing multicomponent phase diagrams by overlapping ZPF lines, Scr. Metall., 1986, 20, 889–894 Search PubMed.
J. Morral and H. Gupta, A figure of merit for predicted phase diagrams, J. Phase Equilib., 1992, 13(4), 373–376 Search PubMed.

Footnote

† These authors contributed equally as first authors.

Click here to see how this site uses Cookies. View our privacy policy here.

Bayesian active learning to accelerate high throughput phase diagram exploration

Abstract

1. Introduction

1.1. From grid-based sampling to active learning

1.2. Bayesian active learning for CALPHAD-guided phase discovery

1.3. Contributions and impact

2. Related work

3. Methods

3.1. Problem formulations

3.2. Gaussian Process surrogates

3.3. Bayesian acquisition functions

3.4. Active learning workflow implementation

4. Results and discussion

4.1. Evaluation metric

4.2. Phase identification for the SiO2–Al2O3–MgO glass-ceramic system

4.3. BCC–B2 phase identification for the NiTiHfCu shape memory alloy system

4.4. Discussion

5. Conclusion

Author contributions

Conflicts of interest

Data availability

Acknowledgements

References

Footnote

4.2. Phase identification for the SiO₂–Al₂O₃–MgO glass-ceramic system