Benius
Dunn
a,
Javier
Meza-Arroyo
a,
Armi
Tiihonen
b,
Mark
Lee
c and
Julia W. P.
Hsu
*ac
aDepartment of Materials Science and Engineering, The University of Texas at Dallas, Richardson, Texas 75080, USA. E-mail: jwhsu@utdallas.edu
bDepartment of Applied Physics, Aalto University, Espoo, Finland
cDepartment of Physics, The University of Texas at Dallas, Richardson, Texas 75080, USA
First published on 7th January 2026
Neuromorphic computing hardware enables edge computing and can be implemented in flexible electronics for novel applications. Metal oxide materials are promising candidates for fabricating flexible neuromorphic electronics, but suffer from processing constraints due to the incompatibilities between oxides and polymer substrates. In this work, we use photonic curing to fabricate flexible metal–insulator–metal capacitors with solution-processible aluminum oxide dielectric tailored for neuromorphic applications. Because photonic curing outcomes depend on many input parameters, identifying an optimal processing condition through a traditional grid-search approach is unfeasible. Here, we apply multi-objective Bayesian optimization (MOBO) to determine photonic curing conditions that optimize the trade-off between desired electrical properties of large capacitance–frequency dispersion and low leakage current. Furthermore, we develop a human-in-the-loop (HITL) framework for incorporating failed experiments into the MOBO machine learning workflow, demonstrating that this framework accelerates optimization by reducing the number of experimental rounds required. Once optimization is concluded, we analyze different Pareto-optimal conditions to tune the dielectric's properties and provide insight into the importance of different inputs through Shapley Additive exPlanations analysis. The demonstrated framework of combining MOBO with HITL feedback can be adapted to a wide range of multi-objective experimental problems that have interconnected inputs and high experimental failure rates to generate usable results for machine learning models.
For neuromorphic devices to be used in temporal signal processing, they must exhibit short-term synaptic plasticity (STP) behavior, with a decreasing current response over time or ‘fading memory’ behavior. The STP characteristics have been widely seen in metal oxide memristors2–5 and thin-film transistors.6–8 Devices exhibiting STP often show hysteretic behavior, attributed to different time scales associated with defects at the semiconductor/dielectric interface or with mobile species in the dielectric, most often in the form of oxygen vacancies.5–7 Migration of mobile oxygen vacancies under electrical bias leads to a time-dependent buildup of a positive charge gradient within the dielectric, followed by a time-dependent relaxation as mobile vacancies diffuse back to their equilibrium distribution upon removing the electrical bias, resulting in hysteresis in neuromorphic transistor devices.7 Literature has also shown that the hysteretic transistor behavior necessary for STP is often accompanied by frequency dependence of capacitance, i.e., capacitance–frequency (C–f) dispersion, in metal–insulator–metal (MIM) structures with the dielectric.8–10
Metal oxides have shown promise for building flexible neuromorphic devices. However, there are challenges associated with using solution-processed metal oxides on flexible polymer substrates. These include high processing temperatures and a mismatch between the coefficients of thermal expansion of metal oxides and substrates, leading to processing restrictions to prevent mechanical failures. One approach to alleviate these problems is to convert the metal oxide films using photonic curing, which uses high-intensity broadband (200–1500 nm) light from a xenon flash lamp to anneal a film in a millisecond timescale (<20 ms) while leaving underlying substrates close to ambient temperature.11,12 Several interconnected input parameters control the photonic curing process, and optimizing them requires navigating a complex multi-dimensional input space, which is impractical with traditional experimental grid-search methods.
Bayesian optimization (BO) is a popular machine learning technique for optimizing black-box functions and has been used in materials science problems.13–18 By mapping input–output relationships via a surrogate model and sequential active learning that balances exploiting known relationships with exploring the areas of uncertainty, BO is an ideal method for optimizing processing problems with multiple input variables. While the original intent of BO is to find the global optimum of a single objective function over a pre-defined input space, this method can also be extended to optimize multiple objective functions simultaneously, known as multi-objective Bayesian optimization (MOBO).19–21 In most multi-objective problems, different objectives compete, so the global optimal values of each cannot be achieved simultaneously. The general goal of MOBO is thus to find a Pareto Frontier of optimal conditions, representing the set of conditions where one objective cannot be improved without degrading another.22
Based on the literature of hysteretic transistors,8–10 we use large C–f dispersion, the ratio of low to high-frequency capacitance, in a MIM capacitor as a metric for a desirable electrical property for the metal oxide dielectric. Additionally, minimal leakage current in a dielectric is necessary for practical transistor function, thus giving us a two-objective optimization problem. These two objectives are inherently competing, as minimizing leakage current requires a denser, more stoichiometric dielectric with fewer structural defects, whereas large C–f dispersion relies on defect states. This trade-off highlights the need for a multi-objective optimization approach. In this work, we apply MOBO to convert sol–gel Al2O3 dielectric using photonic curing for MIM structures on flexible polyethylene terephthalate (PET) substrates.
Unlike synthetic data, real experiments can produce results that fail, which cannot be used to train a machine learning surrogate model. To remedy this issue, we implement a human-in-the-loop (HITL) to incorporate unsuccessful photonic curing results into the MOBO workflow. Observations by human scientists provide fast and valuable information that is difficult for a machine to replicate, which would require defining success versus failure and adding another layer of characterization. Previous work has demonstrated the merits and methodology of incorporating subjective human inputs into BO.23,24 After incorporating HITL, we successfully inform the machine learning model about the input space with a higher probability of producing working devices while continuing to optimize electrical characteristics. Through simultaneously mitigating leakage current and maximizing C–f dispersion, we obtain a set of Pareto-optimal conditions that enable us to tune MIM characteristics for neuromorphic computing applications. We also apply SHapley Additive exPlanations (SHAP) analysis to understand the contributions of individual input parameters on each objective, enabling better insight into the physical processes affecting photonic curing. This optimization approach, which incorporates HITL knowledge, is applicable to a broad range of complex experimental and processing problems that are hindered by failed results unusable for training machine learning surrogate models, or that can benefit from incorporating domain expertise from humans.
:
5
:
1 volume ratio of methanol
:
DI water
:
glacial acetic acid for 5 seconds to remove the unexposed parts, followed by rinsing with DI water and drying with a N2 stream.
An initial set of 30 photonic curing conditions (Table S2, SI) is generated using the pseudorandom sampling method of Latin hypercube sampling (LHS) as the first step of the MOBO workflow (Fig. 1(a)). This initial sampling across the entire input space (distribution of LHS conditions is detailed in Fig. S1, SI) is used to convert Al2O3 dielectric films for MIM capacitors (Fig. 1(b)). Full MIM devices are then characterized by C–f and I–V measurements (Fig. 1(c)). The electrical results are used to train two separate Gaussian process regression (GPR) models as surrogates for the C–f dispersion and leakage current (Fig. 1(d)). To perform MOBO and fit GPR models, a quantitative numeric metric needs to be defined for both of these target properties. First, we define a proxy variable for C–f dispersion using the ratio of the areal capacitance value at 102 Hz to that at 106 Hz, C100
Hz/C1
MHz, of MIM capacitors. This should be maximized. Second, we define a proxy variable for gate leakage current, Ileakage, as the mean areal leakage current at ±5 V bias. Ileakage varies over several orders of magnitude; thus, we convert it to logarithmic values to ensure better performance with the GPR model. We use the absolute value, |log(Ileakage)|, to transform the optimization into a maximization problem for both target variables.
After building the initial surrogate models (posterior mean and their variance shown in Fig. S2, SI), an acquisition function is used in the subsequent rounds to generate the next set of photonic curing conditions to test, as depicted in Fig. 1(e). These conditions are used to fabricate new devices in each subsequent round, the electrical results of which are fed into the surrogate models. This active learning process is iterated, with a new Pareto Frontier of the model and measured data being generated each round. Optimization is deemed completed when all new electrical results fall within ±1 standard deviation of the model-predicted values. Rounds 1 through 5 of MOBO batch picking were conducted using the parallel expected hypervolume improvement (qEHVI) acquisition function.
In order to obtain a more diverse array of model predictions along the Pareto Frontier, later rounds implement a new MOBO batch picking strategy we call Pareto-UCB. We adapt and modify the method from Lukovic et al.26 by first evaluating the acquisition function for each GPR model, then calculating the Pareto Frontier of these acquisition values. We then select the next batch of conditions to test by choosing points along this newly calculated Pareto Frontier of acquisition function values. While Lukovic et al. use the mean posterior function of the GPR as the acquisition function, acqmean = μ, we employ upper confidence bounds (UCB) as our acquisition function, acqUCB = μ + β1/2σ, to increase exploration of unknown areas of the parameter space. µ is the GPR mean posterior model values, σ is the GPR posterior uncertainty, and β is the hyperparameter that controls exploration vs. exploitation. We choose a β value of 2 in this work to encourage exploration of the parameter space. We calculate the Pareto Frontier of non-dominated UCB values and select the next batch of conditions to test by greedily maximizing the hypervolume enclosed by this Frontier.
The GPR models, optimization with qEHVI acquisition function, and hypervolume calculation for Pareto-UCB batch picking are implemented in Python using the BoTorch package.27 GPR models use the ARD Matern 5/2 kernel, with kernel length scales fitted independently for each input variable in each model, hence allowing each input variable to impact each objective (C100
Hz/C1
MHz, |log(Ileakage)|) to a different extent.
Hz/C1
MHz or |log(Ileakage)| data. When only the results from functional devices are fed into the surrogate regression models for choosing the next test conditions, the BO algorithm is prone to continuously suggest processing conditions in regions that produce failed experiments, which it sees as unexplored areas. In cases where certain regions of the domain clearly outperform others, one possible solution is to shrink the domain space to focus on promising regions.28 However, in our experiment, this is infeasible due to the highly interconnected nature of the photonic curing input parameters. A set constant value of one input parameter can produce films ranging from unconverted to burned by altering the other four input parameters. While it has been shown that constraining input space as a function of multiple input parameters is possible,29 doing so requires knowledge of which input combinations will produce failed outcomes. As such, we need a way to inform the model which areas of the input space can successfully produce a film without prior knowledge, based on human-observed conversion outcome (Fig. 1(f)).
One approach is to constrain the acquisition function for next-batch BO picks using a binary classification model that predicts whether a given input condition will produce a successful (model value of 1) or failed (model value of 0) outcome, which can be based on either human observation or automated characterization.30–32 However, this approach is limited in that it does not give a physical model of different modes of failure, i.e., burning versus under-conversion in our experiment. Additionally, the methodology of binary classification cannot incorporate knowledge about conditions that give outcomes closer to success, such as partially burned and partially under-converted conditions. Following this concept, in order to constrain our MOBO in a way that accurately captures the different failure mechanisms and distinguishes between clear failures and partial failures close to success, we assign a numerical score to each photonic curing conversion outcome based on human observation depicted in Fig. 2(a). We assign unconverted = −1.0, partially converted = −0.5, converted = 0, partially burned = +0.5, and burned = +1.0. These values are then used to train a separate GPR model to represent film conversion outcome across the input space (Fig. 2(b)). This film conversion GPR is then used to calculate a probability distribution of yielding a functional film for a given photonic curing condition through a nonlinear transformation, similar to the methodology from Tiihonen et al.23 Specifically, we use a Gaussian function,
The UCB acquisition function of each objective is then weighted by the probability of successful film conversion, acqconstrained = acqUCB·Pconstraint, prior to each round of MOBO batch picking to favor the search space that is more likely to produce measurable devices and avoid the search space likely to produce process failures. Fig. 2(d) shows this process for a single input pair, radiant energy vs. number of pulses, and a single objective, C100
Hz/C1
MHz, using the GPR model of C–f dispersion. When the UCB acquisition function, Fig. 2(e), is constrained by Pconstraint, the new HITL-constrained acquisition function, Fig. 2(f), is suppressed in areas that are less likely to produce a functional film (high radiant energy in this example), and the next batch of conditions generated by the model is shifted accordingly. Fig. 2 depicts these for the single pair of photonic curing inputs. Representations for all pairs of photonic curing inputs are included in Fig. S4 (SI).
Hz/C1
MHz and |log(Ileakage)| values are used to build the first surrogate model. For the next two rounds of active learning, 15 photonic curing conditions are generated with MOBO and tested. But the failure rate of these selections is very high, with only 5 conditions producing measurable devices. Thus, we implement the HITL methodology discussed in Section 3.2 in subsequent rounds. Five additional active learning rounds are conducted. Electrical data from samples made in the fifth round fall within ± 1 standard deviation of the model-predicted Pareto Frontier. We therefore conclude that the MOBO with HITL framework has successfully identified the Pareto Frontier to within experimental uncertainty.
Fig. 3(a) shows the Pareto Frontier of the final GPR models of C100
Hz/C1
MHz and |log(Ileakage)| (blue open squares), as well as the data points for all photonic curing conditions that lead to functional devices (green diamonds) and the devices fabricated using Pareto optimal conditions (red diamonds with error bars showing standard deviation). Experimental values and error are determined from measuring 5 different MIM devices on a given sample. Based on prior experience, |log(Ileakage)| ≥ 4 is needed to obtain functional thin-film transistors on top of a gate dielectric. Fig. 3(b) shows a subset of the Pareto Frontier in this leakage current range of the photonically cured Al2O3 dielectric (standard deviation of the GPR model represented by the shaded region).
Also shown in Fig. 3(b) is a section of the model Pareto Frontier generated from the initial LHS conditions (black open squares), demonstrating that MOBO simultaneously improves both objectives, with the most significant improvement seen in |log(Ileakage)|. The evolution of the dominated hypervolume of the Pareto optimal conditions as function of active learning is shown in Fig. S5 (SI). The accuracy of the final GPR models for both objectives is discussed in the SI section ‘dominated hypervolume evolution & final GPR models’ accuracy’ and Fig. S6.
For Pareto optimal conditions with |log(Ileakage)| ≥ 4, there are several experimental photonic curing conditions with distinctly different dielectric characteristics. The two extreme electrical characteristics of MIMs made by Pareto-optimal processing conditions are MOBO conditions #40 and #66, which are highlighted in Fig. 3(c) and (d). Table 1 shows the photonic curing input parameters for these two conditions along with their respective objective values. Condition #40 shows the lowest leakage current (highest |log(Ileakage)|) and smallest C100
Hz/C1
MHz value, while condition #66 shows the highest leakage current (lowest |log(Ileakage)|) and largest C100
Hz/C1
MHz value. More in-depth analysis on the physical and chemical properties of samples made with these two conditions is presented in the SI section ‘Physical and chemical property comparison of dielectrics from MOBO conditions #40 and #66’ and Fig. S7 and S8.
| Input Values | Objective Values | ||||||
|---|---|---|---|---|---|---|---|
| Condition # | Radiant energy (J cm−2) | Pulse count | Pulse length (ms) | Micropulse count | Duty cycle (%) |
C
100 Hz/C1 MHz |
|log(Ileakage)| |
| 40 | 4.5 | 16 | 7 | 23 | 70 | 1.10 ± 0.04 | 6.05 ± 0.24 |
| 66 | 4.8 | 15 | 16 | 20 | 70 | 1.89 ± 0.46 | 4.26 ± 0.57 |
To demonstrate the effectiveness of our HITL methodology in improving device yield and thereby accelerating MOBO, we revisit our initial 30 LHS conditions, of which only 10 led to measurable devices, as depicted in Fig. 4(a). The initial two rounds, based solely on device results, suggest 15 conditions, of which only five successfully produce functional devices (Fig. 4(b)). To test HITL methodology, we restart MOBO using the 30 LHS results, now incorporating the unsuccessful conditions via the HITL method for two rounds of optimization, to make a direct comparison with results based solely on electrical characterization. For the two rounds of HITL + MOBO, the device yield is 9 out of 10 conditions, which is a marked improvement (Fig. 4(c)). This demonstrates that HITL incorporation can substantially increase the number of data points successfully generated from each round, even from the initial sampling stage. The final results of this comparison are depicted by the yield shown in Fig. 4(d). Thus, incorporating HITL will lead to faster convergence of MOBO, requiring fewer rounds of experiments and saving time and experimental resources. Summary of the methodology used in each round of MOBO is presented in Table S4 (SI), while photonic curing input conditions and the processing outcomes with and without HITL are compared in Table S5 (SI). While the HITL framework does hold the drawback of human scores being subjective, it is quick to implement and uses the domain expertise of the researchers, and can be tailored to match other experimental frameworks. If human scoring is less confident, or if multiple researchers are involved and scoring consistency is a concern, the HITL can be relaxed relative to the other metrics by adjusting the hyperparameter τ in the equation for Pconstraint described in Section 3.2.
![]() | ||
| Fig. 4 (a) Device yield outcomes for initial LHS. Two rounds of MOBO using (b) unconstrained acquisition function without HITL versus (c) HITL-constrained acquisition function described in Fig. 2. (d) Comparison of outcomes for both methodologies. Green represents successfully converted PC conditions that produce functional MIM devices. Blue represents unsuccessful PC conditions that cannot be used for device fabrication. (PC: photonic curing). | ||
To better understand the relationship between photonic curing variables and dielectric behavior, SHAP analysis is conducted on the GPR models for C100
Hz/C1
MHz and |log(Ileakage)|. SHAP is a unified approach based on game theory to interpret machine learning model outcomes.33 SHAP values provide a comparative measure of the magnitude of change each input variable has on the model output, as well as whether the relation is positive or negative. For example, if an input has high SHAP values for high input values and low SHAP values for low input values, that means that the model output has a strong positive correlation with that particular input. While SHAP values alone do not represent a direct correlation coefficient between model inputs and output, they are useful for demonstrating the magnitude and direction of how each model input contributes to the output.
SHAP value trends in Fig. 5 illustrate the competing nature of C100
Hz/C1
MHz and |log(Ileakage)| objectives. Fig. 5(a) shows that modeled C100
Hz/C1
MHz increases with lower radiant energy, lower pulse count, and longer pulse length. Conversely, Fig. 5(b) shows that modeled |log(Ileakage)| increases with higher radiant energy, higher pulse count, and shorter pulse length. Both objectives exhibit a small but positive relation with micropulse count. While |log(Ileakage)| has a positive relation with duty cycle, C100
Hz/C1
MHz is insensitive to duty cycle. SHAP value dependence on individual photonic curing parameters is further detailed in SI section ‘SHAP Analysis Scatter Plots for Individual Input Parameters’ and Fig. S9 and S10. The results of SHAP analysis on the GPR models reveal how different photonic curing parameters can have conflicting effects on the two objectives, as well as the magnitude of their respective effects.
These trends can be interpreted in terms of how photonic curing parameters affect the dielectric properties, with a high C–f dispersion being characteristic of a less converted, defect-rich oxide, while a low leakage current indicates a more converted, defect-free oxide layer. For example, increasing the number of pulses and the radiant energy delivered per pulse will promote further conversion from the nitrate precursor to a metal oxide film and subsequent densification.11 Similarly, decreasing pulse length for a given value of radiant energy will result in a higher intensity of light delivered during the pulse, reaching a higher peak temperature in the film34 that can also promote better conversion of the oxide. The photonic curing parameters and MIM characteristics of our two extreme MOBO conditions, #40 and #66, further confirm the SHAP analysis. The most significant difference in the input parameters for these two conditions (Table 1) is pulse length, with 7 ms for #40, which has a low leakage and small C–f dispersion, and 16 ms for #66, which has a high leakage and large C–f dispersion.
| This journal is © The Royal Society of Chemistry 2026 |