Open Access Article
Akihiro
Fujii
*a,
Anh Khoa Augustin
Lu
ab,
Koji
Shimizu
c and
Satoshi
Watanabe
a
aThe University of Tokyo, Department of Materials Engineering, Faculty of Engineering, Bldg. IV, 7-3-1, Hongo, Bunkyo-ku, Tokyo 113-8656, Japan. E-mail: akihiro.fujii@cello.t.u-tokyo.ac.jp
bNational Institute for Materials Science (NIMS), 1-1 Namiki, Tsukuba, Ibaraki 305-0044, Japan
cNational Institute of Advanced Industrial Science and Technology (AIST), 1-1-1 Umezono, Tsukuba, Ibaraki 305-8568, Japan
First published on 29th October 2025
Materials design aims to discover novel compounds with desired properties. However, prevailing strategies face critical trade-offs. Conventional element-substitution approaches readily and adaptively incorporate various domain knowledge but remain confined to a narrow search space. In contrast, deep generative models efficiently explore vast compositional landscapes, yet they struggle to flexibly integrate domain knowledge. To address these trade-offs, we propose a gradient-based material design framework that combines these strengths, offering both efficiency and adaptability. In our method, chemical compositions are optimised to achieve target properties by using property prediction models and their gradients. In order to seamlessly enforce diverse constraints—including those reflecting domain insights such as oxidation states, discretised compositional ratios, types of elements, and their abundance, we apply masks and employ a special loss function, namely the integer loss. Furthermore, we initialise the optimisation using promising candidates from existing datasets, effectively guiding the search away from unfavourable regions and thus helping to avoid poor solutions. Our approach demonstrates a more efficient exploration of superconductor candidates, uncovering candidate materials with higher critical temperature than conventional element-substitution and generative models. Importantly, it could propose new compositions beyond those found in existing databases, including new hydride superconductors absent from the training dataset but which share compositional similarities with materials found in the literature. This synergy of domain knowledge and machine-learning-based scalability provides a robust foundation for rapid, adaptive, and comprehensive materials design for superconductors and beyond.
Exploiting physical insights—such as selection of elements based on their oxidation states and the fact that materials with too many elements are impractical—can narrow this search, making materials design more efficient. A traditional technique in materials design is elemental substitution (i.e., doping).3–6 In this approach, one starts with a promising host material and partially substitutes certain elements to tune the properties. Substituted elements are typically chosen based on physical insights—such as oxidation states—to ensure charge neutrality and other key constraints.
Machine learning (ML) has become a widely used approach for materials discovery, offering faster property predictions than conventional Density Functional Theory (DFT) calculations and thus enabling high-throughput screening. In the context of HTS development, some studies7–15 have focused on training superconducting transition temperature (Tc) prediction models using the SuperCon dataset,16 which comprises a large set of known superconductors. Some studies17–19 combine ML-based Tc prediction with experimental tests and report the discovery of novel superconducting materials.
Recently, deep generative models have gained prominence in materials design,20–23 including the quest to discover novel superconductors.24,25 These models propose new compounds by learning the statistical distribution of existing data, thus enabling the exploration of a vast chemical space. Several studies26,27 employ diffusion models28—a deep generative model widely used in the computer vision field29—to generate superconductor candidates. SuperDiff,27 a diffusion model for superconductors, generates candidate superconductors by gradually removing noise from a noisy composition. Moreover, SuperDiff can generate conditioned outputs based on reference compounds using Iterative Latent Variable Refinement (ILVR).30 While conventional generative methods only explore materials within existing databases, SuperDiff can generate new materials based on promising reference compounds.
Moreover, there are strategies that guide deep generative models toward desired properties, such as label-based conditional generation,31 Universal Guidance32 (UG), Classifier Guidance (CG)33 and Classifier-Free Guidance (CFG).34 While CG and UG use a separate property predictor to steer the generation process, CFG does not require such a predictor. Although the label-based conditional generation and CFG have both been extensively validated in image generation, their reliance on labels within the dataset limits their flexibility in materials design. By contrast, CG can be conditioned on labels not present in the target dataset using models trained on other datasets. Xie et al.20 employ a strategy similar to CG, combining a diffusion model with a formation energy prediction model. Applying CG to Tc prediction models and superconducting material generation models such as SuperDiff has the potential to enable HTS design.
A gradient-based method35–38 that uses prediction models and their gradients to optimise inputs has recently attracted attention. This method is similar to CG and UG but simpler, as it does not require training a generative model. Moreover, this method allows for more flexible and adaptive conditional optimisation.38 While there is no study of applying this technique to composition optimisation, it could be a promising approach for materials design.
Despite these advances, significant trade-offs remain. Elemental substitution can incorporate physical knowledge but may limit exploration to a relatively narrow search space. Deep generative models can explore a broader chemical space efficiently, yet they struggle to flexibly integrate physical knowledge—such as atomic valence constraints or converting compositional ratios to integers—in an adaptive manner. On the other hand, the gradient-based method has a risk of falling into poor solutions, though this method has the potential to introduce various physical knowledge in an adaptive manner.
In this paper, to address these issues, we adopt a gradient-based method and propose a straightforward materials-design method called Knowledge-Integrated Adaptive Gradient-based Optimisation (KIAGO). This framework combines the adaptive application of domain knowledge with computational efficiency to directly optimise chemical compositions (Fig. 1). KIAGO does not require training a deep generative model, making it more straightforward to implement. Specifically, we adopt two property prediction models—one for Tc and another for formation energy—to maximise Tc while enhancing stability, thereby proposing realistic materials. Unlike CFG and label-based conditional generation, KIAGO can optimise formation energy (which is not included in the SuperCon dataset) by using a separately trained formation energy prediction model. Additionally, by conducting an intensive search around promising materials, KIAGO mitigates the risk of being trapped at poor results. Moreover, by applying masks and a specialised loss function to enforce integer values, we can effectively embed physical insights (e.g., ensuring the retention of specific elements, oxidation states, the number of elements, or integer compositional ratios), thus providing a versatile framework that accommodates diverse constraints in adaptive manners.
To validate the effectiveness of KIAGO, we performed experiments to propose promising HTS. Our approach significantly outperformed both generative models (SuperDiff and SuperDiff with CG) and conventional elemental substitution techniques in proposing high-Tc candidates efficiently. In particular, we found that SuperDiff with CG tended to generate materials with lower Tc values because the Tc distribution in the original data constrained them. In contrast, our method proposed high-Tc candidates without being limited by the original distribution. Additionally, KIAGO could keep some part of the composition fixed while optimising others, relevantly replace elements according to their oxidation states, and maintain charge neutrality perfectly. Additionally, KIAGO proposed candidate compositions that shared the same elements as hydride superconductors reported in other literature despite their absence from the SuperCon dataset. These results highlight its potential for discovering novel materials.
L = −fTc( ) + αfEf( ) | (1) |
![]() | (2) |
∈ [0, 1]Nelem is a compositional vector spanning Nelem elements. Minimising L aims to increase Tc while lowering the formation energy. However, simple minimisation poses several issues: (1) it may converge to poor solutions, (2) it lacks control over the number and type of elements, and (3) it does not ensure integer ratios in the final composition.
into a fixed portion xconst and an optimisable portion
opt:![]() | (3) |
![]() | (4) |
![]() | (5) |
Because xconst remains unchanged, its specified composition remains fixed during optimisation. We further introduce a mask Melem(Melem ∈ {0, 1}Nelem) to select the allowable elements. Concretely, we set
opt = base*Melem, | (6) |
is a trainable parameter, and the asterisk (*) denotes element-wise multiplication. This mask enforces strict control over which elements can be used, thus guiding the optimisation toward compositions that meet specified domain constraints.
| ∑Mmaxelem = nmaxelem | (7) |
![]() | (8) |
The integer loss measures how far each compositional ratio of element i(
i) is from its nearest value in {cnNunit}.
{cnNunit} = {n/Nunit}n=0,1,…,Nunit i ∈ { H, He, Li, …, Nelem} | (9) |
![]() | (10) |
As an example, Fig. 2 shows the result of applying Lint4 to Ca0.23Sr0.27O0.50. We assume Nunit = 4 and guide the compositions toward the nearest values in {0.00, 0.25, 0.5, 0.75, 1.00}. Because it is difficult to fix Nunit in advance, we evaluate multiple candidates for Nunit and choose the one that yields the smallest loss. Concretely, we define Linteger as follows:
![]() | (11) |
This flexible approach selects a suitable integer grid even when the optimal cell size is unknown.
base and control which elements appear while iteratively minimising the following loss L1st. This process yields
1st*.![]() | (12) |
L1st = −fTc( ) + αfEf( ) | (13) |
![]() | (14) |
Next, we introduce the conversion to integers and a maximum-atom constraint. We use
1st* to build the mask Mmaxelem, then iteratively minimise the loss L2nd.
![]() | (15) |
L2nd = −fTc( ′) + αfEf( ′) + βLinteger,{Nunit}( ′) | (16) |
![]() | (17) |
′(
2nd*) as the final solution.
To predict the superconducting transition temperature (Tc), we employ a ResNet18 model41 trained on normalised compositions of SuperCon and Crystallography Open Database (COD).42–51 Each composition is represented by a periodic table-based feature map, which has four channels corresponding to the s, p, d, and f orbitals.12 SuperCon comprises more than 26
000 composition–Tc pairs and is widely used for Tc prediction. Although SuperCon lacks explicit structural information, it is used to propose novel superconductor candidates, some of which are later verified experimentally.17–19 We also use COD as a source of non-superconductors to regularize training and reduce false positives.
For each element, we set flags at the positions of its row and column on the periodic table, as well as at its relevant orbital channels. We then multiply these element-level feature maps by the respective compositional ratios to create the final input representation. Further details are available in Section S.1. For formation energy, we used ElemNet,52 which is originally implemented in TensorFlow 1.x53 and we re-implemented it in PyTorch. Since ElemNet only covers elements up to atomic number 86, we apply a mask to exclude elements beyond that range. Additional technical specifics are given in Section S.2.
We compared KIAGO against two baselines: a conventional elemental-substitution (C-ES) approach and SuperDiff, a diffusion-based generative model. The C-ES method randomly replaces some elements with others of identical oxidation states. For SuperDiff, we used the official implementation and trained on the same data for our Tc prediction model, but without normalising compositions. Following the official code, we conducted 1000 diffusion steps. According to the original SuperDiff, we conditioned generations of compositions on existing superconductors using Iterative Latent Variable Refinement (ILVR). We applied scale factors of 1, 2, 4, and 6 to yield a total of 4096 samples. We also incorporated Classifier Guidance (CG) using Tc prediction model and ElemNet into SuperDiff to compare it directly with KIAGO. Normally, the classifier used for CG must be trained on data with noise, which would make Universal Guidance (UG) the better choice for off-the-shelf models. However, we found that UG did not work well and CG still improved Tc using models without noise-augmented training. Therefore, we decided to use CG. Although neither model is strictly a classifier, we refer to this approach as CG for convenience. Note that this is not proposed in the original paper.27 At each inference step, we used eqn (13) with α = 4 for gradient guidance, and we tuned the guidance weight from 1 × 10−7 to 1.0, ultimately selecting 1 × 10−3. Additional details of SuperDiff are provided in Section S.3.
After generating candidate compositions, we applied a multi-step screening procedure to ensure realistic materials. First, we used SMACT54 to filter compositions with charge neutrality and electronegativity balance, following Yuan et al.27 Next, we selected only those with formation energies (predicted by ElemNet) less than zero. We also removed compositions containing ten or more elements since the preprocessed SuperCon data have at most nine. Finally, we evaluated Tc values using the same ResNet18 predictor used in KIAGO and SuperDiff with CG.
To assess thermodynamic stability of the proposed materials, we use the energy above the convex hull per atom (ΔEhull). Our method proposes compositions rather than crystal structures, so validating ΔEhull with DFT total energies is not feasible. Instead, we estimate ΔEhull from formation energies predicted by ElemNet. Concretely, we predict formation energies for compositions from the Materials Project55 and the Alexandria Materials Database,56–59 and compute ΔEhull by building phase diagrams with pymatgen.60
Fig. 3 shows the optimisation results. Our initialisation scheme yields higher Tc values than random initialisation. Although our method can still become trapped in local optima, it proposes more promising solutions than the random approach. Hence, our approach partially mitigates the inherent challenge of local minima in gradient-based methods. See Section S.4 for details on the variability across different random seeds.
![]() | ||
| Fig. 3 Comparison of optimisation results under different initialisation methods. Both approaches employ Adam optimiser40 with a learning rate of 0.001. (Left) Initialisation by adding noise to an existing superconductor (LaNiAsO). (Right) Random initialisation, in which seven elements are chosen arbitrarily and assigned random compositional values. | ||
Table 1 shows the results comparing the rule-based approach and Linteger,{Nunit}. Because it remains closer to the pre-conversion to integers composition, rounding with a larger total number of atoms is generally advantageous. Although the rule-based method fully exploits this by always rounding at the maximum atom count, Linteger,{Nunit} does not always do so, yet it still performs better.
440 samples derived from 15 different superconducting materials
| Max. num. atoms | L integer,{Nunit} (K) | Rule-based (K) |
|---|---|---|
| 15 | −3.59 | −7.65 |
| 20 | −0.81 | −4.30 |
| 25 | −0.93 | −2.23 |
| 50 | −0.43 | −1.38 |
| 100 | −0.13 | −0.31 |
Table 2 presents the differences in predicted Tc between the generated superconductors and their base materials. KIAGO achieves the most efficient exploration of higher Tc values compared to other methods. By contrast, there are experiments where SuperDiff does not yield any valid materials passing all screenings. This limitation may stem from the fact that many entries in SuperCon do not pass charge-neutrality and electronegativity checks; hence, the model struggles to generate valid compositions. The C-ES method also fails to propose sufficiently high Tc compounds, likely because its rule-based approach cannot fully explore the vast compositional space. In contrast, KIAGO proposes many materials showing substantial Tc increases. For completeness, Section S.6 includes all screening-pass rates.
| Base materials from SuperCon | KIAGO | SD | SD w/ CG | C-ES |
|---|---|---|---|---|
| Top-30 | Top-30 | Top-30 | Top-30 | |
| ΔTc (K) | ΔTc (K) | ΔTc (K) | ΔTc (K) | |
| LaNiAsO | 104.39 | −1.42 | −0.47 | 13.54 |
| SrFe1.88Ni0.12As2 | 97.91 | 26.76 | 4.53 | 21.88 |
| Sr4V2Fe2As2O6 | 97.49 | −13.77 | −13.66 | −5.45 |
| LaPt2B2C | 86.73 | −5.01 | −4.66 | 6.90 |
| HgBa2Ca2Cu3O8 | 17.07 | −29.43 | −1.18 | −12.11 |
| CeBiS2O | 92.12 | N/A | −0.31 | 2.04 |
| Bi2Sr2CuO6 | 129.63 | 18.34 | 26.76 | 43.78 |
| TlSr2CaCu2O7 | 76.32 | 5.21 | 12.42 | 18.60 |
Table 3 describes example compositions. Both KIAGO and C-ES yield integer-total compositions, allowing straightforward induction of possible crystal structures. However, SuperDiff and SuperDiff with CG frequently produce non-integer totals, making immediate structural analysis more challenging. For several candidates proposed by KIAGO, we computed the convex-hull distance ΔEhull using ElemNet. Ca5Cu4Sr5O11 (127.16 K) and MgCa4Cu4Ba3TlO10 (142.90 K) showed ΔEhull < 0.06 (eV per atom), suggesting possible thermodynamic stability.
| Method | Samples |
|---|---|
| KIAGO | Ca4Co3Sr3W3F2As8 (115.65 K) |
| Ca5Cu4Sr5O11 (127.16 K) | |
| MgCa4Cu4Ba3TlO10 (142.90 K) | |
| Cu6Sr3Pt3B5O7 (87.52 K) | |
| SD | Ni0.99LaO1.01As0.99 (3.65 K) |
| Ni0.82Ge0.25La0.97C1.59As0.44Se0.14 (1.23 K) | |
| Ca0.28Cu1.39Sr2.04Pb0.91Bi1.15O7.03 (76.03 K) | |
| Ca1.84Sc0.17Cu2.91Ba2.05HgO8.05 (127.46 K) | |
| SD w/ CG | CaCu2.02Sr1.69Y0.41Tl1.04O6.95 (87.78 K) |
| Co0.3Ni0.69La0.83Ce0.16O0.94As0.97 (4.85 K) | |
| La0.6Ce0.43Nd0.15Bi1.03O0.98S2.01 (2.54 K) | |
| Ni0.9Ge0.11La0.96C0.29O0.87As0.8 (5.96 K) | |
| C-ES | La4Bi4O4S8 (4.61 K) |
| CaCu2BaTlCO7 (86.78 K) | |
| V2Sr4YHfO6As2 (36.65 K) | |
| La2Hf2Ir2B4C2 (18.46 K) |
Interestingly, SuperDiff with CG does not necessarily generate higher-Tc compounds than SuperDiff alone. Table 4 shows how Tc changes in guidance and denoising under different weights for guidance. During generation, high guidance weights raise Tc in the guidance step but then revert it in the ILVR and denoising step. We attribute this to the Tc distribution in SuperCon, where low- or moderate-Tc compounds dominate (median: 12.5 K). As a result, extremely high Tc values are treated as noise, prompting the model to restore them to more typical levels. Furthermore, larger weight for guidance cause a stronger mismatch with the training distribution, reducing the fraction of generated compositions that contain fewer than ten elements. This interplay of denoising and guidance likely hampers SuperDiff with CG's ability to reach stable, high-Tc solutions. For the results without ILVR, please refer to Section S.3.
| Guide weight w | Denoise ΔTc (K) | ILVR ΔTc (K) | Guide ΔTc (K) | Sum (K) | Screening ratio |
|---|---|---|---|---|---|
| — | 84.8 | 37.4 | — | 122.2 | 0.13 |
| 1.0 × 10−5 | 80.2 | 43.0 | 0.0 | 123.2 | 0.09 |
| 1.0 × 10−4 | 75.9 | 46.9 | 0.4 | 123.1 | 0.10 |
| 1.0 × 10−3 | 79.2 | 39.8 | 3.2 | 122.3 | 0.08 |
| 1.0 × 10−2 | 77.1 | 13.2 | 33.8 | 124.1 | 0.11 |
| 1.0 × 10−1 | 42.5 | −197.1 | 279.3 | 124.7 | 0.12 |
| 1.0 | −63.5 | −1409.6 | 1611.0 | 137.9 | 0.00 |
| 1.0 × 101 | −101.7 | −3945.1 | 4190.5 | 143.7 | 0.00 |
| 1.0 × 102 | −42.2 | −3746.6 | 3925.2 | 136.4 | 0.00 |
We implemented this approach in KIAGO by treating the preserved composition as a fixed vector xconst. We then randomly initialise the substituting element and apply a mask based on its oxidation state. For example, when substituting Y3+, we only allow elements having a +3 oxidation state, such as gallium or aluminum. To achieve this, we used a mask that has one value on elements having a +3 oxidation state and set all others to zero. To simplify evaluation, we excluded the preserved elements from the mask. We use Adam optimiser with learning rate of 0.03 for KIAGO. Note that we did not perform conversion to integers on the substituting element, so we set β = 0. By contrast, SuperDiff does not explicitly support elemental substitution, so we approximated it by conditioning the generation process with ILVR.
In addition to the screening described in Section 3.1, we assessed whether the intended elemental substitution was correctly carried out. First, we checked whether the preserved composition remains within 1% of its original ratio. Second, we checked that the total composition of the newly substituted element (or elements) stays within 1% of the original substituted metal's ratio. To simplify evaluation, we excluded the preserved elements from the substituted element candidates. We then evaluated the Tc of compositions that pass both this substitution check and the previous screening.
Tables 5–7 summarize the probability of correct element evaluation, the charge-neutrality evaluation, and the resulting Tc values, respectively. Notably, KIAGO and C-ES achieve 100% correct substitutions (Table 5), indicating that these methods incorporate domain knowledge effectively. Consequently, as shown in Table 6, their proposed materials always satisfy charge neutrality. Moreover, KIAGO demonstrates high search efficiency, yielding the best results in most experiments (Table 7). SuperDiff, however, cannot reliably perform elemental substitution, indicating that generative models like SuperDiff are not well suited when strict domain knowledge must be enforced.
| Base materials from SuperCon | Substitute target | KIAGO | SD | SD w/ CG | C-ES |
|---|---|---|---|---|---|
| CeFeAsF0.2O0.8 | Ce3+ | 1.00 | 0.00 | 0.00 | 1.00 |
| LaFeAsO | La3+ | 1.00 | 0.01 | 0.00 | 1.00 |
| SrFe2As2 | Sr2+ | 1.00 | 0.01 | 0.00 | 1.00 |
| Bi2CaSr2Cu2O8 | Bi3+ | 1.00 | 0.23 | 0.20 | 1.00 |
| CeNiC2 | Ce4+ | 1.00 | 0.00 | 0.00 | 1.00 |
| LaNiC2 | La3+ | 1.00 | 0.01 | 0.01 | 1.00 |
| MgCoNi3 | Co2+ | 1.00 | 0.00 | 0.00 | 1.00 |
| RuSr2GdCu2O8 | Sr2+ | 1.00 | 0.03 | 0.03 | 1.00 |
| RuSr2YCu2O8 | Y3+ | 1.00 | 0.03 | 0.03 | 1.00 |
| Y2Fe3Si5 | Y3+ | 1.00 | 0.00 | 0.00 | 1.00 |
| YIrSi | Y3+ | 1.00 | 0.00 | 0.00 | 1.00 |
| Base materials from SuperCon | Substitute target | KIAGO | SD | SD w/ CG | C-ES |
|---|---|---|---|---|---|
| CeFeAsF0.2O0.8 | Ce3+ | 1.00 | 0.03 | 0.02 | 1.00 |
| LaFeAsO | La3+ | 1.00 | 0.02 | 0.02 | 1.00 |
| SrFe2As2 | Sr2+ | 1.00 | 0.03 | 0.05 | 1.00 |
| Bi2CaSr2Cu2O8 | Bi3+ | 1.00 | 0.01 | 0.01 | 1.00 |
| CeNiC2 | Ce4+ | 1.00 | 0.07 | 0.06 | 1.00 |
| LaNiC2 | La3+ | 1.00 | 0.03 | 0.04 | 1.00 |
| MgCoNi3 | Co2+ | 1.00 | 0.71 | 0.63 | 1.00 |
| RuSr2GdCu2O8 | Sr2+ | 1.00 | 0.05 | 0.06 | 1.00 |
| RuSr2YCu2O8 | Y3+ | 1.00 | 0.08 | 0.07 | 1.00 |
| Y2Fe3Si5 | Y3+ | 1.00 | 0.18 | 0.26 | 1.00 |
| YIrSi | Y3+ | 1.00 | 0.28 | 0.48 | 1.00 |
| Base materials from SuperCon | Substitute target | KIAGO | SD | SD w/ CG | C-ES |
|---|---|---|---|---|---|
| Top-30 | Top-30 | Top-30 | Top-30 | ||
| ΔTc (K) | ΔTc (K) | ΔTc (K) | ΔTc (K) | ||
| CeFeAsF0.2O0.8 | Ce3+ | 14.17 | N/A | N/A | 9.96 |
| LaFeAsO | La3+ | 33.88 | 1.24 | N/A | 31.10 |
| SrFe2As2 | Sr2+ | 31.54 | N/A | N/A | 19.25 |
| Bi2CaSr2Cu2O8 | Bi3+ | 18.72 | 0.15 | −1.11 | 6.10 |
| CeNiC2 | Ce4+ | 13.10 | N/A | −0.02 | 4.33 |
| LaNiC2 | La3+ | 10.53 | N/A | N/A | 4.90 |
| MgCoNi3 | Co2+ | 31.66 | N/A | 0.13 | 7.23 |
| RuSr2GdCu2O8 | Sr2+ | −3.38 | −0.23 | −0.16 | 5.64 |
| RuSr2YCu2O8 | Y3+ | 45.96 | −0.89 | 1.53 | 37.74 |
| Y2Fe3Si5 | Y3+ | 5.57 | N/A | N/A | 1.50 |
| YIrSi | Y3+ | 6.09 | 3.42 | N/A | 3.86 |
Next, we compare the highest-Tc compounds in the SuperCon dataset that have undergone the same elemental substitution with the compounds proposed by KIAGO. Table 8 shows that, in most element-substitution experiments, KIAGO proposes materials with higher Tc than any element-substituted materials in the SuperCon dataset. This result highlights the potential of our approach to surpass known substitution strategies and discover more promising superconductors.
For several candidates proposed by KIAGO, we computed the convex-hull distance ΔEhull using ElemNet. Element-substituted derivatives of Y2Fe3Si5 and CeFeAsF0.2O0.8—namely Pr0.3995Gd0.3859Dy0.3634Ho0.361Hf0.2428Sm0.2474Fe3Si5 (Tc = 5.3 K) and Ag0.1752Sm0.6868Tb0.138FeAsF0.2O0.8 (Tc = 51.66 K)—exhibit convex-hull distances of ΔEhull = 0.02 and 0.00 eV per atom, respectively, indicating potential thermodynamic stability.
In this section, we limit our discussion to single-element substitution. However, our method can also support multi-element substitution. For example, in Ti2O4, two Ti4+ atoms and one O2− atom contribute a total charge of +6. This can be replaced by two X3+ atoms, resulting in a composition like X2O3. Here, X denotes any element with a +3 oxidation state. Such substitutions are feasible as long as the total charge is preserved, and our oxidation-state-based masking mechanism can accommodate them.
For initialisation for KIAGO, we began with HSC from the SuperCon dataset. With a probability of 0.29, we replaced elements of its composition with different elements chosen according to their occurrence frequencies in SuperCon. We then selected random elements with random compositional ratios (from 0.0 to 0.03) for those elements, normalised the resulting composition, and used it as the initialisation. We also set {Nunit} = {1, 2, …, 15}.
In Table 9, we present the average Tc of the top five proposed hydride superconductors. Compared with other methods, KIAGO efficiently generates hydride superconductors. Table 10 shows the probability of proposing materials that meet specific criteria—namely, having at least 40% hydrogen content, three or fewer elements, and a total atom count of 15 or below. These results indicate that KIAGO not only explores the search space efficiently but also strictly adheres to the specified constraints.
| Base materials from SuperCon | KIAGO | SD | SD w/ CG | C-ES |
|---|---|---|---|---|
| Top-5 | Top-5 | Top-5 | Top-5 | |
| T c (K) | T c (K) | T c (K) | T c (K) | |
| PdH | 4.06 | 0.00 | 0.46 | 2.82 |
| PtH | 2.73 | 0.00 | 0.44 | 2.82 |
| LaH10 | 3.84 | 0.00 | 0.00 | 0.00 |
| H2S | 2.97 | 0.14 | 0.58 | 0.00 |
| H4Si | 3.11 | 0.00 | 0.00 | 0.00 |
| Base materials from SuperCon | KIAGO | SD | SD w/ CG | C-ES |
|---|---|---|---|---|
| PdH | 1.00 | 0.03 | 0.04 | 1.00 |
| PtH | 1.00 | 0.03 | 0.05 | 1.00 |
| LaH10 | 1.00 | 0.02 | 0.02 | 1.00 |
| H2S | 1.00 | 0.04 | 0.05 | 1.00 |
| H4Si | 1.00 | 0.03 | 0.02 | 1.00 |
Table 11 lists HSC proposed by KIAGO. Notably, KIAGO also proposed materials made of the same elements as those in known compounds from the SuperCon dataset. In addition, it suggested materials that are not in the SuperCon dataset but have been reported in other literature.
Improving the prediction model's accuracy must involve ensemble methods,14 better model architectures, or enhanced datasets. Importantly, our method does not depend on any specific model architecture. Given the rapid pace of machine-learning advances, more accurate models will likely become available soon, and substituting them into our framework should alleviate current limitations. Additionally, datasets are also improving at a fast rate, offering further opportunities for refinement. Incorporating crystal structure prediction (CSP)65,66 from composition may mitigate the limitation of missing structural information, while multi-modal learning7,67 at the fine-tuning stage may enable models to consider essential factors such as pressure.
| This journal is © The Royal Society of Chemistry 2025 |