Open Access Article
Panyalak Detrattanawichai
,
Zhenzhu Li,
Hyunsoo Park,
Kinga O. Mastej
and
Aron Walsh
*
Department of Materials, Imperial College London, London SW7 2AZ, UK. E-mail: a.walsh@imperial.ac.uk
First published on 16th April 2026
The discovery of materials has long been a fundamental building block for technological advancement, yet traditional trial-and-error methods are slow and costly to meet the growing demand for novel functionality, particularly in green energy technologies, energy storage, and electronics. In response to this challenge, high-throughput screening and data-driven workflows that combine computational simulations, machine learning models, and materials databases have emerged as powerful tools for accelerating materials discovery. Spinels (AB2X4) stand out as a versatile class of materials with applications ranging from energy storage to catalysis. However, their full compositional space remains largely unexplored. In this work, we present a data-driven framework to identify potentially synthesisable spinel compounds composed of the first 83 elements in the periodic table with oxygen (O2−) and three chalcogen anions (S2−, Se2−, Te2−). Over 30
000 charge-balanced and chemically plausible candidates, including inverse spinels, were sequentially filtered based on the stability, structural feasibility, and electronic properties criteria. Our workflow integrates materials databases, empirical heuristic rules, and machine learning predictions to efficiently reduce the candidate pool. As a result, 2303 novel spinel candidates were identified from this workflow, offering a diverse subset of target compounds for further investigation.
Among many classes of materials, spinels, a family of AB2X4 compounds that adopt the crystal structure of the naturally occurring mineral MgAl2O4, stand out due to their structural versatility and wide-ranging properties. In the regular spinel structure, the larger X anions form a tightly packed cubic structure, with the A- and B-site cations filling the tetrahedral and octahedral sites, respectively, arranged in compliance with the space group Fd
m.7–12 A wide range of elemental substitutions at A and B sites are possible, leading to exceptional structural flexibility and a large number of naturally occurring and synthetic spinels.13–16
Barth and Posnjak17 found that the A- and B-site cations can vary their distributions between the tetrahedral and octahedral voids, creating cation disordering. Spinels with A-site cations occupying tetrahedral sites and B-site cations occupying octahedral sites, as shown in Fig. 1A, are known as “normal” spinels (e.g. MgAl2O4). In comparison, spinels with half of the B-site cations occupying the tetrahedral sites, while the A-site cations and the other half of the B-site cations occupying the octahedral sites, as shown in Fig. 1B, are known as “inverse” spinels (e.g. SnZn2O4). These represent two extremes, and a parameter that describes the distribution of cations between A- and B-sites, which quantifies the likelihood of a spinel structure being “inverse”, is known as the cation inversion parameter (or degree of inversion) x, ranging from 0 to 1. Therefore, the formula for AB2X4 spinel can be rewritten as (A1−xBx)[AxB2−x]X4, where x = 0 refers to normal spinel and x = 1 refers to inverse spinel.13,14 This distribution of cations can significantly manipulate the properties of the structure.
Disorder in the cation sublattice can impact the chemical and physical properties of spinels. This can be observed from their applications as superconductors (e.g. LiTi2O4),18–20 catalysts (e.g. NiAl2O4),21–24 magnetic cores (e.g. Fe3O4),25 and gas sensors (e.g. ZnFe2O4).24 With such a wide range of applications, spinel has become one of the interesting classes of compounds that could potentially be utilised in green energy technologies, such as electrochemical energy storage.26–28
Most spinel studies have primarily focused on naturally occurring compositions or on optimising specific properties, leaving a large number of valuable compounds potentially undiscovered.9–12,29–32 Given the vast number of potential elemental combinations across A-, B-, and X-sites, exploring the entire compositional space of spinels through a conventional trial-and-error experimental or computational approach is impractical and unrealistic due to chemical complexity and resource demands involved. Bhattacharya and Wolverton used high-throughput first-principles calculations to explore quaternary spinels such as LiTiFeO4 for Li-battery cathodes.33 Quintero et al. synthesised a series of novel quaternary thiospinels, including Li2MgSn3S8.34 Such studies highlight the potential for realising new materials in this space.
Here, we present a data-driven workflow integrating computational simulations, machine learning models, and materials databases to efficiently explore the full compositional space of ternary spinels, focusing on compounds formed from the first 83 elements in the periodic table with four anions O2−, S2−, Se2−, and Te2−. Our workflow enables a systematic approach to narrow down the search space from over 30
000 possible compositions to 2303. These data-driven methods allow us to rapidly screen for candidates among thousands of compounds, substantially minimising the experimental and computational cost.
m space group. For ICSD, a manual search is carried out for entries with space group Fd
m and structural type similar to MgAl2O4. The retrieved entries are then filtered to retain only compositions matching the AB2X4 or A3X4 formula. To ensure consistency, compositions with non-integer stoichiometric coefficients are rounded to the nearest integers.
Combining both datasets, we identify 234 known spinels in this space. Among them, there are 124 oxides, 72 sulfides, 34 selenides, and 4 tellurides. These known compositions are highlighted in Fig. 2 using red triangle markers, where different triangle orientations indicate different anion types.
Spinels are heteropolar compounds and can be classified based on how the charges are balanced. For the general formula AB2X4, charge neutrality requires that the sum of cationic charge satisfies qA + 2qB = −4qX. For oxides and chalcogenides, where qX = −2, this leads to three common charge combinations: (qA = 2, qB = 3), (qA = 4, qB = 2), and (qA = 6, qB = 1), with known examples being MgAl2O4, TiMg2O4, and WLi2O4, respectively.
To note here, the oxidation states of elements used during the SMACT screening include every possible oxidation state that are found in ICSD data, without considering the imbalanced distribution of commonly or uncommonly seen oxidation states. In response to this, three oxidation state thresholds-low, medium, and high-to filter for chemically valid compositions are implemented. The low threshold includes all reported oxidation states that appear at least 3 times; the medium threshold excludes those that appear less than 10% of total appearance for a given element; and the high threshold retains only the most common oxidation state for each element.
The resulting chemically valid compositions are represented in Fig. 2, where different triangle orientations indicate different anion type (same as the known ones), and the intensity of the blue represents the three levels of oxidation state thresholds. Using SMACT, we obtain 16
175, 5745, and 1907 valid compositions for the low-, medium-, and high-threshold filters. “Recover” is used here to refer to the fraction of experimentally known spinels that are successfully preserved after applying the screening filter. In detail, the low threshold recovers 100% of the known AB2X4 spinels. However, the number of recovered known spinels drops by 9% at the medium threshold and by 40% at the high threshold. This highlights the importance of oxidation state selection when identifying potential candidates. To balance chemical plausibility with compositional flexibility-while ignoring rare oxidation states-the medium threshold is adopted for the remainder of our investigation.
000 structures to optimise. It is important to note that the normal and inverse structures for A3X4 spinels are the same since they only possess one type of cation. To efficiently screen the large number of candidate structures, a machine learning force-field model pre-trained with inorganic materials, the Multiple Atomic Cluster Expansion (MACE),42–44 is employed to accelerate the optimisation of crystal structures. The foundation model MACE-MP-0a is used to perform structural relaxation via the Atomic Simulation Environment (ASE) using the FIRE structural optimisation algorithm. Both atomic positions and cell are allowed to relax freely through the Frechet cell filter with a loose force convergence criterion fmax = 0.05 eV Å−1. To avoid excessive calculations on structures that fail to converge, the maximum number of optimisation steps is set to 600. In the calculations, any structure that fails to converge within this limit or possesses excessively large forces is discarded from further analysis.
To verify the accuracy of the MACE-MP-0a for the spinels family, comparative calculations on a representative set of the spinel chemical space (10 oxides, 10 sulfides, 10 selenides, and 4 tellurides) have been carried out using MACE-MP-0a and conventional Density Functional Theory (DFT)45,46 with Perdew–Burke–Ernzerhof (PBE)47 functional within Vienna Ab initio Simulation Package (VASP).48,49 The resulting parity and difference plots for lattice parameters and total energy per unit cell are provided in the SI as Fig. S1 and S2 with the computational details. The results show that while MACE-MP-0a is powerful for oxides, an increase in uncertainty of prediction is observed for the chalcogenides, especially tellurides, which likely reflects the decreasing abundance of relevant training data for these chemistries in the MACE-MP-0a model.
Beyond pure thermodynamic considerations, data-driven measures have also been developed to assess how likely it is that a new compound can be synthesised.51–56 To further refine the candidate screening, a data-driven machine learning model, namely, synthesisability-stoi-CGNF, based on Positive Unlabelled Learning (PUL) proposed by Jang et al.53 is also adopted to investigate the crystal likelihood from a different perspective. This model is trained using stoichiometric crystal data, where known compositions are labelled as 1 (positive) and unknown as 0. The output of this model is a metric, namely, crystal-likeness score (CLscore), ranging from 0 to 1, where higher values indicate an increased likelihood of compound stability. According to Jang's study, compositions with CLscore ≥ 0.5 are considered potentially stable. However, since this model only takes stoichiometry (or compositions) as input, it cannot distinguish the difference between normal and inverse spinels. Nevertheless, this method offers us a complementary perspective on the crystal likelihood of spinels based on existing materials data. The distribution of CLscore is illustrated in Fig. S4 in the SI.
![]() | (1) |
![]() | (2) |
![]() | (3) |
From the thermodynamic and data-driven evaluations, we combine the results by choosing only structures that satisfy both criteria (Ehull ≤ 0.2 eV per atom and CLscore ≥ 0.5). Each metric is first normalised using eqn (1) and (2), where α and β refer to a metastability window of Ehull and stability threshold of CLscore, respectively. For example, α = 0.2 eV per atom and β = 0.5 in Fig. 3. The normalised values are then averaged to define a new single unified metric called super score (Sscore) as shown in eqn (3). This score integrates both Ehull and CLscore to quantify the joint likelihood of each composition on a scale from 0 (unlikely) to 1 (likely). The chemical space of oxide spinels with their Sscore is depicted in Fig. 3A, with the distribution shown in Fig. S5 in the SI. This metric allows us to rank and prioritise spinel candidates. The chemical space of other types of spinel (X = S, Se, and Te) with their Sscore are shown in Fig. S6–S8 in the SI.
The recovered percentage of the experimentally known oxide spinels under the medium oxidation state threshold at different Ehull, CLscore, and Sscore can be illustrated as Fig. 4 (The same illustrations for sulfides, selenides, and tellurides are provided in the SI Fig. S9). The initial recovered percentage starts at 91.9%, reflecting that 8.1% of known compositions are excluded by the initial chemical filter (SMACT) or failed to converge during structural optimisation using the MACE force field. From this baseline, stricter screening filter naturally reduces the recovery rate further.
![]() | ||
| Fig. 4 Fraction of recovered experimentally known oxide spinels with medium oxidation state threshold for the initial chemical filter. | ||
For the thermodynamic filter (Ehull), which is represented by the red solid line, the recovery rate remains constant above Ehull = 0.4 eV per atom but decreases to 83.1% at the metastability window of Ehull = 0.2 eV per atom. This indicates that the thermodynamic filter successfully captures the majority of the known oxide spinels with 83.1% true positive rate. The data-driven filter (CLscore) represented by the blue dotted line, on the other hand, is more selective, recovering 75% of the known oxide spinels at the stability criterion of CLscore = 0.5. This shows slightly weaker performance than the thermodynamic approach, yet it still provides a reasonable prediction of the crystal likelihood based on existing data. These results support our chosen criteria for identifying crystal likelihood as they preserve a large fraction of the known oxide spinel compositions. With both filters combined into Sscore, which is represented by the purple dashed line, the recovery rate becomes 66.9% at Sscore = 0. It is important to note that this 66.9% recovery rate does not represent a failure to identify valid materials, but rather the strictness of our combined screening filters that prioritises high-confidence predictions. The excluded 33.1% largely consist of rare elements where the underlying training data for the ML model is scarce. This suggests that Sscore offers a continuous and balanced metric for evaluating crystal likelihood.
From Fig. 2 and 3A, it can be seen that a large portion of the chemical space has been filtered out during the screening process, retaining only compositions that pass through chemical filter and MACE structural optimisations, while satisfying both the thermodynamic and data-driven stability criteria. This significant reduction reflects the strictness of the combined filters and indicates that many compositions are unlikely to be synthesisable. However, it is important to note that this may not always reflect true instability since some valid but rare compositions may be excluded due to limitations in the force field or machine learning models used in the workflow. This point is supported by the fact that 66.9% of the known oxide spinels pass through the thermodynamic and data-driven filters, where some of the known oxide spinels that are chemically valid but contain rare elements, such as TcCo2O4, NiRh2O4, and MnSm2O4, are filtered out. This shows that data scarcity of compositions containing such elements can impact model predictions and influence the screening.
Moreover, it is shown in Fig. 3A that compositions with high Sscore are spatially clustered in the chemical space. This clustering suggests that compositions with similar chemical structure tend to share similar crystal likelihood profiles. These patterns can be used to guide exploration regions in the chemical space that may be particularly promising for further studies.
![]() | (4) |
One driving force for cation inversion is configurational entropy. The normal spinel corresponds to a single configuration of atoms in a unit cell, while the inverse spinel features a range of possible distributions, which can lower the free energy of the compound. The role of entropy becomes more important at higher temperatures. Wei and Zhang14 proposed a thermodynamic model based on the earlier work of Navrotsky and Kleppa63 to estimate the equilibrium cation inversion parameter (xeq) at a specific temperature using the energy difference between the normal and inverse configuration, as expressed in eqn (4), where c = eΔE/kT, ΔE = Einverse − Enormal, k is Boltzmann constant, and T is temperature. Using this model, they successfully predicted the equilibrium cation inversion parameter for several oxide spinels. We adopt their approach to cation inversion at room temperature (298.15 K) across the spinel chemical space using MACE-predicted energies for both normal and inverse structures.
The results for oxide spinels are illustrated in Fig. 3B, and the results for the rest are provided as SI. This approach cannot describe inversion of A3X4 spinels, as the current force fields don't distinguish between different species (charges) of the same element. Therefore, A3X4 spinels are excluded from Fig. 3B. Interestingly, consistent with what was observed using Sscore in Fig. 3A, the equilibrium cation inversion parameter shows clear clustering of normal and inverse configurations across the chemical space. Compositions with similar chemical features exhibit similar cation disordering behaviour.
To determine the preferred configuration, we compare the Ehull of the normal and inverse configurations and select the one with lower Ehull for each composition. In cases where both configurations possess the same Ehull, configuration with lower total energy is chosen. We further validated these preferences by mapping them onto orbital radii maps64,65 and comparing them against preferences determined by xeq, which are rounded to 0 and 1. The resulting maps as shown in Fig. S10 and S11 show identical distributions, demonstrating that the energetically more stable structure, which is indicated by the lower Ehull, aligns with the calculated xeq. Therefore, we adopt the lower Ehull configuration as the representative structure for each composition.
| Type of anions | Known | Possible | Chemical filter | Ehull ≤ 0.2 eV per atom | CLscore ≥ 0.5 | Ehull ≤ 0.2 eV per atom and CLscore ≥ 0.5 |
|---|---|---|---|---|---|---|
| Oxide | 124 | 6889 | 1679 | 885 | 1096 | 704 |
| Sulfide | 72 | 6889 | 1668 | 736 | 968 | 675 |
| Selenide | 34 | 6889 | 1529 | 671 | 1033 | 634 |
| Telluride | 4 | 6889 | 869 | 370 | 394 | 290 |
Based on Sscore, top candidates for each type of spinel are identified by ranking all compositions. This represents the raw output of the models that has not been human-filtered. Table 2 shows the top 10 spinel candidates for oxides, sulfides, selenides, and tellurides, excluding compositions that are presented on MP and ICSD, and it is noteworthy to mention that all 40 candidates favor normal configuration based on the Ehull value. Each candidate was checked by a manual literature search to confirm that they have not yet been experimentally reported.
| Type of anions | Compositions | ||||
|---|---|---|---|---|---|
| Oxide | n-ReK2O4 | n-OsK2O4 | n-ReNa2O4 | n-SAg2O4 | n-ReCs2O4 |
| n-ReRb2O4 | n-ReHg2O4 | n-CSb2O4 | n-BaNd2O4 | n-WK2O4 | |
| Sulfide | n-CTl2S4 | n-Pd3S4 | n-PbGa2S4 | n-ReRb2S4 | n-CSn2S4 |
| n-PbTl2S4 | n-OsTb2S4 | n-EuBi2S4 | n-WK2S4 | n-CoTb2S4 | |
| Selenide | n-TeCs2Se4 | n-SrB2Se4 | n-CeTl2Se4 | n-Pd3Se4 | n-EuB2Se4 |
| n-CaSb2Se4 | n-CaB2Se4 | n-CaTl2Se4 | n-Rh3Se4 | n-CsRh2Se4 | |
| Telluride | n-MgNd2Te4 | n-ZrBa2Te4 | n-MgEr2Te4 | n-MgTb2Te4 | n-MgSm2Te4 |
| n-MgGd2Te4 | n-MgLu2Te4 | n-MgY2Te4 | n-CeBa2Te4 | n-CdGd2Te4 | |
The ranking procedure produced some unexpected candidates, such as SAg2O4. This is stoichiometrically equivalent to silver sulfate (Ag2SO4), whose known ground-state structure is orthorhombic, not the predicted cubic spinel configuration. To further evaluate the dynamic stability of the surviving candidates, phonon dispersion calculations were carried out through the MatCalc66 package using the MACE-MP-0a model. The results are embedded in Table 2, where compositions shown in bold possess no imaginary phonon modes, indicating dynamic stability with respect to small atomic displacements. Full phonon dispersion plots for each candidate that possess no imaginary modes are provided in the SI, Fig. S12–S23.
Six of the oxide candidates were found to be dynamically stable. Among these, the alkali rhenates (ReNa2O4, ReK2O4, ReRb2O4) and BaNd2O4 are unusual due to their large B-site. The large B-site channels suggest potential utility in electrochemical applications, where such open frameworks often facilitate rapid ionic diffusion for battery electrolytes. Their predicted stability here indicates they are valid local minima on the potential energy surface. One of the most plausible candidates is WK2O4, a finding supported by the fact that its analogue, WNa2O4, is known to adopt a high-pressure spinel phase.67 Beyond structural feasibility, the WNa2O4 spinel possesses a wide band gap of 4.4 eV,68 making it suitable for applications in electrocatalysis or as optoelectronic materials. We therefore anticipate WK2O4 will exhibit a similarly wide band gap.
In contrast, no dynamically stable sulfide spinels were identified. This may be a false negative resulting from the under-representation of ternary chalcogenides within the model's training data relative to oxides. For the selenides, Pd3Se4 is predicted to be stable. This composition implies a mixed-valence state of [Pd2+][Pd3+]2Se4. Its primary competing phase in the Pd–Se system would be the mineral verbeekite (PdSe2), which contains Pd2+ cations charge-balanced by diselenide dimers, (Se2)2−. Finally, five telluride compositions show no imaginary modes, all of which conform to the Mg(RE)2Te4 formula (RE = Nd, Gd, Lu, Er, Tb). The incorporation of these magnetic rare-earth elements makes these compounds interesting targets for novel electronic and magnetic applications.
The dynamical stabilities of selected MACE force field predicted stable structures were further tested using DFT PBE functional on a 2 × 2 × 2 supercell. The analysis of phonon dispersion was carried out using Phonopy package69,70 comparing against MACE-predicted results, the comparisons are illustrated as Fig. S24–S26 in the SI with the computational details. The comparison demonstrates good agreement between DFT-PBE and MACE-MP-0a, confirming the reliability of the ML model. However, for the case of WK2O4, even though MACE predicted full stability, the DFT reveals a small imaginary mode at the L-point. This does not confirm that WK2O4 is unstable. These imaginary modes at zone boundaries often indicate a subtle structural distortion that may occur due to the structure falling to the local minima or a slight symmetry breaking that is energetically close to the ideal cubic phase. Such modes are often stabilised at finite temperatures, suggesting that the ideal cubic phase could still be accessible under specific growth conditions.
112 possible compositions, including inverse spinels, are narrowed down to just 2303 compositions. We introduce a unified metric combining thermodynamics (Ehull) with the statistical CLscore and are able to quantify the crystal likelihood of compositions. The screening recovers over two-thirds of the known oxide spinels using a metastability window of Ehull ≤ 0.2 eV per atom and potentially stable criteria of CLscore ≥ 0.5, while also uncovering hundreds of previously unknown oxide compositions. Furthermore, our investigation into the equilibrium cation inversion parameter reveals trends in the chemical space, where similar chemical structures possess similar cation disordering behaviour. The final top candidates for each anion type highlight a set of promising novel candidates. Among the top candidates, six promising candidates without imaginary phonon modes are found for oxides, none for sulfides, one for selenides, and five for tellurides through phonon dispersion calculations using MACE-MP-0a. Overall, this framework serves as a scalable workflow for materials exploration across vast chemical spaces, offering a foundation for further modelling and future experimental validations.
Supplementary information (SI) is available. See DOI: https://doi.org/10.1039/d6dd00064a.
| This journal is © The Royal Society of Chemistry 2026 |