Open Access Article
This Open Access Article is licensed under a
Creative Commons Attribution 3.0 Unported Licence

Data-driven exploration of AB2X4 (X = O, S, Se, Te) spinel chemical space

Panyalak Detrattanawichai, Zhenzhu Li, Hyunsoo Park, Kinga O. Mastej and Aron Walsh*
Department of Materials, Imperial College London, London SW7 2AZ, UK. E-mail: a.walsh@imperial.ac.uk

Received 5th February 2026 , Accepted 8th April 2026

First published on 16th April 2026


Abstract

The discovery of materials has long been a fundamental building block for technological advancement, yet traditional trial-and-error methods are slow and costly to meet the growing demand for novel functionality, particularly in green energy technologies, energy storage, and electronics. In response to this challenge, high-throughput screening and data-driven workflows that combine computational simulations, machine learning models, and materials databases have emerged as powerful tools for accelerating materials discovery. Spinels (AB2X4) stand out as a versatile class of materials with applications ranging from energy storage to catalysis. However, their full compositional space remains largely unexplored. In this work, we present a data-driven framework to identify potentially synthesisable spinel compounds composed of the first 83 elements in the periodic table with oxygen (O2−) and three chalcogen anions (S2−, Se2−, Te2−). Over 30[thin space (1/6-em)]000 charge-balanced and chemically plausible candidates, including inverse spinels, were sequentially filtered based on the stability, structural feasibility, and electronic properties criteria. Our workflow integrates materials databases, empirical heuristic rules, and machine learning predictions to efficiently reduce the candidate pool. As a result, 2303 novel spinel candidates were identified from this workflow, offering a diverse subset of target compounds for further investigation.


1 Introduction

There is a transition toward clean energy solutions to tackle global challenges such as resource scarcity and climate change, both of which demand urgent and innovative solutions. As a result, the need for novel materials for green energy technology is more crucial than ever.1–6 The discovery of materials with the necessary functionality will not only drive technological innovation, but can also support global sustainability goals.

Among many classes of materials, spinels, a family of AB2X4 compounds that adopt the crystal structure of the naturally occurring mineral MgAl2O4, stand out due to their structural versatility and wide-ranging properties. In the regular spinel structure, the larger X anions form a tightly packed cubic structure, with the A- and B-site cations filling the tetrahedral and octahedral sites, respectively, arranged in compliance with the space group Fd[3 with combining macron]m.7–12 A wide range of elemental substitutions at A and B sites are possible, leading to exceptional structural flexibility and a large number of naturally occurring and synthetic spinels.13–16

Barth and Posnjak17 found that the A- and B-site cations can vary their distributions between the tetrahedral and octahedral voids, creating cation disordering. Spinels with A-site cations occupying tetrahedral sites and B-site cations occupying octahedral sites, as shown in Fig. 1A, are known as “normal” spinels (e.g. MgAl2O4). In comparison, spinels with half of the B-site cations occupying the tetrahedral sites, while the A-site cations and the other half of the B-site cations occupying the octahedral sites, as shown in Fig. 1B, are known as “inverse” spinels (e.g. SnZn2O4). These represent two extremes, and a parameter that describes the distribution of cations between A- and B-sites, which quantifies the likelihood of a spinel structure being “inverse”, is known as the cation inversion parameter (or degree of inversion) x, ranging from 0 to 1. Therefore, the formula for AB2X4 spinel can be rewritten as (A1−xBx)[AxB2−x]X4, where x = 0 refers to normal spinel and x = 1 refers to inverse spinel.13,14 This distribution of cations can significantly manipulate the properties of the structure.


image file: d6dd00064a-f1.tif
Fig. 1 (A) Normal spinel structure of MgAl2O4, where the tetrahedral and octahedral sites are occupied by Mg2+ and Al3+, respectively. (B) Inverse spinel structure of MgAl2O4 with cation inversion parameter x = 1, where the tetrahedral sites are occupied by half of the Al3+, while the octahedral sites are occupied by Mg2+ and the remaining half of the Al3+.

Disorder in the cation sublattice can impact the chemical and physical properties of spinels. This can be observed from their applications as superconductors (e.g. LiTi2O4),18–20 catalysts (e.g. NiAl2O4),21–24 magnetic cores (e.g. Fe3O4),25 and gas sensors (e.g. ZnFe2O4).24 With such a wide range of applications, spinel has become one of the interesting classes of compounds that could potentially be utilised in green energy technologies, such as electrochemical energy storage.26–28

Most spinel studies have primarily focused on naturally occurring compositions or on optimising specific properties, leaving a large number of valuable compounds potentially undiscovered.9–12,29–32 Given the vast number of potential elemental combinations across A-, B-, and X-sites, exploring the entire compositional space of spinels through a conventional trial-and-error experimental or computational approach is impractical and unrealistic due to chemical complexity and resource demands involved. Bhattacharya and Wolverton used high-throughput first-principles calculations to explore quaternary spinels such as LiTiFeO4 for Li-battery cathodes.33 Quintero et al. synthesised a series of novel quaternary thiospinels, including Li2MgSn3S8.34 Such studies highlight the potential for realising new materials in this space.

Here, we present a data-driven workflow integrating computational simulations, machine learning models, and materials databases to efficiently explore the full compositional space of ternary spinels, focusing on compounds formed from the first 83 elements in the periodic table with four anions O2−, S2−, Se2−, and Te2−. Our workflow enables a systematic approach to narrow down the search space from over 30[thin space (1/6-em)]000 possible compositions to 2303. These data-driven methods allow us to rapidly screen for candidates among thousands of compounds, substantially minimising the experimental and computational cost.

2 Results and discussion

2.1 Data mining

We begin by collecting experimentally reported AB2X4 spinel materials from the Materials Project (MP)35 and the Inorganic Crystal Structure Database (ICSD)36,37 to examine how known spinels are distributed over the chemical space. For the MP, the MP API is employed to retrieve entries with the formulas AB2X4 and A3X4 in the Fd[3 with combining macron]m space group. For ICSD, a manual search is carried out for entries with space group Fd[3 with combining macron]m and structural type similar to MgAl2O4. The retrieved entries are then filtered to retain only compositions matching the AB2X4 or A3X4 formula. To ensure consistency, compositions with non-integer stoichiometric coefficients are rounded to the nearest integers.

Combining both datasets, we identify 234 known spinels in this space. Among them, there are 124 oxides, 72 sulfides, 34 selenides, and 4 tellurides. These known compositions are highlighted in Fig. 2 using red triangle markers, where different triangle orientations indicate different anion types.


image file: d6dd00064a-f2.tif
Fig. 2 Space of chemically valid AB2X4 spinels, generated using SMACT with three oxidation state thresholds. The intensity of blue indicates the level of oxidation state threshold. The triangle orientations represent different anion types. The red triangle markers denote the experimentally known spinel compositions. The x- and y-axes correspond to A-site and B-site cations, respectively, in the order of periodic table group (from group IA to the lanthanides). The diagonal line denote the A3X4 spinels where the same cation occupies both A and B sites.

2.2 Generating spinel compositions

We start by applying a series of chemical rules-including charge neutrality and electronegativity balance-to efficiently screen for chemically plausible compositions composed of the first 83 elements, excluding noble gases, in the periodic table with oxygen and three chalcogen anions. These rules are implemented in the Semiconducting Materials by Analogy and Chemical Theory (SMACT)38,39 package.

Spinels are heteropolar compounds and can be classified based on how the charges are balanced. For the general formula AB2X4, charge neutrality requires that the sum of cationic charge satisfies qA + 2qB = −4qX. For oxides and chalcogenides, where qX = −2, this leads to three common charge combinations: (qA = 2, qB = 3), (qA = 4, qB = 2), and (qA = 6, qB = 1), with known examples being MgAl2O4, TiMg2O4, and WLi2O4, respectively.

To note here, the oxidation states of elements used during the SMACT screening include every possible oxidation state that are found in ICSD data, without considering the imbalanced distribution of commonly or uncommonly seen oxidation states. In response to this, three oxidation state thresholds-low, medium, and high-to filter for chemically valid compositions are implemented. The low threshold includes all reported oxidation states that appear at least 3 times; the medium threshold excludes those that appear less than 10% of total appearance for a given element; and the high threshold retains only the most common oxidation state for each element.

The resulting chemically valid compositions are represented in Fig. 2, where different triangle orientations indicate different anion type (same as the known ones), and the intensity of the blue represents the three levels of oxidation state thresholds. Using SMACT, we obtain 16[thin space (1/6-em)]175, 5745, and 1907 valid compositions for the low-, medium-, and high-threshold filters. “Recover” is used here to refer to the fraction of experimentally known spinels that are successfully preserved after applying the screening filter. In detail, the low threshold recovers 100% of the known AB2X4 spinels. However, the number of recovered known spinels drops by 9% at the medium threshold and by 40% at the high threshold. This highlights the importance of oxidation state selection when identifying potential candidates. To balance chemical plausibility with compositional flexibility-while ignoring rare oxidation states-the medium threshold is adopted for the remainder of our investigation.

2.3 Structural optimisation

To proceed with further calculations, we first generate two prototype structures with lattice parameter a = 8 Å: one representing the normal spinel configuration, resembling that of MgAl2O4, and another representing the inverse configuration, generated using the Special Quasirandom Structure (SQS)40,41 approach based on the normal prototype. These prototype structures serve as initial structures for all compositions generated using SMACT. Therefore, we have more than 10[thin space (1/6-em)]000 structures to optimise. It is important to note that the normal and inverse structures for A3X4 spinels are the same since they only possess one type of cation. To efficiently screen the large number of candidate structures, a machine learning force-field model pre-trained with inorganic materials, the Multiple Atomic Cluster Expansion (MACE),42–44 is employed to accelerate the optimisation of crystal structures. The foundation model MACE-MP-0a is used to perform structural relaxation via the Atomic Simulation Environment (ASE) using the FIRE structural optimisation algorithm. Both atomic positions and cell are allowed to relax freely through the Frechet cell filter with a loose force convergence criterion fmax = 0.05 eV Å−1. To avoid excessive calculations on structures that fail to converge, the maximum number of optimisation steps is set to 600. In the calculations, any structure that fails to converge within this limit or possesses excessively large forces is discarded from further analysis.

To verify the accuracy of the MACE-MP-0a for the spinels family, comparative calculations on a representative set of the spinel chemical space (10 oxides, 10 sulfides, 10 selenides, and 4 tellurides) have been carried out using MACE-MP-0a and conventional Density Functional Theory (DFT)45,46 with Perdew–Burke–Ernzerhof (PBE)47 functional within Vienna Ab initio Simulation Package (VASP).48,49 The resulting parity and difference plots for lattice parameters and total energy per unit cell are provided in the SI as Fig. S1 and S2 with the computational details. The results show that while MACE-MP-0a is powerful for oxides, an increase in uncertainty of prediction is observed for the chalcogenides, especially tellurides, which likely reflects the decreasing abundance of relevant training data for these chemistries in the MACE-MP-0a model.

2.4 Crystal likelihood

After obtaining the optimised structures and their total energies, we calculate the energy above the convex hull (Ehull) for each structure based on the MACE-predicted energy using functions from Python Materials Genomics (Pymatgen).50 Notably, Pymatgen phase diagram module could not calculate Ehull of Yb due to data deprecated on MP. Therefore, compositions with Yb are excluded from the screening. For compositions where both the normal and inverse configurations successfully converged, the structures with the lower Ehull are selected for further analysis. This method allows for evaluating the crystal likelihood using thermodynamic approach for each structure and visualise the distribution of Ehull across different anion types, as shown in Fig. S3 in the SI. While Ehull = 0 corresponds to the prediction of a stable compound on the convex hull from the MP database, a metastability window exists where materials have been experimentally realised. In this work, we define a metastability window of Ehull ≤ 0.2 eV per atom.

Beyond pure thermodynamic considerations, data-driven measures have also been developed to assess how likely it is that a new compound can be synthesised.51–56 To further refine the candidate screening, a data-driven machine learning model, namely, synthesisability-stoi-CGNF, based on Positive Unlabelled Learning (PUL) proposed by Jang et al.53 is also adopted to investigate the crystal likelihood from a different perspective. This model is trained using stoichiometric crystal data, where known compositions are labelled as 1 (positive) and unknown as 0. The output of this model is a metric, namely, crystal-likeness score (CLscore), ranging from 0 to 1, where higher values indicate an increased likelihood of compound stability. According to Jang's study, compositions with CLscore ≥ 0.5 are considered potentially stable. However, since this model only takes stoichiometry (or compositions) as input, it cannot distinguish the difference between normal and inverse spinels. Nevertheless, this method offers us a complementary perspective on the crystal likelihood of spinels based on existing materials data. The distribution of CLscore is illustrated in Fig. S4 in the SI.

 
image file: d6dd00064a-t1.tif(1)
 
image file: d6dd00064a-t2.tif(2)
 
image file: d6dd00064a-t3.tif(3)

From the thermodynamic and data-driven evaluations, we combine the results by choosing only structures that satisfy both criteria (Ehull ≤ 0.2 eV per atom and CLscore ≥ 0.5). Each metric is first normalised using eqn (1) and (2), where α and β refer to a metastability window of Ehull and stability threshold of CLscore, respectively. For example, α = 0.2 eV per atom and β = 0.5 in Fig. 3. The normalised values are then averaged to define a new single unified metric called super score (Sscore) as shown in eqn (3). This score integrates both Ehull and CLscore to quantify the joint likelihood of each composition on a scale from 0 (unlikely) to 1 (likely). The chemical space of oxide spinels with their Sscore is depicted in Fig. 3A, with the distribution shown in Fig. S5 in the SI. This metric allows us to rank and prioritise spinel candidates. The chemical space of other types of spinel (X = S, Se, and Te) with their Sscore are shown in Fig. S6–S8 in the SI.


image file: d6dd00064a-f3.tif
Fig. 3 Chemical space of oxide spinels that satisfy both Ehull ≤ 0.2 eV per atom and CLscore ≥ 0.5. (A) The blue gradient indicates Sscore, where lighter and darker shades designate lower and higher crystal likelihood, respectively. (B) The colourmap represents the equilibrium cation inversion parameter (xeq), where lighter orange designates the normal configuration, and darker purple represents the inverse configuration. The red square markers denote the experimentally known spinel compositions.

The recovered percentage of the experimentally known oxide spinels under the medium oxidation state threshold at different Ehull, CLscore, and Sscore can be illustrated as Fig. 4 (The same illustrations for sulfides, selenides, and tellurides are provided in the SI Fig. S9). The initial recovered percentage starts at 91.9%, reflecting that 8.1% of known compositions are excluded by the initial chemical filter (SMACT) or failed to converge during structural optimisation using the MACE force field. From this baseline, stricter screening filter naturally reduces the recovery rate further.


image file: d6dd00064a-f4.tif
Fig. 4 Fraction of recovered experimentally known oxide spinels with medium oxidation state threshold for the initial chemical filter.

For the thermodynamic filter (Ehull), which is represented by the red solid line, the recovery rate remains constant above Ehull = 0.4 eV per atom but decreases to 83.1% at the metastability window of Ehull = 0.2 eV per atom. This indicates that the thermodynamic filter successfully captures the majority of the known oxide spinels with 83.1% true positive rate. The data-driven filter (CLscore) represented by the blue dotted line, on the other hand, is more selective, recovering 75% of the known oxide spinels at the stability criterion of CLscore = 0.5. This shows slightly weaker performance than the thermodynamic approach, yet it still provides a reasonable prediction of the crystal likelihood based on existing data. These results support our chosen criteria for identifying crystal likelihood as they preserve a large fraction of the known oxide spinel compositions. With both filters combined into Sscore, which is represented by the purple dashed line, the recovery rate becomes 66.9% at Sscore = 0. It is important to note that this 66.9% recovery rate does not represent a failure to identify valid materials, but rather the strictness of our combined screening filters that prioritises high-confidence predictions. The excluded 33.1% largely consist of rare elements where the underlying training data for the ML model is scarce. This suggests that Sscore offers a continuous and balanced metric for evaluating crystal likelihood.

From Fig. 2 and 3A, it can be seen that a large portion of the chemical space has been filtered out during the screening process, retaining only compositions that pass through chemical filter and MACE structural optimisations, while satisfying both the thermodynamic and data-driven stability criteria. This significant reduction reflects the strictness of the combined filters and indicates that many compositions are unlikely to be synthesisable. However, it is important to note that this may not always reflect true instability since some valid but rare compositions may be excluded due to limitations in the force field or machine learning models used in the workflow. This point is supported by the fact that 66.9% of the known oxide spinels pass through the thermodynamic and data-driven filters, where some of the known oxide spinels that are chemically valid but contain rare elements, such as TcCo2O4, NiRh2O4, and MnSm2O4, are filtered out. This shows that data scarcity of compositions containing such elements can impact model predictions and influence the screening.

Moreover, it is shown in Fig. 3A that compositions with high Sscore are spatially clustered in the chemical space. This clustering suggests that compositions with similar chemical structure tend to share similar crystal likelihood profiles. These patterns can be used to guide exploration regions in the chemical space that may be particularly promising for further studies.

2.5 Cation inversion

We have not yet distinguished between normal and inverse spinels. MgAl2O4 is a normal spinel if naturally grown (x ≈ 0). However, when exposed to a high temperature (ca. 900 °C), the cations exhibit disordering with x ≈ 0.3.57 Furthermore, it is found that synthetic MgAl2O4 usually exhibit cation disordering with x ∼ 0.1 to 0.6.13 The disorder has been analysed and modelled using methods such as X-ray diffraction,58 neutron diffraction,9,59 Mössbauer spectroscopy,9,60 classical electrostatics,61 and ligand field theory.62
 
image file: d6dd00064a-t4.tif(4)

One driving force for cation inversion is configurational entropy. The normal spinel corresponds to a single configuration of atoms in a unit cell, while the inverse spinel features a range of possible distributions, which can lower the free energy of the compound. The role of entropy becomes more important at higher temperatures. Wei and Zhang14 proposed a thermodynamic model based on the earlier work of Navrotsky and Kleppa63 to estimate the equilibrium cation inversion parameter (xeq) at a specific temperature using the energy difference between the normal and inverse configuration, as expressed in eqn (4), where c = eΔE/kT, ΔE = EinverseEnormal, k is Boltzmann constant, and T is temperature. Using this model, they successfully predicted the equilibrium cation inversion parameter for several oxide spinels. We adopt their approach to cation inversion at room temperature (298.15 K) across the spinel chemical space using MACE-predicted energies for both normal and inverse structures.

The results for oxide spinels are illustrated in Fig. 3B, and the results for the rest are provided as SI. This approach cannot describe inversion of A3X4 spinels, as the current force fields don't distinguish between different species (charges) of the same element. Therefore, A3X4 spinels are excluded from Fig. 3B. Interestingly, consistent with what was observed using Sscore in Fig. 3A, the equilibrium cation inversion parameter shows clear clustering of normal and inverse configurations across the chemical space. Compositions with similar chemical features exhibit similar cation disordering behaviour.

To determine the preferred configuration, we compare the Ehull of the normal and inverse configurations and select the one with lower Ehull for each composition. In cases where both configurations possess the same Ehull, configuration with lower total energy is chosen. We further validated these preferences by mapping them onto orbital radii maps64,65 and comparing them against preferences determined by xeq, which are rounded to 0 and 1. The resulting maps as shown in Fig. S10 and S11 show identical distributions, demonstrating that the energetically more stable structure, which is indicated by the lower Ehull, aligns with the calculated xeq. Therefore, we adopt the lower Ehull configuration as the representative structure for each composition.

2.6 Candidate spinels

The number of candidate compositions remaining after each step of the screening process are summarised in Table 1. The pool of possible compositions for each anion is 832 = 6889. By applying the chemical filter using SMACT with the medium oxidation state threshold, the number is reduced by over 75% for oxide, sulfide, and selenide spinels, and even more by 87.3% for telluride spinel. This shows the critical role of basic chemical rules in materials screening. By applying thermodynamic filter (Ehull ≤ 0.2 eV per atom), the pool is further reduced by an average of 11.15% of the possible compositions. On the other hand, the data-driven filter (CLscore ≥ 0.5) is slightly less selective by narrowing down an average of 8.2%. However, with the two filters combined, the pool is effectively reduced to an average of 8.4% of the possible compositions. This highlights the utility of unified metrics that combine thermodynamic and data-driven statistical information.
Table 1 Number of candidate AB2X4 compositions after each step of screening starting from a pool of 83 elements for the A and B sites
Type of anions Known Possible Chemical filter Ehull ≤ 0.2 eV per atom CLscore ≥ 0.5 Ehull ≤ 0.2 eV per atom and CLscore ≥ 0.5
Oxide 124 6889 1679 885 1096 704
Sulfide 72 6889 1668 736 968 675
Selenide 34 6889 1529 671 1033 634
Telluride 4 6889 869 370 394 290


Based on Sscore, top candidates for each type of spinel are identified by ranking all compositions. This represents the raw output of the models that has not been human-filtered. Table 2 shows the top 10 spinel candidates for oxides, sulfides, selenides, and tellurides, excluding compositions that are presented on MP and ICSD, and it is noteworthy to mention that all 40 candidates favor normal configuration based on the Ehull value. Each candidate was checked by a manual literature search to confirm that they have not yet been experimentally reported.

Table 2 Top 10 hypothetical spinel candidates from the combined thermodynamic and statistical Sscore metric. Compositions shown in bold were found to possess no imaginary mode in their phonon dispersions, indicating they are stable local minima in their respective potential energy surfaces. n- and i-AB2X4 refer to normal and inverse configurations, respectively
Type of anions Compositions
Oxide n-ReK2O4 n-OsK2O4 n-ReNa2O4 n-SAg2O4 n-ReCs2O4
n-ReRb2O4 n-ReHg2O4 n-CSb2O4 n-BaNd2O4 n-WK2O4
Sulfide n-CTl2S4 n-Pd3S4 n-PbGa2S4 n-ReRb2S4 n-CSn2S4
n-PbTl2S4 n-OsTb2S4 n-EuBi2S4 n-WK2S4 n-CoTb2S4
Selenide n-TeCs2Se4 n-SrB2Se4 n-CeTl2Se4 n-Pd3Se4 n-EuB2Se4
n-CaSb2Se4 n-CaB2Se4 n-CaTl2Se4 n-Rh3Se4 n-CsRh2Se4
Telluride n-MgNd2Te4 n-ZrBa2Te4 n-MgEr2Te4 n-MgTb2Te4 n-MgSm2Te4
n-MgGd2Te4 n-MgLu2Te4 n-MgY2Te4 n-CeBa2Te4 n-CdGd2Te4


The ranking procedure produced some unexpected candidates, such as SAg2O4. This is stoichiometrically equivalent to silver sulfate (Ag2SO4), whose known ground-state structure is orthorhombic, not the predicted cubic spinel configuration. To further evaluate the dynamic stability of the surviving candidates, phonon dispersion calculations were carried out through the MatCalc66 package using the MACE-MP-0a model. The results are embedded in Table 2, where compositions shown in bold possess no imaginary phonon modes, indicating dynamic stability with respect to small atomic displacements. Full phonon dispersion plots for each candidate that possess no imaginary modes are provided in the SI, Fig. S12–S23.

Six of the oxide candidates were found to be dynamically stable. Among these, the alkali rhenates (ReNa2O4, ReK2O4, ReRb2O4) and BaNd2O4 are unusual due to their large B-site. The large B-site channels suggest potential utility in electrochemical applications, where such open frameworks often facilitate rapid ionic diffusion for battery electrolytes. Their predicted stability here indicates they are valid local minima on the potential energy surface. One of the most plausible candidates is WK2O4, a finding supported by the fact that its analogue, WNa2O4, is known to adopt a high-pressure spinel phase.67 Beyond structural feasibility, the WNa2O4 spinel possesses a wide band gap of 4.4 eV,68 making it suitable for applications in electrocatalysis or as optoelectronic materials. We therefore anticipate WK2O4 will exhibit a similarly wide band gap.

In contrast, no dynamically stable sulfide spinels were identified. This may be a false negative resulting from the under-representation of ternary chalcogenides within the model's training data relative to oxides. For the selenides, Pd3Se4 is predicted to be stable. This composition implies a mixed-valence state of [Pd2+][Pd3+]2Se4. Its primary competing phase in the Pd–Se system would be the mineral verbeekite (PdSe2), which contains Pd2+ cations charge-balanced by diselenide dimers, (Se2)2−. Finally, five telluride compositions show no imaginary modes, all of which conform to the Mg(RE)2Te4 formula (RE = Nd, Gd, Lu, Er, Tb). The incorporation of these magnetic rare-earth elements makes these compounds interesting targets for novel electronic and magnetic applications.

The dynamical stabilities of selected MACE force field predicted stable structures were further tested using DFT PBE functional on a 2 × 2 × 2 supercell. The analysis of phonon dispersion was carried out using Phonopy package69,70 comparing against MACE-predicted results, the comparisons are illustrated as Fig. S24–S26 in the SI with the computational details. The comparison demonstrates good agreement between DFT-PBE and MACE-MP-0a, confirming the reliability of the ML model. However, for the case of WK2O4, even though MACE predicted full stability, the DFT reveals a small imaginary mode at the L-point. This does not confirm that WK2O4 is unstable. These imaginary modes at zone boundaries often indicate a subtle structural distortion that may occur due to the structure falling to the local minima or a slight symmetry breaking that is energetically close to the ideal cubic phase. Such modes are often stabilised at finite temperatures, suggesting that the ideal cubic phase could still be accessible under specific growth conditions.

3 Conclusion

We have showcased a data-driven workflow to identify potentially synthesisable AB2X4 spinel oxides and chalcogenides across the vast chemical space composed of the first 83 elements. By integrating chemical plausibility checks, efficient structural optimisation, and stability assessments through both thermodynamic filtering and data-driven approaches, 55[thin space (1/6-em)]112 possible compositions, including inverse spinels, are narrowed down to just 2303 compositions. We introduce a unified metric combining thermodynamics (Ehull) with the statistical CLscore and are able to quantify the crystal likelihood of compositions. The screening recovers over two-thirds of the known oxide spinels using a metastability window of Ehull ≤ 0.2 eV per atom and potentially stable criteria of CLscore ≥ 0.5, while also uncovering hundreds of previously unknown oxide compositions. Furthermore, our investigation into the equilibrium cation inversion parameter reveals trends in the chemical space, where similar chemical structures possess similar cation disordering behaviour. The final top candidates for each anion type highlight a set of promising novel candidates. Among the top candidates, six promising candidates without imaginary phonon modes are found for oxides, none for sulfides, one for selenides, and five for tellurides through phonon dispersion calculations using MACE-MP-0a. Overall, this framework serves as a scalable workflow for materials exploration across vast chemical spaces, offering a foundation for further modelling and future experimental validations.

Author contributions

The author contributions have been defined following the CRediT system. P. D.: conceptualisation, investigation, formal analysis, methodology, visualisation, writing – original draft. Z. L.: methodology, supervision, writing – review & editing. H. P.: methodology, supervision, writing – review & editing. K. O. M.: methodology, software, writing – review & editing. A. W.: conceptualisation, methodology, supervision, writing – review & editing.

Conflicts of interest

There are no conflicts to declare.

Data availability

The workflows employ a range of open-source software packages that are cited in the main text. The original workflows and data will be made available in a GitHub repository on https://github.com/WMD-group/Spinel-chemical-space.

Supplementary information (SI) is available. See DOI: https://doi.org/10.1039/d6dd00064a.

Acknowledgements

We thank Anthony Onwuli for useful discussions and guidance on SMACT. This work was supported by the Development and Promotion of Science and Technology Talents Project (Thai Government Scholarship). Via our membership of the UK's HEC Materials Chemistry Consortium, which is funded by EPSRC (EP/X035859/1), this work used the ARCHER2 UK National Supercomputing Service (https://www.archer2.ac.uk). We are grateful to the UK Materials and Molecular Modelling Hub for computational resources, which is partially funded by EPSRC (EP/T022213/1, EP/W032260/1 and EP/P020194/1).

Notes and references

  1. A. R. Oganov, C. J. Pickard, Q. Zhu and R. J. Needs, Nat. Rev. Mater., 2019, 4, 331–348 CrossRef.
  2. J. N. Cawse, Experimental design for combinatorial and high throughput materials development, Citeseer, 2003 Search PubMed.
  3. A. Valero, A. Valero, G. Calvo and A. Ortego, Renew. Sustain. Energy Rev., 2018, 93, 178–200 CrossRef.
  4. Y. Liu, T. Zhao, W. Ju and S. Shi, J. Materiomics, 2017, 3, 159–177 CrossRef.
  5. J. E. Saal, A. O. Oliynyk and B. Meredig, Annu. Rev. Mater. Res., 2020, 50, 49–69 CrossRef CAS.
  6. P. Raccuglia, K. C. Elbert, P. D. Adler, C. Falk, M. B. Wenny, A. Mollo, M. Zeller, S. A. Friedler, J. Schrier and A. J. Norquist, Nature, 2016, 533, 73–76 CrossRef CAS PubMed.
  7. W. H. Bragg, London, Edinburgh Dublin Philos. Mag. J. Sci., 1915, 30, 305–315 CrossRef CAS.
  8. S. Nishikawa, Proc. Tokyo Math.-Phys. Soc., 1915, 8, 199–209 CAS.
  9. K. R. Sanchez-Lievanos, J. L. Stair and K. E. Knowles, Inorg. Chem., 2021, 60, 4291–4305 CrossRef CAS PubMed.
  10. B. Xu and S. Meng, J. Power Sources, 2010, 195, 4971–4976 CrossRef CAS.
  11. Y. Zhou, S. Sun, C. Wei, Y. Sun, P. Xi, Z. Feng and Z. J. Xu, Adv. Mater., 2019, 31, 1902509 CrossRef CAS PubMed.
  12. M. Nakayama, H. Taki, T. Nakamura, S. Tokuda, R. Jalem and T. Kasuga, J. Phys. Chem. C, 2014, 118, 27245–27251 CrossRef CAS.
  13. K. E. Sickafus and R. Hughes, J. Am. Ceram. Soc., 1999, 82, 3277–3278 CrossRef.
  14. S.-H. Wei and S. B. Zhang, Phys. Rev. B:Condens. Matter Mater. Phys., 2001, 63, 045112 CrossRef.
  15. R. J. Hill, J. R. Craig and G. Gibbs, Phys. Chem. Miner., 1979, 4, 317–339 CrossRef CAS.
  16. C. Biagioni and M. Pasero, Am. Mineral., 2014, 99, 1254–1264 CrossRef.
  17. T. F. Barth and E. Posnjak, Z. Kristallogr. - Cryst. Mater., 1932, 82, 325–341 CrossRef CAS.
  18. D. Johnston, H. Prakash, W. Zachariasen and R. Viswanathan, Mater. Res. Bull., 1973, 8, 777–784 CrossRef CAS.
  19. D. Johnston, J. Low Temp. Phys., 1976, 25, 145–175 CrossRef CAS.
  20. R. McCallum, D. Johnston, C. Luengo and M. Maple, J. Low Temp. Phys., 1976, 25, 177–193 CrossRef CAS.
  21. N. Grimes, Phys. Technol., 1975, 6, 22 CrossRef CAS.
  22. H. Qin, Y. He, P. Xu, D. Huang, Z. Wang, H. Wang, Z. Wang, Y. Zhao, Q. Tian and C. Wang, Adv. Colloid Interface Sci., 2021, 294, 102486 CrossRef CAS PubMed.
  23. J. H. Advani, G. S. More and R. Srivastava, Green Chem., 2022, 24, 3574–3604 RSC.
  24. S. J. Salih and W. M. Mahmood, Heliyon, 2023, 9, year Search PubMed.
  25. M. Kurian and S. Thankachan, Open Ceram., 2021, 8, 100179 CrossRef CAS.
  26. D. Liu, W. Zhu, J. Trottier, C. Gagnon, F. Barray, A. Guerfi, A. Mauger, H. Groult, C. Julien and J. Goodenough, et al., RSC Adv., 2014, 4, 154–167 RSC.
  27. J. Bhattacharya and C. Wolverton, Phys. Chem. Chem. Phys., 2013, 15, 6486–6498 RSC.
  28. T. Nissinen, T. Valo, M. Gasik, J. Rantanen and M. Lampinen, J. Power Sources, 2002, 106, 109–115 CrossRef CAS.
  29. I. Ganesh, Int. Mater. Rev., 2013, 58, 63–112 CrossRef CAS.
  30. M. Liu, Z. Rong, R. Malik, P. Canepa, A. Jain, G. Ceder and K. A. Persson, Energy Environ. Sci., 2015, 8, 964–974 RSC.
  31. V. Kocevski, G. Pilania and B. P. Uberuaga, J. Mater. Chem. A, 2020, 8, 25756–25767 RSC.
  32. F. Gao, X. Tang, H. Yi, S. Zhao, W. Zhu and Y. Shi, J. Environ. Sci., 2020, 89, 145–155 CrossRef CAS PubMed.
  33. J. Bhattacharya and C. Wolverton, J. Electrochem. Soc., 2014, 161, A1440 CrossRef CAS.
  34. M. A. Quintero, S. Hao, S. V. Patel, J.-K. Bao, X. Zhou, Y.-Y. Hu, C. Wolverton and M. G. Kanatzidis, Chem. Mater., 2021, 33, 2080–2089 CrossRef CAS.
  35. A. Jain, S. P. Ong, G. Hautier, W. Chen, W. D. Richards, S. Dacek, S. Cholia, D. Gunter, D. Skinner, G. Ceder and K. A. Persson, APL Mater., 2013, 1, 011002 CrossRef.
  36. G. Bergerhoff, I. Brown, F. Allen, et al., International Union of Crystallography, Chester, 1987, vol. 360, pp. 77–95 Search PubMed.
  37. D. Zagorac, H. Müller, S. Ruehl, J. Zagorac and S. Rehme, J. Appl. Crystallogr., 2019, 52, 918–925 CrossRef CAS PubMed.
  38. D. W. Davies, K. T. Butler, A. J. Jackson, A. Morris, J. M. Frost, J. M. Skelton and A. Walsh, Chem, 2016, 1, 617–627 CAS.
  39. D. W. Davies, K. T. Butler, A. J. Jackson, J. M. Skelton, K. Morita and A. Walsh, J. Open Source Softw., 2019, 4, 1361 CrossRef.
  40. A. Zunger, S.-H. Wei, L. G. Ferreira and J. E. Bernard, Phys. Rev. Lett., 1990, 65, 353 CrossRef CAS PubMed.
  41. A. van de Walle, P. Tiwary, M. de Jong, D. L. Olmsted, M. Asta, A. Dick, D. Shin, Y. Wang, L.-Q. Chen and Z.-K. Liu, Calphad, 2013, 42, 13–18 CrossRef CAS.
  42. I. Batatia, D. P. Kovacs, G. N. C. Simm, C. Ortner and G. Csanyi, Adv. Neural Inf. Process. Syst., 2022, 11423–11436 Search PubMed.
  43. I. Batatia, P. Benner, Y. Chiang, A. M. Elena, D. P. Kovács, J. Riebesell, X. R. Advincula, M. Asta, M. Avaylon, W. J. Baldwin, F. Berger, N. Bernstein, A. Bhowmik, S. M. Blau, V. Cărare, J. P. Darby, S. De, F. D. Pia, V. L. Deringer, R. Elijošius, Z. El-Machachi, F. Falcioni, E. Fako, A. C. Ferrari, A. Genreith-Schriever, J. George, R. E. A. Goodall, C. P. Grey, P. Grigorev, S. Han, W. Handley, H. H. Heenen, K. Hermansson, C. Holm, J. Jaafar, S. Hofmann, K. S. Jakob, H. Jung, V. Kapil, A. D. Kaplan, N. Karimitari, J. R. Kermode, N. Kroupa, J. Kullgren, M. C. Kuner, D. Kuryla, G. Liepuoniute, J. T. Margraf, I.-B. Magdău, A. Michaelides, J. H. Moore, A. A. Naik, S. P. Niblett, S. W. Norwood, N. O'Neill, C. Ortner, K. A. Persson, K. Reuter, A. S. Rosen, L. L. Schaaf, C. Schran, B. X. Shi, E. Sivonxay, T. K. Stenczel, V. Svahn, C. Sutton, T. D. Swinburne, J. Tilly, C. van der Oord, E. Varga-Umbrich, T. Vegge, M. Vondrák, Y. Wang, W. C. Witt, F. Zills and G. Csányi, J. Chem. Phys., 2025, 163, 184110 CrossRef CAS PubMed.
  44. I. Batatia, S. Batzner, D. P. Kovács, A. Musaelian, G. N. Simm, R. Drautz, C. Ortner, B. Kozinsky and G. Csányi, Nat. Mach. Intell., 2025, 7, 56–67 CrossRef PubMed.
  45. P. Hohenberg and W. Kohn, Phys. Rev., 1964, 136, B864 CrossRef.
  46. W. Kohn and L. J. Sham, Phys. Rev., 1965, 140, A1133 CrossRef.
  47. J. P. Perdew, K. Burke and M. Ernzerhof, Phys. Rev. Lett., 1996, 77, 3865 CrossRef CAS PubMed.
  48. G. Kresse and J. Furthmüller, Comput. Mater. Sci., 1996, 6, 15–50 CrossRef CAS.
  49. G. Kresse and J. Furthmüller, Phys. Rev. B:Condens. Matter Mater. Phys., 1996, 54, 11169–11186 CrossRef CAS PubMed.
  50. S. P. Ong, W. D. Richards, A. Jain, G. Hautier, M. Kocher, S. Cholia, D. Gunter, V. L. Chevrier, K. A. Persson and G. Ceder, Comput. Mater. Sci., 2013, 68, 314–319 CrossRef CAS.
  51. N. C. Frey, J. Wang, G. I. Vega Bellido, B. Anasori, Y. Gogotsi and V. B. Shenoy, ACS Nano, 2019, 13, 3031–3041 CrossRef CAS PubMed.
  52. G. H. Gu, J. Jang, J. Noh, A. Walsh and Y. Jung, npj Comput. Mater., 2022, 8, 71 CrossRef.
  53. J. Jang, J. Noh, L. Zhou, G. H. Gu, J. M. Gregoire and Y. Jung, Matter, 2024, 7, 2294–2312 CrossRef CAS.
  54. S. Amariamir, J. George and P. Benner, Digital Discovery, 2025, 4, 1437–1448 RSC.
  55. S. Kim, Y. Jung and J. Schrier, J. Am. Chem. Soc., 2024, 146, 19654–19659 CrossRef CAS PubMed.
  56. V. Chung, A. Walsh and D. J. Payne, Digital Discovery, 2025, 4, 2439–2453 RSC.
  57. U. Schmocker, H. Boesch and F. Waldner, Phys. Lett. A, 1972, 40, 237–238 CrossRef CAS.
  58. H. Furuhashi, M. Inagaki and S. Naka, J. Inorg. Nucl. Chem., 1973, 35, 3009–3014 CrossRef CAS.
  59. E. Gorter, Adv. Phys., 1957, 6, 336–361 CrossRef.
  60. S. Carbonin, U. Russo and A. Della Giusta, Mineral. Mag., 1996, 60, 355–368 CrossRef CAS.
  61. H. S. C. O'Neill and A. Navrotsky, Am. Mineral., 1983, 68, 181–194 Search PubMed.
  62. D. S. McClure, J. Phys. Chem. Solids, 1957, 3, 311–317 CrossRef CAS.
  63. A. Navrotsky and O. Kleppa, J. Inorg. Nucl. Chem., 1967, 29, 2701–2714 CrossRef CAS.
  64. A. Zunger, Phys. Rev. B:Condens. Matter Mater. Phys., 1980, 22, 5839 CrossRef CAS.
  65. X. Zhang and A. Zunger, Adv. Funct. Mater., 2010, 20, 1944–1952 CrossRef CAS.
  66. R. Liu, E. Liu, J. Riebesell, J. Qi, S. P. Ong and T. W. Ko, MatCalc, 2024, https://github.com/materialsvirtuallab/matcalc Search PubMed.
  67. K. Okada, H. Morikawa, F. Marumo and S. Iwai, Acta Crystallogr., Sect. B, 1974, 30, 1872–1873 CrossRef.
  68. S. A. Abbas, I. Mahmood, M. Sajjad, N. Noor, Q. Mahmood, M. Naeem, A. Mahmood and S. M. Ramay, Chem. Phys., 2020, 538, 110902 CrossRef CAS.
  69. A. Togo, L. Chaput, T. Tadano and I. Tanaka, J. Phys.:Condens. Matter, 2023, 35, 353001 CrossRef CAS PubMed.
  70. A. Togo, J. Phys. Soc. Jpn., 2023, 92, 012001 CrossRef.

This journal is © The Royal Society of Chemistry 2026
Click here to see how this site uses Cookies. View our privacy policy here.