Open Access Article
Tri Minh Nguyen
*,
Sherif Abdulkader Tawfik
,
Truyen Tran,
Sunil Gupta,
Santu Rana and
Svetha Venkatesh
Applied Artificial Intelligence Institute, Deakin University, 75 Pigdons Rd, Waurn Ponds, VIC, Australia. E-mail: tri.nguyen1@deakin.edu.au; s.abbas@deakin.edu.au; truyen.tran@deakin.edu.au; sunil.gupta@deakin.edu.au; santu.rana@deakin.edu.au; svetha.venkatesh@deakin.edu.au
First published on 2nd October 2025
Discovering new solid-state materials requires rapidly exploring the vast space of crystal structures and locating stable regions. Generating stable materials with desired properties and compositions is extremely difficult as we search for very small isolated pockets in the exponentially many possibilities, considering elements from the periodic table and their 3D arrangements in crystal lattices. Materials discovery necessitates both optimized solution structures and diversity in the generated material structures. Existing methods struggle to explore large material spaces, outside the training space, and generate diverse samples with desired properties and requirements. We propose the Symmetry-aware Hierarchical Architecture for Flow-based Traversal (SHAFT), a novel generative model employing a hierarchical exploration strategy to efficiently exploit the symmetry of the materials space to generate crystal structures with desired properties. In particular, our model decomposes the exponentially large materials space into a hierarchy of subspaces consisting of symmetric space groups, lattice parameters, and atoms. To benchmark our approach, we first develop a novel, non-hierarchical GFlowNet for complete crystal structure generation. We then introduce SHAFT, a more expressive hierarchical model that leverages its architecture and increased model capacity to more efficiently explore the materials space. We demonstrate that SHAFT significantly outperforms the flat GFlowNet baseline and other state-of-the-art methods like CDVAE and DiffCSP in generating valid, stable, and diverse crystal structures.
We address the complexity associated with the large search space by proposing a new generative model termed Symmetry-aware Hierarchical Architecture for Flow-based Traversal (SHAFT). SHAFT explores the vast search space in an efficient way. The key insight for solving the large state space problem is breaking space exploration into more meaningful hierarchical sub-tasks. Here the higher-level tasks explore actions that are closely related to the reward function, while lower-level tasks handle the configuration adjustment corresponding to the actions taken at higher-level tasks. Since the exploration starts at a highly general concept level, it can learn a more meaningful policy that corresponds to the target reward function. With more meaningful actions taken at a higher level, the policy networks can focus on searching the actions in a significantly smaller sub-space that corresponds to the actions of high-level tasks instead of exploring the whole space.
Three key concepts can help generative models search the material space effectively. First, the crystal structure class imposes symmetry operations and geometrical characteristics on atom positions and lattice parameters, as seen in Fig. 1, effectively reducing the material search space. Second, searching for stable structures on the non-smooth energy landscape defined by quantum mechanics requires generative models to explore and generate diverse sample sets. This approach helps avoid getting stuck in a single mode or local minima. Third, we apply bond constraints on atom pairs in the generated crystals, ensuring that atoms are not closer to each other than specific bond thresholds. These thresholds are obtained using all materials in the Materials Project database. Given the diversity of materials in this database, it's reasonable to assume that minimum distances derived from it are threshold distances for the ground state structure of the respective atoms in any new crystal structure. By incorporating these concepts, generative models can more effectively navigate the complex material space and produce stable, diverse, and physically realistic crystal structures.
We model the entire material space in a hierarchical structure. The highest level is the crystal structure class, which is related to relevant material properties. Exploring this class can efficiently lead to more optimal properties of the generated structure. Specifically, we use the space group of the crystal structure to effectively reduce the 3D atom space exploration to a more meaningful high-level exploration and generate a high-symmetry crystal structure. The next level searches for the unit cell lattice parameters and atom configurations, given the space group. The space group imposes constraints on these parameters and atom positions, thus reducing the search space. Choosing positions one by one creates a long-horizon trajectory, making it difficult to learn a policy that generates high-symmetry structures proportional to the reward function. By leveraging the space group's symmetry operations, we can immediately replicate atoms over the unit cell. This approach reduces the trajectory length, making it easier for the policy network to learn how to generate a high-symmetry crystal structure.
While deep generative models have shown promise, the immense complexity and size of the material search space remain a formidable barrier. Prior methods using Generative Flow Networks (GFlowNets), such as Crystal-GFN,10 have successfully generated crystal parameters like space groups and lattice constants, but stop short of generating the complete crystal structure, omitting the explicit placement of atoms. Our work addresses this complete generation problem.
Our work first addresses this complete generation problem by developing a novel, non-hierarchical or “flat”, GFlowNet that generates the full crystal—space group, lattice, and atomic coordinates—within a single, monolithic policy framework. This in itself is a significant advance for creating full atomic structures with GFlowNets. However, we hypothesize that such a flat approach struggles to efficiently navigate the intricate, long-horizon dependencies inherent in crystal design. To overcome this, we then introduce SHAFT, a hierarchical model that decomposes the generation problem to improve learning efficiency and exploration. The comparison between SHAFT and our flat GFlowNet baseline is therefore designed specifically to isolate and quantify the benefits of this hierarchical decomposition.
Our main contributions are:
(1) We propose a generative model that can search effectively in a large search space, even outside the observed training dataset space, by modeling the state space in a hierarchical structure.
(2) We incorporate the physical knowledge curated from the large material databases into our generative model to generate more stable structures.
(3) We validate the hierarchical structure state space and physics priors of our proposed generative model in the crystal structure generation task to show the efficiency in material space exploration as well as the stability of the generated structures.
Generative Flow Networks (GFlowNets) are generative models designed to sample candidates proportional to their target rewards. The framework has been successfully applied to many fields such as molecular discovery,27 protein sequence discovery,28 causality,29 and continuous control.30 Jiangyan et al.31 integrated symmetries into GFlowNets with equivalent actions. LS-GFN32 explores the local neighborhood to avoid the over-exploration in GFlowNet. Our SHAFT is a generalization of GFlowNets for more efficient sampling by modelling the hierarchy of the search space. When applied to crystal structure generation, SHAFT also exploits the group symmetry unique to this domain.
describing 3 lengths and 3 angles of the unit cell, respectively. The atoms list A = (a1, …, aN) describes the elements. The atoms' coordinates
describe the positions of the atoms within the unit cell, which can be Cartesian or fractional.
The space group of a crystal structure consists of a list of symmetry transformations applied to the atoms within the unit cell. In crystallography, there are 230 space groups.33 Each space group has geometrical characteristics defined in lattice angles and lengths that can be used as constraints to limit the parameter search space. The list of geometrical characteristics is provided in SI Appendix Section A.1. Given the space group Gs, the elements of the space group g ∈ Gs are a set of symmetry operations. A crystallographic orbit of an atom o = (xo, ao) with coordinate xo and element ao is defined as OGs(xo) = {g·xo∣g ∈ Gs}, where g·xo denotes the application of the symmetry operation g on the atom o within the unit cell. From a reference atom o, we can obtain a set OGs of equivalent points.
. The sampling process starts with the initial state vertex s0 ∈ S with no incoming edge and stops at the sink state vertex sn ∈ S, where n is the sampling trajectory length, with no outgoing edge. GFlowNet learns a policy function π that can sample the object x with a probability proportional to the non-negative reward function. GFlowNets construct the object step by step, from the initial state s0 to the sink state sn, forming a trajectory τ = s0, …, sn, τ ∈ τ where τ is the trajectory set. For any state s in the trajectory, we can define the flow of the state as
and the flow of the edge s → s′ as
.27 The forward policy that maps the probability of transition from the current state s to the next state s′ is given by
. The backward policy mapping probability transition to the previous state s given the current state s′ is given by
. The training objective of the GFlowNets is flow-matching consistency where the incoming flow is equal to the outgoing flow,
, for all states. Ref. 34 proposes trajectory balance to deal with the long trajectory credit assignment problem. Given the trajectory τ = (s0 → s1 → … → sn), the forward probability of a trajectory is defined as
. The trajectory balance constraint is defined as
![]() | (1) |
and Z = F(s0) represents the initial state flow. Then the trajectory balance objective is defined as:
![]() | (2) |
To address this, SHAFT introduces a hierarchical policy structure that decomposes the generation process into two distinct, more manageable sub-tasks, as illustrated in Fig. 2. This design is motivated by the natural hierarchy present in crystal structures themselves. SHAFT generalises GFlowNets to accommodate a policy hierarchy of two levels: the high-level decision-making policy operating on the space groups, and the low-level execution policy operating on the atom–lattices (see Fig. 2). The space group policy chooses the space group and applies corresponding constraints on the atom–lattice policy actions. The corresponding hierarchical state space is decomposed as s = (ssg, sal), where ssg is the space group state and sal is the atom–lattice state. The latter consists of lattice parameters slp, atom coordinate sac, and atom type sat states.
![]() | ||
Fig. 2 Hierarchical policy for the crystal structure state. First, the crystal structure graph state s is decomposed into space group state ssg and atom-lattice state sal. Then the transition probability in eqn (4) is given by the space group policy network θsg. The transition probability in eqn (4) is given by the atom-lattice policy network θal. | ||
Then we define the probability of transitions as:
![]() | (3) |
![]() | (4) |
Then given the trajectory τ = (s0 → s1 → … → sn), we have the trajectory balance constraint (eqn (1)) with the decomposed state as:
![]() | (5) |
![]() | (6) |
The space group forward and backward transition probabilities PF(stsg|st−1) and PB(st−1sg|st) are parameterized by the multinoulli distribution defined by the logits output of the space group policy networks. The lattice parameter state slp transition probability is parameterized by the Gaussian distribution defined by the mean µ and variance σ2. The atom fractional coordinate state sac transition probability is parameterized by the multivariate Gaussian distribution defined by the mean µ and covariance matrix Σ. The atom type state sat transition probability is parameterized by the multinoulli distribution.
The sampling process is described in Algorithm 1. The lattice sampling process starts with the state s0 which has lattice parameters a = b = c = 4 and α = β = γ = 90°, space group 1, and an empty crystal graph at state
. The state at any point, s = (ssg, sal), represents the crystal structure accumulated up to that step. Initially, at s0, the crystal is empty. At each subsequent step t, the algorithm performs space group proposal and incremental construction. In the space group proposal, the high-level policy, θsg, proposes a new space group,
. This proposed space group governs the symmetry rules for the current step. It is used by the low-level policy, θal, to apply symmetry operations to the new reference atom being added. The choice of
can also impose new constraints on the crystal's lattice parameters, refining the overall structure. In the incremental construction, the low-level policy adds a new reference atom to the existing structure. The symmetry operations of the currently proposed space group,
, are applied to this new atom and all previously placed reference atoms in the structure. The resulting state at the end of the step,
, is an augmentation of the previous state, st−1. It incorporates the newly added atom (and its symmetric equivalents) and reflects the space group that is now governing the structure's symmetry. This step-by-step refinement ensures the DAG property of the generation process (more details in SI Appendix Section A.2).
A key aspect of this sampling process is the implicit handling of Wyckoff positions. SHAFT does not require the explicit selection of a Wyckoff position. Instead, the low-level policy learns to place reference atoms at specific fractional coordinates. If these coordinates correspond to a special Wyckoff position for the given space group, the application of symmetry operations generates a set of symmetrically equivalent atoms, some of which will overlap. These overlapping sites are then merged using a standard distance tolerance (0.01 Å via the pymatgen). If the coordinates correspond to a general Wyckoff position, a larger set of symmetrically equivalent atoms will be generated.
The policy is guided by a reward function that favors the generation of stable, low-energy structures. The policy learns to propose reference atom positions that result in favorable crystal structures, which may indirectly lead it to utilize Wyckoff positions commonly found in stable materials. This is an emergent behavior rather than an explicit design constraint.
The consistency between the space group, atom positions, and composition is maintained throughout this process. The atom positions are, by definition, consistent with the chosen space group because they are generated by its symmetry operations. The overall composition is managed incrementally. At each step, the model chooses an atom type, and a composition-related reward (Rcomp(x)) guides the policy towards chemically valid structures. Furthermore, as shown in Algorithm 1, the choice of elements can be restricted using a validity mask based on compositional constraints, ensuring that the generated structures adhere to desired chemical formula.
The formation energy term dictates that a stable crystal structure will have a negative formation energy, defined as Re(x) = e−E(x)/T where E(x) is the predicted formation energy per atom given by the prediction model (SI Appendix Section A.4.7), and T is the normalization term.
The bond length preference term Rb encourages the bond lengths of the input crystal structure to agree with the empirical lengths retrieved from the database. First, for each atom ai in the crystal structure x, we search all its neighbors nei(ai) within a radius cutoff of 4.0 Å. The neighbors can also include atoms from adjacent unit cells. We retrieve the minimum and average bond distances of all bond types existing in x from the Materials Project database (e.g. the minimum bond distance of Li–O is 1.63 and the average bond distance of Li–O is 3.02). Then the difference between the bond length d(ai, aj), aj ∈ nei(ai) with minimum dmin(ai, aj) and the average bond distance davg(ai, aj) of the (ai, aj) bond type is given by
![]() | (7) |
The density term RP encourages the optimal structure density. Because the bond distance preference term applies a strict penalty for structures violating the minimum distance constraint, the generative model tends to generate a structure with long distances between atom pairs with only a few neighboring atoms. This leads the model to generate gas-like structures with low density rather than solid-state density. On the other hand, structures with very high density (i.e. larger than 10) are unlikely to be realistic as the unit cells are crowded with atoms causing a very high formation energy. We measure the generated structure density P(x) (SI Appendix Section A.4.5) and define the density term as a Gaussian function of structure density:
.
The composition validity term is defined as Rcomp(x) = 1 for valid composition and Rcomp(x) = 0 otherwise. We follow composition validity as determined by the charge neutrality check.35 Finally, the physics-informed reward function is composed as:
| R(x) = weRe(x) + wpRP(x) + wbRbond(x) + wcRcomp(x) | (8) |
with node feature matrix
and edge matrix
. The node features include the atom's atomic number and its fractional coordinates within the unit cell. The edges are determined using k-nearest neighbors with the maximum number of neighbors being 12 and the radius cut-off is 8.0 Å. The graph representation of the crystal structure is learned using a MatErials Graph Network (MEGNet) model36 (details in SI Appendix Section A.4.2) to satisfy the periodic invariance and E(3) invariance of the crystal structure.
Lattice parameters are encoded using a multi-layer perceptron as:
![]() | (9) |
. The encoded crystal structure state hM is later used as the representation of the hierarchical state space sM.
Baselines: we compare our SHAFT with SOTA techniques including CDVAE,15 DiffCSP20 and GFlowNet, which is a flat version of our method. CDVAE is a diffusion model that uses SE(3) equivariant GNNs adapted for periodicity. We train CDVAE and DiffCSP on the MP-battery dataset for the battery material discovery task (based on requirements in Section 5.1, more details in SI Appendix Section A.5) and the MP-20 dataset6 for general crystal generation (Section 5.3). We report the top-K (K = 100) results ranked by eqn (8). For CDVAE and DiffCSP, we sample 100 samples three times and report the mean and standard deviation. We follow recent work on continuous GFlowNet37 to operate in the continuous space of the atoms' coordinates and lattice parameters. The model has a single-level policy network that outputs the space group, lattice parameters, atom coordinates, and atom types. Note that this is the first time GFlowNet has been applied successfully to complete crystal generation.
Motivated by the search for light-weight, transition-metal free cation materials, we explore the space of possible materials that can be made from the light elements Be, B, C, N, O, Si, P, S, and Cl, and one of the three alkali metals Li, Na and K. This space of materials constitutes materials that can be utilised in lithium-ion, sodium-ion, and potassium-ion battery materials, respectively. We discuss further the motivation for the experiments in SI Appendix Section A.3.
| Method | Structure ↑ | Composition ↑ |
|---|---|---|
| CDVAE | 0.66 ± 0.03 | 0.73 ± 0.090 |
| DiffCSP | 0.86 ± 0.03 | 0.59 ± 0.005 |
| GFlowNet | 0.98 ± 0.03 | 1.00 ± 0.000 |
| SHAFT | 0.99 ± 0.01 | 1.00 ± 0.000 |
| Method | Crystal family ↑ | Composition ↑ | Structure ↑ |
|---|---|---|---|
| CDVAE | 0.15 ± 0.09 | 1859.07 ± 199.28 | 0.43 ± 0.004 |
| GFlowNet | 1.97 ± 0.15 | 165.13 ± 32.55 | 0.84 ± 0.03 |
| DiffCSP | 1.23 ± 0.09 | 2360.97 ± 110.37 | 0.57 ± 0.06 |
| SHAFT | 2.19 ± 0.06 | 147.80 ± 15.81 | 0.89 ± 0.05 |
SHAFT demonstrates superior performance in generating a broad range of crystal families (Shannon–Wiener index of 2.19) and structurally distinct materials (structure diversity of 0.89), outperforming all baselines.
Conversely, for compositional diversity, we observe that the scores for GFlowNet (165.13) and SHAFT (147.80) are lower than those for CDVAE (1859.07) and DiffCSP (2360.97). This difference is an important and expected consequence of their fundamentally different generative approaches. CDVAE and DiffCSP are trained to replicate the distribution of the training data, and their high compositional diversity reflects the breadth of compositions present in the source dataset. In contrast, SHAFT and our GFlowNet baseline are reward-driven agents designed to actively seek candidates that maximize a physics-informed reward function. This reward-guided exploration inherently focuses the search on compositional regions that yield high-reward structures—those with high stability and validity—within the constrained chemical space of the battery discovery task.
This represents a critical trade-off: while diffusion models may explore a wider range of compositions, this often comes at the cost of lower stability and validity, as shown by our results in Tables 1 and 3. SHAFT, by design, prioritizes finding high-quality candidates within promising compositional niches, leading to a more targeted but less compositionally diverse set of outputs in this specific metric. This focus on high-reward regions is a key feature of our approach for efficient materials discovery, trading exhaustive compositional breadth for a higher yield of stable and valid material candidates.
| Method | Energy ↓ | % Energy < 0 ↑ | Ehull ↓ |
|---|---|---|---|
| CDVAE | −0.318 ± 0.080 | 74.6 ± 8.6 | 1.04 ± 0.24 |
| DiffCSP | 0.120 ± 0.108 | 45.5 ± 6.3 | 0.55 ± 0.04 |
| GFlowNet | −1.014 ± 0.003 | 98.6 ± 1.6 | 0.32 ± 0.02 |
| SHAFT | −1.241 ± 0.097 | 99.5 ± 0.8 | 0.29 ± 0.03 |
As illustrated in Fig. 4a, SHAFT demonstrates significantly faster mode discovery than the baseline flat GFlowNet. This enhanced efficiency is a direct consequence of its hierarchical policy structure. The high-level policy, θsg, learns to strategically prune the vast search space by first proposing promising crystallographic space groups that are historically more likely to yield stable materials. This high-level decision provides a strong structural constraint. Consequently, the low-level policy, θal, is tasked with a more focused and tractable problem: arranging atoms and defining lattice parameters within the pre-selected, promising symmetric framework. By contrast, a flat GFlowNet must inefficiently explore all parameters simultaneously, making it slower to escape regions of the search space that lead to invalid or unstable structures. The discovery of nearly 30 unique modes after evaluating 105 candidates underscores the immense and sparse nature of the de novo materials design space. The vast majority of potential atomic combinations are unstable or physically invalid. The critical insight from this experiment is that SHAFT's hierarchical decomposition of the problem allows it to navigate this complex landscape more efficiently, identifying the rare, high-reward modes with substantially fewer samples than its non-hierarchical counterpart.
Diffusion models like CDVAE and DiffCSP are trained to be expert replicators, learning the distribution of known, stable materials from a database. Their ability to generate stable structures is largely inherited from these high-quality training data. In contrast, RL-based models like SHAFT are trained as systematic explorers, designed to search the entire chemical space to find novel, high-reward structures. This process necessarily involves learning from a large number of low-reward samples to map out the vast, unstable regions of the search space, thus again highlighting the immense and sparse nature of the de novo materials design space.
The average reward per epoch in Fig. 4b shows that SHAFT not only speeds up the mode discovery but also discovers higher reward structures. The formation energy, bond length preferences, density, and composition validity terms improving over epochs (see SI Fig. 9) show that the model can learn to embed the physical constraints into the policy network. SI Fig. 10 showing the shift of the distribution of reward function terms from the random initialization of an untrained SHAFT crystal structure to trained SHAFT further proves this point.
![]() | ||
| Fig. 3 Examples of two configurations ((a) and (b)) of (LiBr)x generated by SHAFT and the corresponding structure optimized by the M3GNet framework38 and DFT. | ||
![]() | ||
| Fig. 6 Distributions of the sampled crystal structures from the observed crystal structures (MP-battery dataset) and SHAFT in terms of energy above hull, plotted using t-SNE. | ||
Match rate: we follow the previous work in ref. 39 to evaluate the match rate of crystal structure relaxation. A structure m and its optimized structure m′ are considered matched if their atoms' translations and angles are within tolerance thresholds, indicating that the generated structure is close to the optimum, and thus is more stable. We use the matching algorithm provided by the pymatgen library43 in StructureMacher with 10° angle tolerance, 1.0 fractional length tolerance, and 1.0 site tolerance. The match rate is the fraction of matched structures over the total number of generated structures.
The results reported in Table 4 show that our proposed method can produce more structures (at a rate of 0.82) that are nearly optimal in terms of total energy compared to GFlowNets and CDVAE.
| Methods | Match rate ↑ | RMS displacement ↓ |
|---|---|---|
| CDVAE | 0.46 ± 0.03 | 0.287 ± 0.0133 |
| DiffCSP | 0.33 ± 0.04 | 0.335 ± 0.041 |
| GFlowNet | 0.80 ± 0.01 | 0.174 ± 0.019 |
| SHAFT | 0.82 ± 0.02 | 0.161 ± 0.020 |
We also report the match rate of structures generated by SHAFT with their DFT-optimized structures (reported in Section 5.1.7) instead of M3GNet-optimized structures. In stark contrast, when comparing the as-generated structures to their DFT-relaxed ground truths, the match rate drops to 0.042 and the RMS displacement increases to 0.297 Å.
This discrepancy is expected and highlights a key characteristic of surrogate-model-guided generation. The high match rate with M3GNet-relaxed structures validates that SHAFT effectively optimizes its outputs for its given reward function—the M3GNet potential energy surface (PES). The model learns to produce structures that are already near a local minimum on this surrogate PES.
However, the M3GNet potential, while highly accurate, is an approximation of the true quantum mechanical landscape described by DFT. The lower match rate with DFT-relaxed structures quantifies the inherent gap between the surrogate model and the ground truth. The larger structural rearrangement required during DFT optimization is a necessary step to move the structure from the surrogate's predicted minimum to the true ground-state configuration. This also illustrates a common challenge known as “reward hacking”: the model becomes highly proficient at generating structures that minimize the energy of the surrogate M3GNet potential, which is an accurate but imperfect proxy for the true quantum mechanical landscape described by DFT.
Ultimately, this analysis reinforces the power of the hierarchical generation and validation workflow. SHAFT, guided by the M3GNet surrogate, is exceptionally effective at identifying promising regions of the vast chemical space that contain verifiably stable materials. The results confirm that while a final DFT refinement is essential for achieving ground-truth accuracy, SHAFT's approach is highly successful for the critical initial discovery phase.
![]() | (10) |
| R(x) = weRe(x) + wbgRbg(x) + wpRP(x) + wbRbond(x) + wcRcomp(x) | (11) |
We set weights as we = wbg = 0.1, wp = 0.1, wb = 0.5, and wc = 0.1. To demonstrate that the generated structures are stable and have near zero band gap, we present the average formation energy and band gap in Table 5.
| Method | Energy ↓ | % Energy < 0 ↑ | Band gap ↓ |
|---|---|---|---|
| CDVAE | −0.82 | 83.33 | 0.45 |
| DiffCSP | −0.96 | 86.74 | 0.48 |
| GFlowNet | −0.63 | 90.91 | 0.08 |
| SHAFT | −0.84 | 100.00 | 0.04 |
| Methods | Crystal family diversity ↑ | Composition diversity ↑ | Structure diversity ↑ |
|---|---|---|---|
| CDVAE | 0.95 | 1857.31 | 0.74 |
| DiffCSP | 2.60 | 1812.25 | 0.93 |
| GFlowNet | 2.37 | 1399.59 | 0.99 |
| SHAFT | 2.32 | 1471.59 | 1.01 |
| Methods | Comp. val. | Struct. val. | Avg. form. E | % E < 0 | Match rate ↑ | RMS dis. ↓ |
|---|---|---|---|---|---|---|
| CDVAE | 0.86 | 1.00 | −0.84 | 58 | 0.71 | 0.14 |
| DiffCSP | 0.83 | 1.00 | −0.85 | 90 | 0.94 | 0.06 |
| GFlowNet | 1.00 | 1.00 | −0.89 | 72 | 0.66 | 0.18 |
| SHAFT | 1.00 | 0.97 | −1.14 | 87 | 0.71 | 0.19 |
While this work has primarily focused on generating stable crystal structures with desirable formation energies and targeted band gaps, the SHAFT framework offers inherent flexibility for broader applications in materials discovery. In principle, additional material properties, such as mechanical characteristics (e.g., hardness and elastic moduli) or specific optical responses (e.g., refractive index and absorption spectra), could be seamlessly integrated into the generative process. This would typically involve incorporating corresponding terms into the reward function, guided by accurate and computationally efficient surrogate models capable of predicting these properties from structural inputs. However, expanding to multiple target properties introduces further considerations. The primary challenge lies in navigating the complexities of multi-objective optimization, particularly when objectives are conflicting, which may necessitate sophisticated strategies beyond simple weighted-sum approaches to effectively explore the Pareto front.44 Furthermore, the computational overhead of evaluating an increasing number of property predictors must be carefully managed to maintain the efficiency of the generation process. Future work could explore advanced multi-objective SHAFT algorithms and the development of robust surrogate models for a wider range of material characteristics to fully exploit SHAFT's potential in designing novel, multi-functional materials.
CCDC 2409676–2409705 contain the supplementary crystallographic data for this paper.45a–ad
Supplementary information (SI) is available. See DOI: https://doi.org/10.1039/d4dd00392f.
| This journal is © The Royal Society of Chemistry 2026 |