Jakob
Baumsteiger
*ab,
Lorenzo
Celiberti
b,
Patrick
Rinke
cdef,
Milica
Todorović
g and
Cesare
Franchini
ab
aDepartment of Physics and Astronomy “Augusto Righi”, Alma Mater Studiorum – Univeristà di Bologna, 40127 Bologna, Italy. E-mail: jakob.baumsteiger2@unibo.it
bFaculty of Physics and Center for Computational Materials Science, University of Vienna, 1090 Vienna, Austria
cPhysics Department, TUM School of Natural Sciences, Technical University of Munich, Garching, Germany
dDepartment of Applied Physics, Aalto University, FI-00076 Aalto, Finland
eAtomistic Modelling Center, Munich Data Science Institute, Technical University of Munich, Garching, Germany
fMunich Center for Machine Learning (MCML), Germany
gDepartment of Mechanical and Materials Engineering, University of Turku, FI-20014 Turku, Finland
First published on 24th May 2025
The investigation of magnetic energy landscapes and the search for ground states of magnetic materials using ab initio methods like density functional theory (DFT) is a challenging task. Complex interactions, such as superexchange and spin–orbit coupling, make these calculations computationally expensive and often lead to non-trivial energy landscapes. Consequently, a comprehensive and systematic investigation of large magnetic configuration spaces is often impractical. We approach this problem by utilizing Bayesian optimization, an active machine learning scheme that has proven to be efficient in modeling unknown functions and finding global minima. Using this approach we can obtain the magnetic contribution to the energy as a function of one or more spin canting angles with relatively small numbers of DFT calculations. To assess the capabilities and the efficiency of the approach we investigate the noncollinear magnetic energy landscapes of selected materials containing 3d, 5d and 5f magnetic ions: Ba3MnNb2O9, LaMn2Si2, β-MnO2, Sr2IrO4, UO2, Ba2NaOsO6 and kagome RhMn3. By comparing our results to previous ab initio studies that followed more conventional approaches, we observe significant improvements in efficiency.
In recent years, solutions to these problems have been attempted. Payne et al. used the meta heuristic firefly algorithm12 and Zheng and Zhang genetic evolution13 to identify noncollinear ground states. Although successful, both attempts still require hundreds of DFT calculations per material to converge to the final result. An approach by Huebsch et al. aims to find magnetic ground states with the help of a basis set of magnetic structures determined by a cluster-multipole expansion, a theoretical approach based on magnetic point group analysis.14 However, magnetic ground states are often unknown linear combinations of these basis configurations.
Although the aforementioned methods concentrate on identifying magnetic ground states, they overlook additional details about the magnetic energy landscape. Consequently, there is a need for an efficient approach to explore noncollinear magnetic energy landscapes to determine the ground state and other magnetic properties of complex magnetic materials. This work proposes a solution using a data-driven Bayesian optimization (BO) technique.
BO is a versatile machine learning tool that has found widespread applications across diverse domains.15–24 It excels in scenarios where evaluating the objective function is resource-demanding and proves particularly valuable for modeling complex, potentially multi-dimensional black box functions. Here we use DFT-informed BO to explore noncollinear magnetic energy landscapes. Thereby we gain important insight into magnetic properties while uncovering metastable states and magnetic ground states at the same time.
We test our approach using the benchmark materials Ba3MnNb2O9, LaMn2Si2, β-MnO2, Sr2IrO4, UO2, Ba2NaOsO6 and RhMn3. These materials represent a minimal dataset of materials characterized by different structural motifs, different magnetic orbitals, and different magnetic structures, as summarized in Table 1.
Material | Structure | Magnetic orbital | Magnetic ordering | k-mesh | Energy cutoff [eV] | Hubbard correction [eV] |
---|---|---|---|---|---|---|
Ba3MnNb2O9 | Triple perovskite | 3d | 120° Néel state | 5 × 5 × 6 | 500 | U eff = 3 (ref. 7) |
LaMn2Si2 | Tetragonal | 3d | Canted FM | 8 × 8 × 8 | 500 | — |
β-MnO2 | Rutile | 3d | Helical | 8 × 8 × 5 | 600 | U = 6.7, J = 1.2 (ref. 25) |
Sr2IrO4 | Layered perovskite | 5d | Canted AFM | 7 × 7 × 3 | 800 | U eff = 1.6 (ref. 26) |
UO2 | Cubic | 5f | Canted AFM | 6 × 6 × 6 | 700 | U eff = 3.46 (ref. 10) |
Ba2NaOsO6 | Double perovskite | 5d | Canted AFM | 6 × 6 × 6 | 600 | U eff = 3.4 (ref. 11) |
RhMn3 | fcc 3D kagome | 3d | 120° Néel state | 10 × 10 × 10 | 450 | — |
At each iteration step t, the BO algorithm constructs a surrogate model of the magnetic energy landscape Et(Φ) and determines the corresponding statistical Bayesian uncertainty σt(Φ), utilizing the available data. The uncertainty σt(Φ) indicates the lack of confidence in the current model of the magnetic energy landscape that arises from the limited amount of available data and the noise of the DFT energies. Typically, Gaussian process regression (GPR) is used for the generation of the surrogate model.21,27 The initial data set must include at least two data points to start the optimization procedure. We used two initial data points for the models involving one degree of freedom and four initial data points for all other models.
The configuration, which will be investigated in the next DFT calculation is strategically determined by utilizing an acquisition function , which usually depends on the current surrogate model and the Bayesian uncertainty. More precisely, the next DFT energy is determined for the magnetic configuration that minimizes the acquisition function. Different acquisition functions have been proposed in the literature and acquisition function design is an active field of research. In this work we use the pure exploration (EXP) acquisition function, which, in our case, is expressed as
![]() | (1) |
Once the BO algorithm terminates after completing Niter iteration steps, it is possible to evaluate whether the model has converged and, if so, determine the number of iterations required for convergence Nconviter and the number of DFT calculations required for convergence NconvDFT. If convergence is not achieved after Niter iterations, the optimization can be resumed starting from the last iteration. Naturally, the total number of performed iterations must be larger than the number of iterations required for convergence (Nconviter).
To ensure a high accuracy of the final models, we monitor the convergence of each surrogate model using a validation set S, which consist of random magnetic configurations, parametrized by vectors containing all relevant canting angles
and the corresponding DFT energies Ei. We use the largest difference between any total energy within the validation set and the energy prediction of the surrogate model (ΔE)max as a convergence measure:
![]() | (2) |
The concept and the mathematical foundation of BO are extensively described in the literature.21,27,28 In this work we used the Bayesian Optimization Structure Search (BOSS)17 python package (version 1.10.1).
For the creation of the two- and three-dimensional models of the Ba3MnNb2O9 magnetic energy landscape and the two-dimensional model of RhMn3 magnetic energy landscape, we exploit a symmetry in the crystal structures to reduce the number of required calculations using a feature which is incorporated in BOSS. This feature allows us to avoid redundant computations by adding not only the directly calculated DFT data point but also a second, symmetry-equivalent point to the data set. We used the radial basis function kernel for the generation of almost all of our models. The only exception is our model of the magnetic energy landscape of UO2 that takes into account four individual degrees of freedom. In this case we used the Matérn kernel with ν = 3/2, since it outperformed the radial basis function kernel for this specific model in a kernel test not shown here.
A key objective of this work is to assess the accuracy of the BO approach by comparing the resulting magnetic energy landscapes and ground state configurations with those reported in related previous studies. To enable a meaningful comparison, we recreated the respective DFT setups used in the earlier studies. Consequently, the calculated DFT energies can be considered accurate within the context of the specific setups up to the numerical precision of our DFT calculations, which is 10−6 eV. Since our aim is to recreate the features of the magnetic energy landscape within the framework determined by the previous studies, we set the GPR calculations in such a way that it can fit energy values with a noise characterized by a standard deviation of 10−6 eV.
![]() | (3) |
In the following, we showcase the capabilities of our approach using LaMn2Si2, β-MnO2, Ba2NaOsO6, Sr2IrO4, UO2, Ba3MnNb2O9 and RhMn3 as benchmark materials and monitor the computational efficiency. We apply BO to generate up to four-dimensional models of the magnetic energy landscape. Since the computational resources required to run the BO algorithm are negligible compared to the computational costs of DFT calculations, one can assume that the overall computational costs scale with the number of DFT calculations required for the convergence of the model. The computation time for a single DFT calculation is heavily influenced by the computational system in use and the specific material being analyzed. Usually, materials of greater interest require more computation time since they are often more complex. In our case, the DFT calculations required between 30 minutes and four hours.
To determine an one-dimensional energy profile that connects the 120° state and the uud state, we defined an out-of-plane canting angle Φ and quantify the energy gain as a function of Φ in the range between 0° and 90° using BO. Within this range, the one-dimensional magnetic energy surface lacks any symmetry that could be inferred from the crystal symmetry. Therefore we could not reduce the number of required DFT calculations by exploiting a given symmetry as described in Section 2.1. The convergence criterion introduced in Section 2.1 is reached after performing four DFT calculations. The model and the four data points that were used for the generation of the model are displayed in Fig. 2a. A similar energy profile, based on ten DFT calculations and qualitatively matching our results was previously determined by Lee et al.7 To further investigate the phase space for potential stable or metastable states, including those with lower symmetry, we created higher-dimensional models of the magnetic energy landscape by introducing two and three independent canting angles, respectively. For two independent angles, we coupled Φ2 and Φ3 as defined in the inset of Fig. 2a and added Φ1 as a second independent canting angle. For three independent angles, we used all three canting angles as independent degrees of freedom. To account for both ferromagnetic and ferrimagnetic states, we expanded the search bounds of all angles to cover the range between −90° and 90°. As a consequence, the resulting models show a point symmetry that can be defined as , where
is a vector that contains the two or three canting angles, respectively. This symmetry follows from a mirror symmetry in the crystal structure. The comparison of all three models is presented in Fig. 2a. As shown in the figure, all three models are in good agreement. Fig. 2b depicts a two-dimensional slice of the full three-dimensional model, which is defined by the constraint Φ2 = Φ3. The three-dimensional model indicates that the 120° state is the only stable or metastable configuration in the explored phase space.
![]() | ||
Fig. 2 The definition of the canting angles used for the triangular lattice Ba3MnNb2O9 is shown in the inset of (a). In (b), a slice of the magnetic energy landscape of Ba3MnNb2O9 that was generated by defining one independent canting angle per magnetic site, is shown. The slice is defined by the constraint Φ2 = Φ3. The comparison between a one-dimensional, a two-dimensional and a three-dimensional model of the magnetic energy landscape is shown in (a). The crystal structure of LaMn2Si2 and the used definition of the canting angles Φ1 and Φ2 are shown in (c). The magnetic energy landscapes that is spanned by the two canting angles, is depicted in (d). In (e), the energy profile corresponding to a canting angle Φ which is defined by the constraint Φ = Φ1 = Φ2, is shown. The results of the DFT calculations used for the creation of the model are depicted as blue points. The model is compared to the model from (d). The line that correspond to the definition of Φ is marked in (d) as an orange line. The reduced crystal structure of β-MnO2 is depicted in (f). Shown are also the magnetic moments of Manganese as proposed by Yoshimori.41 The body-centered Manganese atoms are displayed in solid colors. The effect on the total energy of rotating the canting angles of the body-centered Manganese atoms in plane by up to 180°, determined by BO is shown in (g). |
Using BO we obtained the energy difference as a function of the canting angle Φ defined by the constraint Φ = Φ1 = Φ2 in the range between 0° and 90°. This range encompasses all possible magnetic configurations achievable by varying Φ due to the crystal's symmetry. The surrogate model is fully converged after the execution of seven DFT calculations within the BO framework and exhibits a global minimum that corresponds to a canting angle of 55.8° (see Fig. 2e). This finding agrees well with previous DFT studies, which estimated the canting angle to be between 53° and 57.7°.42,47
All Mn atoms lie within two distinct lattice planes of the magnetic unit cell. To take into account additional magnetic configurations, particularly those that are antiferromagnetic along the [001] direction, we expand the magnetic energy landscape using BO by decoupling the canting angles of both planes and varying Φ2 between 0° and 180°. The resulting model is shown in Fig. 2d. It is converged after 32 DFT calculations. In addition to the global minimum, which coincides with the known magnetic ground state of LaMn2Si2 (see Fig. 2d and e), a local minimum emerges at Φ1 = 59° and Φ2 = 121°, which indicates a metastable magnetic state. To the best of our knowledge, this state has not been reported yet as a stable state of LaMn2Si2. The energy difference between both minima is 27.1 meV/f.u. and the difference between the global minimum and the saddle point between both minima amounts to 40.7 meV/f.u.
As Yoshimori found out, the corner manganese atoms and the body-center cations form two distinct sublattices in the helical phase. The magnetic moments of both sublattices screw along the c-direction with a wavelength of 7c/2, so that the pattern repeats after seven unit cells (see Fig. 2f). All magnetic moments lie within the (001) plane. Later a magnetic X-ray diffraction experiment52 and neutron diffraction experiments53,54 found that the wavelength of the helical structure is about 4% smaller than 7c/2, which makes it incommensurate. According to Yoshimori's model, the helical phase in β-MnO2 is stable only if the interaction between each corner atom and its neighboring body-centered atom is antiferromagnetic, eventually resulting in a 128.6° tilt between each manganese atom and the two closest atoms of the other sublattice.
In 1966, W. Osmond argued that the interaction between corner atoms and body-centered atoms should be ferromagnetic according to the Goodenough-Kanamori rules for superexchange.55 Based on this assumption he proposed an alternative pattern for the spin helix. This pattern differs from Yoshimori's model by inversion of the three-dimensional magnetic moment vectors of all body-centered atoms. This inversion can alternatively be described as a revolution of the three dimensional magnetic moment vectors of all body-centered atoms by 180° around the c-axis.
To directly compare the energies of the spin configuration proposed by Osmond with the one predicted by Yoshimori, we used BO to determine the total energy of β-MnO2 as a function of the revolution angle Φ in the range between 0° and 180°. Thereby Φ = 0° corresponds to the model of Yoshimori and Φ = 180° corresponds to the model of Osmond. The surrogate model function converges after four DFT calculations (see Fig. 2g). It clearly shows that the spin structure predicted by Yoshimori is lower in energy than the one predicted by Osmond. This result is in line with magnetic X-ray diffraction data acquired later.52
The canting angle Φ is defined by the condition Φ = Φ1 = Φ2, where Φ1 and Φ2 are illustrated in Fig. 3a. Therefore, Φ describes the common tilting of all magnetic moments from the crystallographic axis. This result is in good agreement with experimental measurements (Φ = 12.2° (ref. 60)). Using BO, we recreated the one-dimensional magnetic energy landscape within the interval from Φ = 0° to Φ = 50° (see Fig. 3b). The surrogate model converged after five DFT calculations. The energy minimum of our model is reached at Φ = 11.2° which agrees well with the previous findings of Liu et al. Furthermore, we expanded the model of the magnetic energy landscape by decoupling the canting angles Φ1 and Φ2, considering magnetic configurations with different magnetic moment canting angles in neighboring planes. The resulting model, which converged after 17 DFT calculations, is shown in Fig. 3b. This two-dimensional model is in good agreement with the data from Liu et al. (see Fig. 3c). The model has two minima, located at (Φ1 = 0.009°, Φ2 = 32.448°) and (Φ1 = 36.067°, Φ2 = 0.004°), respectively. Of those two minima, the first is lower in energy. Its location in the two-dimensional model is marked by a red cross in Fig. 3b. The two minima do not coincide with the magnetic ground state previously reported by Liu et al. To test the validity of our prediction, we performed two additional DFT calculations for the magnetic configuration that corresponds to the lowest minimum of the model and the magnetic ground state determined by Liu et al., respectively. These calculations indicate that the minimum of the model lies 0.009 meV/f.u. lower in energy than the state reported by Liu et al., a difference that approaches the limit of computational accuracy and highlights the complex multi-minima landscape characteristic of spin–orbit entangled oxide systems.
![]() | ||
Fig. 3 In (a), the crystal structures of subsequent lattice planes of Sr2IrO4 and the used definition of the canting angles Φ1 and Φ2 are given. The magnetic energy landscape spanned by the canting angles Φ1 and Φ2 is shown in (b). In (c), the two-dimensional model is compared to a one-dimensional model and to data from Liu et al.9 The crystal structure of UO2 as well as the used definition of the canting angles Φ1 to Φ4 are given in (d). (e) Shows a slice of the four-dimensional model of the magnetic energy landscape obtained by using BO and all four canting angles independently. The slice is defined by the conditions Φ1 = Φ3 and Φ2 = Φ4. In (f) the four-dimensional model and a model obtained by a varying the four canting angles simultaneously are compared to data from Dudarev et al.10 The reduced crystal structure of Ba2NaOsO6 is shown in (g). The definition of the canting angle Φ is given in the inset of the figure. The converged model of the BO energy profile is compared to data from Fiore Mosca et al.11 in (h). |
We could reproduce Dudarev's model with BO (see Fig. 3f) by generating an one-dimensional model with BO. The surrogate model converged after five DFT calculations. By using all four canting angles as independent input parameters for BO that range between 0° and 90°, we received a full four-dimensional model of the magnetic energy landscape of uranium dioxide, which converged after 360 DFT calculations. A two-dimensional slice defined by Φ1 = Φ3 and Φ2 = Φ4 is shown in Fig. 3e. Fig. 3f demonstrates that our model is in good agreement with the data of Dudarev et al. However, our model indicates that the 3k state is not a global energetic ground state, but rather a saddle point in the magnetic energy landscape. The global minimum of our model is located at Φ1 = Φ3 = 14° and Φ2 = Φ4 = 90°. The energy difference between the global energy minimum and the 3k state amounts to 0.38 meV/f.u. This result calls for further theoretical and experimental analysis for elucidating the magnetic ground state of UO2.
The model converged after five DFT calculations, yielding the energy profile shown in Fig. 4b. It confirms the expectation that T1 is the 120° in-plane ground state of RhMn3. In order to expand the model to also account for small collective out-of-plane cantings, we introduced a second canting angle Θ, which we defined as the angle between each magnetic moment and the (111) plane. With this, we used our BO approach to construct a two-dimensional model of the magnetic energy surface of RhMn3 in the magnetic configuration space defined by 0° ≤ Φ ≤ 90° and −2° ≤ Θ ≤ 2°. To enhance efficiency of the model generation, we exploited the symmetry E(Φ, Θ) = E(Φ, – Θ) as described in Section 2.2. The model converged after 20 DFT calculations. As depicted in Fig. 4b, it agrees closely with the one-dimensional model. The full two-dimensional model is presented in Fig. 4c. The global minimum of the model is located at (Φ = 90.0°, Θ = 0.2°), closely matching the T1 spin structure. Notably, the two-dimensional model shows that even minor out-of-plane canting results in significant magnetic energy contributions.
Material | Dim. | Search bounds | N convDFT this work | (NconvDFT)1/d | N DFT related work | New findings |
---|---|---|---|---|---|---|
Ba3MnNb2O9 | 1D | [−90°, 90°] | 4 | 4 | 10 (ref. 7) | — |
2D | [−90°, 90°] × [−90°, 90°] | 14 | 3.74 | — | — | |
3D | [−90°, 90°] × [−90°, 90°] × [−90°, 90°] | 25 | 2.92 | — | — | |
LaMn2Si2 | 1D | [0°, 90°] | 6 | 6 | — | — |
2D | [0°, 90°] × [0°, 180°] | 33 | 5.74 | — | Potential metastable state | |
β-MnO2 | 1D | [0°, 180°] | 4 | 4 | — | — |
Sr2IrO4 | 1D | [0°, 50°] | 5 | 5 | 9 (ref. 9) | — |
2D | [0°, 50°] × [0°, 50°] | 17 | 4.12 | — | — | |
UO2 | 1D | [0°, 90°] | 5 | 5 | 46 (ref. 10) | — |
4D | [0°, 90°] × [0°, 90°] × [0°, 90°] × [0°, 90°] | 360 | 4.35 | — | Potential ground state | |
Ba2NaOsO6 | 1D | [0°, 90°] | 10 | 10 | 36 (ref. 11) | — |
RhMn3 | 1D | [0°, 90°] | 5 | 5 | — | — |
2D | [−2°, 2°] × [0°, 90°] | 20 | 4.47 | — | — |
Naturally, the number of DFT calculations needed for convergence NconvDFT is most significantly influenced by the dimensionality of the model. The value of NconvDFT of the one-dimensional models ranges between four (Ba3MnNb2O9, β-MnO2) and ten (Ba2NaOsO6). Compared to the number of DFT calculations used for in the respective related study, the BO approach required between 44% (Sr2IrO4) and 89% (UO2) fewer DFT calculations. The two-dimensional models converged earliest after 14 DFT calculations (Ba3MnNb2O9) and latest after 33 DFT calculations (LaMn2Si2). The only three-dimensional model presented in this work (Ba3MnNb2O9) converged after 25 DFT calculations and the only four-dimensional model (UO2) converged after 360 calculations.
Performance differences between models of the same dimensionality appear to be caused primarily by variations in their complexity. For instance, the only one-dimensional model with two local minima (Ba2NaOsO6) required ten DFT calculations to converge, compared to the four to six calculations needed in the other cases. A similar pattern can be found for the two-dimensional models. The model for LaMn2Si2, which features two local minima, required 33 calculations to converge. In contrast, Ba3MnNb2O9, Sr2IrO4 and RhMn3, each with a single local minimum within the search bounds, required 14,17 and 20 calculations, respectively. In both instances, NconvDFT roughly doubles when the model contains two minima.
When NconvDFT DFT calculations are required to converge a one-dimensional model, one could expect that (NconvDFT)d calculations are required to converge a d-dimensional model of the same material. However, in all cases discussed in this work, the performance of the multi-dimensional BO models is better than this approximation, as one can see from the (NconvDFT)1/d values in Table 2.
As stated in Section 2.1, we have used validation sets to track convergence of the surrogate models. This enabled us to ensure the convergence of the models and to meaningfully assess the performance of BO. However, this approach might not be reasonable for the use of BO in most practical applications, since it requires additional DFT calculations. Therefore we recommend using the maximal change of the surrogate model per iteration step as a convergence measure. This measure led to similar results for the benchmark cases presented in this work.
The two-dimensional model of the magnetic energy landscape of LaMn2Si2 revealed a local metastable state defined by Φ1 = 59° and Φ2 = 121° following the definition of Φ1 and Φ2 given in Fig. 2c. Due to the high energy difference between the local minimum and the energy barrier, the existence of the local minimum can not be explained as an artifact of an imperfect choice of k-mesh or energy cutoff of the plane-wave expansion. To the best of our knowledge, this metastable state has not been reported yet.
For Sr2IrO4, the 1D model accurately reproduces previous DFT results and agrees well with experimental observations. For the 2D model, BO identifies a broad region of the magnetic configuration space where the total energy does not exceed the global minimum by more than 0.05 meV/f.u., highlighting the challenges of exploring the energy landscape of complex spin–orbit entangled systems. Within this 2D model, BO also found an additional minimum lying 0.009 meV/f.u. below the experimentally reported structure.60,72 However, due to the inherent approximations of the employed DFT + U approach, this small energy difference lies beyond our computational precision. As such, our results remain consistent with the experimental observations.
Also in the case of UO2 there is a discrepancy between the postulated magnetic ground state and the minimum of the BO magnetic energy landscape. Although our models match the literature DFT data perfectly, they show that the energy is further reduced for several magnetic configurations with reduced symmetry by up to 0.38 meV/f.u. The lower energy values of these configurations are not artifacts of the BO approach, since they are established by explicit DFT calculations, that are computed with an accuracy threshold of 10−6 eV for a supercell containing four formula units UO2. However, as mentioned in Section 2.1, we adapted the DFT setup of Dudarev et al.10 in order to make the results comparable. This setup, in particular the choice of k-mesh and energy cutoff, might affect the observed magnetic ground state, especially considering the relatively small energy differences at play. Additionally, the choice of exchange-correlation functional might lead to unphysical results. It has been reported that the choice of exchange-correlation functional affects the total energy of UO2 obtained by DFT calculations in such a manner that it can result in different predicted ground states.66 However, although most studies indicate that the 3k state is the physical ground state of UO2, it remains a complicated matter and still poses a subject of debate.66,73 If the minima of our models are a result of the used exchange-correlation functional, the choice of the Ueff parameter or if they have a real physical root will have to be clarified in further studies.
Despite the mentioned open questions concerning interpretation, the benchmark cases LaMn2Si2 and UO2 highlight a strength of BO. They show how the capability of generating large magnetic energy landscapes with several degrees of freedom can help to spot interesting physics and potential problems of theory that would otherwise be overlooked.
BO-guided explorations are easy to implement and can easily be adapted to new problems. Although not explicitly discussed in this work, BO can also be applied to 2D magnets and materials with defects, where approaches based on symmetry analysis fail due to the breaking of symmetry by the defect. In principle, BO can be applied to explore any magnetic configuration that is compatible with the chosen computational unit cell.
Just like the methods used in previous approaches by Payne et al.12 and Zheng and Zhang,13 BO is an active machine learning scheme. An important advantage of using active machine learning schemes is that they do not require large training datasets that would be computationally expensive to produce.
Apart from the three methods mentioned in Section 1,12–14 magnetic ground states and model magnetic energy surfaces can also be found with other methods. A more conventional approach, for example, is to derive spin models from ab initio methods.74–76 The main idea of this approach is to map DFT or other first principles calculations onto a lattice spin Hamiltonian. The resulting spin model can then describe the interaction of the spins according to the chosen Hamiltonian and its properties can be determined using analytical, numerical or spin dynamics methods. This approach is highly efficient since often only a small number of DFT calculations is needed to determine the free parameters of a given spin Hamiltonian. The choice of a Hamiltonian corresponds to the chosen material and the interactions deemed essential for understanding specific properties. However, determining in advance which Hamiltonian best meets the specific requirements often is a delicate task and sometimes impossible. In contrast, BO does not require the prior specification of a Hamiltonian. This lack of a required predefined Hamiltonian is a key strength of BO, enabling it to serve as a tool for verifying or challenging prior assumptions about spin interactions in a given material. While doubtful or even wrong assumptions about spin interactions could be reflected in an improper choice of Hamiltonian and eventually lead to misleading results, BO always converges towards the true magnetic energy landscape within a given DFT + U framework after a sufficient number of DFT calculations.
Magnetic ground states can also be obtained from ab initio spin dynamics calculations as proposed by Antropov et al.77 in 1995. Starting from an arbitrary spin configuration, one follows equations of motion for the spin degrees of freedom to end up in the magnetic ground state. Although the approach produces accurate results, it is computationally relatively expensive, as the procedure should be repeated multiple times with different random initial configurations to reduce the risk of getting trapped in a local minimum. This approach has been adopted for investigating relatively simple systems, such as Fe and Co chains,78 while its application to multi-element compounds is considerably more challenging. In contrast to ab initio spin dynamics, BO provides a more comprehensive information output and can be easily adapted to more complex systems, as demonstrated in this work.
A recent approach by Ponet et al. focuses on generating an extensive list of stable magnetic states.79 By employing a constraining technique, they can reliably initialize the system's electronic state. The core idea of their approach is to generate a diverse set of initial electronic states and relax each one toward its nearest energy minimum. While the strength of this approach lies in its ability to reliably identify stable magnetic states in magnetic materials, our BO-based method offers additional insights into the magnetic energy landscape, including magnetocrystalline anisotropies and energy barriers between stable states. As such, the two are complementary.
Recently, Rinaldi et al. presented a noncollinear magnetic atomic cluster expansion (ACE) parametrization for Fe.80 They showed that the parametrization is capable of correctly reproducing a wide range of properties, including the magnetic ground state, magnetic phases and magnetic energy profiles. This achievement, however, required a substantial training dataset of 70000 structures with different atomic and magnetic structures. Magnetic ACE is therefore at the opposite end of the data volume range from BO. While BO offers efficient information gain about the magnetic energy landscape within a predefined configuration space at relatively low computational cost, ACE potentially offers more refined insight, albeit at the expense of a significantly larger training dataset.
A key strength of the ACE approach by Rinaldi et al. lies in its ability to capture the interplay between spin structure and crystal structure. For materials where a strong coupling exists between these two degrees of freedom, exploring the magnetic energy landscape of a rigid crystal structure – as done in this work – may not suffice to identify the true magnetic ground state. However, this limitation of our BO-based approach could potentially be addressed by performing a structural relaxation inside the BO loop prior to each DFT total energy calculation. Such an approach has already been successfully applied to investigate octahedral tilting using BO.24 While this would incur higher computational costs, it may yield a more physically accurate description of the ground state in such strongly coupled systems.
In general our results show that BO can be used as a tool to efficiently explore noncollinear magnetic energy landscapes and find magnetic ground states within large spin configuration spaces. Thereby it can help to obtain exhaustive insights into the intricate interactions of complex magnetic materials.
This journal is © The Royal Society of Chemistry 2025 |