Shuping
Guo
*a,
Ryan
Morrow
a,
Jeroen
van den Brink
ab and
Oleg
Janson
*a
aLeibniz Institute for Solid State and Materials Research IFW Dresden, Helmholtzstraße 20, Dresden 01069, Germany. E-mail: shuping.guo@ifw-dresden.de; olegjanson@gmail.com
bDepartment of Physics, Technical University Dresden, Dresden 01069, Germany
First published on 1st February 2024
Double perovskites are a growing class of compounds with prospects for realization of novel magnetic behaviors. The rich chemistry of double perovskites calls for high-throughput computational screening that can be followed by or combined with machine-learning techniques. Yet, most approaches neglect the bulk of microscopic information implicitly provided by first-principles calculations, severely reducing the predictive power. In this work, we remedy this drawback by including onsite energies and transfer integrals between the d states of magnetic atoms. These quantities were computed by Wannierization of the relevant energy bands. By combining them with the experimental information on the magnetism of studied materials and applying machine learning, we constructed a model capable of predicting the magnetic properties of the remaining materials whose magnetism has not been addressed experimentally. Our approach combines classification learning to distinguish between double perovskites with dominant ferromagnetic or antiferromagnetic interactions and regression employed to estimate magnetic transition temperatures. In this way, we identified one antiferromagnet and three ferromagnets with a high transition temperature. Another 28 antiferromagnetic candidates were identified as magnetically frustrated compounds. Among them, cubic Ba2LaReO6 shows the highest frustration parameter, which is further validated by a direct first-principles calculation. Our methodology holds promise for eliminating the need for resource-demanding calculations.
The magnetic properties of double perovskites are very diverse. Following the discovery of large magnetoresistance in Sr2FeMoO6,4 several promising candidates with high magnetic transition temperature (Tc) were suggested for spintronic applications, such as Sr2FeReO6 (Tc = 401 K), Sr2CrWO6 (Tc = 450 K), Sr2CrReO6 (Tc = 625 K) and Sr2CrOsO6 (Tc = 725 K).5 On the other hand, long-range magnetic ordering in double perovskites can be suppressed by magnetic frustration, giving rise to the valence bond glass state in Ba2YMoO6.6
This remarkable disparity is driven not only by chemistry, but also by structural degrees of freedom. Alternating BO6 and B′O6 octahedra form a three-dimensional framework; A cations occupy the voids of this framework. In a regular octahedral environment, the d orbitals of B and B′ split into threefold degenerate t2g (dxy, dyz, dxz) and twofold degenerate states. While the strict degeneracy is often lifted by distortions of octahedra, the crystal field splitting remains significant and largely shapes the electronic properties of double perovskites. In addition to distortions, different sizes of A and B/B′ ions may give rise to octahedral tilts and rotations. As a result, various structural types can be formed, such as cubic Fm
m, tetragonal I4/m, monoclinic P21/n, I2/m, rhombohedral R
and R3. Both distortions and tilts play a major role in the physical properties, as they are strongly intertwined with the charge, orbital, and spin degrees of freedom.7
The magnetic exchange in double perovskites strongly depends on the atomic arrangement and the valence of B and B′ cations. Exchange interactions are contributed by different virtual electron transfer processes that can couple the t2g or eg states of B with the t2g or eg states of B′, giving rise to multiple – and often competing – processes. Two such processes are illustrated in Fig. 1, where the t2g states of B′ are coupled to the t2g states of the B cation on the right, and the eg states of B′ are coupled to the eg states of the B cation on the left. In this case, B and B′ ions in double perovskites interact strongly with each other, and the magnetism is driven predominantly by the B–B′ interaction. Electron transfer processes within each sublattice, B or B′, are also possible. If B′ has an empty d shell, the interaction within the B sublattice determines the magnetic order, despite the large separation between the nearest neighbors.
Transfer integrals are quantum-mechanical amplitudes describing the virtual electron transfer between a certain pair of orbitals. In the literature,8 the respective terms are called “onsite” if the pertinent orbitals belong to the same atom, and “transfer integrals” or “transfers” otherwise. Unfortunately, the knowledge of all transfer integrals between the magnetically active orbitals is not sufficient to evaluate the respective magnetic exchange: strong electronic correlations in the d shell lead to a many-body quantum problem whose generic solution is unknown. For specific electronic configurations, such as heavy d5 metals in an edge-sharing octahedral environment,9 perturbative treatments or solving the problem on a small cluster can deliver parameterized solutions, provided that interaction parameters and the strength of SOC are known. In all other situations, insights are limited to empirical assessments, such as the Goodenough-Kanamori rules, that are at best qualitative. Therefore, predicting the magnetic ground state is difficult even on a case-by-case basis, let alone in high-throughput calculations.
An alternative, more direct way of obtaining exchange integrals is performing density functional theory (DFT) calculations of different spin configurations. If only collinear arrangements are considered, the DFT total energies can be mapped onto a Heisenberg Hamiltonian and its classical ground state can be addressed by energy minimization. The treatment of noncollinear configurations is even more complicated: the inclusion of a full spin-density matrix and the necessary symmetry reduction leads to a considerably longer computational time.10 Therefore, screening the total energy of various configurations is almost impossible in a high-throughput fashion, especially for systems with metastable solutions that are sensitive to initially set magnetic moments and U values.11
Because no method can be universally applied to derive magnetic interactions, magnetic double perovskites have not been studied much by high-throughput calculations. Machine learning12–14 assisted high-throughput computations show excellent potential for exploring the physical properties particularly when the understanding of underlying mechanisms is elusive. Even though the majority of them don't have a large amount of training data, with experimental inputs or more in-depth features, a machine learning model is capable of accurate predictions for a small dataset.11,14–17 For example, a regression model with 157 experimental known data points proposed 26 new high Curie temperature two-dimensional ferromagnetic materials with a testing root-mean-square error (RMSE) of 174 K.11 Here, we're inclined to explore representative magnetic features which can further improve the predictive power and make up for the limited experimental sample size.
Presently, there are more than 400 ordered double perovskites known experimentally (including doped, high-pressure and high-temperature synthesized phases),18 and this number is sufficient for several machine-learning techniques. But at present, machine learning studies are either based on simple inputs (e.g. radius, valence difference, atomic mass, tolerance factor and so on)13 or total energies obtained for collinear configurations,10 limiting their predictive power. For more accurate assessments, it is crucial to resort to physically relevant inputs and accurate targets which determine the predictive precision of the machine learning.
In this work, we demonstrate that using the microscopic information contained in the transfer integrals in combination with the experimental information can drastically enhance the accuracy, leading to accurate predictions of the magnetically ordered state and the critical temperature. As shown in Fig. 1, by using 113 experimentally known double perovskites as the training dataset, we were able to predict the magnetic properties of 68 further double perovskites. This is done in two steps. First, a classification machine-learning model with atomic features was used to distinguish between antiferromagnets and ferromagnets. In the second step, we constructed a regression model which comprises the onsite energies and the transfer integrals between the first few neighbors; leading transfer integrals in the three channels – t2g–eg, t2g–t2g, and eg–eg – are included separately. Note that these microscopic terms underlie different exchange mechanisms, including direct exchange, superexchange and double exchange. All these terms were calculated by Wannierization of the nonmagnetic band structures of the respective materials. In this way, we obtained a model which predicts the ordering temperature with a RMSE of 18 and 61 K, for antiferromagnets and ferromagnets, respectively. One antiferromagnet and three ferromagnets with a high ordering temperature were identified. We also found 28 antiferromagnets combining a low transition temperature (≤50 K) with sizable transfer integrals (≥100 meV); these materials are likely magnetically frustrated and therefore may harbor exotic magnetic ground states. Since frustration is often assessed as the ratio of the Weiss temperature and the magnetic ordering temperature, we constructed an additional regression model to predict the Weiss temperature (RMSE = 76 K) and identify systems with a large frustration parameter. We obtained the largest ratio of 12 for cubic Ba2LaReO6; the sizable frustration was subsequently confirmed by a direct DFT + U + SOC calculation. This demonstrates that transfer integrals can be efficiently used in machine-learning models that aim at describing magnetic properties.
![]() | (1) |
Then with the computed matrices of the largest t2g–eg, t2g–t2g, eg–eg transfer integral of the short-range connections plus the onsite energies, the AdaBoost regression model was used to fit the magnetic transition temperature (Tc) and Curie–Weiss temperature (Θ). The dataset is randomly split into 80/20 training/testing datasets for validation.
![]() | ||
Fig. 2 Crystal structures of A2BB′O6 double perovskites crystallizing in various space groups: (a) Fm![]() |
Our first task is to assess the ground states of 68 double perovskites whose magnetism remains unknown. Following earlier studies that addressed thermodynamic stability,31 we consider geometric atomic features (such as tolerance factor, ionic radius, and atomic number). These are supplemented by electronic atomic features such as the number of electrons in the outermost shell and the oxidation states of B and B′ atoms. To surmount the relatively small size of the data set, leave-one-out validation was used to evaluate the performance. We applied three extensively used multi-class classifiers: MLP, XGBoost and AdaBoost that yielded an accuracy of 63%, 73%, and 78%, respectively. Unfortunately, using microscopic parameters such as transfer integrals (which we do use for predictions of the ordering temperatures, see the next section) as features does not improve the performance of the classification models. We attribute this to the excessive dimensions of such features and the small size of the dataset. The AdaBoost model with the highest accuracy was further used to predict the magnetic order of the 68 remaining double perovskites. In this way, we identified 45 AFM, 7 FM, and 16 NM compounds depicted in Fig. 3 with red, blue, and gray squares.
To illustrate how such balance is realized in practice, we consider three Co2+-containing double perovskites: the cubic Ba2CoReO6, the tetragonal Sr2CoReO6, and the monoclinic La2CoIrO6. Since Re6+ (Ir4+) has one (five) electron in the 5d shell, exchanges between B and B′ as well as within each sublattice has to be considered. Possible relevant exchanges in these three structure types are schematically illustrated in the left panel of Fig. 4, where the purple, green and orange solid lines represent B–B, B′–B′, and B–B′ exchanges, respectively. Respective directions in the crystal lattice are denoted by superscripts, i.e. is exchange between B and B along the a axis,
is the direction along the body diagonal between B′ and B′, etc. In the right panel of Fig. 4, we show the strongest transfer integrals for a given interatomic separation in each orbital sector; the color map spans the range between −300 (blue) and 300 (red) meV. For the cubic Ba2CoReO6, three first Co–Re nearest-neighbor bonds
are identical, with three large t2g–t2g transfer integrals (three blue squares in the third row) around −225 meV. Besides, two eg–eg Co–Co transfer integrals
and two t2g–t2g terms of Re–Re
are around −100 meV.
For Sr2CoReO6 crystallizing in the I4/m space group, leading transfer integrals are three t2g–t2g (one blue and two red squares in the third row) of −167 and 193 meV and two t2g–eg (two red squares in the third row) around 203 meV of the three shortest Co–Re bonds, and
For the monoclinic La2CoIrO6, the largest components (100 to 200 meV) also pertain to the short-range B–B′ connections. At the same time, both t2g–eg and t2g–t2g channels are active, as evidenced by five blue and one red squares in the third row of Fig. 4 (right). In addition, there are four eg (t2g) transfer integrals operating within the Co (Ir) sublattice. The structure of these matrices determines the sign and the strength of the respective magnetic exchanges.
The leading transfer integrals are complemented by the onsite energies of B and B′ d-orbitals. The resulting model is trained on a dataset of experimental transition temperatures (Tc), which is separated into 21 ferro- and 66 antiferromagnets. To assess the predictive power, we randomly divided each set following the 80/20 rule for the training/testing. As shown in Fig. 5, the training data set of antiferromagnets yields very high accuracy, which allows us to achieve the root mean square error (RMSE) of 18 K for the testing data set. In contrast, RMSE is sizable (around 61 K) for ferromagnets, which can be traced back to the small size of the dataset and the broad distribution of ordering temperatures.
These two pre-trained models are further used to predict the transition temperatures of prospective 45 antiferromagnets and 7 ferromagnets. Among them, we find antiferromagnetic Ba2TbReO6 and three ferromagnets with high predicted critical temperatures: −51 K for Ba2TbReO6 (Fmm), 172 K for Bi2NiMnO6 (P21/n), 209 K for Ba2FeMoO6 (Fm
m) and 99 K for Ca2TiMnO6 (I4/m). While the predicted ordering temperature (209 K) of Ba2FeMoO6 is sensibly lower than in the recent experiment (345 K32), our model correctly identifies this material as a high-temperature magnet. Considering this deviation, to test whether the regression models are at risk of overfitting, different validations using resampling are conducted. As shown in Fig. S1,† the robust behavior of the model for antiferromagnets against overfitting gives us hope that the deviations in the model for ferromagnets can be overcome if more samples are added to the training dataset.
To identify relevant candidates, we collect experimentally reported the Curie–Weiss temperatures (Θ) of 63 known antiferromagnetic double perovskites, and then used the transfer integrals to train the regression ML model (leaving as before 20% for testing). In this way, we obtain a RMSE of 76 K for the testing data set (Fig. 6). Next, we calculate Θ for each of the 45 prospective antiferromagnets. As shown in Fig. 7, most monoclinic double perovskites like Ba2LaRuO6 (Tc = −30 K and Θ = −127 K) lie near the f = 5 isoline. At the same time, many cubic double perovskites lie in a more frustrated regime. In particular, three Re-based compounds are predicted to have f > 10: Ba2LuReO66 (Tc = −43 K and Θ = −485 K), Ba2EuReO6 (Tc = −50 K and Θ = −572 K) and Ba2LaReO6 (Tc = −42 K and Θ = −521 K). We note that the predicted temperatures for Ba2LuReO6 are consistent with recent experimental work33 reporting Tc = −31 K and Θ = −678 K.
![]() | ||
Fig. 6 Regression ML model predicted as well as experimentally reported Weiss temperature Θ of AFM candidates. The training and testing dataset are green and blue dots, respectively. |
For an independent assessment of the accuracy, we consider the cubic Ba2LaReO6 and perform DFT + U + SOC calculations for the commonly observed AFM configuration in experiment34 as well as the ferromagnetic configuration. The energy of the antiferromagnetic configuration is lower by −151.7 meV/f.u., indicating that and
are exchanges that are sizable (110 K assuming S = 1) and antiferromagnetic, which is in line with large (−154 meV, see Fig. 8) transfer integrals in the t2g channel. We note that in the absence of magnetic moments on B atoms, B′ moments (localized on Re) form a face-centered cubic lattice, which is geometrically frustrated.34
![]() | ||
Fig. 8 Transfer paths and transfer integrals of Ba2LaReO6 and Ba2LaRuO6. To be consistent with the visual representation of Fig. 4, we show B–B and B–B′ parts despite the absence of a partly filled electronic shell in La3+. Parameters whose absolute value is around 100 meV or larger are labeled; other terms are denoted by color. |
For comparison, we perform a similar DFT + U + SOC calculation for the monoclinic Ba2LaRuO6. Since the low symmetry generates several independent terms, more supercells with different magnetic configurations (ferromagnetic and six antiferromagnetic) are needed. Total energies for all magnetic configurations are listed in the ESI.† By mapping these energies onto a Heisenberg model, we estimate the short-range exchange integrals Ja, Jb, Jd and Jd′. For simplicity, the difference between Jd and Jd′ is neglected; this is justified by the similarity of crystalline environments pertaining to both exchange pathways.35 The resulting Ja, Jb, and Jd = Jd′ are 20, 11 and 11 K, respectively, with an accuracy of 0.5 K. These antiferromagnetic exchanges mainly stem from electron transfer in the t2g channel; due to the smaller spatial extent of 4d orbitals compared to 5d, these transfer integrals are much weaker than in Ba2LaReO6.
We identify Ba2TbReO6 as an antiferromagnet with a high magnetic ordering transition temperature of about 50 K, and three prospective ferromagnets – Bi2NiMnO6, Ba2FeMoO6, and Ca2TiMnO6 – with the ordering temperatures of 172, 209, and 99 K, respectively. We additionally identified 28 prospective frustrated antiferromagnets that combine sizable transfer integrals (100 meV and larger) with a low ordering temperature (below 50 K). Detailed total-energy calculations of magnetic supercells support this conjecture for Ba2LaReO6.
In addition to providing new candidate double perovskites with high magnetic transition temperature or sizable magnetic frustration, our work opens new insights for machine learning assisted high-throughput calculations. Estimation of transfer integrals is becoming a routine task that can be done in a high-throughput fashion. These microscopic parameters underlie the electronic and magnetic properties, but quantitative information on the magnetic properties can be obtained using analytical or perturbative expressions that are available for several specific cases only.9,36,37 Machine learning methods can be an appealing alternative to such analytical approaches, because of its potential to capture the elusive, yet inherent link between the transfer integrals and physical observables.
Footnote |
† Electronic supplementary information (ESI) available: Full lists of experimental and machine learning predicted magnetic transition temperature, Weiss temperature and frustration parameters; detailed formulae of DFT + U + SOC calculations. See DOI: https://doi.org/10.1039/d3ta05679a |
This journal is © The Royal Society of Chemistry 2024 |