Ralf
Wanzenböck
a,
Esther
Heid
a,
Michele
Riva
b,
Giada
Franceschi
b,
Alexander M.
Imre
b,
Jesús
Carrete
c,
Ulrike
Diebold
b and
Georg K. H.
Madsen
*a
aInstitute of Materials Chemistry, TU Wien, 1060 Vienna, Austria. E-mail: georg.madsen@tuwien.ac.at
bInstitute of Applied Physics, TU Wien, 1040 Vienna, Austria
cInstituto de Nanociencia y Materiales de Aragón, CSIC-Universidad de Zaragoza, 50009 Zaragoza, Spain
First published on 16th September 2024
The investigation of inhomogeneous surfaces, where various local structures coexist, is crucial for understanding interfaces of technological interest, yet it presents significant challenges. Here, we study the atomic configurations of the (2 × m) Ti-rich surfaces at (110)-oriented SrTiO3 by bringing together scanning tunneling microscopy and transferable neural-network force fields combined with evolutionary exploration. We leverage an active learning methodology to iteratively extend the training data as needed for different configurations. Training on only small well-known reconstructions, we are able to extrapolate to the complicated and diverse overlayers encountered in different regions of the inhomogeneous SrTiO3(110)-(2 × m) surface. Our machine-learning-backed approach generates several new candidate structures, in good agreement with experiment and verified using density functional theory. The approach could be extended to other complex metal oxides featuring large coexisting surface reconstructions.
Machine-learned force fields (MLFFs) are an increasingly popular tool that can be used to explore surface structures, including those of metal oxides. Method development and their applications have been advancing in parallel, with innovative and powerful models synergizing with established and proven methods. For example, moving from neural-network force fields that utilize precomputed invariant descriptors1–3 to adopting equivariant message passing networks4–6 has enabled more data-efficient and transferable MLFFs. Modern applications include foundation models trained on a wide range of materials,7 transferable water potentials,8 and condensed phase chemistry.9
Here, we utilize MLFFs to explore the surface reconstructions of strontium titanate (SrTiO3), a perovskite oxide used as a model system for the development of many technological applications, including optoelectronics, catalysis, memory devices, and photovoltaics.10–13 SrTiO3 is also exciting from a fundamental standpoint: It exemplifies the richness of bulk, surface, and interface properties that can be accessed within a single perovskite material: donor doping by chemical impurities,14,15 oxygen vacancies,14,16,17 or field effects15,18 can turn it into an insulator, a metal, a superconductor or even induce confined metallic behaviour in the form of 2D electron gases. The diversity extends to the atomistic details of the surface,19 where a variety of composition-related, polarity-compensating reconstructions have been found for the (001), (110), and (111) orientations.20–27 Pinpointing the regions of stability of such reconstructions and gaining a deep understanding of the atomic-scale details of their surfaces is essential for designing SrTiO3-based systems with tailored functionalities.
Many studies exist that explore the atomic details of the surface of single-crystalline SrTiO3 samples under ultra-high vacuum (UHV) conditions. It is known that specific surface reconstructions are difficult to reproduce and can depend on sample history and preparation conditions.19 Notably, scanning tunneling microscopy (STM) studies of SrTiO3 surfaces frequently reveal the coexistence and even intermixing of multiple surface structures,21,26,28 a feature common to the surfaces of other complex perovskites such as BaTiO3 and (La, Sr)MnO3 (ref. 29 and 30). The variety of coexisting surface reconstructions and their dependence on sample history underlines the necessity of changing the framing from identifying a single specific reconstruction to mapping out the range of possible reconstructions. This diversity can serve as an ideal showcase of the power of MLFF-supported stochastic searches, which, in turn, could be applied to the exploration of surface structures of further, technologically relevant materials.
In particular, the enhanced accuracy and reliability of MLFFs facilitate the application of stochastic algorithms for structural exploration of materials.31–36 Stochastic approaches require a substantial volume of calculations and are impracticable with ab initio methods such as density functional theory (DFT) as the backend. This holds especially true for large and complex systems, such as surface reconstructions of multi-element compounds. Such systems usually feature a complex energy surface with too many degrees of freedom to explore exhaustively, as well as many local minima, where local searches for an optimal structure largely depend on the initial geometry of the search. Given that stochastic searches produce more diverse structures than, e.g., molecular dynamics, a transferable, robust, and generalizable force field trained on a diverse dataset is key to their success.
An accurate MLFF is, however, just one part of the toolbox necessary to build a robust and efficient workflow for structure searches. The design of an MLFF can enhance or restrict its transferability and any MLFF has the potential to emit infinitely diverse mispredictions. This is especially significant for stochastic searches, which, by design, tend to move into regions that the model was not trained on. Extrapolation happens almost surely in high-dimensional models,37 and therefore is not, by itself, an indicator of poor performance. Research has hence focused on estimates of uncertainty as proxies for the error incurred by using a given MLFF.38–42 When that error is suspected to exceed tolerable margins, retraining with an expanded training set can help extend the applicability of the MLFF. In such scenarios, an efficient algorithm must aim toward issuing only as many ab initio calculations as required, while preventing waste of resources on redundant or irrelevant configurations.43,44 Thus, the need for an uncertainty metric to evaluate the quality of the results dovetails with the usefulness of such metrics for identifying or even generating optimally informative new configurations.33 This makes stochastic structure searches naturally part of the domain of application of active learning (AL) workflows, where an informed data selection is achieved through uncertainty estimation.45
This work focuses on the less well-understood Ti-rich surface reconstructions at the (110) orientation of SrTiO3. Literature describes numerous composition-related SrTiO3(110) surface reconstructions that can be broadly grouped into two families, characterized by Ti-poor (n × 1) and Ti-rich (2 × m) overlayers on an otherwise unchanged bulk.26 Here, n and m denote the number of (1 × 1) bulk unit cells covered in the [001] and [10] directions, respectively. Fig. 1 shows examples of both variations. Ti-rich overlayers pose the particular challenge that, even when the Ti-to-Sr ratio is controlled,26 numerous reconstructions lacking corresponding DFT models can coexist. Here, we again observe pronounced inhomogeneity in new STM measurements and are able to identify varied reconstructions (see Fig. S1 of the ESI†). We mitigate the lack of suitable atomistic models of such inhomogeneous surfaces by combining an evolutionary search algorithm with a transferable MLFF to identify valid candidate structures. Transferability, in particular, is an important prerequisite for minimizing the computational cost associated with generating training data. We demonstrate that by utilizing small, well-known reconstructions and implementing a careful data selection routine built on structural and spatially-resolved local uncertainties, we can arrive at an MLFF capable of extrapolating to larger, more complex structures. In the following, we first discuss the active learning approach. We then proceed to show how structural models reproducing the STM images can be systematically obtained for all the coexisting surface structures.
![]() | ||
Fig. 1 Top view of SrTiO3(110)-(n × m) surface reconstructions that are used for MLFF training: previously explored (4 × 1),22 (2 × 2) and (2 × 3)b,25 and newly identified (2 × 3)c and c(2 × 6) from this work. O atoms are shown in red, Sr in green and Ti in blue. The blue polyhedra depict TiOx polyhedra in all panels. Colored lines indicate the borders of unit cells. For (e) only half the unit cell is outlined. See Fig. S10 of the ESI† for the full unit cell and additional side views. (n × 1) overlayers exclusively contain tetrahedrally coordinated TiO4 units, while SrTiO3(110)-(2 × m) surface reconstructions are predominantly composed of octahedrally coordinated TiO6 units and in experimental observations include at least one Sr atom per unit cell.19,25 |
We started from the basis of our previous results on a Ti-poor reconstruction of SrTiO3(110), namely the (4 × 1) (see Fig. 1(a)).35 There, overlayers were symmetrically set up on opposite sides along the surface normal, attached to bulk-like layers in between. All structures in this study are constructed in the same manner as introduced in ref. 35. We constructed a database of 495 structures, re-evaluated using VASP47 with the r2SCAN functional.48 This initial DFT database is illustrated in Fig. 2 together with a 2D projection of the spherical Bessel descriptors of the local environments of Sr atoms obtained using the uniform manifold approximation and projection (UMAP) method for dimension reduction.49
Using these data we trained a ten-member committee based on the descriptor-based NeuralIL architecture.3,39 In committees the uncertainty is approximated by training a set of models that vary by initialization seed, hyperparameters, architecture, or training data, and monitoring their disagreement on a prediction to obtain the model variance. The majority of the computational cost incurred when training a descriptor-based model can be attributed to the calculation of the descriptors and the associated vector-Jacobian product operator. In this study, we vary the initial seed to enable NeuralIL's particularly efficient committee implementation that reuses descriptor encodings for all members, so that committees needed for uncertainty estimation can be trained with a negligible performance penalty.39
We then generated CMA-ES trajectories for the (2 × 2), (2 × 3)a and (2 × 3)b reconstructions, see Fig. 1. The CMA-ES46 samples a population of λ individuals, x(g)k, k = 1, …, λ, for every generation g from the multivariate normal distribution
![]() | (1) |
We then iteratively added structures from the MLFF backed CMA-ES trajectories using the committee uncertainty estimate aggregated structure-wise39,42
![]() | (2) |
The structures added to the training data are illustrated in Fig. 2 (top row) using the 2D UMAP projection. The background shows the distribution of all SrTiO3(110) data from the final dataset through hexagonal binning. The foreground of subplots (a)–(c) depicts the local Sr environments of the individual (n × 1), (2 × 2) and (2 × 3) data sets, with the colors indicating the distance of each Sr atom from the center of the surface slab. It can be seen how the data from each reconstruction used for the initial dataset contributes their own distinct environments. Not surprisingly, the Sr in the (2 × 3) overlayer reconstructions, depicted as green points in the top left of Fig. 2(c), is particularly distinctive, since it did not occur in the (n × 1) or (2 × 2) training data. In the bottom row of Fig. 2, the workflows used for generating the data depicted directly above them are illustrated, with (b) and (c) generated by different iterations of the same workflow.
We then performed exploratory CMA-ES searches on the (2 × 3) surface with population size λ = 100, and varying the initial step size in the range of σ(0) ∈ [0.1, 0.5]. To expand the search space, we developed a more generic founder structure (pictured in Fig. S7 of the ESI†), rather than relying on published findings. From these searches, we identified the new SrTiO3(110)-(2 × 3)c reconstruction, Fig. 1(c). A key feature of this structure is the alignment of the overlayer Sr atoms relative to the topmost TiOx rows. Geometry optimizing the (2 × 3)b and (2 × 3)c structures using VASP, reveals that the new (2 × 3)c has a lower energy of ΔE = 160 meV per (1 × 1) bulk unit cell.
While two of the 35 initial CMA-ES searches produced the new stable configuration, a majority of these evolutionary searches were found to be prone to instability, specifically to the expulsion of an Sr atom (see inset structure in Fig. 3). Although problematic structures could be identified manually, an active learning procedure needs to be able to identify and incorporate such structures into the training data based on computed quantities such as model uncertainties. Interestingly this behaviour was not reflected in the aggregated structure uncertainty s, eqn (2), as shown with the red line in Fig. 3. We therefore calculated spatially resolved atomic uncertainties by aggregating over neighboring atoms within a cutoff radius42 (in the following set to 4 Å) instead of over the entire structure
![]() | (3) |
The local uncertainties, eqn (3), clearly identified the misinterpretation of unphysical local structures which led to escalating errors during the evolution and are thus a reliable indicator for atoms being expelled from the surface. A visual representation of the evolution of local and global uncertainty within a CMA-ES run is given in Fig. 3. The solid black line tracks the local uncertainty associated with the single Sr atom in the overlayer, slocalSr. Notably, slocalSr begins to increase after generation 70 and exceeds three times the standard deviation of the local uncertainties (3σs) after generation 90. This behavior is further illustrated by the atomic structure shown in the inset: at generation 160 even the configuration with the lowest energy features a bright yellow sphere, indicating the high local uncertainty in the force estimate for the Sr atom. The rise in local uncertainty corresponds to environments that the model is increasingly uncertain about but which are localized enough for them to have a low weight in the globally aggregated uncertainty, eqn (2). Once an atom is separated from the rest by more than the cutoff radius of the underlying MLFF model, its contribution to the energy and forces, and thus also to the local and global uncertainty, becomes zero.
In an additional AL step we then identified trajectories where the local uncertainty associated with at least one atom j exceeded three standard deviations of all local uncertainties (sjlocal > 3σs). We randomly sampled structures from 40 CMA-ES evolution trajectories, with 32 of these structures exhibiting similar local uncertainty behaviour as shown in Fig. 3. The sampling was performed uniformly but was restricted to intact surface slabs, meaning that generations following Sr expulsion were excluded. The 2D UMAP of the local Sr descriptors within these structures is shown in Fig. 2(d), highlighting the added diversity that was achieved. With these additional 392 structures, the full training set consisted of 2618 configurations. This was used to train the final MACE model which was utilized for all further CMA-ES runs. The complete database and trained model are made available on Zenodo.50
Surface slabs were set up as illustrated in Fig. S8.† For all system sizes an anchor region of fixed atom positions was defined at the center of each slab, consisting of bulk-like layers that remained unchanged. The Cartesian coordinates of the atoms in the over- and attachment-layers are the degrees of freedom, i.e., the individuals x(g)k, sampled from the multivariate normal distribution. Opposite sides of all slabs were made symmetric. Further symmetry elements (e.g., mirror planes) were leveraged where feasible to drastically reduce the number of degrees of freedom in larger unit cells.
With the majority of hyperparameters set to default values, MACE trainings were performed with a cutoff radius rmax = 4.0 Å and two hidden layers set to 128 channels for scalar and vector properties each. The maximum number of epochs was set to 1200 with an early stopping patience of 50, and energy and force weights of 1 and 100, respectively. Afterwards, training was run for an additional 300 epochs with an increased energy weight of 1000 and an unchanged force weight.
Notably, as mentioned in the workflow description, training and test data are generated in a manner that allows the resulting MLFF to serve as the backend for CMA-ES runs, instead of DFT codes. This means that the MLFF needs to be able to reliably evaluate configurations far from the equilibrium, which is achieved by utilizing the CMA-ES itself for data generation. At each step, training and test data belong to the same distribution.
To eliminate unusable or obstructive data, all structures with force components larger than 500 eV Å−1, as well as those that do not converge with the chosen parameters, are excluded.
The computation of one CMA-ES trajectory starting from a founder containing 450 atoms and running for up to 1000 generations, with population size λ = 100, required only between one and three hours on one NVIDIA A40 GPU with 46 GiB memory when utilizing MACEfull, depending on early stopping. For that reason, it was possible to freely explore various founders to then select highlights for further investigation using DFT.
After a few cycles of sputtering–annealing (6 × 10−6 mbar Ar, 1 keV, 5–10 μA, 10 min, followed by 1 h at 1000 °C, 6 × 10−6 mbar O2), the surface cleanliness was verified through XPS and STM. The surface stoichiometry was then adjusted via submonolayer deposition of Sr (via molecular-beam epitaxy)54 or TiO2 (via PLD).55 The resulting surface periodicity was verified by LEED and STM. The surface presented in this work was obtained starting from a mixed (4 × 1)/(5 × 1) reconstruction. 1.4 ML Ti was deposited in PLD by keeping the sample at 580 °C in a background oxygen pressure of 6 × 10−6 mbar O2, followed by ramp down at 80 °C min−1.
STM images were acquired in constant-current mode with homemade, electrochemically etched W tips. The tips were prepared in situ by Ar sputtering (1 μA, 30 min). Voltage (up to 10 V) or current pulses (up to 10 nA) were applied while in tunneling contact to reshape the tip and improve resolution. Positive bias voltages correspond to tunneling into the empty states of the sample.
The UMAP in the upper row of Fig. 2(e) illustrates the variety of local Sr environments encountered in randomly selected structures chosen from early generations (10, 25, and 50) along these trajectories (red crosses). In comparison, the local environments in geometry-optimized structures of different unit cell sizes are clearly more uniform (black crosses). Throughout all CMA-ES searches, spatially resolved local uncertainty, along with the loss, served as an indicator for structure stability. Importantly, after adding the structures from the exploration-based active learning in the (2 × 3) cell, Fig. 2(d), to the training data, the MLFF learned to avoid regions leading to the previously observed overlayer instability. Fig. S4 of the ESI† shows uncertainty trajectories for (2 × 3), (2 × 4), and (2 × 5) runs, where no local uncertainty exceeds 3σs. Of particular interest is the (2 × 3) trajectory, which still illustrates that the Sr environment is the most uncertain, but does not escalate anymore (compare to Fig. 3). Importantly, this demonstrates the transferability of the model when extrapolating to reconstruction with larger unit cells.
With this approach, we were able to discover the new candidate structures shown in Fig. 4, namely (2 × 4)d (yellow), (2 × 4)e (orange), c(2 × 8) (blue), and (2 × 5)c (white). All of these structures were then relaxed with VASP and will be compared to experiment in the following. They are available on Zenodo.50
![]() | ||
Fig. 4 Top view of SrTiO3(110) overlayer candidates. (a) and (b) with (2 × 4) bulk periodicity. (c) With centered (2 × 8) periodicity and (d) with (2 × 5) periodicity. The orange and white rectangles and the blue rhombus indicate (parts of) the respective unit cells. Only half of the c(2 × 8) unit cell is outlined. See Fig. S11 of the ESI† for the full picture. |
![]() | ||
Fig. 5 Different regions of the same STM measurement of Ti-rich SrTiO3(110), showing (2 × 4) (orange and yellow), c(2 × 8) (blue), and (2 × 5) (white) motifs. The colored dashed lines mark the border of simulated images, overlaid with 50% transparency, colored solid lines represent the model unit cells or parts thereof in the case of c(2 × 8). Imaging parameters: Vsample bias = +1.8 V, I = 0.04 nA. Simulated images were created using the Tersoff–Hamann approximation.56 Alternative versions of this figure, including without overlays, with fully opaque simulation images or the topmost atoms overlaid, as well as a larger cutout from the experimental image, are shown in the ESI as Fig. S1.† |
In experiments, it is rare to encounter a surface composition that precisely matches the stoichiometry of a given thermodynamically stable surface reconstruction. Rather, the average composition observed in experiments often lies between two specific reconstructions, implying the coexistence of these reconstructions on the surface. Additionally, kinetic limitations may lead to the coexistence of even more surface reconstructions, with local variations in composition, while preserving the overall average surface composition.
To facilitate the investigation of these coexisting reconstructions, founder structures for SrTiO3(110)-(2 × 4) and -(2 × 5) cells were created as described in Section S3 of the ESI.† In short, they were generated by varying initial atom positions and adjusting the stoichiometry (addition or removal of TiO2 units). Details regarding the evolutionary searches, including the number of runs, population sizes λ, and step sizes σ are summarized in the Technical details. All newly proposed structures were tested by comparing the corresponding simulated STM images with the experimental data in Fig. 5. Important criteria for matching were the position of the Sr adatoms and their relative alignment to the TiO ridges.
For (2 × 5) systems, the initial placement of “TiO2 vacancies” resulted in distinct founders, with the vacancies positioned either in-line or out-of-line relative to the overlayer Sr in the [10] direction. All sensible configurations resulting from these founders yielded significantly higher energies than the (2 × 5)b from ref. 25. However, an alternative candidate structure, (2 × 5)c (see Fig. 4), could be identified due to its distinct features. There, in contrast to (2 × 5)b, the overlayer Sr atom is aligned with a TiO ridge rather than being offset. While the energy difference between the two is ΔE = 208 meV per (1 × 1) bulk unit cell in favor of (2 × 5)b, the new (2 × 5)c clearly fits regions of the inhomogeneous surface, as shown in Fig. 5.
The investigation of SrTiO3(110)-(2 × 4) identified two stable surface structures, labeled (2 × 4)d and (2 × 4)e, both shown in Fig. 4. Although the difference in DFT energies between them is vanishingly small – only 2 meV per (1 × 1) bulk unit cell in favor of (2 × 4)d – the arrangement of the overlayer atoms is distinct. The most noticeable differences include the relative position of the overlayer Sr atom with respect to the TiO ridges and the resulting positional changes. Additionally, the centered unit cell c(2 × 8) was found as a candidate structure for explaining regions on the STM measurement showing a shift between Sr positions.
The newly proposed structures (2 × 4)d, (2 × 4)e, c(2 × 8), and (2 × 5)c thus provide previously missing atomistic models, which we successfully matched to the various experimentally observed motifs.
To fine-tune the training data in a further AL step, and thereby enhance model performance, we employed spatially resolved uncertainty estimation to identify underrepresented local environments which global uncertainty measures had failed to resolve. The resulting MLFF, MACEfull, was trained on 2618 structures spanning SrTiO3(110)-(n × m), n ∈ {4, 5}, m ∈ {2, 3}.
We successfully identified two not previously reported candidates for stable (2 × 3) reconstructions. These structures were then used to extrapolate to (2 × 4) and (2 × 5) founder structures for evolutionary exploration. With this approach, we found new stable candidate structures for SrTiO3(110)-(2 × 4) and -(2 × 5), explaining different experimentally observed regions of the inhomogeneous Ti-rich surface. This method could be extended to other multi-element oxides featuring complex, composition-related, and possibly coexisting surface reconstructions characterized by large motifs.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4dd00231h |
This journal is © The Royal Society of Chemistry 2024 |