Zixuan Wanga,
David E. Oliverb,
Andrew J. Bissellb,
Gylen Odlingb,
Colin R. Pulham
a and
Carole A. Morrison
*a
aSchool of Chemistry and EaStCHEM Research School, University of Edinburgh, The King's Buildings, David Brewster Road, Edinburgh, EH9 3FJ, UK. E-mail: c.morrison@ed.ac.uk
bSunamp Ltd., Macmerry Business Park, 1 Satellite Park, Tranent, East Lothian, EH33 1RY, UK
First published on 23rd September 2025
We present a high-throughput data-driven workflow to identify potential heterogeneous nucleating agents from structural databases for phase change materials, such as ice. Our model evaluates the fit between ice Ih and nucleator docked slabs, considering Miller index planes up to (333), thus addressing some of the structural complexities in nucleation by examining crystal morphology features. Bulk water immersion experiments on a set of ten known nucleators set a delineating temperature to distinguish between good and poor nucleation behaviour, which helped derive numerical tolerance limits to allow reliable differentiation on the basis of the number of predicted matching interface models. We then used our algorithm to screen 3500 simple metal oxides and halides taken from the inorganic chemistry structural database (ICSD), and show that just 7% of the former and 3% of the latter were predicted to nucleate ice on the basis of geometric slab matching alone. Subsequent experimental testing of 22 compounds suggested a 64% correct prediction rate, and identified four new ice nucleators (CeO2, WO3, Bi2O3, Ti2O3). Inspired by the ice-nucleating efficiency of copper oxides, we also tested copper tubing with local tap water, and observed sub-cooling suppression, most likely due to copper oxide buildup. Although based on a simple geometric interface matching model, this approach offers an efficient route as a first stage high throughput screen for potential heterogeneous nucleating agents.
The available literature testifies to the complex nature of heterogeneous ice nucleation, with papers citing the importance of crystallographic similarities,14,15 surface chemistries,7,14,16–18 topologies,19 water structuring effects and adsorption strengths,20–22 and suspended solid or liquid particle sizes,23–25 with experimental conditions that range from macroscopic observations on ice formation in the atmosphere,18,25 to those performed in ultra-clean materials chemistry labs.26 Moreover, it is known that different nucleation pathways exist, depending on whether the nucleating particles are immersed in liquid water or suspended in a supersaturated vapour.27,28 The broadness of the field, combined with the variations in experiments conducted, results in variable reporting of ‘good’ or ‘poor’ ice-nucleating ability.
From a theoretical perspective, early reports8,29 often attributed ice-nucleating ability with a zero-lattice mismatch registry, i.e. a close similarity between the unit-cell dimensions of the nucleator and a particular face of the hexagonal phase of ice (ice Ih), typically defined as the basal plane (0001).30 While this has proven effective to account for the well-known ice-nucleating properties of simple compounds such as AgI,8 it is now widely accepted as an over-simplification,31 not least because it does not take into account the chemistry of the nucleator/ice-forming interface. Computer simulation has made significant inroads into providing insights at the atomic level for heterogeneous32 and homogeneous33 ice nucleation. In particular, work by Michaelides et al. has highlighted the importance of understanding surface hydrophobicity, morphology and the variation in the adsorption energy landscape,14,17 as well as considering how ordered water molecule layers build up on a nucleating substrate,5,17 and how the density of the liquid water reduces near the surface.22 More recently, machine-learning techniques trained on images of water-contact layers and the resulting prediction model (IcePic) have demonstrated success at accurately and rapidly predicting heterogeneous ice-nucleating behaviour.24
Herein we have taken a different approach that looks to take advantage of the wealth of potential heterogenous nucleators available through databases such as the international centre for diffraction data (ICDD)34 and the inorganic chemistry structural database (ICSD).35 We seek to generate a geometric docking model that assesses the quality of fit between ice Ih/nucleator docked slabs cleaved along Miller index planes from the respective bulk crystal lattices. While this has similarities to the zero-lattice mismatch approach, it goes beyond the low-index planes to consider the docking of all interfaces (both nucleator and ice Ih) described by the Miller indices up to (333). In this way, we are addressing some of the structural complexity of the nucleation process by considering crystal morphology, where ice crystallites could seed on the faces, edges, corners, defects or other surface features of the nucleating crystal that could be described by these higher Miller-index planes. While this study focuses on ice nucleation in bulk water, our overarching goal is to build a generalisable high-throughput framework for predicting heterogenous nucleation agents for any given phase change material.
Given the variation in the literature regarding experimental set up, our study began with establishing our own experimental benchmarking, via bulk water immersion experiments, on a set of ten widely known effective or poor nucleators for ice that we could readily source. We then derived a data-driven approach capable of identifying new heterogeneous crystal nucleators using geometric interface matching, where the quality of fit between ice Ih/nucleator docked slabs cleaved along Miller index planes from the respective bulk crystal lattices are assessed and ranked. By tightening a set of geometric criteria that describe the fit of the docked nucleator and ice Ih cut planes, the number of matching slab interfaces that remain can act as a guide to the likely classification of a good or poor nucleator for ice Ih. On this basis, we then screened the ICSD for several thousand simple metal oxide and halide structures. Testing the predicted outcomes for 22 compounds showed a 64% success rate. Our procedure has also lead to the discovery of four compounds, along with standard copper tubing, that can act as ice nucleators under immersion conditions.
The identity and phase purity of all compounds were verified by powder X-ray diffraction measurements, through cross-referencing against known structures in the ICDD34 database, PDF-5+ (see SI, Section S1). These crystal structures from the database were extracted as crystallographic information files (CIFs) to be used as input structures for our interface matching workflow, which is described in the following section.
Results from the immersion experiments, conducted according to the Experimental Methods, are shown in Fig. 1(a), with further data presented in the SI (Section S3). All measurements are in line with expectations of effective or poor ice nucleation capability according to literature precedents.10,19 Ice nucleation onset temperatures are known to vary significantly, even for well-documented compounds like AgI,10,43–45 owing to the inherent stochastic nature of the nucleation process and the variability of different experimental protocols. Our set-up affords sufficient reliability to differentiate between effective nucleators (e.g. AgI, Cu2O) and weak or inactive ones (e.g. Al(OH)3, BaF2). This is sufficient for our means, as we look only to define a boundary temperature to delineate between the two behaviours. According to the accuracy limitations afforded by the Polar Bear apparatus and our sample preparations (1 wt% solid loading in 10 mL ultra-pure water), we set the boundary temperature to −4 °C; this temperature distinction will be used to experimentally classify all further compounds as either a good or poor ice nucleator under these immersion conditions. We note that our binary classification is an operational choice specific to the accuracy limits of our experimental set up, and does not replace the droplet-freezing T1/2 values commonly reported in the literature.46 In the absence of a nucleator, our experimental set-up achieves reliable sub-cooling to −12 ± 3 °C (see SI, Section S3). We fully acknowledge that e.g. nucleation chamber experiments47 and drop-freezing assays48 would yield far more reliable nucleation temperatures than we have achieved here.
In the first instance, sets of surfaces were created by cleaving the corresponding bulk crystal lattices along the Miller index planes hkil ≤ 3 for ice Ih and for hk(i)l ≤ 3 for the nucleators to create a pool of 64 non-duplicated surfaces for each crystal lattice. Allowing all ice Ih surfaces to dock on all nucleator surfaces generated a total of 2401 non-duplicated interface models per nucleator.
The geometric matching of two docked surfaces was assessed by searching through integer multiples of the vectors of each surface to find the supercell models that present the smallest unit-cell mismatch. The geometry of each surface is described by two vectors parallel to each slab edge, and
, expressed as (m, n) supercells, such that m·
≈ n·
. These define the new vectors
and
, respectively (see Fig. 3). A reduction scheme was then used to express the vectors in the slab frame of reference, to negate the effects of translation, rotation or reflection of the individual surfaces. The two slabs were aligned by minimising
and
where the subscripts 1 and 2 denote ice Ih and nucleator supercell surfaces, respectively. Following translational alignment, the slabs were then rotated by transformational matrices Ri(θ) to lie parallel to each other. The surface generation, alignment and subsequent docking procedures are summarized in Fig. 3.
Five features were then defined to quantify the quality of fit for the resulting bank of ice Ih/nucleator interfaces (Fig. 4). These were the (i) maximum area overlap, (ii) angle mismatch, (iii) supercell vector mismatch [0] for vector , and the (iv) supercell vector mismatch [1] for vector
, according to eqn (1)–(4). Finally, to temper the (m, n) supercell generation to sensible outcomes compared to the maximum area overlap, we set a maximum value of tolerance for variables m and n, according to eqn (5). Note that the maximum area overlap and the m_n_tolerance are defined to strike a balance between achieving close lattice registry and avoiding unphysically large super-cells. Increasing the size of the super-cell makes geometric matching easier, but also less physically meaningful, since epitaxial interactions become ineffective at very large interface areas. To prevent this, the maximum area overlap feature sets an upper bound on the permissible supercell size, while the m_n_tolerance parameter restricts the integer scaling factors (m, n) used to construct commensurate supercells. Together, these criteria ensure that the generated interface models are both computationally tractable and physically representative of realistic epitaxial relationships, rather than artefacts of excessive cell scaling.
![]() | (1) |
![]() | (2) |
![]() | (3) |
![]() | (4) |
![]() | (5) |
A sensible value for the maximum-area overlap feature (eqn (1)) can be deduced based on the maximum surface area of possible surfaces generated from the bulk ice Ih lattice. This is the largest possible area of vector overlap achieved during slab docking. For this, the upper limit value of (4.5193 Å) ×
(7.3595 Å) × 10 ≈ 330 Å2 (where the multiplier by 10 ensures ample tolerance of surface size differences) was chosen for eqn (1), and was held fixed while the criteria for eqn (2)–(5), which by definition are assumed to adopt values close to zero, were allowed to vary. Tightening the parameters refers to applying stricter numerical thresholds for lattice vector length mismatch and angular deviation. The ‘loose’ and ‘medium’ thresholds were chosen empirically based on literature tolerance ranges observed in epitaxial lattice matching.51 Starting with pre-defined loose values of 0.2°, 0.2, 0.2 and 0.1 for eqn (2)–(5) respectively, the numbers of ice Ih/nucleator surface pairings from the known nucleator dataset that conform to these numerical limits are given in Fig. 5(a). Applying medium-level values of 0.1°, 0.01, 0.01 and 0.1 generates the plot given in Fig. 5(b), while tightening the m_n_tolerance parameter further to 0.01 gives the plot shown in Fig. 5(c). At this point the number of matching interfaces for the effective nucleators remains over ten, while those for the poor nucleators fall below ten (see SI Section S5 for the Miller indices of the matching interface models). Tightening the parameters any further results in a substantial and comprehensive loss in matching interface models; thus these fit criteria represent a probability boundary to differentiate between predicted good and poor ice nucleation behavior.
Of the correct predictions, some have previously been reported as ice nucleators in cloud-chamber experiments. We highlight the very early work by Fukuta,52 who investigated the behavior of many metal salts under vapor-deposition conditions, including MnO2, MgO, CoO, Ag2O, NiO, ZnO, and PbBr2. NiO was trialed for artificial snow production as far back as 1956,53 while more recently lead oxide was highlighted as an anthropogenic climate modifier.54 Early reports by Vonnegut55 suggested that solid solutions of CuI with AgI improve the nucleating ability of the latter under immersion conditions, which was attributed to improvements in the lattice mismatch with the crystal structure of ice Ih. MgO, TiO2, AgI, Al2O3, and SiO2 have attracted attention from materials scientists and computational modelers, who have studied the ice-nucleating abilities of individual faces. For instance, an experimental study on pristine MgO (100) and TiO2 (100) (rutile) suggested the absence of a templating effect,56 which matches our observation that these surfaces do not interface match with ice Ih (see SI, Section S5). The same paper reports that the (110) face of TiO2 (rutile) supports the growth of cubic ice.41 While we specifically matched against ice Ih, our modelling suggests that the (110) TiO2 (rutile) surface only matches against one cleaved surface from ice Ih; instead, the (001), (010), and (011) feature more heavily, geometrically matching with the basal and primary ice Ih faces (see SI, Section S5). Molecular dynamics simulations performed on the (0001) and (100) faces of the β-polymorph of AgI (as studied here) concluded that ice Ih nucleates on the former, but not the latter.57 This also matches the outcome of our study, with the (0001) face pairing with both the basal and primary ice Ih faces, whereas none of the ice surface models interface with the (10
0) face of AgI. For Al2O3, the γ-polymorph (as studied here, although with fairly low crystallinity, see SI, Section S1) is known to be the oxide that forms on aluminum surfaces when exposed to the atmosphere.58 Immersion studies have previously shown that α-Al2O3 is a more effective ice nucleator, and that any effect by γ-Al2O3 is weak.59 This is borne out in our work, where we classed γ-Al2O3 as a weak nucleator (Fig. 1b), which was substantiated by the low number of matching interfaces it presents with ice Ih (Fig. 5(d)); re-running the slab-matching process with the α-polymorph resulted in considerably more matching interfaces, which may be indicative of a higher ice nucleating ability (see SI, Section S5). SiO2 (10
0) has been observed to template ice Ih under immersion freezing conditions.27 While we initially paired this face with multiple ice Ih faces under our loose geometric criteria (Fig. 5(a)), upon tightening the criteria other Miller planes of SiO2 (notably (0001)) were found to match more closely with those of ice Ih (see SI, Section S5). We note that thin films of the nanocomposite SnO2 (cassiterite)/TiO2 (anatase), spin coated with a Krytox grease lubricant have been shown to display anti-icing properties,60 suggesting that ice templates poorly on this substrate. While geometric slab matching for SnO2 and TiO2 (anatase) returns high numbers of matching interfaces (Fig. 5(d) and SI, Section S5), suggesting that both should template for ice Ih, the latter was one of our five wrong assignments, as our bulk water immersion experiments showed that TiO2 (anatase) does not nucleate ice (Fig. 1b). While there are likely to be many reasons why the SnO2/TiO2 nanocomposite inhibits ice growth, it would be of interest to explore if this could be attributed to TiO2 (anatase) dominating the suppression of ice formation. To the best of our knowledge no prior reports have attributed ice nucleation properties to CeO2, WO3, Bi2O3 or Ti2O3, suggesting these could be new ice Ih nucleators under immersion conditions.
While these findings are generally encouraging for a high throughput screening approach for identification of heterogeneous nucleating agents based purely on interface matching, it is important to note that predictions will miss any potential nucleators that do not fulfil the matching criterion, as indeed illustrated by Co3O4, Mn2O3 and Fe2O3. It also does not consider any surface chemistry effects (such as surface polarity), allow for any surface reactions or reconstructions, variation in surface termination, slab stretching/compression, or rank the relative stabilities of the surface models. Our data set highlights a number of false negatives (CoO, CaCO3 (aragonite) and TiO2 (anatase)). MgO displays ambiguous behaviour, despite presenting with one of the highest number of matching interface models with ice Ih. One possible explanation is that the solid is undergoing a surface reaction in water to form Mg(OH)2 (brucite) or a hydrate. Repeating our slab matching approach with Mg(OH)2 (ICSD code 34401), and Mg(OH)2·2H2O (ICSD code 118781) return a total of 45 and 4 slab matching interfaces, respectively, with ice Ih – down considerably from the 81 predicted interfaces with MgO. Without an in-depth experimental validation that explores ice nucleation onto defined nucleator surfaces, it remains unknown whether nucleation actually proceeds via these geometrically-matching interfaces. Nevertheless we note that, reassuringly, the Miller index of the basal face of ice Ih features heavily on our paired slabs list, as do the primary and secondary prismatic faces (see SI, Section S5). Finally, the data presented in Fig. 1 and 5 illustrates an important point that a higher number of matching interface models does not correlate with a greater extent of suppression of supercooling; rather this demonstrates that the given crystal nucleator morphology (the edges, corners and potential defect sites captured by the Miller index planes up to hkl = (333)) are more likely to geometrically match with a corresponding Miller index place for ice Ih.
In an attempt to show whether interface matching offers new information beyond the zero-lattice mismatch approach,30 we have also calculated the mismatch registry parameter for the unit-cell parameters and for each of the matching interface models (see SI, Section S5). The analysis shows that, based on similarities of unit-cell dimensions between the nucleator and ice Ih basal face, just seven of the 32 compounds we explored experimentally would be correctly predicted as an effective or poor nucleator. For the matched interface models, while some of the effective nucleators do return low lattice mismatch values, the majority of the interface pairings do not. Overall, this suggests that interface matching offers a broader search criterion for potential nucleation behavior than the lattice mismatch approach.
It is important to further acknowledge that our geometric slab-matching approach, while efficient as a high-throughput screening tool, does not capture several features that are likely to play a significant role in heterogeneous ice nucleation. For example, the consideration of higher Miller index surfaces in our workflow inevitably includes surface terminations that are energetically costly and thus may be unstable under realistic conditions. In aqueous environments, such surfaces may undergo reconstructions or hydroxylation to alter the interfacial chemistry, thereby changing their propensity to nucleate ice. More sophisticated computational methods, such as density functional theory (DFT) calculations of surface energetics and adsorption geometries, or molecular dynamics (MD) simulations of hydration layer structure and dynamics, would be required to capture these effects in detail. Finally, our current model cannot assess the role of surface polarity, hydration dynamics, or specific adsorption energetics, all of which have been identified as key descriptors in recent simulation studies. Despite these limitations, our results show that geometric matching alone provides a surprisingly effective first filter for identifying candidate nucleators, which can then be prioritised for more detailed computational and experimental investigation.
Finally, given the high performance of copper oxides in our validation list, combined with the recent report that copper oxide nanoparticles act as ice nucleation sites,60,61 we decided to test ‘off-the-shelf’ copper tubing for ice nucleation. This follows given that a copper surface will readily oxidize in contact with air and water, and thus could promote the nucleation of ice in bulk water. This is particularly relevant given that copper is widely used in plumbing, heating ventilation and cooling applications, and electrical transmission power lines, for which the formation of ice contributes to burst water pipes and power outages.62,63 The resulting data show that the copper tubing did indeed induce ice nucleation at −2.3 ± 0.2 °C (compared to −10.3 ± 0.9 °C in its absence). Moreover, the nucleation temperature increased over multiple cycles, potentially correlating with increased oxidation of the metal surface (see SI, Section S7).
For the copper tubing experiments, 1 cm samples of 10-mm diameter copper pipe (BS EN 1057 standard) were added to 20 ml samples of tap water (East Lothian, CaCO3 concentration = 74.23 ppm64) in sealed sample vials and thermally cycled from −20 to +20 °C in the Polar Bear Plus Crystal apparatus using a heating/cooling rate of 1 °C min−1, for 25 cycles. Blank samples in the absence of copper tubing were run alongside, and measurements were repeated in triplicate.
Numerical tolerance limits for the docking model were derived from a training set of ten compounds. Our bulk water freezing experiments were sufficiently reliable to classify each compound correctly as an effective or poor ice nucleator, based on expectations from the literature. Tightening the geometric matching criteria resulted in a fall in the number of matching interface models until differentiation between the two classes was obtained.
The optimized model screened approximately 3500 simple metal oxides and halides from the ICSD for predicted nucleation behaviour. Subsequent experimental measurements of 22 compounds showed a 64% prediction success rate, as defined by the freezing temperature boundary obtained from the experimental training set data. Our workflow also identified four previously unreported ice nucleating agents (CeO2, WO3, Bi2O3, Ti2O3).
Given the high ice nucleating ability demonstrated for copper oxides, we were also inspired to test the nucleating ability of standard copper tubing immersed in samples of local tap water. This was also found to suppress sub-cooling, likely due to the build-up of copper oxides over the timescale of the experiment.
While the approach we have taken here is undoubtedly simplistic, and does not account for many important aspects, such as surface chemistry effects, reactions and reconstructions, it nevertheless demonstrates an acceptable level of success to form the basis for a high throughput computational screening approach to locate potential heterogeneous nucleating agents for further investigation.
Further information is available from the corresponding author on reasonable request.
This journal is © the Owner Societies 2025 |