Open Access Article
Tong Li
a and
Martin T. Dove
*abc
aDepartment of Mechanical Engineering, Guizhou University of Engineering Science, Bijie, 695013, China
bInstitute of Atomic and Molecular Physics, Sichuan University, Chengdu, 610065, China
cSchool of Physical and Chemical Sciences, Queen Mary University of London, Mile End Road, London, E1 4NS, UK. E-mail: martin.dove@qmul.ac.uk
First published on 15th April 2026
The purpose of this article is to take away some of the mystery associated with understanding the phenomenon of negative thermal expansion (NTE) in isotropic crystals. For sure, NTE is counter to our textbook intuition about the normal positive thermal expansion of chemical bonds, but there are now sufficient results from experimental and theoretical investigations to enable us to rationalise the existence of NTE in materials with cubic symmetry, and indeed to be able to predict its possible existence in any new material. In this article, we posit that there are four principles that can enable us to rationalise the origin of NTE in any cubic crystal: the existence of a network structure, the ability of the structure to support low-frequency vibrations, the spreading of the important vibrations into a sufficiently large volume of reciprocal space, and the limiting factor of dynamic disorder. We describe these with regard to a number of key examples. We point out that the same principles will operate for anisotropic crystals, but we offer the caution that NTE in anisotropic materials can also arise from factors missing in isotropic materials. We also point out that NTE arises in the face of competing mechanisms that ordinarily would give positive thermal expansion, so the practical existence of NTE arises as a balance between mechanisms pulling in opposite directions. We therefore argue that one should not point to a single simple mechanism for NTE. Instead, NTE is associated with the four principles together with chemical effects that can tilt how these principles are balanced against the drive of the chemical bonds towards positive thermal expansion. We present a practical way to understand these principles applied to the recently identified NTE material NaZr2(PO4)3. There is a long-standing and well-demonstrated importance of glass ceramics for many technical and domestic applications, which practically achieve many of the goals of the NTE materials community. In the light of this, it is our feeling that the key achievement of much of NTE research over the past 30 years has not so much been in terms of finding alternative NTE materials as in what we have been able to learn from these materials, which in turn has enabled us to elucidate the general principles that underpin our understanding of how NTE materials work.
In this article we focus on NTE materials with cubic symmetry. We have the impression that these materials are now better understood than is often appreciated, and our intention here is to present a coherent view of the various factors that can lead to NTE in cubic materials. Indeed, we believe that based on these insights and a few calculations, it ought to be possible to predict whether a given cubic material might show NTE.
It might be thought that crystals with anisotropic symmetry are not so different from those with cubic symmetry, and indeed in the early years after 1996 there was equal interest in a range of anisotropic NTE materials such as the family of materials whose crystal structures are isomorphic with that of orthorhombic Sc2W3O12.10–14 Certainly the mechanisms for NTE in cubic materials that we discuss here apply equally to anisotropic crystals. However, the existence of anisotropy creates two new options for NTE. One is from the effect of elastic anisotropy, which we have discussed in some detail with respect to some specific case studies.15,16 The second is that NTE can be driven by the existence of a structural phase transition, and we have discussed this in detail for the case of Cu2P2O7.17 In both cases NTE can arise quite independently of the mechanisms we discuss here for cubic materials. Some comments in this regard are given in Section 8.
There have been many reviews of NTE,18–31 often from the perspectives of materials and experimental characterisation. In this regard, we particularly recommend the recent comprehensive review of Naike Shi,28 which gives details of many families of NTE materials. We also recommend the Focus article by Coates and Goodwin,32 which discusses how to quantify NTE in isotropic materials, and in so doing provides a good overview of the range of isotropic NTE materials.
Since the publication of the primary reviews of mechanisms from a decade ago,22,33 the general principles have been articulated for a number of key examples, with the role of simulation becoming particularly important. This paper is not intended to be a review per se, but it is offered as an attempt to understand how the various principles apply to different cases. However, it is useful to start with a brief overview of the different cubic materials known to show NTE before we move on to discuss the underlying general principles, not least because we will use many of these materials as examples of how the principles apply in practice.
Here we focus on cubic crystals whose NTE is driven by a standard phonon mechanism (summarised in Section 3.1). These materials have what is often called a network structure, in which structural polyhedra, such as SiO4 and WO4 tetrahedra, or ZrO6 and ScF6 octahedra, are connected to each other at the polyhedral vertices to form an infinite connectivity. Because we focus on the phonon mechanism, we explicitly do not consider NTE that arises from magnetic and electronic effects, except to briefly mention the case of SmB6, because at first sight it looks deceptively like a material that could show NTE for the reasons that we have NTE in the other materials discussed here. NTE arising from magnetic and electronic effects has been reviewed by other authors, which we recommend.21,24,29,34,35
The literature variously reports two types of measurement of thermal expansivity, either by volume, αV = V−1∂V/∂T, or its linear form, αl = l−1∂l/∂T, where l is typically the unit cell edge length. To a good approximation, αV ≃ 3αl. Typical units are 10−6 K−1, which is usually written as MK−1. We have typically converted from published values of αl to αV in this paper when publishing reported measurements of thermal expansivities.
The crystal structure of β-cristobalite, shown in Fig. 1, is based on the cubic diamond structure, with the silicon atoms occupying the sites of the diamond structure, and oxygen atoms occupying, on average, the sites half-way between two silicon atoms. These sites suggest a linear Si–O–Si connection, but almost always in silicates this connection has an angle of 145°. It appears that in practice the positions of the oxygens atoms are dynamically disordered as the SiO4 tetrahedra constantly and dynamically tilt their orientations in order to bend the Si–O–Si linkages.40 Such motions are enabled by the spectrum of rigid unit modes (RUMs), namely normal modes of vibration in which the structural polyhedra, in this case the SiO4 tetrahedra are able to move without distortion, as if they are moving as rigid units.41–45
The crystal structure of ZrW2O8 is shown in Fig. 2. It shows a network of connected ZrO6 octahedra and WO4 tetrahedra. Each WO4 tetrahedron has one non-bridging bond.
Because of the importance of ZrW2O8 there have been many experimental and simulation studies. Briefly we note extensive neutron powder diffraction measurements46,47 for the study of the average structure, and X-ray absorption spectroscopy and total scattering measurements with X-rays and neutrons48–53 for measurements of local structure. Atomic vibrational dynamics have been studied by inelastic neutron and light scattering experiments.54–57 The effects on its thermodynamic properties have been studied by detailed measurements of the heat capacity.56,58 Many simulation studies focussing on the lattice dynamics have been reported,59–65 which have been supported by recent measurements of dispersion curves using neutron inelastic scattering from a single crystal.66
![]() | ||
Fig. 4 Crystal structure of ScF3 (space group Pm m). The fluorine atoms are represented by the green spheres, and the scandium atoms are represented by the pink spheres with shaded octahedra. | ||
NTE was discovered in ReO3 in 200871–73 and in ScF3 two years later.74 The NTE in ReO3 is relatively small and limited to low temperatures (note that an additional higher-temperature NTE has been reported from neutron powder diffraction measurements,72 but this was not confirmed by other studies75 and is likely to be associated with reduction of the oxygen content). On the other hand, NTE in ScF3 has a large magnitude and exists up to around 1100 K. Crystal structure analysis of both ReO3 and ScF3 clearly shows large amplitudes of the transverse motions of the anions.72,74
NTE has also been observed in double-cation derivatives, such as CaZrF6.76–78 The size of the NTE can be larger than that in ScF3 in some cases, but with other compositions it can be reduced, and even such materials can show PTE.
There have been several studies of the local structures of these materials using X-ray absorption spectroscopy,79 X-ray total scattering,79 and neutron total scattering.75,80,81 Dynamics and related aspects have been studied by inelastic X-ray scattering methods.82–84 There are also many reported simulation studies of NTE in ReO3,71,85 ScF3,86–88 and the double cation derivatives,89,90 although different studies frequently do not give consistent calculations of phonon dispersion curves.
Interestingly NTE is not found in the cubic phase of AlF391 or in any of the cubic perovskite phases.
The crystal structure of these phases, shown in Fig. 6, can be described as interpenetrating face-centred cubic arrangement of the metal ions and a body-centred cubic arrangement of the oxygen atoms. But more intuitively, this structure looks like two interpenetrating β-cristobalite structures of corner-linked O(Cu,Ag)4 tetrahedra with linear O–(Cu,Ag)–O bonds. Aspects of the dynamics have been studied by EXAFS96,97 and X-ray total scattering methods.98,99 Several simulation studies of the phonon dynamics have shown the importance of the acoustic modes in the mechanism for NTE.100–102
The crystal structures of Zn(CN)2 and Si(NCN)2 are similar to that of Cu2O, with Zn/Si occupying the tetrahedral site occupied by the oxygen site in Cu2O, and with the linear CN− and NCN2− anions forming the vertices of the tetrahedra and providing bridges between connected tetrahedra. As for Cu2O, the crystal consists of two interpenetrating β-cristobalite networks. We show the crystal structure of Si(NCN)2 in Fig. 7. In the case of Zn(CN)2 the head-to-tail orientations of the CN− anions are disordered;104 practically this gives the reason for showing Si(NCN)2 in Fig. 7.
Zn(CN)2 has been studied by X-ray total scattering experiments.105 Much of the analysis has focussed on dynamics involving twisting and displacement of the molecular anions, but several simulation studies have suggested that the NTE mostly arises from the acoustic modes,106,107 exactly as in Cu2O.
Phillips et al.108 showed that it is possible to form a single-network form of Cd(CN)2. These authors showed that without any absorbed molecules, this phase can give a volumetric thermal expansivity as large as αV = 100.5 MK−1, but with absorbed CCl4 molecules the NTE decreases monotonically with the occupancy, eventually turning into PTE. The volume expansivity for the double-network Cd(CN)2 is −61.2 MK−1,103 which is itself larger than that for Zn(CN)2, −50.7 MK−1.103
More recently there has been interest in double-cation versions of Zn(CN)2, with measurements made for materials of general chemical formula MB(CN)2.109,110 Two examples, AgB(CN)2 and NaB(CN)2, have been shown to have NTE reaching as high as −40 MK−1
109 and −92 MK−1
110 respectively.
Although Si(NCN)2 has an analogous crystal structure to Zn(CN)2, but without disorder and perhaps more flexibility,111 it shows much smaller NTE.112 This is surprising, and we will discuss possible reasons in Section 7.
![]() | ||
Fig. 9 Crystal structure of the zeolite Faujasite (space group Fd m). Oxygen atoms are represented by the red spheres, and silicon atoms are represented by the blue spheres within shaded tetrahedra. | ||
As noted at the start of this article, the existence of NTE in some zeolites had been predicted from simulation and confirmed experimentally8,9 just prior to the surge in interest in NTE following the publication of the work on ZrW2O81 in 1996. Experimental work has confirmed NTE in a number of other zeolites.117–119
The zeolite database120 gives 22 zeolites with cubic crystal structures. That NTE probably occurs in most cubic zeolites has been suggested by simulation studies.121,122 Experimentally, NTE has been confirmed in the cubic zeolites LTA123,124 and FAU.8,125,126
Many of these materials show phase transitions, and in some cases NTE in the low-temperature anisotropic phase has been reported, presumably reflecting spontaneous strains accompanied by the displacive distortions. Data for thermal expansion of the cubic phases are sparse, but some data have been collated recently,127 including older reports of NTE in (NH4)2Cd2(SO4)3 and (NH4)2Mg2(SO4)3.128 NTE has also recently been reported in the cubic langbeinite NaZr2(PO4)3.129
MOF-5 shows a significant NTE,130–133 although in practice relatively few MOFs show NTE. Other examples include UiO-66-type structures,134,135 which have larger secondary building units (the central groups of atoms) than MOF-5; CUB-5 and 3DL-MOF-1,136 which are analogues of MOF-5 with the same secondary building unit but with different linkage organic moieties; and the distinctively different Cu3(BTC)2 (HKUST-1).137,138 Curiously, perhaps, the cubic ZIF-8, which has the same network topology as the zeolite sodalite, does not appear to show NTE, even with favourable doping.139
We remark that metal–organic framework materials differ from cyanide materials in allowing the flexibility of the linker molecular ions and of the secondary building units to contribute to the mechanism for NTE, whereas the cyanide anions are effectively rigid over the relevant energy scales for NTE, and they do not form secondary building units beyond the standard tetrahedra and octahedra. Because of this we will not include metal–organic framework materials beyond the simpler cyanides in this article; they probably warrant a separate review article.
Let us begin this discussion by noting that there are whole families of materials – which here we restrict to oxides for convenience – which appear not to show NTE. These include materials with the fluorite (XO2) or related pyrochlore (X2X2′O7) structures, and with the spinel (X2X′O4) and garnet (Y3X2X3′O12) structures, where we denote ions with tetrahedral or octahedral coordination by X (second ones by X′) and other cations by Y. Even cubic perovskite phases (YXO3) do not appear to show NTE, in contrast to the analogous ReO3 and ScF3.
So are there other possible combinations of ionic oxides that might form as cubic structures? Let us consider the case of fully-linked polyhedra. We denote the tetrahedrally coordinated cation as T and the octahedrally coordinated cation as M. Thus we are searching for oxides of chemical formula MmTnO3m+2n. For oxides, since the oxygen ion is divalent, the anion components carry a total charge of −2(3m + 2n). This will require high oxidation states on the cations. In practical terms, the tetrahedral ions must often be a high-valence p- or d-block species (e.g., Si4+, P5+, Mo6+, and W6+) so that reasonable combinations of M and T can achieve overall neutrality without interstitial compensating cations. One combination is Sc2W3O12 (m = 2, n = 3), where Sc3+ and W6+ exactly balance the oxide framework to give a fully connected, interstitial-free network. However, this forms crystals of orthorhombic symmetry.
It is of course possible to create network structures in which some of the cation charge balance is achieved by some cations occupying large cavity sites. The perovskite family provides a clear example of this, as do members of the leucite (KAlSi2O6) family in their high-temperature phases and the garnet families. None of these families have yet been shown to have NTE in their cubic phases. However, NTE has recently been discovered in the langbeinite (K2Mg2(SO4)3) family, as discussed above.129 The few other combinations we can imagine also tend to form non-cubic structures, such as NASICON (Na1+xZr2P3−xSixO12).
The charge balance is easier for halide anions, because the required cation charge is reduced by half. However, whilst charge balance is easier, finding tetrahedral coordination for metal cations in fluorides is difficult. Most transition metals prefer octahedral geometry in fluorides. BeF4, BF4 and ZnF4 groups are exceptions, but linking them with octahedra to form a 3D network is geometrically rare compared to oxides.
It is not the purpose here to discuss potential new materials with cubic crystal structures that show NTE, but we do wonder whether the likelihood is low. Put another way, have we now discovered all the major families of cubic NTE materials? We of course hope not!
The basis of the Grüneisen theory of thermal expansion140–143 is that a lattice strain,
, will increase the energy of the crystal, typically as the square of strain to lowest order, but will also cause a change in the frequencies of the phonons, which will affect entropy. From symmetry we expect that the change in frequency will, to the lowest order, vary as
. For cubic crystals, we can define the volume strain
, where V0 is the equilibrium volume.
The total free energy is the sum of the vibrational and elastic free energies, denoted as Fω and
respectively.
![]() | (1) |
![]() | (2) |
The elastic free energy, per unit volume, is given as
![]() | (3) |
![]() | (4) |
The equilibrium condition is
. We need to form the derivatives of the component parts. For the vibrational free energy, we have
![]() | (5) |
![]() | (6) |
![]() | (7) |
For the elastic free energy we have
![]() | (8) |
The equilibrium condition,
, gives an equation for the thermal strain:
![]() | (9) |
![]() | (10) |
![]() | (11) |
![]() | (12) |
The mode Grüneisen parameters γλ and their weighting coefficients cλ, and hence the overall Grüneisen parameter
, can be readily calculated in a lattice dynamics calculation to get insights into mechanisms of NTE in specific systems. The assumption is made that the harmonic approximation applies for all values of strain and that the effects of anharmonicity are represented by changes in the force constants in the dynamical matrix caused by the strain. This approach is often called the quasiharmonic approximation (QHA). Direct phonon–phonon anharmonic interactions, whose effect increases with temperature, are neglected in the QHA.
We now want to make a comment about why positive thermal expansivity is the normal property of most materials. That is, from eqn (12), the lattice dynamics of a material usually gives positive values of the overall Grüneisen parameter, eqn (11). The theory of harmonic lattice dynamics – which we outline in a lightweight manner in the Appendix for readers who are less familiar with it – is directly dependent on the force constants between pairs of atoms. If two atoms are separated by distance r, the force constants, and hence vibrational frequencies, associated with any atomic displacements are proportional to ∂2E(r)/∂r2, where E(r) is the energy between two atoms separated by distance r.
If the interactions between atoms are pairwise and dependent only on distances, let us assume that the energy between two atoms follows as E(r) ∝ ±r−k, where k ∼ 12 with a positive coefficient for nearest-neighbour repulsive interactions, k = 6 with a negative coefficient for dispersion interactions, and k = 1 with coefficients of both signs for Coulomb interactions. The second-order derivative will scale as ϕ = ∂2E/∂r2∝ ±r−(k+2). In many crystals, where some of the atoms are on symmetric sites, with their fractional coordinates fixed by symmetry, we can anticipate that most interaction distances scale as
because the atoms do not have much freedom to rearrange their relative positions. In this case, the derivative of ϕ with respect to strain will be equal to
. That is, the values of |ϕ| will decrease with increasing strain, thereby systematically lowering the phonon frequencies and increasing entropy, and hence favouring positive thermal expansion.
As we see in the Appendix, the correct treatment of lattice dynamics is to consider not r as the key variable, but the displacements of the atoms. However, we show in the Appendix that the derivatives of the energy with respect to atomic displacement end up requiring an evaluation of ϕ = ∂2E/∂r2, and so the argument presented here is valid.
Where this argument gives way to NTE, however, is in the special case when the displacement is exactly perpendicular to the direction connecting two atoms. This is the situation when there is a maximum tension effect, as we will discuss in Section 3.3. In this case the relevant derivative has a zero value, and the residual force constant will be given a non-zero value by weaker angular forces. This gives the possibility that the overall force constant might increase with positive strain, and hence with the required negative values of mode Grüneisen parameters to give NTE. We will illustrate this point in Section 3.5.
Higher-frequency bond-stretching modes always show positive values of mode Grüneisen parameters and measurements of a few instantaneous bond lengths using total scattering or EXAFS always show that the bonds display positive thermal expansion.81,144,145 Why this is so can be explored using the Morse potential as a representation of the energy of a chemical bond:
E(r) = D[exp(−2α(r − r0)) − 2 exp(−α(r − r0))]
| (13) |
![]() | (14) |
Let us illustrate the discussion about the balance between contributions to positive and negative thermal expansion by considering the case of ScF3. Its crystal structure is shown in Fig. 12a, where we show the three-dimensional linear Sc–F–Sc connectivities. This figure represents the atoms by their thermal ellipsoids74 and demonstrates the extent of the transverse motions of the fluorine atoms. The distribution of the values of individual mode Grüneisen parameters with frequency as calculated by recent density functional theory (DFT) calculations88 is shown as a two-dimensional colour map in Fig. 12b. This distribution is not untypical for NTE materials, in that the positive values of the mode Grüneisen parameters are uniformly for the higher frequencies – in this case, for frequencies from 7 THz up to the maximum frequency of 19 THz – with the most negative values of the mode Grüneisen parameters for lower frequencies, beginning at 4 THz and with increasingly negative values down to 1 THz.
![]() | ||
Fig. 12 (a) Crystal structure of cubic ScF3 (space group Pm m),74 shown in an alternative representation to that shown previously in Fig. 4. The light blue spheres represent the scandium atoms, and the darker blue ellipsoids represent the fluorine atoms, with the ellipsoids representing the relative extent of thermal motion in three-dimensional space. This representation highlights the significant amplitudes of vibrations of the fluorine atoms in the directions perpendicular to the Sc–F–Sc linkages (transverse vibrations). (b) Distribution of the values of the mode Grüneisen parameters with frequency calculated by DFT methods,88 shown as a colour map. Pink represents zero values, and yellow represents maxima in the distribution. (c) Accumulated overall Grüneisen parameter at high temperature, .88 | ||
It is pertinent to this discussion to explain exactly why we would expect the most negative values of the mode Grüneisen parameters to occur at low frequencies. The intuitive explanation might follow this way. The spectrum of frequencies we see in a crystal for a given wave vector arise by combining all the force constants in different ways, some of which may be large and others small. This of course is what the theory of lattice dynamics does for us – recall that we present a lightweight version of the theory in the Appendix for readers who are less familiar with it – but let us stick to intuition for now. The high-frequency modes will arise from combinations of the force constants that mostly add constructively. On the other hand, the low-frequency modes are likely to involve combinations that give cancellations of large force constants, giving a residual value that is effectively the difference between two large numbers. Instinctively we know that the relative sensitivity of the difference between two large numbers to small changes in these numbers is much higher than that for the sum of two large numbers. Thus when the force constants change due to imposition of a small strain, the lower frequencies in the spectrum are likely to be much more sensitive than the higher frequencies. Thus we intuitively expect that the vibrations of lower frequency will have larger absolute values of their mode Grüneisen parameters.
We can say the same thing more formally from the theory of lattice dynamics. Central to this theory is the quantity called the dynamical matrix, which is described in the Appendix. It is formed from the interatomic harmonic force constants together with relevant phase factors. The important point is that the squares of the mode frequencies, ω2, are obtained as the eigenvalues of the dynamical matrix. The dynamical matrix is Hermitian − its complex conjugate is its transpose – and this means that its eigenvalues are real numbers.
The eigenvalues of a Hermitian matrix that are most sensitive to small changes in the values of the matrix elements are those with the lowest values,147 and this is important when we consider the link between mode Grüneisen parameters and frequencies. Small volume strains will cause small changes in the interatomic-force constants that contribute to the dynamical matrix, and it follows that the largest effects will be on the modes of lowest frequencies. This is the exact mathematical demonstration of the intuitive analysis given above.
This sensitivity of the lower-frequency eigenvalues is then enhanced by the factor of ωλ−2 in the equation for the mode Grüneisen parameter, eqn (7). Thus the distribution of negative values of the mode Grüneisen parameters seen in ScF3, Fig. 12b, at the lower frequencies is typical of most NTE materials for which the same analysis has been performed.
With negative and positive values of the mode Grüneisen parameters occurring over the lower and higher ranges of frequencies respectively, the question is how finely balanced are the two positive and negative contributions. We can evaluate this by performing an accumulated integral of the form
![]() | (15) |
From eqn (11) we see that each contribution is weighted by a factor, cλ, that depends on frequency and temperature, specifically as their ratio ħω/kBT. At low temperature only the low-frequency modes are excited, so the values of cλ preferentially weight the contributions of the low-frequency modes and hence typically of the NTE modes. Upon increasing temperature, the higher-frequency modes are excited, increasing the weighting factor cλ for these modes, and hence increasing the weighting of the terms with positive values of mode Grüneisen parameters to the overall average. This explains why there are many cubic NTE materials where the NTE is only found at lower temperatures (β-cristobalite is our best-known exception,38,39 but there are indications of the same in ZrV2O7 at temperatures just above the first phase transition upon cooling,18 and in the isostructural materials ThP2O7 and UP2O767).
The point is that this shows why asking a question regarding the origin of NTE is very nuanced. One should not point to a given vibration and assign undue significance. For example, it is tempting to point to the RUMs in ScF3 – the normal modes in which ScF6 rotates without distortion, which exist with wave vectors on the edges of the Brillouin zone between the special points
and
41,44,45 – and imagine that they are the origin of NTE in this material. Well, they are important, but the question as to the origin of NTE needs to consider not just why modes with negative values of their Grüneisen parameters exist, but why these modes eventually outweigh the large number of vibrations with positive values. In this regard ScF3 offers us a challenge if we try to understand why it shows NTE: we also need to understand why many other materials with exactly the same features as ScF3 show only PTE. We will return to this question in Section 9.
Let us consider the ceramic network first. If we denote the polyhedral centre atoms as A, and the linkage atoms as B, the tension effect operates on the linear triplet of A–B–A atoms. The A–B bonds are considered to be relatively stiff, and likely the B–A–B right angles are also quite stiff, but the linear A–B–A angles are relatively flexible. Because of the stiffness of the A–B bonds, transverse vibrations of the B atom may actually pull the two A atoms towards each other rather than stretch the A–B bonds. A geometric argument can show that the contraction of the A–A distance in this case will be proportional to the mean-square-amplitude of the vibration,148 which will vary linearly with temperature in a classical approximation. This idea actually predated the onset of the current interest in NTE by four decades,143,149 but was revived with the later work on ZrW2O8.1 It provides a natural foundation for our understanding of NTE, but as we will discuss below (Section 3.6), by itself it does not automatically explain the origin of NTE.
This discussion of the general idea of the tension effect can be extended for the case where the intermediate entity is not a single atom but is a molecular ion, as illustrated in Fig. 13c. For example, we can represent this as A–C–C–A, where C–C represents the molecular anion. In this case, not only can we find a transverse displacement of the C–C molecular anion that will give an analogous contraction of the A–A distance, but a contraction of the A–A distance can also be achieved by rotation of the C–C molecular anion. These two motions are illustrated in Fig. 13d.
We want to add a cautionary note, by way of introducing the next section, that the images in Fig. 13 typically represent optic phonon modes, but often, particularly if there are strong bending forces in the crystal, the tension effect may instead arise mostly from acoustic modes. These may of course include some aspects of the motions shown in Fig. 13b and d, but there are significant differences in how NTE can arise from a tension effect mediated by acoustic and optic modes. Readers should not consider the visualisation of Fig. 13 to represent all of the possible tension-effect mechanisms.
At the outset we should comment on the difference between acoustic and optic modes. In the limit that the wave vector q goes to zero, the acoustic modes are the solutions to the dynamical matrix in which the centre of mass is allowed to move, whereas for the optic modes the centre of mass is fixed. Consider a crystal with 4 symmetrically related atoms in the unit cell, that is, that they have equal mass and displacements with equal magnitudes but with the possibility of positive of negative signs. There are four possible linear combinations of displacements in each unit ell that are consistent with the requirements of symmetry. Denoting the displacements by arrows, the four linear combinations are
Mode 1 is the acoustic mode, and at zero wave vector it reflects a uniform displacement of the crystal. Because this pattern of displacement will not involve stretching or bending of bonds, it incurs no cost in energy, and this mode is therefore of zero frequency at a zero wave vector. Put another way, the displacements of atoms in the acoustic mode are completely in phase. On the other hand, for Modes 2 to 4 the displacements are in some out-of-phase combination. Their displacements sum to zero, thereby maintaining a constant centre of mass. These are the optic modes at a zero wave vector. Each vector has three components in three dimensions, so that we have a total of 12 modes, with three being the acoustic modes. This effectively defines the difference between acoustic and optic modes, exactly and only defined in the limit q → 0.
In the limit q → 0, but not exactly q = 0, the transverse acoustic modes correspond to shearing of the unit cell, with displacements perpendicular to the direction of q, whereas the longitudinal acoustic modes correspond to tensile strains parallel to q. Thus for a cubic crystal with q‖[001], the frequencies of the longitudinal acoustic modes are determined by the tensile elastic constant, ωLA2(q) ∝ C11|q|2, whereas the frequencies of the transverse acoustic modes are determined by the shear elastic constant, ωTA2(q) ∝ C44|q|2.
The distinction between acoustic and optic modes becomes less clear upon increasing |q|, and at the Brillouin zone boundary there may be no distinction. The textbook example of the diatomic chain has a clear distinction between the in-phase and out-of-phase motions of the acoustic and optic modes as q → 0, but at the zone boundary one mode has one atom not moving at all, and the other mode has the other atom not moving. Distinctions may also be blurred upon increasing |q| in more complex crystals as the acoustic modes ‘interact’ and anti-cross with optic modes of the same symmetry, leading to a mixing of mode eigenvectors leading to the acoustic branch in the dispersion diagram showing no signature of the original acoustic mode in its eigenvectors.
We show one example of a tension effect operating due to a transverse acoustic mode in Fig. 14. This image shows the effect of a long-wavelength transverse acoustic mode with the wave vector along one of the [110] directions. It leads to a shearing of the unit cell. To minimise deformation of the two sets of angles and to preserve bond lengths, there is a small twist of the cyanide groups and a commensurate rotation of the octahedra. However, for the subtle reason that from unit cell to unit cell the displacements are progressive, meaning that the twist is actually a positive displacement of both atoms, the motions associated with this acoustic mode are formally distinct from those of an optic mode.
![]() | ||
| Fig. 14 Representation of the deformation of the structure for two-dimensional Prussian blue (shown in Fig. 13c) associated with a long-wavelength transverse acoustic mode whose wave vector is propagating along the diagonal of the diagram. | ||
The motions associated with the transverse acoustic mode are seen in Fig. 14 to act exactly as a tension effect in lowering the volume. We will see below that acoustic modes are responsible for NTE in several cubic materials, including Cu2O (Section 5.4.4) and Zn(CN)2 (Section 5.4.5). We will also see below, Section 3.5, that the importance of acoustic modes for NTE arises naturally in a simple model of the tension effect in a one-dimensional crystal.
It might be asked whether we can predict whether acoustic modes are or are not important in any particular system. Our feeling is that acoustic modes are likely to be important in many systems, both isotropic and anisotropic, and the cases where they are not important, such as in ScF3, are likely to be the exception. The other comment we can make is that in cases where there are many atoms in the unit cell, such as ZrW2O8, the tension effect is spread over many phonon modes, acoustic and optic, so the contributions specifically from the acoustic modes are relatively slight. On the other hand, in Cu2O and Zn(CN)2, the network flexibility enables the acoustic modes to have low frequencies, and thus these are markedly important compared to the relatively few optic modes that contribute to NTE.
Our point is that we should view the transverse acoustic modes as more likely than not to contribute to NTE and that their relative importance will depend on whether the overall flexibility of the network allows for smaller or larger numbers of optic modes that also contribute to NTE. It is surely the case that the role of acoustic modes has been somewhat neglected in our understanding of NTE.
![]() | ||
| Fig. 15 One-dimensional model showing NTE, from Fang et al.150 The model has two types of atoms and three types of potential energy functions. The first, V1(r) with force constant k, represents the stiffness of the bonds, eqn (17). The second, V2 with force constant K, represents bending of the bond within the quasi-rigid unit, eqn (18). The third, V3 with force constant J, represents flexing of two rigid units about their shared vertex, eqn (19). | ||
The model has the minimum number of interactions to create realistic behaviour. The first is a strong radial interaction between nearest-neighbour atoms. Defining the equilibrium distance between atoms as r0, and allowing for a strain
, the distance between any two neighbouring atoms is given as
![]() | (16) |
![]() | (17) |
The second interaction is a term that reflects the stiffness of quasi-rigid units. In principle this could be an angular term, but it is useful to work with the linearised form in terms of the displacements,
![]() | (18) |
The third term is designed to ensure that the RUMs in the model, the vibrational modes in which the quasi-rigid units rotate without distortion, have non-zero frequency, and simply give an energy cost for rotations of the quasi-rigid units. This is expressed in linear form as the transverse flexing of the linear chain, giving an energy cost for nearest-neighbours to move by different amounts,
![]() | (19) |
Here we give solutions to the dynamical matrix for two special cases. The first case is for very small wave vector q. One solution is for the acoustic mode, whose frequency is linear in the wave vector. The other solution is for the optic mode, which has zero group velocity at q = 0, namely ∂ω/∂q|q=0 = 0. The two solutions are
![]() | (20) |
The second case is for the wave vector at the boundary of the Brillouin zone, q = 1/2, where again both solutions are standing waves with zero group velocity,
![]() | (21) |
The full dispersion curves for this model, for strain
and for a small negative value of the strain, together with the second model representing a hybrid structure, are shown in Fig. 16. Recall that these are the modes for transverse motion, and do not include the stiff longitudinal acoustic modes or the high-frequency transverse optic modes. It is clear that all transverse phonons show a lowering of frequency under negative strain, meaning that they all have negative values of their mode Grüneisen parameters.
![]() | ||
Fig. 16 Dispersion curves for the two one-dimensional models described in this text for a given set of values of force constants, together with the density of states (right), from Fang et al.150 The top diagram is for the ceramic model described in the text, and the lower diagram is for the corresponding hybrid model described by Fang et al.150 The results are given for strain as the black curves, and for a small negative value of as the grey curves. | ||
The mode Grüneisen parameters can be calculated from
. Both q → 0 and q = 1/2 give
![]() | (22) |
Fang et al.150 used the simple model to calculate the variation of the overall Grüneisen parameters, heat capacity and thermal expansivity of both the ceramic and hybrid materials. The results for some trial parameters (those used to calculate the dispersion curves shown in Fig. 16) are shown in Fig. 17 and demonstrated that the tension effect encapsulated in the model gives NTE in the manner found experimentally in many systems (as we will discuss below). Actually the intention of this model was not merely to reproduce NTE, but to demonstrate that a system showing NTE will also show pressure-induced softening.121,151,152 It should be noted that a simple three-dimensional model has been applied to ScF3, where we have three interaction terms corresponding to those in eqn (17)–(19).81 This model also showed NTE and will be discussed below because of some insights it has given.
![]() | ||
Fig. 17 Results for the two one-dimensional models, from Fang et al.150 The two graphs show the variation of heat capacity, CV/R, overall Grüneisen parameter, , and the thermal expansivity, α, for reasonable values of the model parameters. | ||
The tension effect, as we have described it, gives a natural mechanism for the existence of negative mode Grüneisen parameters, as supported by the simple model150 described in Section 3.5. But the immediate concern we may have is that the three atoms in the A–B–A linkage are never isolated, which means that we need to consider correlations between motions. Consider the simple two-dimensional model in Fig. 13a. The transverse motion of one B atom requires rotation of the corresponding A–B bonds. Each A atom has three other A–B bonds, so an isolated displacement of a single B atom will cause distortionsof two B–A–B right-angles. It is quite possible that this will cost energy, and not a small energy, which will raise the frequency of any collective vibration showing transverse vibrations unless there are correlated displacements of the other atoms within the AB4 square coordination. What this means is that the energy cost can be avoided if the squares can rotate about the perpendicular direction as if acting as rigid entities, with neighbouring squares rotating in equal and opposite senses. This is in the manner associated with the cubic–tetragonal phase transitions in perovskites, such as SrTiO3, and is the starting point for the RUM model.41,45,153 In a three-dimensional cubic perovskite crystal, neighbouring layers can rotate in the same or opposite sense, because the connecting oxygen atom is not displaced in the vibration, and this means that the RUMs exist for all wave vectors along the edges of the Brillouin zone, from
to
and symmetrically-related directions.
The existence of RUMs can explain why some vibrations have the necessarily low frequencies to give large negative values of mode Grüneisen parameters, but not all systems can support the existence of RUMs. Therefore there may need to be a degree of bond-bending flexibility, but to what extent is not clear a priori. We also need to understand why it is possible for RUMs to give NTE in some systems, such as ScF3 and zeolites, but not in the other related systems such as the cubic perovskites and the cubic leucites.
The related issue concerns the balance between the negative and positive mode Grüneisen parameters. This balance is illustrated when thinking about RUMs. Consider the simple two-dimensional model shown in Fig. 13a. It has a single RUM, with its wave vector at the corner of the two-dimensional Brillouin zone. The overall Grüneisen parameter is a sum over all vibrational modes, which means an integral over all wave vectors. A single point has a vanishingly small contribution to an integral over the area of the two-dimensional Brillouin zone, and in three dimensions a line of wave vectors also gives a vanishingly small contribution to the integral over the space of the three-dimensional Brillouin zone. Thus the contribution to NTE from the RUMs needs to also be expanded outwards in reciprocal space, but how is this possible?
These are amongst the questions we will now discuss, but they are not the only questions. The comparison between ceramic (atomic linkers between polyhedra) and hybrids (molecular linkers) is useful to explore. The existence of the molecular linkers may give more flexibility,154 but to what extent? Perhaps greater network flexibility will lead to more modes with negative Grüneisen parameters, and this might explain why NTE in the Zn(CN)2–Cd(CN)2 system is relatively large,103 but it appears difficult to create larger NTE. Why?
If some new cubic network materials could be found to show NTE, could we have predicted the NTE from the knowledge of the crystal structure, and would we be able to explain the NTE without having to do detailed calculations? Actually we suspect so. After a large body of work that has been carried out during the last few years, although there is not a single general mechanism that we can invoke – well, the tension effect underpins our understanding, but it needs too much else for it to be considered a general mechanism – we do have a set of general principles that are now sufficiently well understood to mean that NTE can be rationalised, in a detailed way, in most systems. In summary, these are
1. We need to have arrangements of atoms giving the possibility of a tension effect; this probably means we have to have a network crystal structure.
2. The vibrational modes giving the tension effect need to be of low frequency; these are the modes most affected by changes in structure. Thus we need to have low-energy flexibility of the network. This may be associated with RUMs, or else with some degree of polyhedral angle flexibility.
3. The modes giving the tension effect need to occupy a sufficient volume of reciprocal space; this may require flexibility of polyhedra in a RUM model.
4. If we have too much flexibility, the thermal motion will lead to dynamic structural disorder, which will limit the extent of NTE. This probably means it is unlikely that a new material will show a much larger extent of NTE than that in the current examples.
In the rest of this article we will articulate these general principles and explain how they are used to understand NTE in the isotropic systems introduced earlier (Section 2.1). We stress, however, that whilst these principles give the possibility of NTE and enable us to understand the mechanism of NTE, they do not automatically give the necessity of NTE occurring in a particular material. We need to always remember that there is a balance between interactions between atoms that will favour PTE, which we highlighted in Section 3.2 and will discuss again in Section 9. Accordingly, we hope that this overview will be able to guide understanding and interpretation of NTE in future studies.
Almost certainly for a crystal to display such an arrangement of atoms means that it will have a network structure, with the central atoms of these near-linear triplets being the linkage atom connecting two structural polyhedra, such as tetrahedra and polyhedra. Examples of network structures are shown in Fig. 1–11. Some examples of the groups of three atoms giving a possible tension effect mechanism are shown in Fig. 18. Not only is the network required to give the connectivities of atoms, but it is necessary for the inward pull of the tension-effect motions to percolate across the crystal; if the relevant groups of atoms were isolated from each other, their motions would not be correlated and therefore could cause contraction over large distances. We see something similar in low-dimensional networks, such as the one-dimensional CuCN155 and the two-dimensional ZnB2(CN)8,16 where the thermal expansion in the perpendicular directions is positive.
None of the examples discussed here and shown in Fig. 1–10 have the structural polyhedra sharing edges or faces. The only example of a possible NTE material where polyhedra share edges is tetragonal ZnF2, which has been reported to have weak area NTE,156 but we were unable to reproduce this in our own neutron diffraction measurements.157 Not all networks need to have fully-connected polyhedra. ZrW2O8 is one example that has non-bridging W–O bonds,1 but this is relatively uncommon in fact. It is a factor that we will discuss in the next section regarding the flexibility of the network. Anticipating the discussion, we would like to remark that the existence of non-bridging bonds within the structural polyhedra does not necessarily mean an increase in the flexibility of a network.
Lattice dynamics calculations on a variety of materials usually show that the bonds in network structures are stiff, giving rise to the highest frequencies.85–88,100,111,158 A few experiments focusing on local structure, namely total scattering50,51,53,79,81 and X-ray absorption spectroscopy,49,53,96,97 show that these bonds tend to show PTE.
Consider a group of atoms with square coordination of the polyhedra in two dimensions, and with a connection between two squares that gives a tension effect, as shown in Fig. 19. The transverse displacement of the shared atom moving alone will bend the angles within the two polyhedra, which is likely to cost energy, Fig. 19a. This cost is removed if the two squares both rotate in concert, as shown in Fig. 19b. Such rotations can occur as RUMs if allowed by the network. The point is that quite probably the bending mode (Fig. 19a) will have rather higher frequency than the RUM (Fig. 19b).
Our qualitative understanding of the role of low frequencies is from the fact that the squared amplitude of atomic motion is proportional to the inverse-square of the frequency, 〈u2〉 ∝ ω−2. That is, the lower the frequency, the larger the corresponding atomic motion. Much of the transverse atomic motion shown by the fluorine atoms in ScF3, Fig. 12a, arises from low frequency modes.
Consider this triangle, which represents two bonds in the tension effect, each of length a0/2, with a root-mean-square displacement of 〈u2〉1/2:
The distance between the two horizontal ends, representing the polyhedra centres, is reduced by the tension effect from a0 to a ≃ a0 − 2〈u2〉/a0 to the first order. The mean-square displacement associated with a vibration of angular frequency ω will be equal to 〈u2〉 = kBT/mω2, where T is the temperature and m is an effective mass. Thus we have a negative linear thermal expansivity equal to α = a0−1∂a/∂T = −2kB/mω2a02. The key point is that the NTE is large for smaller values of ω2, and hence we can immediately see that the tension effect is more likely to arise from motions that cause lowest energy deformations of the structural polyhedra. This argument appears to be edging towards a “RUM theory of NTE”,148 but at this point let us be prudently cautious, because in practice this need not be so.
This argument has actually sidestepped the third principle to be discussed later in Section 6, but we have at least established that requiring the network to permit low frequencies is a plausible factor in allowing it to show NTE. A firmer argument comes by considering a more-formal approach using Grüneisen theory, which we reviewed in Section 3.1. The important quantities are the individual mode Grüneisen parameters, defined in eqn (7). Considering the form of the equation in terms of ω2, we see that γλ is the product of two quantities, ωλ−2 and ∂ωλ2/∂V. Obviously, ωλ−2 is larger for smaller values of ωλ. But using the argument we made earlier, that the eigenvalues of a matrix with lowest-values are the most sensitive values, it also follows that |∂ωλ2/∂V| will be larger for lower frequencies. It follows from both factors that the mode Grüneisen parameters with the largest negative values will be those of the lower frequencies. This was clearly seen in ScF386,88,90 and in most other systems for which there are comparable calculations.33,63,65,101,102,106,107,109,133,159
We next ask the question, can network structures support the existence of low-frequency vibrations?
In fact there are two ways to count constraints and degrees of freedom in network structures, as reviewed recently.45 We prefer the approach of treating the polyhedra as rigid units, when appropriate, giving F = 6 degrees per polyhedron, and C = 3 constraints per shared vertex (from ensuring that the vertices of two connected polyhedra share the same coordinates x, y, z). The other approach is to treat atoms as the dynamic objects, with F = 3, to allow each bond to give one constraint, and to include angular constraints within the polyhedra where appropriate. Here we mostly follow the first approach, but incorporate bond constraints when we consider flexible polyhedra.
The important point is that if the total number of degrees of freedom exceeds the total number of constraints, F > C, the network is flexible in some way. But if the total number of constraints exceeds the total number of degrees of freedom, C > F, the network is over-constrained and hence rigid. In the marginal case, F = C, it is possible that some constraints are made redundant due to symmetry. This point has been discussed in some detail elsewhere in the context of the RUM model of phase transitions.41,45
We start by considering the case where the polyhedra in the network are fully connected, so that the material has the general formula MmTnO3m+2n and contains m MO6 octahedra and n TO4 tetrahedra. In the case where the polyhedra are rigid, we have F = 6m + 6n per formula unit. To count constraints, note that each vertex gives 3/2 constraints per polyhedron (3 total constraints, but shared between two polyhedra). It follows that C = 9m + 6n per formula unit, meaning that C–F = 3m. Thus the network is over-constrained unless m = 0, that is, only for tetrahedral networks might we expect to find RUMs.167 In fact the cubic perovskite network is the only known exception, because the high-symmetry alignment of the octahedra allows for a significant number of constraints to become degenerate. This has been discussed elsewhere.44,45,153
Let us now take the case where some of the atoms in the tetrahedra are non-bridging. Without making it too complicated, we can say that each tetrahedron has q bridging atoms. The chemical formula will be of the form MmTnO3m+n(4−q/2). Of course, we can't have half atoms, so clearly nq is constrained to be an even number. The number of degrees of freedom is the same as in the fully-connected case, but now we see a reduction in the number of constraints to C = 9m + 3nq/2, giving C–F = 3m − (6 − 3q/2)n per formula unit. This becomes flexible in the RUM-sense when C ≤ F. Let us take ZrW2O8 as our example: we have m = 1, n = 2 and q = 3, giving C = F = 18. In searching for new NTE materials, this simple criterion may give a possible guidance.
Now we consider the case where the tetrahedra are rigid, but the octahedra have some degree of bond-bending flexibility. We cite this example because this appears from DFT calculations to be a well-realised case.65,168 Now we have to count slightly differently. We assign the same 6 degrees of freedom to the tetrahedra, but now we only count the degrees of freedom of the central atom in the octahedral sites, so F = 3m + 6n. Each bond in an octahedron acts as a constraint, giving C = 6m for the octahedra. Again, counting constraints for the tetrahedra will depend on how many non-bridging oxygen atoms are there. But we also need to be aware that if tetrahedra are connected to each other, we will need to add 3 constraints for each shared linkage.
Let us compare the cases of ZrW2O8 and ZrV2O7 as a practical example, in the two cases of rigid and flexible ZrO6 octahedra, and with rigid WO4 and VO4 tetrahedra. For the case of rigid octahedra, we have F = 18 per formula unit in both cases. Because ZrV2O7 is fully connected, we have C = 21 per formula unit and hence C–F = 3. On the other hand, both WO4 tetrahedra in ZrW2O8 have one non-bridging bond each, and as we discussed above we have the marginal case of F = C. The RUMs allowed in ZrW2O8, which arise as a result of symmetry causing some constraints to be degenerate, were found to have wave vectors lying on a surface of exotic shape in the three-dimensional reciprocal space.169
When we take the case of flexible octahedra, for both cases we count F = 3 + 12 = 15 per formula unit, allowing for the rigid-body motions of two tetrahedra and the motion of the octahedral cation. For ZrW2O8, the only constraints we need to count are the six bond constraints within the ZrO6 octahedra, giving F–C = 9 per formula unit. On the other hand, in ZrV2O7 we have to add 3 constraints for the linkage between pairs of VO4 tetrahedra, giving F–C = 6 per formula unit. Again, ZrW2O8 will be more flexible than ZrV2O7.
These arguments could be modified if there are non-bridging bonds within the octahedra, and extended to the idea that no polyhedra are rigid, and that instead the only rigid entities are the bonds.80,170 In such a case, for a fully connected network, and where we now consider each atom to have 3 degrees of freedom and the bonds provide the constraints, we have F = 12m + 9n and C = 6m + 4n, so F–C = 6m + 5n. However, this imputes too much flexibility to the network, and one consequence of such a situation is almost certainly that the network will be elastically soft. We do not consider that this situation is likely to be encountered, and we do not consider such an interpretation to be useful.
We should make two concluding comments in this section. The first is that if F = C, as in tetrahedral silica networks, this means that there is exactly zero flexibility unless, as mentioned above, symmetry makes some constraints redundant. In that case, we expect to see some RUMs with wave vectors at specific places in reciprocal space. In our studies of tetrahedral networks we often found that the RUMs lie on special planes in reciprocal space, which are often flat,41,167 but to all our surprise, can also be on well-defined curved surfaces.171 We mentioned above the exotic surface found for ZrW2O8.169 Also surprising was the finding that in low-density structures, like cubic zeolites of highest symmetry, we might find more than one RUM for each wave vector.45,172,173 This gives a very high degree of flexibility. We stress that we do not consider this to be a failure of the Maxwell method160 per se, but instead represents the fact that identifying constraints that are redundant is difficult. Furthermore, an inability to identify redundant constraints in the Maxwell counting will lead to us underestimating the degree of flexibility and never the other way round. The important cases where Maxwell counting overestimates the number of constraints appear to only be for the tetrahedral networks, where Maxwell gives the marginal case F = C, and in the case of corner-sharing octahedra on a cubic lattice (ScF3 and perovskites). We have not identified other cases where there are significant numbers of redundant constraints. This statement applies also to the case of hybrid networks discussed in the next section.
The second comment is that the purpose of this analysis is to identify potential ways in which we might expect to find the necessary low frequencies in a network structure. But we cannot take this argument alone; it remains one of four principles, not a single principle. For example, in comparing the flexibility of ZrV2O7 and ZrW2O8 above, we found that the latter network is more flexible. Yet the NTE in ZrV2O718,67,68 is only slightly lower than that in ZrW2O8,1,2,46 but by changing the chemical composition can significantly change the thermal expansion. The NTE in ZrV2O7 will turn into PTE by doping the vanadium sites with phosphorus.67,68 In fact we will not consider the effects of chemical composition in detail here as a principle. but we will comment on some issues below (Section 9).
For the structural topology of Zn(CN)2, if we treat the tetrahedra and cyanide molecular anions as rigid objects, we have F = 16 per formula unit. The number of constraints, counting by vertices, is C = 12 per formula unit. There are actually two formula units in the primitive unit cell, so for the crystal as a whole we have F–C = 8 per unit cell. This means that we expect to find 8 RUMs for every wave vector. These are three modes in which the rotations of the Zn-centred tetrahedra cause transverse displacements of the cyanide molecular ions, three where the rotations give twists of the cyanide molecular ions, and also both of the transverse acoustic modes.
In principle the 8 RUMs in Zn(CN)2 give the possibility for a tension effect, and hence for strong NTE. However, this assumes that there is a high flexibility at the vertices, as there is in silica, but it appears that there are strong directional forces. Thus the displacement mode has frequencies of around 4 THz, and the twist mode has frequencies of around 8 THz,107 and their contribution to NTE appears to be relatively slight. Instead, it is the two acoustic modes that give the low frequencies that support the strong NTE in Zn(CN)2.
In the case of the Prussian blue network with octahedra connected by shared cyanide molecular anions, the same counting gives F = 21 per M(CN)3 formula unit. The number of constraints is C = 18. This gives F–C = 3, but there are two formula units in the primitive unit cell in the case of chemical composition MM′(CN)6, giving a net F–C = 6. These correspond to two twist modes, two displacement modes, and two transverse acoustic modes.
The cyanide, or NCN2−, connectors give the simplest hybrid materials. In recent years we have seen an explosion in the number of metal–organic framework materials, a few of which crystallise with cubic symmetry and show NTE. Mostly, however, the linkages and linking groups of atoms are generally rather more complicated than we have discussed here. The isoreticular metal–organic frameworks have so-called ‘secondary building units’ instead of polyhedrally-connected atoms, such as O(ZnO3)4 clusters in MOF-5 (IRMOF-1), with the four ZrO4 tetrahedra connected at the central oxygen atom, and with the outer oxygen atoms forming part of the linkage 1,4-benzenedicarboxylate molecular ions, O2C–C6H4–CO2. MOF-5 shows significant NTE.130,131 The low-frequency flexibility in MOF-5 that leads to NTE appears to come from two sources. One is a low-frequency flexing of the 1,4-benzenedicarboxylate molecular ions,131,133 and the other is the transverse acoustic mode.133 However, there are relatively few isoreticular MOFs with cubic symmetry that show NTE.
One family of metal–organic framework materials with metal-centred tetrahedra are the zeolitic imidazolate framework structures. These exploit the fact that the angle subtended by the two bonds to metal cations from the nitrogen atoms of imidazolate molecular ions, C3N2H3− is around 145°, similar to the Si–O–Si angle in silicates. The ZIFs therefore are able to form crystal structures with networks that are also found within the family of zeolite structures. One of the better studied is ZIF-8, which has a cubic crystal structure with the same network topology as the sodalite zeolite structure. In this case the linkage molecular ion is methyl-imidazolate, C3N2CH3H2− (mIm). The thermal expansion across the solid solution from Zn(mIm)2 to Cd(mIm)2 has been found to be positive.139
Zeolites have been found to have surprisingly rich spectra of RUMs.45,167,172 For example, the networks of the two cubic zeolites for which NTE have been observed, LTA123,124 and FAU,8,125,126 both having 4 RUMs for every wave vector,167,172 together with a continuum of quasi-RUMs (QRUMs).45 Such a rich spectrum of low-frequency modes will give the likely origin of a strong tension effect. However, the Si–O–Si angles in these two zeolites are typically around 145°, meaning that, geometrically, the tension effect will not be as strong as in the cases with near-linear bonds. But this is offset by the large number of phonon modes that will give NTE.
The one case where we have detailed phonon studies is for FAU,22 where many of the phonon branches below 4 THz are found to have a significant degree of RUM character and negative values of their mode Grüneisen parameters. The picture though is not as clear-cut as one might have imagined, because of the complexity of the phonon dispersion curves. With many branches at low frequencies, both RUMs and QRUMs, there will be a considerable degree of mixing of eigenvectors, as allowed by symmetry. We therefore cannot point to a single mode as being responsible for NTE.
, and R,
, points in reciprocal space. The second key point is that in all calculations, these RUMs are found to have the largest negative values of the mode Grüneisen parameters. In fact the negative values are typically found to be very large. These modes are the well-established RUMs in this structure. Their atomic motions, namely alternating rotations of the octahedra within each layer (Fig. 19b), clearly gives the transverse motions involved in the tension effect. The facts that the motion requires no significant distortions of the polyhedra, and that there is a very low energy of flexing at the shared vertices, give the low frequency required for the large negative values of the mode Grüneisen parameters. The large negative values of the mode Grüneisen parameters are further enhanced by the relatively high stiffness of the Sc–F bonds, which shows normal positive thermal expansion79,81 and vibrates at the highest frequency (the branch just below 20 THz in the dispersion curves shown in Fig. 20).
![]() | ||
| Fig. 20 Left shows the full set of calculated dispersion curves of ScF3,88 and right shows the same dispersion curves coloured according to the value of the mode Grüneisen parameters, saturating with values −4 (red) and +4 (green), with yellow representing a zero value. The same calculations were used for the generation of the data shown in Fig. 12. | ||
Whilst we have identified a tension-effect mechanism for NTE in ScF3, this is not sufficient to be able to say that we understand the mechanism of NTE in ScF3. To do so, we need also to explain why NTE is not found in any oxide or halide cubic perovskite. Take for example SrTiO3, where extensive data exist to demonstrate clearly that in its cubic phase this material shows normal positive thermal expansion.175–177 Yet its lattice dynamics are remarkably similar to those of ScF3.178 Even more curious is that NTE is not found in AlF391,179 or TiF3.180 We will identify the likely explanation when we discuss the third principle below, Section 6.
A lot of work has been carried out on NTE in the double-cation fluorides of general chemical formula MM′F6, particularly where the one cation is divalent and the other is tetravalent. Experimental measurements have been performed on a number of examples,76,181 as have lattice dynamics calculations89,90,159 and molecular dynamics simulations.158 The picture that emerges is similar to that discussed above regarding ScF3, but some insights from this work will be discussed in Section 6.
![]() | ||
| Fig. 21 Left, calculated phonon dispersion curves of ZrW2O8 using DFT methods. Centre, the phonon branches have been coloured by the corresponding values of their mode Grüneisen parameters, with red representing negative values saturating at values of −6, blue representing positive values saturating for values above +6, and both colours fading to white as values tend towards zero. Right shows the level of matching of the phonon eigenvectors with those of a flexibility model with white to black representing matching from 0–100%.168 This figure has been adapted from the published work of Rimmer et al.65 | ||
One of the representations of the phonon dispersion curves in Fig. 21 has the different branches coloured according to the values of the mode Grüneisen parameters. It is seen that most of the branches for frequencies below 4 THz show negative values, and most of the branches for frequencies above 4 THz show positive values. The contrast with ScF3 shown in Fig. 20 is striking. It is not just that ZrW2O8 has many more phonon branches (132 versus 12), but that the NTE is spread over many phonon branches with wave vectors all across the Brillouin zone.
The second comparison is with a model of the flexibility, in which the WO4 tetrahedra are held to be rigid, as are the Zr–O bonds, but the O–Zr–O angles are completely flexible. The phonon modes from the DFT calculations have been projected onto the vibrations of the flexibility mode using an eigenvector matching algorithm.168 Where the representation of the phonon branch is shaded dark, there is good matching, whereas a lighter shade of grey implies a low level of matching. It is seen that the branches with frequencies below 6 THz, more so below 4 THz, match well to this model, whereas all branches above 6 THz have very weak matching. This implies that the lower frequency modes, the ones that have negative values of their Grüneisen parameters, match this flexibility model well. It also follows that the phonon branches above 6 THz have motions that involve bending of the WO4 tetrahedra. Not shown in the phonon dispersion diagrams of Fig. 21 are another set of modes with frequencies of 22–30 THz, which involve stretching of the Zr–O and W–O bonds. Matching to a flexibility model in which all bonds are rigid but all bond angles are flexible shows a complete match up to 12 THz,168 but this fails to discriminate between rigid and flexible polyhedra.
The flexibility-matching study of the phonon dispersion curves included the case of rigid ZrO6 and WO4 polyhedra, representing the RUM model. This showed no signatures of RUMs for the wave vectors along the symmetry directions, except for the acoustic modes at a very small wave vector. This, however, is the only wave vector close to symmetry directions in which RUMs had been predicted.169 The curved surface of RUMs does not intersect the symmetry directions in reciprocal space at any other point. We suggest, however, that for general points within the Brillouin zone, the RUM eigenvectors are able to mix with the eigenvectors of several other branches, because the phonons with general wave vectors have no symmetry other than the identity operation, and because there are many branches of relatively low frequency.
The conclusion from this study is that the low frequencies that support the negative values of mode Grüneisen parameters are enabled by some degree of flexibility of the ZrO6 octahedra, whilst the WO4 tetrahedra move as rigid objects over this frequency range.
In fact it was found from lattice dynamics calculations102 that the mode eigenvectors of low-frequency modes that have some degree of RUM character also have a high degree of bond-bending flexibility. In fact the analysis of rigidity showed that there is a degree of rigidity of the linear O–Cu–O bond.102 In this case, we can treat the atoms as the objects with 3 degrees of freedom each, and the bonds as constraints, but adding 2 angular constraints for each O–Cu–O bond. Thus F = 9 and C = 8, giving one degree of flexibility per formula unit. Since there are two formula units per unit cell, we have two degrees of flexibility per wave vector. It appears that this flexibility is seen in the acoustic modes and that the NTE primarily arises from the acoustic modes.
This analysis has been quantified in the lattice dynamics results presented in Fig. 22.102 In this figure we compare the complete dispersion curves with the predictions of a flexibility model based on rigid linear O–Cu–O bonds but flexible angles, and with the calculation of mode Grüneisen parameters for each branch in the phonon dispersion curves. The role of acoustic modes giving rise to NTE is clear, as is the correlation with the description of the flexibility of the structure in which O–Cu–O moves as rigid entities but with flexible angles at the centres of the OCu4 tetrahedra. It is important to note that assigning even more flexibility to this structure, for example in which the O–Cu–O angle is also flexible,170 would predict too great a network flexibility and too soft an elastic stiffness.
![]() | ||
| Fig. 22 Left shows the calculated dispersion curves of Cu2O. The same dispersion curves are shown in the centre panel coloured according to the size of the mode Grüneisen parameters, with red representing negative values and saturating at −8, blue representing positive values and saturating at +8, and both colours fading to white as values tend to zero. This suggests that the important modes for NTE are the transverse acoustic modes. The right panel shows a mapping of the phonon dispersion curves onto a flexibility model in which the linear O–Cu–O linkage is kept rigid, and the angular forces are zero. The motions associated with the acoustic modes reflect this rigidity model. This figure has been adapted from the work of Rimmer et al.102 | ||
![]() | ||
| Fig. 23 Left shows the dispersion curves of Zn(CN)2, calculated using a force field parameterised using ab initio calculations, performed using the virtual crystal approximation to simulate the head-to-tail orientational disorder of the cyanide anions. The centre panel shows the same dispersion curves coloured by the size of the mode Grüneisen parameters, where red represents negative values and blue represents positive values, with zero values coloured white. Right shows the mapping to a RUM flexibility model, where red represents rotational RUMs and blue represents acoustic (displacement) RUMs. On the far right is shown the corresponding atomic motions for a small wave vector. This figure has been adapted from the published work of Fang et al.107 | ||
The case of single-network Cd(CN)2 provides an interesting comparison with the double-network phase, not least because NTE is 50% stronger in the single-network case.108 Zn(CN)2 has a primitive lattice, space group Pn
m, with two formula units in the unit cell because of the double network. In the case of the single network, the crystal structure is face-centred cubic, space group Fd
m, also with two formula units per unit cell, but in this case they are part of the same network. Therefore the number of RUMs per wave vector is again 8. The enhanced NTE in the single network most likely arises from the single-network having lower frequencies of the acoustic modes, because there is no resistance from the second interpenetrating network, but this has apparently not been tested.
We noted earlier in our description of the crystal structure, Section 2.1.9, that there is a head-to-tail disorder of the orientations of the cyanide groups. Therefore most calculations have to be performed on a hypothetically ordered structure106 or via parameterised force fields using the virtual crystal approximation.107 In this context, two types of related compounds are interesting for study, because these have ordered orientations. The first is Si(NCN)2, which we showed in Fig. 7 and discussed briefly in Section 2.1.9. The second has the general formula MB(CN)4, where M is a monovalent metal cation. In this case, the boron atoms are bonded to the carbon atoms in the cyanide ion, and the M cations are bonded to the nitrogen atoms.
There are two known cubic forms of MB(CN)4-type structures. They have the same interpenetrating β-cristobalite lattices, but with different alignments of the cations between the two substructures. In one case the M and B cations in the two sublattices are arranged in alternative (100) layers of each type, with space group P
3m. In this structure, one type of cation is located at the origin of the unit cell, and the other in the centre. One example is AgB(CN)4,182 which shows an NTE that is not much smaller than that of Zn(CN)2.109 On the other hand, CuB(CN)4 has the same structure but shows only small NTE.109 Phonon calculations appear to suggest that, as in Zn(CN)2, the acoustic modes play an important role in the NTE, with a contribution also from the lower-frequency optic RUM.109 LiB(CN)4 is another example that shows NTE.110 The other cubic form of this type of structure is demonstrated by NaB(CN)4, which shows a large NTE.110 This has space group Fd
m, and in this case the neighbouring cations in the two sublattices along the [100] direction are of opposite type.110
In the MB(CN)4 materials it is likely that the most rigid entity is the B(CN)4 unit. Our own lattice dynamics calculations of the layer compound ZnB2(CN)4, which has a two-dimensional network of polyhedra connected by the cyanide ions, show that the B(CN)4 units move as rigid objects for all the lower frequencies, with the modes showing significant bending of the C–B–C bond occurring for higher frequencies.16 On the other hand, the tetrahedra formed between the monovalent ions and the nitrogen atom are unlikely to be rigid. The monovalent-cation–nitrogen bond is likely to be formed by relatively weak electrostatic interactions, as in the alkali halides, with the angular rigidity formed from the repulsive interactions between neighbouring nitrogen atoms rather than from any covalent bonding. This may account for the point made by Gao et al.110 about some of the phonon modes showing a lack of rigidity. However, the point is not quantified. Our own unpublished calculations of LiB(CN)4183 indicate that in the lowest-frequency optic mode at a zero wave vector and a frequency of 65.6 cm−1 (1.97 THz), which is part of a branch of optic modes with consistently negative values of the mode Grüneisen parameters, has mode eigenvectors characteristic of a RUM, with no distortions of the B(CN)4 and LiN4 tetrahedra. Instead these two polyhedra rotate with transverse displacements of the cyanide anions, giving rise to an obvious tension effect.
If the BC4 and MN4 polyhedra move as rigid units, the flexibility analysis is identical to that for Zn(CN)2 and Si(NCN)2, showing 2 transverse acoustic RUMs and 6 optic RUMs for every wave vector. If the B(CN)4 unit is the rigid unit, this reduces the problem to the topology of β-cristobalite, albeit with different Bravais lattices and with the flexibility reduced to planes of RUMs in the [110] zones of wave vectors (wave vectors of the form [ζ, ±ζ, ξ]).184 These include a triple-degenerate set of optic modes at a zero wave vector, and 2, 1 and 3 optic RUMs along the [100], [110] and [111] directions respectively. This degree of rigidity explains why one of the lowest-frequency optic modes double in frequency as the wave vector goes from zero (the Γ point) to
(X point) in AgB(CN)4.109 On the other hand, if the MN4 tetrahedra are flexible in terms of bending the N–M–N angle, we would get 5 RUMs per wave vector. It is quite likely that this all-or-nothing approach to rigidity is too extreme, but certainly to quantify the rigidity in the various examples of MB(CN)4 materials would be interesting and important. It is interesting to note that we find a cyanide twisting RUM in LiB(CN)4 at the high frequency of 335 cm−1 (10 THz),183 similar to the same RUM in Zn(CN)4.107
The same discussion will apply to other cyanides, such as the members of the Prussian blue family, Section 2.1.10. As we pointed out earlier, Section 5.3, the Prussian blue network is able to support 6 RUMs per wave vector, of which 2 are transverse acoustic modes. Although not yet clearly demonstrated, we expect that the transverse acoustic modes are most likely to be the important vibrations in giving NTE in these materials.
The second point is that many crystals share both the structure and features of the lattice dynamics of ScF3 yet without showing NTE.91,175–177,179,180 The cubic phase of SrTiO3 has phonon dispersion curves178 that share much in common with those of ScF3,88 without having NTE. Therefore it is not sufficient to have identified the RUMs as giving the mechanism for NTE in ScF3 – we also need to explain why this mechanism is important in some cases but not in others.
We noted earlier (Section 3.6) that the RUMs in ScF3 and in all cubic perovskites lie along the edges of the Brillouin zone.† The important point to understand is that the volume of the line occupied by the wave vectors along the edge of the Brillouin zone is a vanishingly small fraction of the three-dimensional volume of the whole of the Brillouin zone. Thus, although the set of RUMs may contribute extraordinarily large negative Grüneisen parameters to the overall sum, in truth they are large irrelevant by themselves because the number of RUMs is almost infinitesimally small compared to the total number of vibrations.
In practice, however, the dispersion curves form continuous distributions of frequencies in reciprocal space, so the phonons whose wave vectors are very close to the edges of the Brillouin zone will also have low frequencies close to those of the RUMs, with correspondingly-large negative values of mode Grüneisen parameters. As the wave vector moves further from the edge of the Brillouin zone, the frequencies will increase, the modes will have less of the rotational RUM character, and increasingly the mode eigenvectors will include atomic motions that bend the bonds in the ScF6 octahedra. Thus the values of the mode Grüneisen parameters will become less negative, possibly becoming positive. This is seen in the phonon dispersion curves in Fig. 20. The RUMs are the low-frequency curves connecting the M,
, and R,
, points. The R point represents the meeting of three such lines, so the RUM at R is triply degenerate. Going from R → M we see the two modes (doubly degenerate) that cease to be RUMs increase in frequency, with a commensurate lessening of the negative value of the mode Grüneisen parameter. The same is seen in the directions R → Γ, [0,0,0], and M → X,
.
It appears, therefore, that there is a cylindrical volume around the edges of the Brillouin zone, within which there is a sufficient number of wave vectors with RUM-like modes – modes with the necessary low frequencies and tension-effect motions to give negative mode Grüneisen parameters – to enable the crystal to have an overall NTE. The corollary of this is that NTE is not seen in similar systems because the volume of this cylinder around the edges of the Brillouin zone may not be large enough. So what controls the size of this volume?
The size of the relevant volume around the edges of the Brillouin zone is set by how quickly the phonons that are no longer RUMs increase in frequency. In turn, this is set by the forces associated with bending the bonds within the octahedra. That is, the more rigid are the octahedra, the smaller the relevant volume around the edges of the Brillouin zone, and the less likely is the chance for the RUMs to have sufficient weight to give NTE. We tested this idea using the simple model shown in Fig. 24a.81 The model has only three forces. The first force is to control the stiffness of the Sc–F bond. A Morse potential was used for this,
E(r) = D[exp(−2α(r − r0)) − 2 exp(−α(r − r0))],
| (23) |
![]() | (24) |
![]() | (25) |
![]() | ||
| Fig. 24 The simple model for ScF3 used to assess the important of the size of the bond angle flexing forces.81 (a) shows the definition of the model, for which a Morse potential is used to describe the Sc–F bond length, with angular forces for the right-angle F–Sc–F bond angle θ, eqn (24), and for the linear Sc–F–Sc angle φ, eqn (25). The model parameters were tuned to best reproduce the DFT lattice dynamics calculations shown in Fig. 20. (b) Shows the effect of varying the force constant A associated with the Sc–F–Sc angle φ, with the smallest value representing the starting model. (c) Shows the effect of varying the force constant k associated with the F–Sc–F angle θ, again with the smallest value representing the starting model. This figure has been adapted from the work of Dove et al.81 | ||
The parameter A was tuned so that the model gave the value of the frequency of the RUM along the line of wave vectors from R → M, consistent with the DFT calculations of the lattice dynamics.88 The parameter k directly controls the dispersion of the transverse acoustic mode along the Γ → X direction, so it was tuned to give the appropriate value of the elastic constant C44. Whilst the model lacks electrostatic interactions, and whilst it does not contain interactions required to give a non-zero value of the elastic constant C12, it nevertheless gave a full set of phonon dispersion curves in remarkable agreement with the DFT dispersion curves.81
The model was then explored using the molecular dynamics simulation method with varying values of the two force constants A and k. The results for varying the value of A are shown in Fig. 24b. As this bond becomes stiffer, the frequency of the RUM will increase, and so we see the lowering of the NTE, until at a value of A that is 40 times larger than the tuned value the thermal expansion turns positive. Qualitatively this is not surprising, given that the effect of increasing the value of A is to stiffen the structure, and through the raising of the RUM frequencies to decrease the negative values of the mode Grüneisen parameters along the RUM lines – remember the additional factor of ω−2 in eqn (7).
The second set of results, for varying the value of k, are shown in Fig. 24c, and are particularly interesting. As the ScF6 octahedra become more rigid, the NTE decreases, which corresponds to the narrowing of the cylinder of important wave vectors around the edges of the Brillouin zone. What is interesting, certainly in comparison with the variation of the value of A, is that the thermal expansion reaches zero with only a three-fold increase in the value of k.
It is possible to use a variation of this model in which the F–Sc–F right-angle potential, eqn (24), is replaced by a Morse potential for the F–F bond with an equilibrium distance that is
times that for the Sc–F potential. This has the effect of giving a non-zero C12 = C44, which is actually as given by the DFT calculations,88 because compressing a pair of opposing S–F bonds will cause compression of the F–F bonds, which is relaxed by a balanced expansion of the perpendicular Sc–F bonds. The cost is that the value of C11 is also affected by the F–F force constant, thereby perhaps breaking some of the orthogonality of the model, but this might be considered a small price compared to the feeling that C12 = 0 is too unrealistic. We developed such a model tuned by comparing the elastic constants given by this model with those calculated by DFT methods,88 showing a 1% agreement, and with a calculated phonon density of states similar to that calculated by DFT. The thermal expansion of this model is also similar to that given in the DFT calculations. As with the model discussed before, with a direct F–Sc–F angle function,88 we found from the lattice dynamics calculations that the NTE lessened and became PTE when the F–F force constant was increased by the same factor as the increase to the force constant for the F–Sc–F angular potential energy seen in Fig. 24c. Thus the conclusions regarding the importance of the flexibility of the ScF6 octahedra are robust regarding details of the model. We will discuss the use of such models in Section 10.
We can now compare the stiffness of the octahedra in ScF3 with that for some other systems, namely the TiO6 octahedra in SrTiO3 and the PbI6 octahedra in CsPbI3 and methylammonium lead iodide. We have performed studies on these systems145,188,189 using neutron total scattering190 analysed by the Reverse Monte Carlo method.191 From the realistic atomic configurations these studies have given, we have been able to extract the contributions to the atomic motions from stretching and bending of the bonds within the octahedra, and from whole-body rotations of the octahedra, using the GASP method.43,185–187 The results for ScF3 and the model described above,81 methylammonium lead iodide,188 CsPbI3189 and SrTiO3,145 are compared in Fig. 25. It can be seen that ScF3 has the most flexible octahedra; this correlates with the fact that only ScF3 shows NTE, and is consistent with our contention here that the bond-bending flexibility increases the volume of reciprocal space that is required for the RUM-type motions to be significant.
![]() | ||
| Fig. 25 GASP results43,185–187 for some different cubic networks of linked octahedra, ScF3 and its model,81 methylammonium lead iodide (MAPbI3),188 CsPbI3189 and SrTiO3.145 | ||
It is important to note that actually ScF3 is NTE only with a fine degree of sensitivity to the balance between the phonons that give NTE and those that give PTE. This is clearly seen in Fig. 12, particularly Fig. 12c, which shows a very large cancellation between modes with negative and positive values of the mode Grüneisen parameters. One can see that with only a slight reduction in the weighting of the modes with negative values of the mode Grüneisen parameters the thermal expansion will become positive.
This discussion might explain why the related fluorides AlF391,179 and TiF3180 show positive thermal expansion. Although the necessary calculations have not yet been performed, we can learn something from the recent calculations on a range of MF3 structures by Koiso et al.192 These calculations were primarily designed to understand why some materials such as AlF3 show displacive phase transitions whereas ScF3 does not. It was suggested that the RUM-type distortions in AlF3, GaF3 and InF3 arise from a second-order Jahn–Teller interaction that gives distortion of the M–F–M linkage and which is strong enough to overcome the Coulomb repulsion arising from the distortion. As part of this study the authors calculated the phonon dispersion curves of the cubic phase. It is clear from the results for the acoustic modes of AlF3 that the AlF6 octahedra are much stiffer than the ScF6 octahedra, consistent with the previous discussion. On the other hand, YF3 and La3 were predicted to show NTE, and their dispersion curves show that the acoustic modes have low frequencies, indicative of more flexible octahedra.
Confirmation of the same point also comes from the MM′F6 systems. It follows that a cation giving softer octahedra, such as the calcium cation, might lead to an increase in the NTE. This point has been confirmed experimentally76 and by lattice dynamics calculations.90
It has been suggested that we could take this argument to an extreme limit in which it is assumed that there is no bond-bending interaction within the ScF6 octahedra, but that instead the orientations of the Sc–F bonds can librate as independent Einstein oscillators.80
‡ This would mean that the relevant phonons for NTE would be distributed across the whole of the Brillouin zone, but detailed lattice dynamics calculations demonstrate that the relevant modes for NTE do actually lie within a cylinder around the edges of the Brillouin zone.88 Moreover, with such a flexibility we would expect the value of the elastic constant C44 to be extremely low – it isn't!88 Not only is such a model unsatisfactory because it contradicts the elasticity seen in the experimental data for the acoustic phonons,82 it is also not a plausible route towards understanding NTE because it requires that crystals are unrealistically soft.
The key point is that NTE does not arise from a single phonon or a specific set of phonons, but from a sufficiently-large number of phonon modes that their overall contribution to the thermal expansion outweighs the contribution from the many phonons that would normally give PTE. When the important phonons lie on specific wave vectors, or on lines or planes in reciprocal space, it is necessary that their characteristics (low frequency, tension effect motions) bleed outwards into a sufficiently large volume of reciprocal space – our third principle.
For some systems the important phonon modes span across much of reciprocal space. This includes the zeolites, the hybrid cyanide structures, and oxides such as ZrW2O8 (Fig. 21) with flexible octahedra. In this case, the relevant phonon modes do not need to have large negative values of mode Grüneisen parameters, and often they don't – the large sizes seen in ScF386,88 are not common. In this case, several phonon branches with higher frequencies and lower negative values of mode Grüneisen parameters are weighted by the fact that they occur for wave vectors across most of the Brillouin zone.
The case of β-cristobalite presents an interesting challenge. It is surely a flexible structure. Its idealised crystal structure is shown in Fig. 1. This structure has linear Si–O–Si linkages, with large transverse displacements of the oxygen atom,195 characteristic of motions associated with the tension effect (compare with ScF3, Fig. 12a). The large-amplitude motions arise from the RUM flexibility,41,42,196 which in turn leads to the existence of a displacive phase transition.195,196 The presence of a significant number of RUMs in β-cristobalite has been demonstrated by inelastic neutron scattering measurements,196 quantified by GASP analysis of configurations generated from neutron total scattering measurements analysed by RMC,43 confirmed by the diffuse scattering calculated from the RMC configurations,40 and measured by electron diffraction.197
In spite of β-cristobalite appearing to have all the ingredients required for NTE, β-cristobalite shows PTE up to around 1400 K39 and NTE only at higher temperatures,38 Fig. 26. This is remarkable given that Cu2O also crystallises with the β-cristobalite structure, but in this case there are two interpenetrating β-cristobalite lattices with OCu4 tetrahedra, which might be thought to impede the fluctuations required to give NTE. Yet Cu2O shows NTE at lower temperatures95–97 when β-cristobalite does not.
![]() | ||
| Fig. 26 The variation of the lattice parameter of cubic β-cristobalite with temperature, merging data from two sources as open39 and filled38 black circles. The continuous black line is a guide to the eye fitted through the data. The red curve is a calculation from the simulations discussed in Section 7.194 | ||
Another example is Si(NCN)2.112 This has the same crystal structure as Zn(CN)2 (Fig. 7) with ordered NCN2− molecular anions replacing the disordered CN− anions. DFT calculations suggest that Si(NCN)2 has a high degree of flexibility,111 yet the NTE seen in Si(NCN)2112 is much smaller than that in Zn(CN)2.103
It is probable that the high degree of flexibility of both structures leads to the presence of a significant degree of structural disorder. In fact this has been quantified much more for β-cristobalite than that for Si(NCN)2 in terms of structural40,195 and simulation198–200 studies. The total scattering data analysed using the RMC method showed that the O atoms move continuously around an annulus surrounding the Si–Si linkage, with a distribution of positions that look more like a doughnut.40 Prior to this work a model was proposed in which the oxygen atoms occupy six sites on this annulus, but in truth the fitting of this model to the neutron powder diffraction data was little better than that for the idealised model with large anisotropic atomic displacement parameters.195 The RMC results show a much more continuous, and broad, distribution of oxygen positions around the annulus. Put another way, they show a broad distribution of the orientations of the Si–O bonds and hence of the SiO4 tetrahedra.
We have recently posed the question of whether the presence of a significant degree of dynamic structure disorder is responsible for the PTE in β-cristobalite and the relatively low NTE in Si(NCN)2.194 We have constructed some simple models, analogous to the model for ScF3, Fig. 24, both for β-cristobalite and Si(NCN)2, the latter using rigid NCN molecules. The model allows for stretching of the Si–O and Si–N bonds, bending of the O–Si–O and N–Si–N tetrahedral angles (an ideal value of 109.47°), and flexing of the Si–O–Si and Si–NCN angles. The assumption is made that the bond and tetrahedral angle stiffnesses are large, but the second angle is more flexible. The thermal expansion shown by the model was calculated by performing many molecular dynamics simulations across a wide range of temperatures for a range of stiffnesses of the second angle.
We consider here mostly the results for one version of the model for β-cristobalite, in which we have a potential of the form of eqn (24) for the Si–O–Si angle, which we denote as θ, with θ0 = 145°. We have a similar function for the tetrahedral O–Si–O angle, whose force constant is held fixed at a large constant value. We also create a stiff Si–O bond using a Morse function. In this case the Morse function allows for a small positive thermal expansion of the bond.
Results for the variation of the lattice parameter with temperature for different values of the force constant k, from small to large, are shown in Fig. 27. The results show different behaviours for the two extreme values of k, large and very small values, within the results for intermediate values of k showing an interesting interpolation between the two extreme limits. This we need to unpick.
In the case of very small values of k, the network is very floppy, and RUMs will be easily excited. However, it is important to know that the RUMs in β-cristobalite lie on planes in reciprocal space that are normal to the lattice vectors of the form 〈110〉, that is, with wave vectors of the general form {ζ,±ζ,ξ}. Thus the β-cristobalite network has a more constrained RUM flexibility than do the zeolites, and large-amplitude RUM motions will require a small distortion of the tetrahedral Si–O–Si angles. This means that for a small value of k, at low temperatures the crystal structure will align close to its state of maximum volume because that will involve minimal bending of the Si–O–Si angles. What is seen is that there is a rapid drop in the value of the lattice parameter, clearly with a large NTE, until saturating with nearly zero thermal expansion. The saturation is at the value of the lattice parameter at which the orientational disorder of the SiO4 can no longer increase within the constraint of the network.
This result is what was also seen in simulations of the model for θ0 = 0 and also in the simulations of the models for Si(NCN)2 with and without a corresponding angular force. In both cases, the size of the shrinkage decreases as the angle force constant increases. This implies that the relatively low thermal expansion seen in Si(NCN)2 may result from the saturation of the disorder. In the case of Si(NCN)2 there is a phase transition to an unknown crystal structure, so the high NTE this model would predict at low temperatures cannot be observed.
In the other limit of large values of k, the bending term constrains θ to be close to 145°. This causes a small distortion of the SiO4 tetrahedra above the thermal motion, expanding the standard deviation of the O–Si–O tetrahedral angle by about 0.6° at the lowest temperature, above the thermal value of 2°. Upon heating 〈θ〉 remains close to 145°, with a variance that increases with temperature. In this case there is no scope for the operation of a tension effect, because the network is locked into a constrained disordered state, and as a result the system shows only PTE. In fact the linear thermal expansion of the network is about 2/3rd of the thermal expansion of the Si–O bond, showing that fluctuations do reduce the thermal expansion to a small extent, but not by enough to give NTE.
The intermediate cases show a number of very interesting properties. First is that with a not-insignificant value of the Si–O–Si bending force constant k there is a balance between the energy costs of straightening the Si–O–Si angle and bending the O–Si–O tetrahedral angles. In practice this pulls the weight of the distribution of Si–O–Si angles towards 145°, but upon heating the increasing thermal fluctuations actually weight the distribution away from 145° and more towards 180°. This gives rise to the initial positive thermal expansion. However, upon further heating the distribution broadens, which can then only mean spreading away from 180°. However, the pull is not specifically towards 145° but towards larger angles, which then gives rise to NTE, but again saturating exactly as for the case of the weakest value of the Si–O–Si force constant k.
The interesting point from this is that in this model we see an initial PTE at lower temperatures, followed by NTE at higher temperatures. This is exactly as seen in β-cristobalite. We can in fact adjust the parameters of the model to give a reasonable agreement with the experimental data for β-cristobalite, as shown by the red curve in Fig. 26. We can also adjust the parameters to give a good agreement with the NTE seen in Si(NCN)2.194
One very curious feature in Fig. 27 is that the lattice parameters for all values of the Si–O–Si force constant k are identical at a temperature of 598 K. If we increase the value of the force constant associated with the O–Si–O tetrahedral angles there remains a point of concurrency but it moves to higher temperatures. In fact there is a linear – but not proportional – relationship between the concurrency temperature and the value of the O–Si–O force constant.
What this model has shown for both β-cristobalite – which we have discussed in more detail – and Si(NCN)2 is that when we increase the features that can give NTE, as discussed in the first three principles, we have larger thermal fluctuations which give rise to local dynamic disorder and which then limits the size of the NTE. This leads to our fourth principle that the thermal disorder created by the network flexibility that gives NTE ends up limiting the size of the NTE that can be achieved. We might remark that even for materials that show larger NTE, it may well be the case that disorder is still a limiting factor, but this is a point that has not yet been studied.
Experimentally the saturation of the NTE with increased temperature due to disorder will be hard to differentiate from the effects of phonon–phonon anharmonicity increasing the frequencies of the modes associated with the tension effect and from the effects of exciting higher-frequency modes that contribute to PTE with positive mode Grüneisen parameters. The latter effects can be indicated by lattice dynamics calculations.
The effect due to bending of the linkage of three atoms, as in β-cristobalite, may be more common, but again difficult to identify. We can identify cases where we do not expect to see such an effect. According to the recent calculations by Koiso et al.192 it is probably the case that the Sc–F–Sc linkage in ScF3 favours the linear arrangement. Previous work on Zn(CN)2 also suggests that the Zn–(CN) bond also prefers to be linear, which is quite likely to be true for several other cyanide compounds. Thus the NTE in both ScF3 and Zn(CN)2 will not be affected by this type of disorder. On the other hand, given that it appears that AlF3 prefers a bent Al–F–Al linkage,192 it is possible that in addition to the effects from the stiffness of the AlF6 octahedra limiting the impact of the line of RUMs (recall the discussion regarding ScF3 in Section 6), it is possible that the disorder of the positions of the fluorine atoms may also mitigate against NTE in the cubic phase of this material.
It might also be the case that ZrV2O7 shows the same behaviour as that seen in β-cristobalite. The shared oxygen in the pyrovanadate, V2O7, cluster is linear on average but prefers to be bent. Data published by John Evans18 show that ZrV2O7 shows PTE at temperatures immediately above the phase transition at around 375 K, reaching a maximum value of the lattice parameter at a temperature around 10 K higher and then showing NTE at higher temperatures. This resembles the behaviour in β-cristobalite, albeit with different ranges of temperature, and disorder of the orientation of the V–O bonds may be the explanation. The same appears to be the cases or uranium and thorium pyrovanadates.67 The case of PTE in zirconium pyrophosphate, and also titanium pyrophosphate,67 may also be due to orientational disorder of the P–O bonds over all temperatures measured.
It is commonly assumed that the tension effect applies to NTE in anisotropic materials, but this may not always be the case. In particular, there are two other causes of NTE in anisotropic materials, which frequently may be more important. One cause of uniaxial or area NTE is from elastic anisotropy. For anisotropic systems, the Grüneisen equation for thermal expansion in isotropic systems, eqn (12), does not apply. This is a point that is not always appreciated. The correct equation is
![]() | (26) |
j are then defined with respect to the strains rather than volume, formed as
![]() | (27) |
is a component of the second-rank strain tensor, and includes both tensile and shear strains.
It is tempting to assign the same significance to volume and volumetric thermal expansivity αV to an anisotropic crystal showing NTE as to NTE in isotropic materials, but this can be misleading. The volume strain is actually a sum of the individual tensile strains,
![]() | (28) |
cannot be considered in isolation from separate analysis of the individual tensile strain components. For certain, this prevents the formulation of a meaningful calculation of mode Grüneisen parameters.
The point of discussing eqn (26) in this context is that it is perfectly possible for NTE to arise purely from the signs of the components of the elastic compliance tensor with positive values of the overall strain Grüneisen parameters. If the off-diagonal components of the elastic constant tensor C have positive values, it follows that the off-diagonal components of the elastic compliance tensor S = C−1 are necessarily negative, and these can be large enough to give axial or area NTE even when all the overall strain Grüneisen parameters are positive. This point was discussed back in 19724 and has been demonstrated in a few recent studies.15,16,201 One of these cases, the silicate cordierite, is interesting in showing axial NTE, when at some temperatures all three overall strain Grüneisen parameters have negative values and at other temperatures all have positive values.
A second origin of NTE in anisotropic materials is from a structural phase transition, where the symmetry-breaking distortion of the crystal structure is accompanied by spontaneous strains. If these strains are positive, and if the background positive thermal expansion is not too large, the positive spontaneous strains will give the impression of NTE upon heating up to the phase transition. Examples are PbTiO3,202–205 Cu2P2O7,17,206,207 ammonium sulphate,208,209 and Cd2Re2O7.210,211 Interestingly, in the last three of these cases the phase transition appears to be near-tricritical, which means that the NTE actually diverges at the phase transition. This is astonishing and gives the chance to control very large NTE, which compares with the point we have made that isotropic crystals may have a natural limit to the size of their NTE due to dynamic disorder, Section 7.
In these cases, it appears that there is a temptation to immediately perform DFT calculations of Grüneisen parameters. As we have pointed out elsewhere,17 it is highly likely that it will be possible to calculate negative values of the mode Grüneisen parameters, even when using the isotropic formulae. This is a direct consequence of the coupling between strains and order parameter. But the calculation of a negative value of a mode Grüneisen parameter when NTE is associated with a phase transition should never be taken as implying the existence of a tension effect, particularly when the thermal expansion in the high-temperature phase is positive.207,211
The important point is that whilst there may be a tension effect operating in some anisotropic materials, sometimes this may be only for certain strain components. One example is for CuCN,155 where there is a clear tension effect operating along the Cu–CN–Cu chains.212 Graphene is the clearest two-dimensional example, but here the tension effect operates only within a one-atom-thick layer. More interesting, perhaps, is recent work on the network layer material ZnB2(CN)8.16,213 Although the biaxial NTE in this material is caused by the anisotropic elasticity, the origin of a very low overall strain Grüneisen parameter for the relevant area strain has been understood in terms of the same principles outlined in this article.16 However, in other cases, a tension effect may be slight or non-existent, with the major effects arising from the elastic anisotropy or the phase transition. We therefore caution against transferring our clear understanding of NTE in isotropic materials directly to anisotropic materials.
Particularly pertinent is the question of why, in crystal structures that are largely identical but have different chemical composition, this balance can be pushed dramatically from one side to another? For example, why do we see NTE in ScF3 but not in AlF3,179 why in ZrV2O7 but not in ZrP2O7,67 why strongly in AgB(CN)4 but only weakly in CuB(CN)4?109
We have addressed some issues regarding what controls the balance. These include that of the distribution of the wave vectors of the phonons with large negative mode Grüneisen parameters and the relative volume they occupy within the Brillouin zone (Section 3.2). We have also seen from the simple model introduced in Section 3.5 that the size of the negative mode Grüneisen parameters depends on two factors. The first one, in the numerator, is the stiffness of the bond, which in turn determines the coupling of the transverse motion of the central atom in the tension effect to strain. The stiffer the bond, the higher the coupling, leading to promotion of NTE. The second, in the denominator, is the square of the mode frequency, which clearly leads to promotion of NTE if the frequency is low. All of these will be affected by the chemical composition. These factors may all be characterised in some way by the atomic size.109 Larger atoms may result in softer bending potentials that lead to an increase in the relevant volume of reciprocal space and lead to lower frequencies, but at the same time larger atoms may have softer nearest-neighbour interactions, which reduce the operation of the tension effect.
It is not our intention to develop this point further. Our goal in this article has been to review the mechanisms that are required to give NTE, but in practice even with all the mechanisms in place, the issue of how the balance tips between NTE and PTE will depend to a greater or lesser extent on the detailed chemistry. These factors will affect the mechanisms in different and often competing wells, affecting both the numerator and denominator in the equation for thermal expansion of isotropic systems, eqn (12). There is scope for more quantitative research on this issue.
As a side issue, we should address the question of whether in the cases where there is a fine balance – such as in ScF3, as evidenced in Fig. 12 – calculations by slightly different methods may give different results. For example, DFT calculations performed by different software packages, or even by different workers using the same software package, might have different flavours of the exchange–correlation functionals, different pseudopotentials, or even different cutoffs. This is a serious point because often different calculations of the phonon dispersion curves for the same system give different results, as for example for ScF385–87,90,190,214,215 and the related CaZrF6.76,89,90,159 It is often found that the extreme values of mode Grüneisen parameters may vary between different calculations. However, the broad results for the calculation of thermal expansion usually show reasonable qualitative agreement. Another example is for Cu2O,100–102 where different calculations also show reasonable qualitative agreement. It appears that in general the key factors that provide the balance, such as the spread of negative mode Grüneisen parameters across reciprocal space as given by the overall network flexibility and polyhedron flexibility, are more robust, and fine details matter less.
We have developed a simple lattice model for the Zr2(PO4)3 network, setting charges to be zero, and enabling short-range interactions using the Morse function (eqn (23)) for Zr–O, P–O and O–O bonds. The parameters in this minimal model were tuned to give typical frequencies for the bond-stretching and bond-bending modes in both types of polyhedra. Because this system is over-constrained, even without adding interactions to account for bending interactions centred on the oxygen atoms, the 48 flexibility modes acquire reasonable frequencies without giving a gap between them and the other modes. This point can be seen in the phonon density of states shown in Fig. 28a.
Our starting model has tetrahedral O–P–O angles within 2° of the ideal tetrahedral angle. and octahedral O–Zr–O angles within 4° of a right angle. The variance on the P–O and Zr–O bond distances is very small, but the P–O distances show stretching from the ideal distance of the potential energy function – r0 in eqn (23) – of 4%. This gives an indication of the internal stresses experienced by the internal constraints within the network.
The distribution of mode Grüneisen parameters with frequency for our model Zr2(PO4)3 network is shown in Fig. 28b. The largest contribution to NTE clearly comes from modes with frequencies between 1 and 3 THz. These are some of the modes that are flexible in the limit of rigid PO4 tetrahedra and Zr–O bonds and flexible O–Zr–O angles. For frequencies above 8 THz, which is the regime in which we have to account for flexible PO4 tetrahedra and Zr–O bonds, all phonon modes have positive values of their mode Grüneisen parameters.
The combination of the mode Grüneisen parameters to form the overall Grüneisen parameter is shown in Fig. 28c, together with the corresponding linear thermal expansivity. The form of the mode Grüneisen parameters, with a deep minimum at low temperature, is fairly common for an NTE material, but it is always offset by the multiplication by the heat capacity, which falls to zero at zero temperature. The overall Grüneisen parameter remains negative to high temperature, the point at which each mode contributes equally, and so the NTE continues to high temperature, albeit to a slightly lesser extent than at lower temperatures. The experimental linear thermal expansivity also remains negative to higher temperatures,129 but its absolute value is lower than that of our calculated value. We will comment on this below.
The point of this discussion and example is that we have used the ideas articulated in this article to construct an understanding of NTE in a new material, NaZr2(PO4)3.129 We have interpreted the network in terms of rigid PO4 tetrahedra connected to ZrO6 octahedra with rigid Zr–O bonds. With a minimalist force model that allows for flexibility of the PO4 tetrahedra and more rigidity of the ZrO6 octahedra, we have been able to compute the mode Grüneisen parameters and thereby show that the NTE arises from a tension-effect phonon mechanism. The correlation with the flexibility model, in that the low-frequency modes enabled by the flexibility of the network, demonstrates the first two principles outlined in this article. The spread of modes across reciprocal space is enabled by the fact that the flexibility model has no concentration in any particular regions of reciprocal space, in contrast, say, to ScF3.
With a minimalist model we can ask other questions of the crystal structure that are pertinent to NTE. One example is to explore a model where we replace the O–O interactions by bond-bending interactions. As discussed in the case of ScF3, this model does not give a reasonable value of the C12 elastic constant, but for exploring the structure this is not important. One interesting point about the crystal structure of NaZr2(PO4)3 concerns the distribution of the four Zr–O–P angles, which are the angles that give the tension effect. Ideally this angle should be nearly linear, but in the langbeinite structure the angles are more varied, with one as low as around 140°. The lower angles are less effective in transmitting the tension effect than are linear bonds, but in the langbeinite structure the other three angles are around 20° away from linear. It can be seen that changing the relative size of the Zr–O distance compared with the P–O distance can increase all four angles, which should give a stronger tension effect.
The use of minimalist models is probably unexploited in NTE research, yet is an ideal mechanism for understanding the key principles concerning the origin of NTE, as was demonstrated in the case of ScF3.88 Minimalist models actually allow for more immediate contact with the key principles than do more complex models. In fact it is more common to go immediately to DFT calculations. We would like to encourage the use of minimal models first – in fact we can ask the question of whether DFT calculations are able to give much information beyond minimalist models, other than providing a closer quantitative link with experimental results? In the calculations on the Zr2(PO4)3 network we have presented here, we suspect that the minimalist model is sufficient to demonstrate the origin of NTE. We look forward to seeing whether DFT calculations performed on NaZr2(PO4)3 in the near future will add more information beyond what we have presented here.
We commented that the minimalist model over-estimates the size of the NTE compared to experimental value, although it does reproduce the trend towards a constant NTE at higher temperatures with larger NTE at lower temperatures. The over-estimate is probably due to the absence of the sodium cations, which will act to stiffen the lattice and hence increase the value of the bulk modulus K, which appears in the denominator of eqn (12). Does this matter? In terms of understanding, we think that probably not.
It is our contention that due to the efforts of the NTE community over the past three decades, we can say that we now have a detailed and general understanding of NTE in cubic crystals, such that most of the mystery has now been taken away. That is not to say that there are no more open questions. We have articulated above (Section 9) that there is still quantitative work to be done to better understand how chemical composition can change the size of, or even existence of, NTE within the same structure family. What we have provided here is the quantitative theoretical framework into which such an analysis can be woven.
Whilst one aspect of understanding the existence of a property is to explain the factors behind its existence in the known case, two other aspects are predicting behaviour in new materials and also in explaining its non-existence in other cases. Sometimes the latter case is most challenging. For example, we have highlighted the question of why NTE is found in ScF3 but not in AlF3 or in SrTiO3. Our principles have allowed us to understand these systems, and they apply also to other oxides such as spinels and garnets that do not show NTE.
We can comment on the case we mentioned at the outset, SmB6 (Section 2.1.7). Whilst this shows NTE, other members of the family, such as the alkali-earth hexaborides, do not. At first sight these look as if they can show a tension effect, but in practice the network is too rigid and the thermal expansion is dominated by the expansion of the bonds.93 Although it took a calculation to demonstrate this, once it is clear that a certain network is too rigid, it follows that we can understand the absence of NTE in the whole family. The NTE in SmB6 then arises from electronic effects.
We have also raised the challenge of whether there are more families of cubic materials showing NTE to be discovered. We have expressed some pessimism in the case of ceramic oxides, but we would be delighted to be proven wrong, especially if proven wrong several times!
There is also the possibility for new hybrid network materials with cubic symmetry. Many cyanide networks are based on the β-cristobalite or ScF3 structures, but there are many other tetrahedral networks that could possibly be reproduced with cyanide linkers instead of oxygen. The same may be true with other ligands, following the example of the ZIF structures. The four principles we have discussed in this article should apply in these cases, but the network flexibility may be constrained by angular forces, as identified in the cyanide networks.
A decade and a bit after Cora Lind20 posed the question, “[After] two decades of negative thermal expansion research: where do we stand?”, we contend that our understanding has matured considerably. In part this is due to findings associated with new materials, or more often new variants of the existing topologies, but perhaps more importantly from the maturing of supporting calculations and quantitative analysis. We hope this is reflected in this article.
It is often said by authors that one goal of NTE research is to find new materials. Perhaps we might be bold to suggest that it may be unlikely that many new materials will replace the traditional glass ceramics, which have had wide-ranging market applications for many decades. After all, many of the materials we have reviewed are toxic or unstable. But it is our conviction that our understanding of the principles associated with NTE in isotropic crystals will be invaluable in assisting the development of improved high-performance glass ceramics.
![]() | ||
| Fig. 29 One dimensional crystal with four atoms in the unit cell. The horizontal line indicates the span of the unit cell. | ||
Without worrying about issues connected with long-range electrostatic interactions – which are not a problem but make it a little more complicated that is helpful here – we can imagine that the atoms interact through pairwise forces that depend only on the distance between them. Because the crystal is periodic and can be assumed to stretch forever, we will only need to consider atoms in the unit cell at our chosen origin interacting with the atoms in neighbouring unit cells.
Let us assume that we express the energy between two atoms in terms of their separation r, where their equilibrium separation is r0, as a Taylor expansion:
![]() | (29) |
We can write r − r0 in terms of the displacements of the two atoms. Let us denote the displacement of atom j in unit cell ℓ as uj,n, in which case we can write
| r − r0 = uj′,n′ − uj,n | (30) |
![]() | (31) |
![]() | (32) |
![]() | (33) |
We can quickly connect the derivatives of eqn (29) and (32). From our definition of r − r0 above, it follows that ∂r/∂uj′,n′ = +1 and ∂r/∂uj,n = −1. Thus ∂E/∂uj′,n′ = +∂E/∂r and ∂E/∂uj,n = −∂E/∂r. For the case when the two atoms are different, that is when (j′, n′) ≠ (j, n), it follows that
![]() | (34) |
![]() | (35) |
We note in passing that the different signs of the two derivatives can be rationalised when we consider the force constant for two atoms. The harmonic energy is
![]() | (36) |
As we would expect, the energy is zero when uj',n' = uj,n since there is no stretching of the bond and is positive when uj',n' = −uj,n, that is, when the bond is stretched.
The Born–von Kármán theory of lattice dynamics assumes that we can associate the displacements of atoms at time t with travelling waves, so we can write
![]() | (37) |
is the relative displacement of atom j, and q is the wave vector, with magnitude given by wavelength λ as |q| = 2π/λ, where a is the lattice parameter so that the position of the origin of the unit cell is equal to Rn = na, and ej is the relative displacement of atom j in the unit cell, such that
. The approach in the Born–von Kármán theory is to write Newton's equation, force = mass × acceleration, as
![]() | (38) |
When we do the derivatives we get Aexp(i(qna − ωt)) on both sides of the resultant equations; this will cancel, leaving us, after a little shuffling, with
![]() | (39) |
This is now looking like a matrix equation. So let us write a matrix element as
![]() | (40) |
in the definition of ej should now be obvious.
We also pack the variables ej into a column matrix
![]() | (41) |
| ω2e = D(q)·e ⇒ ω2 = eT·D(q)·e | (42) |
There is, in one dimension, as many solutions as there are numbers of atoms, N. So let us put everything into one master equation, such that we define two square matrices
![]() | (43) |
![]() | (44) |
Eqn (42), with N different solutions, can thus be collated as
| Ω(q) = eT(q)·D(q)·e(q) | (45) |
There are a few points to make about eqn (45). The first is that it is a classic eigenvalue equation, such that the diagonal matrix Ω(q) is the array of eigenvalues of D(q), and e(q) is the corresponding array of eigenvectors. Note that we have the normalisation that eT·e is the unit matrix.
The second point to make is that the dynamical matrix D(q) has the property that its complex conjugate is equal to its transpose: D(−q) = DT(q). This is the defining characteristic of what is called a Hermitian matrix; the key property of Hermitian matrices, in the context of the theory of lattice dynamics, is that its eigenvalues, namely the components of the diagonal matrix Ω(q), are purely real, even though the Hermitian matrix itself is complex.
The point is that the individual solutions ω2 for each vibration is a real number, although it can be positive or negative. If all values of ω2 are positive it means that the crystal is stable in the configuration for which the calculation has been performed: it is in its lowest energy state with regard to any small deformation. On the other hand, if one value of ω2 is negative, it means that the crystal is unstable against a deformation that will be exactly characterised by the corresponding column of the eigenvector matrix e. This point is very relevant for our understanding of displacive phase transitions: in the high temperature phase, a positive value of ω2 for one vibration is only sustained by the perturbation from anharmonic interactions,218 whose effect diminishes upon cooling, and the phase transition occurs at the temperature at which the value of ω2 falls to zero. The crystal then undergoes a spontaneous symmetry-breaking distortion that is characterised by the corresponding form of the eigenvector e.
The mathematics can easily be generalised to three dimensions, but at the cost of adding two sets of indices to everything. The wave vector q becomes the three-dimensional vector q, and the displacements uj,n then become the vectors uj,n, where the 3 × 1 array n defines the neighbouring unit cell in three spatial dimensions. The distance from the origin to the lattice point Rn becomes the vector Rn. The individual elements of the force constant matrices,
and
, themselves become 3 × 3 sub-matrices
and
. Thus the dynamical matrix in three dimensions, D(q), becomes a 3N × 3N matrix, with 3N eigenvalues. Writing this out with the new pairs of subscripts merely makes the equations look more opaque, which is why we present the formalism in one dimension, and leave it to the readers' imaginations to appreciate the generalisation to three dimensions.
Footnotes |
| † Note that in the mixed-cation fluorides, MM′F6 discussed in Section 5.4.2, because of the doubling of the unit cell in each direction to give a face-centred Bravais lattice, the RUMs lie along the [100] directions, given that the point at the corner of the Brillouin zone in ScF3 is at the centre of the Brillouin zone for the reciprocal space of the MM′F6 lattice. |
| ‡ The argument is more clearly articulated in the press release that accompanied publication of ref. 80.193 |
| This journal is © The Royal Society of Chemistry 2026 |