Open Access Article
Florian
Kleemiss
a,
Oleg V.
Dolomanov
b,
Michael
Bodensteiner
c,
Norbert
Peyerimhoff
d,
Laura
Midgley
d,
Luc J.
Bourhis
e,
Alessandro
Genoni
f,
Lorraine A.
Malaspina
a,
Dylan
Jayatilaka
g,
John L.
Spencer
h,
Fraser
White
i,
Bernhard
Grundkötter-Stock
j,
Simon
Steinhauer
j,
Dieter
Lentz
j,
Horst
Puschmann
*b and
Simon
Grabowsky
*a
aUniversität Bern, Departement für Chemie und Biochemie, Freiestrasse 3, 3012 Bern, Switzerland. E-mail: simon.grabowsky@dcb.unibe.ch
bOlexSys Ltd, Durham University, South Road, Durham, DH1 3LE, UK. E-mail: horst@olexsys.org
cUniversität Regensburg, Fakultät für Chemie und Pharmazie, Universitätsstr. 31, 93053 Regensburg, Germany
dDurham University, Department of Mathematical Sciences, South Road, Durham, DH1 3LE, UK
eBruker France, 4 Allée Lorentz, Champs-sur-Marne, 77447 Marne-la-Vallée cedex 2, France
fUniversité de Lorraine & CNRS, Laboratoire de Physique et Chimie Théoriques (LPCT), UMR CNRS 7019, 1 Boulevard Arago, 57078 Metz, France
gUniversity of Western Australia, School of Molecular Sciences, 35 Stirling Highway, WA 6009, Perth, Australia
hVictoria University of Wellington, School of Chemical and Physical Sciences, Wellington 6012, New Zealand
iRigaku Europe SE, Hugenottenallee 167, 63263 Neu-Isenburg, Germany
jFreie Universität Berlin, Institut für Chemie und Biochemie, Anorganische Chemie, Fabeckstr. 34/36, 14195 Berlin, Germany
First published on 9th November 2020
The relationship between the structure and the properties of a drug or material is a key concept of chemistry. Knowledge of the three-dimensional structure is considered to be of such importance that almost every report of a new chemical compound is accompanied by an X-ray crystal structure – at least since the 1970s when diffraction equipment became widely available. Crystallographic software of that time was restricted to very limited computing power, and therefore drastic simplifications had to be made. It is these simplifications that make the determination of the correct structure, especially when it comes to hydrogen atoms, virtually impossible. We have devised a robust and fast system where modern chemical structure models replace the old assumptions, leading to correct structures from the model refinement against standard in-house diffraction data using no more than widely available software and desktop computing power. We call this system NoSpherA2 (Non-Spherical Atoms in Olex2). We explain the theoretical background of this technique and demonstrate the far-reaching effects that the improved structure quality that is now routinely available can have on the interpretation of chemical problems exemplified by five selected examples.
Unfortunately, crystallographic methods and software development have not kept up with hardware development. The vast majority of structure refinements are still based on techniques that make use of one crucial simplification that was introduced in the early days of crystallographic refinement: the Independent Atom Model (IAM), in which atoms are assumed to be separate, non-interacting spherical entities.7 Here, we show how an accessible generalized procedure in which quantum chemical calculations coupled with modern crystallographic software can solve a variety of chemical problems using single-crystal diffraction data of any kind.
The diffraction pattern consists of thousands of unintelligible spots on hundreds of images. To make sense of it, it is necessary to resort to a model of the molecule that will give rise to the measured diffraction pattern. This model building is central to any technique based on diffraction, and it is this final model that we call a ‘crystal structure’. Calculating the expected diffraction pattern from this model requires a certain amount of computing power – and this is where the approximation comes in. Calculations are a lot easier if a spherical distribution of electron density around each atom is assumed. This IAM approximation works very well since most of the diffraction is due to the electrons on the atoms themselves – and the quality of diffraction data that could be obtained in the early days of the field was itself rather limited.
Today, X-ray crystallography finds itself in the strange position where the real diffraction pattern arising from the interaction of X-rays with electrons in their real positions can be measured with high precision and accuracy, and yet we still use an approximation in our models that leads to assuming the wrong positions of these electrons.
In a standard structure refinement, where atoms are treated as independent of each other (IAM), the atomic form factors have been calculated theoretically from the spherical electron-density distribution of isolated atoms in the past and are now available in tabulated form.25 Hence, refinements are fast and convenient, but they neglect chemical bonding, as discussed above. Non-spherical structure refinements are either based on approximate pre-calculated and averaged multipole populations or theoretical wavefunctions, which are tailor-made for the compound under investigation. In both cases, non-spherical bonded-atom electron densities are calculated and are then Fourier-transformed to produce non-spherical atomic form factors. However, only in the latter case, the theoretical chemical-bonding information is readily available after the refinement and can be chemically interpreted.
![]() | ||
| Fig. 1 Deformation Hirshfeld densities12 for the carbon (left) and oxygen (right) atoms in the carboxylate group of Gly-L-Ala, i.e. difference between the spherical atomic electron density used in the IAM and the non-spherical Hirshfeld atom density used in HAR (IAM minus HAR). Red = negative, blue = positive. Isovalue = 0.17 eÅ−3. | ||
HAR requires the repeated calculation of a molecular wavefunction, which restricts its applications even further. The overall process can be slow because of the repeated quantum-mechanical step. One approach is to combine HAR with libraries of extremely localized molecular orbitals (ELMOs),38 which has resulted in the HAR-ELMO method.39 While this is very fast, it relies on the availability of pre-calculated molecular orbitals and is therefore not suitable for general use, but is highly relevant for the refinement of proteins which consist of a fixed subset of 20 amino acid building blocks.
Periodic network compounds could not be handled previously because molecular wavefunctions are used – a problem discussed in ref. 40 by analyzing periodic wavefunctions in combination with stockholder partitioning.
Disordered compounds cannot be handled by Tonto, and there are no apparent plans that this feature will be implemented in Tonto in the foreseeable future.
We present here three different compound classes that could not previously have been refined with HAR – but exhibit important chemical-bonding questions:
• Disordered structures (both occupational and conformational disorder).
• Structures in highly symmetric space groups with special positions.
• Structures with heavy elements next to very light elements.
We have refined representative structures of each class using NoSpherA2 as summarized in Table 1.
| Problem | Example | Field of Application |
|---|---|---|
| Occupational disorder | (C6H6O2)(CO2)0.854 | Supramolecular chemistry, host–guest systems, MOFs/COFs |
| Conformational disorder | C10H10N4F2 | Protein crystallography, solvent disorder, macromolecular crystallography |
| High symmetry network compounds | CaF2/(NH4)2B6H6 | Network compounds, crystal design, inorganic structure refinement |
| Compounds containing heavy metals | OsH6(PC12H19)2 | Inorganic and metalorganic compounds, catalytic complex understanding, metal–metal interactions |
The textbook notion that species such as CaF2 consist of spherical ions has rarely been questioned, although there is evidence that either a multipole or a neutral-atom model may describe such compounds better than the conventional ionic model.49,50 Electron and γ-ray diffraction have shown that there is non-sphericity of the valence electron density in KCl, LiF, and MgO.51,52 Specifically for Ca2+, the formally empty d-orbitals can be partially populated to add substantial covalent bonding character to the cation–anion interactions, as, for example, in the putative Zintl-phase of CaSi53 or the calcocenes.54 Here, utilizing HAR, we show that there is significant bond-directed non-spherical valence density present in the crystal structure of CaF2.49
The bonding in polyhedral boranes and borates cannot be explained by 2-center-2-electron Lewis models, but various other concepts such as three-dimensional aromaticity must be invoked.55 Therefore, there are many diffraction- and electron-density-based studies on borates.56 Here, we have chosen (NH4)2B6H6 because of its fundamental character and symmetric crystal packing, but also because in previous studies the quantum-crystallographic description involving the hydrogen atoms was ambiguous.57 However, if the potential of closo-hexaborates for hydrogen-storage applications58 is to be fully exploited, the hydrogen atom geometric and bonding parameters must be determined accurately and precisely, which will be demonstrated in this paper.
Here, we report how the methodological progress manifested in NoSpherA2 allows the successful non-spherical refinement of the osmium atom and the accurate determination of hydrogen atom parameters in bis(diisopropylphenylphosphine) hexahydridoosmium, OsH6(PC12H19)2,62 referenced against results from neutron diffraction of the same compound.63 Limits of the X-ray diffraction experiment and its resolution truncation are also revealed and discussed.
Besides the NoSpherA2 method developments discussed below, the advantages gained within HAR by switching from the Tonto to the olex2.refine refinement engine are manifold and significant. It allows HAR to access the use of restraints and constraints, hydrogen atom riding models, correct crystallographic description of special positions, refinement of partial occupancies, twinning models, solvent disorder treatment via BYPASS, different choices of weighting schemes, and many more options. In olex2.refine, refinements are carried out in F2, not in F as previously done in Tonto. Advantages of the use of the Olex2 GUI include the automatic generation and validation of crystallographic information files (CIFs), the generation of maps and plots, and the straightforward generation of completed molecules or clusters of symmetry-related molecules on the screen, which are then used in the refinement as input.
in the reciprocal lattice. The structure factor is obtained as a finite sum involving the atomic form factors (also known as ‘atomic scattering factors’) fhkl,j of each atom j in the unit cell. The atomic form factors are, in turn, derived from the atomic electron density distributions via a Fourier transform. Natoms is the number of atoms in the unit cell,
j is the position vector of atom j, and Tj is the Debye–Waller factor of atom j, that is, an exponential function involving the atomic displacement parameters of atom j. These values appear in the equation describing the crystallographic model (eqn (1)):![]() | (1) |
In the classical Independent Atom Model (IAM), the atoms are considered independent non-interacting entities, and their electron densities are spherical functions depending only on the atomic type. The resulting form factors are listed in tables.25 In a model that takes interatomic interactions into account, the atomic electron densities – translated to the origin – are no longer spherical functions and are dependent on the geometry of the whole molecule. Therefore, the corresponding atomic form factors are no longer real, but complex-valued functions of the Miller indices.
The idea behind the flexible approach to non-spherical structure refinement that underlies NoSpherA2 is to provide these complex-valued form factors of each atom within the unit cell via a table given in a file provided from plugin software (.tsc-file).65 The underlying total electron density can be provided by quantum-mechanical calculations or other sources.
For molecules that are related by symmetry within the unit cell, only the form factors of atoms within the asymmetric unit are required, since the geometric transformation from an atom in this asymmetric unit to an equivalent atom in the unit cell corresponds to a transformation of the Miller indices of the corresponding non-spherical atomic form factor. Therefore, it is necessary to pay attention that all Miller indices resulting from transformations for all equivalent atoms are provided in the .tsc-file.
Moreover, the least-squares minimization used in non-spherical structure refinement requires not only information about the non-spherical form factors themselves but also about their partial derivatives. To avoid the time-consuming process of additional .tsc-files at close-by configurations, for which the numerical densities of the individual atoms vary only very little under tiny changes of their atom positions, we neglect these small variations in our calculations of the partial derivatives. The validity of this approximation will be discussed in more detail in a forthcoming publication.66 The mathematical details of this general and flexible approach as well as the precise format of the .tsc-files are discussed in the ESI.†
![]() | ||
| Fig. 2 Flowchart of the process behind HAR controlled by NoSpherA2 and a visual representation of the GUI for NoSpherA2 inside Olex2. | ||
Every aspect of the entire process can be controlled directly from within the Olex2 GUI. Depending on the quantum mechanical package used, different choices affecting the molecular wavefunction calculation will be available. The required basis sets are provided by Olex2 and contain all relevant elements of the periodic table. A detailed description of the available options is provided in Section 5.
The calculation of atomic form factors is different from previous HAR implementations, and this has the biggest impact on heavy elements. Integrable grids are calculated with an adaption of numgrid71 which uses a Lindh–Malmqvist–Gagliardi radial description72 and a modern implementation of the spherical Lebedev quadrature procedure.73,74 Becke partitioning75 with a hardness factor of 3 is applied; and for the Hirshfeld partitioning33 spherical atomic electron densities calculated from Thakkar densities,76 represented by Slater-type functions, are used. More details are provided in Section 5.
| ID | Type | Program | QM | Partitioning | Weighting scheme |
|---|---|---|---|---|---|
| i | IAM | olex2.refine | — | — | 1/σ2(F2) |
| ii | HAR | Tonto | Tonto | Tonto | 1/σ(F) |
| iii | HAR | NoSpherA2 | Tonto | Tonto | 1/σ2(F2) |
| iv | HAR | NoSpherA2 | ORCA | Tonto | 1/σ2(F2) |
| v | HAR | NoSpherA2 | ORCA | Tonto | Shelxl-type |
| vi | HAR | NoSpherA2 | ORCA | NoSpherA2 | 1/σ2(F2) |
In all models and datasets, the hydrogen atom positions and displacement parameters were freely refined. In all HAR models, hydrogen atoms were refined anisotropically. Fig. 3 shows an indicative comparison between the residual density distributions after a standard (IAM) refinement and a HAR with model vi. After IAM, distinctive positive residual densities remain on the covalent bonds and in the lone pairs of the oxygen atoms. After HAR, the bonding and lone-pair densities have been fully accounted for by the non-spherical atomic form factors (compare deformation Hirshfeld density representations in Fig. 1).
The tested models iii to vi introduce an increasing amount and combination of parameters that distinguish the new NoSpherA2 HARs from the traditional Tonto HARs (model ii). Summarizing the validation results (Tables S2–33 and Fig. S5–10, ESI†), all HARs present the same accuracy and precision relative to the neutron-diffraction results regardless of the way the model was generated. The exchange of the least-squares refinement in Tonto with olex2.refine through the .tsc files provides statistically identical results, with all differences being far below the measurement uncertainty. However, the results cannot be numerically identical because of the differences in the procedure discussed in Section 2 (such as refinement in F2 instead of F, different Becke grids used, etc.). We find that the iteratively updated ShelXL-type weighting scheme is advantageous concerning the derivation of ADPs when compared to neutron-diffraction results. The combination of ORCA and NoSpherA2 also seems to produce a slightly closer agreement with the neutron-diffraction results for all parameters compared to the Tonto-derived results. Certainly, the results are produced much faster with ORCA and NoSpherA2 (183 seconds) relative to Tonto (884 seconds, Gly-L-Ala at 23 K, 6 CPUs), and additionally, the grid density is higher in NoSpherA2. The final refinements used for producing the results shown in Section 4 are based on model vi as this is indicated as the most promising combination of settings: HAR in NoSpherA2 using ORCA-wavefunctions and NoSpherA2-partitioning with a Shelxl-type weighting scheme.
| Selected parameters | (C6H6O2) (CO2)0.854 | C10H10N4F2 | CaF2 | (NH4)2(B6H6) | OsH6(PC12H19)2 |
|---|---|---|---|---|---|
| Space group |
R (trigonal) |
P21/n (monoclinic) |
Fm m (cubic) |
Fm m (cubic) |
P21/n (monoclinic) |
| No. of unique reflections | 857 | 2975 | 96 | 364 | 13 109 |
| d min/Å (radiation source) | 0.58 (Mo) | 0.70 (Mo) | 0.40 (Ag) | 0.40 (Mo) | 0.58 (Mo) |
| # param's (const/rest), IAM vs. HAR | 51(3/0) vs. 70(1/0) | 204(7/0) vs. 258(1/4) | 3(0/0) vs. 3(0/0) | 11(0/0) vs. 11(0/0) | 421(0/0) vs. 636(0/6) |
| Level of theory | PBE/def2-TZVPP | PBE/def2-TZVPP | PBE/def2-TZVPP | PBE/def2-TZVPP | PBE-DKH2/x2c-TZVPP |
| Charge/multiplicity | 0/1 | 0/1 | +18/1 | +6/1 | 0/1 |
| R int/% | 1.73 | 5.36 | 6.73 | 2.41 | 5.82 |
| R1 (IAM)/% | 3.26 | 3.89 | 1.31 | 1.84 | 2.11 |
| R1 (HAR)/% | 1.45 | 2.14 | 1.14 | 0.95 | 1.92 |
| Δρ (IAM)/eÅ−3 | 0.581/−0.179 | 0.402/−0.313 | 0.690/−0.476 | 0.179/−0.216 | 1.128/−1.093 |
| Δρ (HAR)/eÅ−3 | 0.227/−0.254 | 0.175/−0.216 | 0.686/−0.415 | 0.119/−0.066 | 1.167/−0.938 |
• R-factors and residual densities are significantly lower for HAR in comparison to IAM refinements.
• R-factors after HAR are very low, regardless of Rint values, space group, resolution of measured data or the presence of disorder.
Many studies focus on carbon dioxide inside the hydroquinone voids for fuel and energy science, but the occupancy (or filling ratio) of the voids is unclear.46,47,80,82 In the example of the HQ-CO2 compound shown here, the occupancy of the entity in the void was refined in HAR to be precisely 0.854(2) (Fig. 4(a)). The experimental details are provided in Section 5.
![]() | ||
| Fig. 4 HAR-refined hydroquinone-CO2 clathrate structure (HQ-CO2) with anisotropic displacement parameters depicted at 80% probability level. The cluster of the guest CO2 molecule with the 12 surrounding HQ molecules encapsulating CO2 inside a void is shown as used in the wavefunction calculation underlying the non-spherical form factor generation. (a) Final refined geometry and partial occupation number of CO2. (b) Hirshfeld surface representation of CO2 inside the void mapped with the property dnorm. Color scale from 0.08 (white) to 0.77 (blue). Generated with CrystalExplorer.83 | ||
Not only was it possible to determine the occupancy of CO2 precisely, but all hydrogen-atom positions and anisotropic displacement parameters were obtained accurately and precisely. There is one symmetry-independent hydrogen bond that closes (by symmetry) the two six-membered rings that encapsulate the void (Fig. 4). HAR-refined geometric details of this hydrogen bond identify it as short and strong: d(O⋯O) = 2.6805(3) Å, d(O–H) = 0.963(6) Å, d(H⋯O) = 1.724(6), a(O–H⋯O) = 171.7(6)°. Three void channels are intersecting the unit cell (see Fig. S1†). Having accurate and precise hydrogen-atom positions from HAR, the volume of each void could be estimated with the software CrystalExplorer83 to be 68.5 Å3.84 In total, 16% of the unit cell are guest-accessible void volumes. The Hirshfeld surface85 of the CO2 molecule encloses a volume of 44.4 Å3, which means that the CO2 molecule fits into this particular void but is presumably not closely bound. Further analysis of the Hirshfeld surface using the property dnorm mapped onto it (Fig. 4(b)) reveals that there are no contacts closer than the sum of the van-der-Waals radii of the atoms in CO2 and the atoms of the host structure; in fact, dnorm is positive throughout the entire range. This implies that there are only weak van-der-Waals and dispersion forces between host and guest, enabling the guest molecule to travel through the void channels. Nevertheless, the CO2 molecule is not dynamically disordered in this crystal structure – it is just not always present in the void.
Fig. 5(b) shows a dynamic deformation density map in the main molecular plane, i.e. the difference electron density of the HAR and the IAM including the effect of refined atomic displacement parameters. The map confirms that all the details of chemical bonding can be analyzed from this HAR disorder refinement, also in the disordered region where several displacement ellipsoids overlap. Moreover, the lone pair regions at nitrogen and fluorine atoms are accurately shaped.
Fig. 6(b)–(d) show unambiguously that the description of CaF2 as an ionic salt with spherical ions is incorrect. There are significant charge concentrations in the deformation density maps (Fig. 6(b) and (d)) and charge localization in the electron localizability (ELI-D) map (Fig. 6(c)) directed along the Ca–F interactions. Hence, non-spherical, bond-directed valence electron density distribution caused by polarization, charge transfer, and electron-density deformation play a significant role in the bonding in CaF2. In an analysis of the wavefunction within the framework of QTAIM86 the atomic charges imply significant charge transfer from F− to Ca2+ (+1.74 e (Ca) and −0.87 e (F)), there is a Ca–F bond critical point with an electron-density value of 0.22 eÅ−3 and a Laplacian value of 4.43 eÅ−5, as well as a delocalization index87 of 0.14, which can be interpreted as the partial bond order.49 It remains to be clarified by more detailed follow-up studies to which extent covalency, d-orbital population, or core deformation interplay to support the ionic framework of the crystal structure. However, we note that covalency and ionicity are not necessarily opposing forces but might be two sides of the same coin.88,89
The maps in Fig. 6(b)–(d) are based on the theoretical cluster wavefunction, and represent the input for the non-spherical structure refinement, most directly represented by the deformation Hirshfeld density plots in Fig. 6(d), the difference between the IAM and HAR densities used in the refinement. This information can be used according to ref. 90 to directly show the significance of the non-spherical signal in the X-ray diffraction data, supporting similar findings by electron and γ-ray diffraction.51,52 For this purpose, in Fig. 7 the difference between the calculated Fc and the measured Fo structure factor magnitudes in both models IAM and HAR is plotted relative to their resolution; weighted by their standard uncertainty σ in Fig. 7(a) and (b), or by their absolute magnitude in Fig. 7(c) and (d). The non-spherical valence density signal is expected to be more relevant for the low-order reflections. Consequently, for a compound such as alanine with many strongly covalent bonds, the IAM shows large discrepancies between the measured and modeled structure factor magnitudes below ca. 0.8 Å resolution, whereas HAR does not show such model insufficiencies probed by the experiment (Fig. 7(b) and (d)). For CaF2, the same systematic effect is not as strongly pronounced, but it is clearly present for the 8 to 10 lowest-order reflections out of a total of 96 reflections in this data set. This shows that HAR can reveal the degree of non-sphericity by model comparison with the X-ray diffraction experiment directly, which was so far believed to be only possible for the more precise convergent-beam electron diffraction experiment. In turn, the model of spherical ions is not suited to describe the ions in CaF2.49,50
![]() | ||
| Fig. 7 Differences of observed and calculated structure factors versus resolution for CaF2 and L-Ala for the IAM and the HAR models. | ||
The shortcomings of the IAM model impact on the refined parameters because the neglected non-sphericity must be absorbed by the weighting scheme and the atomic displacement parameters. The coefficients of the weighting scheme are a = 0.0217 and b = 0.3133 in the IAM, while the coefficients in HAR are reduced to a = 0.0175 and b = 0.0607. This trend of significantly smaller weighting scheme factors is observed for all other refinements, too. The two refinable Uiso values are 0.00337(5)/0.00495(9) Å2 for Ca/F in the IAM, which change to 0.00325(7)/0.00488(9) Å2 if the simple 1/σ2 weighting scheme is used. The differences in HAR are slightly smaller: 0.00334(4)/0.00505(6) Å2vs. 0.00328(5)/0.00502(7) Å2.
Accurate X–H distances are crucial for the derivation of properties such as charge concentrations and localizations related to the special bonding situation in boranes and borates. As an example of possibilities inside Olex2 for bonding analysis after HAR, Fig. 8(c) and (d) show two different ELI-D maps as cut-planes intersecting the B–B–B faces and in the central B–B–B–B square. The ELI-D maxima are clearly outside the bond axes and are delocalized around the boron polyhedron, which is in line with previous theoretical calculations that show the ELF/ELI polyhedron being dual/complementary to the structural B6H62− polyhedron, within the theory of three-dimensional aromaticity.57,94,95
In all HARs reported in Table 4, the Os–H distances were refined freely, while some restraints on the hydrogen atom anisotropic displacement parameters were applied, and one hydrogen atom was refined isotropically. Extinction correction was also applied during the refinements. Calculations were performed using the B3LYP, M06-2X, and PBE DFT functionals,99 always with the DKH2 relativistic extension and the basis set x2c-TZVPP. This series was repeated for a subset of the reflections (pruned at d = 0.7 Å), as there is a significant rise in Rint for data beyond this resolution. In Table 4, the R-value, the residual density minimum and maximum values, and the timing for the QM step are compared. More details and model differences are deposited as CIFs with the paper.
| B3LYP full | B3LYP 0.7 Å | M06-2X full | M06-2X 0.7 Å | PBE full | PBE 0.7 Å | |
|---|---|---|---|---|---|---|
| R1/% | 1.94 | 1.20 | 1.93 | 1.19 | 1.92 | 1.19 |
| Δρ/eÅ−3 | 1.210/−0.708 | 0.592/−0.317 | 1.210/−0.681 | 0.581/−0.318 | 1.167/−0.686 | 0.583/−0.317 |
| t of QM step/s | 639 | 1335 | 579 | |||
The geometry and the displacement parameters of the refinement using the full resolution are shown in Fig. 9. The sum of covalent radii100 of Os and H is 1.61 Å, which is significantly longer than the distances resulting from the HAR refinement (Fig. 7(b), average 1.554 Å, average standard uncertainty 0.014 Å). However, the corresponding neutron-diffraction experiment yielded longer Os–H bonds, on average 1.649 Å.62 The isotropic IAM refinement, in turn, yielded much shorter Os–H bond lengths (1.510 Å). When the resolution cut-off from 0.58 to 0.7 Å was used, the average HAR-derived bond distance and the average standard uncertainty remain constant at 1.555 Å and 0.015 Å. This means that the significant difference in R-value and maximum residual-density value located at the Os core between the two resolutions (Table 4) are unrelated to the Fourier truncation error and do not impact on the hydrogen atom treatment. It is unclear whether the advantage of having more information from higher resolution data or the disadvantage of compromising on the overall data quality by including more high-resolution data prevail over the other.
In summary, the HAR results for those hydrogen atom parameters in OsH6(PC12H19)2 that are bonded directly to the Os atom are improved relative to the IAM results but are still significantly less accurate and less precise than those for compounds involving only lighter elements. To understand whether the insufficient match with the neutron-diffraction derived Os–H bond lengths is caused by problems in the HAR methodology or complications of X-ray diffraction experiments on heavy-element containing species, we calculated and refined a theoretical structure factor set of OsH6(PC12H19)2 based on the neutron-derived geometry at the same level of theory as used in HAR. A dynamic set of structure factors was obtained by a combination of ORCA, the .tsc routine in NoSpherA2, and the .fcf routine in Olex2 up to the same resolution (0.58 Å) as the experimental structure factor file. In addition to the information on the displacement parameters, the theoretical structure factors include information on anomalous dispersion based on the Sasaki table.101 Uncertainties were set to 0.001 of the calculated intensities.
The structure was solved from scratch on the basis of the theoretical structure factors. A subsequent IAM refinement resulted in shortened Os–H distances (av. 1.574 Å) in comparison to the input structure (av. 1.649 Å). This means that the IAM model is insufficient and cannot produce the input parameters even for theoretical data. This is reflected in the residual density distributions depicted in Fig. 10(a) where unmodelled Os–H bonding density and overestimated Os core density is visible at levels as high as 0.58 and −0.26 eÅ−3. After HAR at PBE-DKH2/x2c-TZVPP with high integration accuracy, coordinates and atomic displacement parameters agree exactly to the last digit with the input values (see CIFs deposited as ESI†), and the residual electron density has vanished (0.003 eÅ−3, see Fig. 10(b)). These results imply that the experimental X-ray diffraction data and not the HAR model are the reason for the inaccurate determination of the Os–H bond lengths discussed above.
It is worth noting that the OsH6(PC12H19)2 measurement used here is not of especially inferior quality, but rather represents a standard measurement as it is nowadays routinely obtained for service measurements of coordination and organometallic compounds. Therefore, we use this example to have a closer look at the problem of truncation effects caused by limited resolution because it was shown recently in ref. 97 that even at resolutions as high as d = 0.20 Å, core and outer-core electron-density distributions of a mercury hydride cannot be reproduced at all from structure factors, regardless of the sophistication of the quantum-crystallographic model. Fig. 10(c) shows a detailed view of the electron-density map obtained by the Fourier transform of the structure factor set calculated for the Os hydride at d = 0.58 Å resolution. The area around the nucleus is highly positive (in fact so highly positive that the values cause some trouble for the color mapping). Further out, two shells of highly negative electron density values (red and yellow) are separated by another highly positive shell (blue). These are the so-called Fourier truncation ripples, and it is worth remembering how severe the effect is, with highly significant physically meaningless negative electron-density areas near the core of the heavy element. The third shell of positive electron-density values (light blue) is still as high as 5 to 10 eÅ−3. It is located at a distance of about 1.5 Å from the Os nucleus and therefore overlaps with the hydrogen atom electron densities, which makes the localization of hydrogen atoms bonded to heavy elements and the refinement of their parameters so difficult.
These truncation ripples cannot be avoided because of the nature of the X-ray diffraction experiment and its resolution limitation. They can only be mitigated by extremely high resolution which is not available at home sources so far; and at synchrotron sources, radiation damage becomes much more likely to occur. Whenever there are small errors in the Fourier series of the calculated structure factors, the steep gradients between the highly positive and negative electron density regions cause huge errors and consequently large residual electron density effects. Therefore, a much better understanding of the physical background of effects such as absorption, fluorescence, radiation damage, anharmonic motion, thermal diffuse scattering, and other systematic effects impacting on heavy elements is needed, leading to more sophisticated correction procedures39,96,102 and improved diffractometer hardware and software technology.
The final refinement result is independent of the QM software used, but different QM programs provide different sets of features. If Tonto is chosen, a cluster of self-consistent Hirshfeld point charges and dipoles can be used to simulate the crystal field, and the radius of this cluster can be specified in the GUI. If ORCA is chosen, the resolution of identity (RI) and chain of spheres (COSX) approximations in meta-GGA functionals speed up the calculations without loss of accuracy of the refinement results. They are set by default, but the convergence threshold and convergence strategy can be controlled. Computational resources (number of CPUs and memory) can be allocated and might result in different gains of calculation speed in the different QM codes. In the future, we envisage the possibility to send calculations from the NoSpherA2 GUI to a supercomputer infrastructure for further speed gains.
Crystallographic options concerning the hydrogen atom treatment in the refinement are only convenient switches since the least-squares procedure is controlled via the Olex2 GUI in the same way an IAM Shelxl or olex2.refine refinement is controlled, including all the options such as riding models or restraints. Here, the tick boxes “H Aniso” will set all the hydrogen atoms to anisotropic displacement parameters at the start of the refinement, and “No Afix” removes all the previously fixed hydrogen atom parameters to ensure that HAR refines all parameters freely. “DISP” automatically adds Δf′ and Δf′′ values for the anomalous dispersion correction from the Sasaki table101 according to the wavelength specified in the input .ins or .cif files.
The calculation of grids is based on preselected levels of accuracy (low, normal, high, max; tick-box “Integr. Accuracy”), of which “normal” is usually sufficient, but “high” was used to reproduce the atomic electron density of the isolated osmium atom (Z = 76) with an integrated accuracy of better than 0.0001 e. According to the choice of integration accuracy used here for the atomic form factor calculation, a corresponding accuracy will also be selected for the wavefunction calculation in the QM software.
All HARs presented in the Results and Discussion part (Section 4) were carried out with the following settings: source of .tsc: ORCA; basis set: def2-TZVPP (except Os where x2c-TZVPP was used); method: PBE; relativistics: not used (except for Os); SCF Conv. Thresh.: NormalSCF; SCF Conv. Strategy: NormalConv; H Aniso: activated (if not mentioned otherwise); No Afix: activated; updated Shelxl-type weighting scheme: activated; automatic HAR; Integr. Accuracy: high; EXTI: not activated; DISP: activated. Cluster charges were not used for the simulation of the crystal field. Only for validation purposes and for the Os-containing compound, settings were varied. Other refinement details are shown in Table 3 as well as in the Crystallographic Information Files (CIFs) deposited with the Cambridge Structural and Inorganic Crystal Structure Database (CCDC-2034385 to 2034389, 2035147 to 2035148).
The setup for benchmarking the heavy element refinements was a 4-core 2.81 GHz hyperthreaded laptop with 16 GB RAM and no solid-state drives using 7 threads for ORCA. A significant part of the calculation was the evaluation and saving of relativistic integrals, which is similar for all functionals (approx. 125 s in these calculations). Apart from that, the PBE calculation was significantly faster than the MO6-2X calculation (161% longer), and still faster than the B3LYP calculation (13% longer). Furthermore, the results of the refinement using PBE are very similar to those of the more sophisticated functionals, so the use of PBE for all-purpose refinements is suggested, even when heavy elements are involved. In contrast, the application of relativistic corrections is imperative, and all-electron x2c basis sets lower than triple-zeta are not recommended. Therefore, all results discussed in Section 4.3 refer to PBE-DKH2/x2c-TZVPP HARs.
The CO2 molecule in HQ-CO2 is located in a special crystallographic position. For the wavefunction calculations, we constructed a cluster of 12 hydroquinone molecules around the trapped CO2 molecule enclosing the void (Fig. 4), but only a fraction of that cluster was refined as the crystallographic asymmetric unit, which consists of half a molecule of hydroquinone and half a CO2 molecule (Z = 9, Z′ = 0.5 formula units).
A high-resolution X-ray diffraction experiment of CaF2, obtained as a small single-crystal from a fluorite mineralogical sample, was performed in-house on a Rigaku Synergy-S diffractometer equipped with a Hypix6000 detector at 100 K using Ag-Kα radiation. A single-crystal of ammonium hexahydrohexaborate (NH4)2B6H6 was synthesized for this study according to the procedure described in ref. 105 to yield the hexaborate anion as a sodium salt and subsequently yielding (NH4)2B6H6 after aqueous workup at pH 10 with ammonium chloride in solution. Single crystals were obtained by evaporation of the solvent after filtration. It was measured to high resolution using a Rigaku Synergy diffractometer equipped with a Pilatus 300 K detector at 100 K using Mo-Kα radiation.
The compound bis(diisopropylphenylphosphine) hexahydridoosmium, OsH6(PC12H19)2, was synthesized according to ref. 106 a suitable single crystal was measured to medium resolution on a Rigaku SuperNova EosS2 diffractometer with a CCD detector at 120 K using Mo-Kα radiation. Further crystallographic and measurement details are given in Tables 2 and S1.†
HARs in NoSpherA2 are significantly faster than previous implementations of HAR and often more accurate. In principle, any modern quantum-mechanical software can now be used for the theoretical steps, demonstrated here by the use of ORCA, which is a QM software freely available for academic use. The PBE DFT method with triple-zeta basis sets is recommended for a good balance between accuracy and speed. This is even true for heavy transition metals, where relativistic extensions are necessary. NoSpherA2 also enabled us to pin down the problems in the crystallographic refinement of heavy elements and show pathways for further developments in the field.
Since the format file underlying NoSpherA2 is entirely general, interfacing Olex2 with other flavors of X-ray non-spherical structure refinement is simple and straightforward. First tests have been made for the HAR-ELMO method39 and the multipole databank software Discamb.29 Moreover, any of these kinds of non-spherical structure refinement are not restricted to X-radiation. First tests towards the non-spherical refinement of electron-diffraction data have been made inside and outside107NoSpherA2.
Footnotes |
| † Electronic supplementary information (ESI) available: CIFs of all refinements, further electron-density analysis of compound HQ-CO2, theoretical background of non-spherical refinements in NoSpherA2 and the tsc file format, graphs and tables documenting the validation procedure described in Section 3. CCDC 2034385–2034389, 2035147 and 2035148. For ESI and crystallographic data in CIF or other electronic format see DOI: 10.1039/d0sc05526c |
| ‡ Three main theories can each be used for both strategies:17,18 multipole modelling,19,20 maximum entropy methods21 and X-ray wavefunction refinement.22 The latter is part of the original definition of quantum crystallography (QCr),23 whereas all theories are discussed in the generalized field of QCr.24 |
| This journal is © The Royal Society of Chemistry 2021 |