Open Access Article
Kathrin
Rübartsch
a,
Leeroy
Hendrickx
ab,
Ettore
Bartalucci
ab,
Maksim
Plekhanov
c,
Mirijam
Zobel
c,
Carsten
Bolm
d,
Maria
Fyta
*ef and
Thomas
Wiegand
*ab
aInstitute of Technical and Macromolecular Chemistry, RWTH Aachen University, Worringerweg 2, 52074 Aachen, Germany
bMax Planck Institute for Chemical Energy Conversion, Stiftstr. 34-36, 45470 Mülheim/Ruhr, Germany. E-mail: thomas.wiegand@cec.mpg.de
cInstitute of Crystallography, RWTH Aachen University, Jägerstraße 17-19, 52066 Aachen, Germany
dInstitute of Organic Chemistry, RWTH Aachen University, Landoltweg 1, 52074 Aachen, Germany
eComputational Biotechnology, RWTH Aachen University, Worringerweg 3, 52074 Aachen, Germany. E-mail: mfyta@biotec.rwth-aachen.de
fCenter for Computational Life Sciences (CCLS), RWTH Aachen University, Aachen, Germany
First published on 17th February 2026
Carbon-13 nuclear magnetic resonance (NMR) spectroscopic fingerprints enable the unambiguous identification of a chemical compound. Ab initio calculations of NMR magnetic-shielding tensors are crucial for facilitating spectral resonance assignment, but require an accuracy on the order of one ppm or even less for carbon-13 for distinguishing structurally rather similar compounds, such as structural isomers and polymorphs. Using quantum-mechanical calculations within the density functional theory (DFT) approach and solid-state NMR spectroscopy under magic-angle spinning (MAS) conditions, we herein explore whether the accuracy of current DFT levels is sufficient to distinguish between three distinct solid phases of the amino acid serine using computer simulations. In that vein, enantiopure L-serine, L-serine monohydrate, and racemic DL-serine have been studied. Using solid-state calculations employing periodic boundary conditions, we computed NMR observables, such as 13C isotropic magnetic-shielding values and 13C magnetic-shielding anisotropy parameters of these phases. Different levels of DFT theory utilizing distinct exchange correlation functionals were tested in optimizing the hydrogen-atom positions prior to NMR observable calculations, which significantly improved the agreement with the experimental values. The comparison of the measured and the calculated observables confirmed the distinction between the serine phases not only experimentally but also by DFT calculations. We also compare the DFT-calculated NMR observables with predictions from a recently proposed machine learning (ML) approach. Our analysis clearly revealed that a rather low-cost exchange correlation functional in periodic solid-state DFT calculations reaches the desired accuracy of 13C magnetic-shielding tensor calculations, provided that highly accurate initial structures are applied. These results underline that complementary experimental and computational studies can provide key insights into molecular systems and the interactions therein, enabling the distinction of structurally similar compounds (often polymorphs), which is of particular importance in pharmaceutics.
The accuracy of ab initio calculations of NMR magnetic-shielding tensors is crucial for supporting spectral resonance assignment, particularly for distinguishing polymorphs, often with minor structural differences.13,24Ab initio calculations of NMR observables are typically performed in the DFT framework25 using either Gaussian- or Slater-type orbitals.26 In most molecular species, high-level gas phase calculations on monomers (or smaller molecular clusters) are often sufficient, since the NMR observables are mainly dominated by local properties. However, particularly in cases where intermolecular interactions are of importance, solid-state calculations using plane-wave basis sets need to be performed,27–30 which involves significantly higher computational costs, often restricting the DFT calculations to a lower level-of-theory and limiting the accuracy of the results.31 In such methods, the gauge-including projector-augmented wave (GIPAW) approach is used.32–35 Alternatively, quantum mechanics/molecular mechanics calculations have also been reported to compute solid-state NMR observables for molecular solids.31 In the past few years, machine learning models have also entered the field, enabling a rather efficient prediction of magnetic-shielding parameters.36–38
Some of us have recently studied three distinct phases of the amino acid serine in the context of mechanochemical molecular-recognition processes.39 The enantiopure phase of serine (L- and D-serine, respectively, both giving identical 13C solid-state NMR spectra40) shows a different 13C spectroscopic fingerprint than the racemic phase (DL-serine) with average 13C chemical-shift differences on the order of 2 ppm.39 The third serine phase we have observed in the course of our studies was the monohydrate L-serine phase (L-serine·H2O),41 which has been subjected to detailed solid-state NMR investigations before.42 The chemical-shift values (1H and 13C) of several amino acids43 have been calculated using GIPAW DFT with the PBE functional in previous work.44,45 These values were further corrected using the isolated-molecule correction scheme, in which the differences in isotropic magnetic-shielding values at a PBE and PBE0 level-of-theory for a single molecule are calculated.38,46 A mean absolute error of around 1.5 ppm was determined for 13C isotropic magnetic-shielding values.43 Recently, a very detailed study employed the experimentally determined 13C isotropic chemical-shielding values of 20 solid amino acids (the L-enantiomers have been studied) for benchmarking periodic GIPAW calculations with GGA, meta-GGA, and hybrid DFT functionals.20 Interestingly, best agreement between the experimental and calculated values has been achieved for the GGA level-of-theory (PBE functional).20
We herein study whether the current accuracy of plane-wave DFT calculations is sufficient to unambiguously distinguish the three serine phases mentioned above using 13C magnetic-shielding tensor calculations. Based on initial benchmarks on the main DFT parameters and detailed test calculations for the different structures and exchange correlation functionals, we propose a strategy for periodic DFT calculations of interacting molecular systems. We do not intend to benchmark, as recently described in an excellent article,20 but aim at illustrating, for our specific example, the potential of plane-wave periodic DFT calculations in calculating 13C magnetic-shielding tensors, enabling the distinction of polymorphs. Our studies particularly highlight the importance of an accurate starting structure for such calculations (as often provided by single-crystal X-ray or neutron diffraction),47 which indeed allows the distinction between the separate serine phases, thus reproducing experimentally observed trends.
| CAS number | Purity | Manufacturer | |
|---|---|---|---|
| L-Serine | 56-45-1 | 99% | abcr GmbH (Karlsruhe, Germany) |
| DL-Serine | 302-84-1 | 99% | abcr GmbH (Karlsruhe, Germany) |
| Water | — | De-ionized | — |
The crystal structures of serine considered in this work were taken from the Cambridge Structural Database (CSD), with access codes LSERIN18, LSERMH10, and DLSERN11.62,63Table 2 provides an overview of the corresponding CSD IDs and original publications, showing which structure was used to represent the crystalline phases L-serine, L-serine monohydrate, and DL-serine, respectively.
If not reported otherwise, all three structures were geometrically optimized by allowing only the hydrogen atoms to move freely. For the calculation of NMR parameters, two distinct approaches were employed: (i) direct computation of magnetic-shielding tensors using the database structure without any relaxation with DFT and (ii) prior optimization of hydrogen-atom positions using various levels of theory, that is, various exchange correlation (XC) DFT functionals, while keeping the unit-cell parameters fixed. Specifically, we performed various calculations of the three structures and the different XC functionals without or with structural optimization. We refer to these as ‘no opt.’ or ‘X’ (X = functional), respectively. For comparison with the DFT calculations, we applied the machine learning (ML) model ShiftML3 to compute magnetic shielding tensors,36 using either the structures as directly obtained from the CSD or geometries resulting from hydrogen-atom optimization at the B3LYP level-of-theory using VASP.
Fig. 1 shows the 1H–13C cross-polarization (CP) spectra we have reported previously,39,41 indeed showing that DL-serine and L-serine can be distinguished based on their characteristic spectral fingerprints. Chemical-shift differences of up to 3 ppm have been observed for the individual carbon atoms (for the isotropic 13C chemical-shift values, see Table 3). We also prepared the L-serine monohydrate sample (in the following denoted as L-serine·H2O) using a RAM device, whose Cα and Cβ chemical-shift values resemble those of DL-serine. The carboxylic acid group carbon atoms, though, allow an unambiguous distinction of the three phases. Herein, we also explore the accuracy of plane-wave periodic DFT calculations in distinguishing such phases in computer simulations, requiring accuracies in the calculated 13C isotropic chemical-shift values of around 1–2 ppm.
![]() | ||
| Fig. 1 1H–13C CP-MAS NMR spectra of DL-serine (purple, bottom), L-serine·H2O (orange, middle) and L-serine (blue, top) revealing distinct differences in the isotropic 13C chemical-shift values of the phases. All spectra have been recorded at a magnetic-field strength of 11.7 T and a MAS frequency of 17.0 kHz. L-Serine·H2O contains a small amount of L-serine. The spectra have already been published in our previous work, see ref. 39 and 41. The truncation visible in some of the spectra is caused by too short data acquisition times caused by probe limitations during high power 1H decoupling. In addition, representative structures of the different serine phases based on the single-crystal structures (CSD codes DLSERN11, LSERMH10 and LSERIN18 for DL-serine, L-serine·H2O and L-serine, respectively) are given. In the molecular stuctures hydrogen, carbon, oxygen, and nitrogen atoms are depicted in white, grey, red, and blue, respectively. | ||
| δ exp.iso (COO−)/ppm | δ exp.iso (Cα)/ppm | δ exp.iso (Cβ)/ppm | |
|---|---|---|---|
| L-Serine | 175.2 | 55.9 | 63.0 |
| L-Serine·H2O | 176.0 | 58.6 | 61.4 |
| DL-Serine | 177.1 | 58.7 | 61.0 |
A central goal of this work is to compare the anisotropy of the magnetic-shielding tensor between experiment and DFT calculations. The anisotropy reveals important information about the local chemical environment of the nucleus-of-interest,64,65 can be linked directly to electronic structures,66,67 even enabling the understanding of chemical reactivities (for a recent review article, see ref. 68), and can reveal insights into molecular dynamics.69 The magnetic-shielding interaction is described by a 3 × 3 tensor, which contains in the laboratory frame (LAB, in which the external magnetic field, B0, points along the z-axis) nine independent components (see eqn (1)). By using rotation matrices involving the Euler angles α, β, and γ, this matrix can be diagonalized in a molecule-fixed frame (the principal axis system, PAS) and three independent components remain:
![]() | (1) |
In solution, due to the fast Brownian motion, only the isotropic chemical-shift value (note that the chemical shift is obtained from the magnetic shielding by referencing to a standard compound) can be measured (which is 1/3 of the trace of the above reported tensor). In contrast to this, solid-state NMR allows the determination of all three principal-axis system components, for instance, by recording slow-spinning MAS spectra and simulating their lineshapes to envelopes of the static powder spectra70 or by recording multidimensional isotropic-anisotropic correlation experiments.71–74 In that vein, according to eqn (1), we determined the 13C chemical-shift anisotropy (CSA) by recording slow-spinning MAS experiments (employing an MAS frequency of 2.0 kHz) at two different magnetic-field strengths (for the spectra at 11.7 T, see Fig. 2, and the spectra at 16.4 T are provided in Fig. S4). Lineshape simulations allow the determination of the span (Ω) and skew (κ) parameters (using the Herzfeld–Berger convention23). The respective results are summarized in Table 4, reporting only the values for the carboxylic group carbon, which exhibits the highest CSA. Rather similar Ω-values of around 130 ppm have been observed for the three different serine phases, whereas the κ-values range between −0.04 and 0.22. The results for Cα and Cβ are displayed in Tables S3 and S4 of the SI, respectively.
| Ω exp. (COO−)/ppm | κ exp. (COO−) | |
|---|---|---|
| L-Serine | 130.8 [131.3] | −0.04 [−0.03] |
| L-Serine·H2O | 129.9 [n.d.] | 0.2 [n.d.] |
| DL-Serine | 135.8 [133.3] | 0.09 [0.09] |
We next attempted to establish a protocol for determining the agreement between the experimental data provided in Tables 3 and 4 and calculations of NMR magnetic-shielding tensors using DFT calculations under periodic-boundary conditions. As a matter of fact, finding a decent starting structure remains crucial for the calculations. We thus recorded the PXRD pattern of all three serine phases for which NMR spectra have been recorded. Crystal structures taken from the Cambridge Crystallographic database75 were used for comparing the experimental and back-predicted patterns (Fig. 3). For L-serine, the structure LSERIN1862 matches the diffraction pattern. For DL-serine, the best match has been observed for DLSERN11.63 For L-serine·H2O, a good agreement with the structure LSERMH10 has been found, taken into account its decomposition into L-serine over time.63 All samples exhibit strong texture effects, resulting in changes in relative peak intensities. We concluded that the identified crystal structures revealed good agreement with the experimental PXRD pattern and decided to use those geometries in the complementary DFT calculations. The hydrogen-atom positions, however, were optimized in most cases prior to the NMR parameter calculations, as will be outlined further below. Note that while the structure of L-serine has been determined by X-ray diffraction, the structures of DL-serine and L-serine·H2O were determined by neutron diffraction, already allowing us to assume more accurate hydrogen-atom positions (vide infra).
![]() | ||
| Fig. 3 Comparison of experimental PXRD patterns of the three serine phases (a) L-serine, (b) L-serine·H2O and (c) DL-serine to the back-predicted ones using published single-crystal structures (see Table 2 for the corresponding CSD entries used). | ||
![]() | ||
| Fig. 4 “Heat maps” comparing experimental and DFT-calculated isotropic 13C chemical-shift values. The differences between experimental and DFT-calculated values for the difference in the chemical-shift values between the carboxyl acid group carbon and Cα (a), between the carboxyl acid group carbon and Cβ (b) and between Cβ and Cα (c) for the three different serine phases are plotted in units of ppm. “Heat maps” of the CSA parameters Ω (d) and κ (e) only for COO− as a function of the DFT functional. In all cases (except no opt.) the hydrogen-atom positions were geometrically optimized in VASP before the calculation of the 13C magnetic-shielding tensors using different DFT functionals, as indicated in the figure. In VASP (highlighted by the green lines), calculations of 13C magnetic-shielding tensors have been performed in all cases with the PBE DFT functional. As comparison and separated by the purple lines, the results obtained by using the machine learning model ShiftML3 are shown.36 | ||
In general, the calculated chemical-shift difference between COO− and Cα was found to be in better agreement with the experimental data than the corresponding difference between COO− and Cβ and between Cβ and Cα. For calculations involving the relaxation of the hydrogen-atom positions only, the deviation between calculated and experimental chemical-shift differences varied depending on the level-of-theory used during the optimization. Except for DL-serine, the best agreement with experiment was achieved for hybrid functionals, such as B3LYP. The minimal deviations between computation and the experiment for the Cβ–Cα chemical-shift difference have been determined to 2.6 ppm for L-serine (B3LYP), 1.4 ppm for L-serine·H2O (B3LYP) and 1.2 ppm for DL-serine (HSE06). As noted above, smaller deviations ranging between 0.2 and −1.1 ppm for the minimal values are obtained for the COO−–Cα difference. Larger deviations involving Cβ chemical-shift values are attributed to higher uncertainties in their DFT-computed isotropic chemical-shift value, possibly as a consequence of intermolecular hydrogen-bond formation of the neighbored hydroxyl group. The overall better agreement for DL-serine and L-serine·H2O compared to L-serine might point to the more accurate refinement of hydrogen-atom positions via neutron diffraction, leading to structures of higher quality than in the case of L-serine. These results highlight the crucial role of hydrogen-atom positioning in accurately predicting the isotropic 13C chemical-shift differences, as already indicated in previous studies (for some examples, see ref. 28–30, 34, 35, and 76–78). For instance, 13C magnetic-shielding tensor GIPAW calculations for α-glycine using different crystal structures revealed the best agreement with experimental values in most cases by only geometrically optimizing the hydrogen atom positions, instead of an all-atom optimization.76 Whether a full geometry optimization even improves the agreement with experimental chemical-shift values further is under debate (as, for instance, such calculations would yield the geometry only at 0 K) and should be decided on a case-to-case basis as proposed in ref. 30. In the case of the serine phase studied herein, optimizing the hydrogen-atom positions seems to be sufficient,77 as full geometry optimization even leads to larger deviations from experimental values (vide infra).
We have also compared our experimental values with those obtained by the ML model ShiftML3, in which a ML model was trained on a database including periodic DFT-calculated magnetic-shielding tensors achieved at the PBE level-of-theory.36 Similar to the observations for the DFT calculations, the deviations to the experimental values decrease for the geometry-optimized structures (Fig. 4a, b and c), pointing again to the importance of an accurate structure. Interestingly, the DFT calculated and the ML predicted data in this Fig. 4 show a different relationship relative to the experimental data. Specifically, the DFT calculations match better the experimental measurements in the case of the COO−–Cα differences, while the ML model better captures the COO−–Cβ differences. Both methods reveal a similar accuracy for the Cβ–Cα differences. Taking the low computational cost of ShiftML3 into consideration, though, this algorithm already pinpoints the power of ML in predicting 13C magnetic-shielding tensors and it will probably even further improve with the increase of the accuracy in DFT-calculations of magnetic-shielding tensors (and thus the training data sets).
We next turn to the anisotropy of the 13C magnetic-shielding tensors of the three serine phases. As mentioned above, only the carboxylic acid carbon is analyzed as the span-value is largest in magnitude and thus determined with much higher precision experimentally. The comparison in Ω- (Fig. 4c) and κ-values (Fig. 4d) between calculation and experiment, again using different DFT-functionals for the geometry optimization of the hydrogen-atom positions, reveals in all cases an overestimation of the Ω-values with the largest deviations of 4.9 and 5.7 ppm for L-serine·H2O and L-serine without initial constrained geometry optimization. While the deviations in Ω-values between calculation and experiment reduce significantly for L-serine upon optimizing the hydrogen atom positions (minimal deviation of 2.7 ppm, PBE), the values for L-serine·H2O and DL-serine do not change significantly (minimal deviations of 4.2 and 2.6 ppm, respectively). A similar trend is observed for the κ-values with minimal deviations of −0.04 (PBE), −0.09 (PBE) and −0.04 (PBE) for L-serine, L-serine·H2O and DL-serine, respectively. Interestingly, the PBE GGA-functional revealed the best agreement between simulation and experiment in all cases. ShiftML3, as in the case of the DFT, overestimates the Ω-values, whereas for the κ-values in all cases, larger deviations are obtained by such ML-approaches as compared to the DFT calculations. Note that a geometry optimization allowing all atoms in the molecules to relax (PBE level-of-theory) provides a poorer agreement with nearly all of the respective experimental NMR observables in our cases instead of optimizing the hydrogen atoms only (see Fig. S5, S8).
Fig. 5 shows schematically the 13C NMR spectra for all three serine phases using the DFT-calculated isotropic magnetic-shielding values obtained after optimizing the hydrogen atoms at a B3LYP level-of-theory (for the further DFT functionals employed herein, see Fig. S6). When compared with the experimental spectra in Fig. 1, the computed 13C chemical-shift values of the carboxylic acid group carbon atom reproduce the experimental order, with DL-serine having the highest chemical-shift value and L-serine the lowest. The same is observed for the Cβ–Cα chemical-shift difference, with L-serine showing the largest value experimentally and DL-serine the smallest. Even the small deviations observed experimentally between L-serine·H2O and DL-serine are qualitatively reproduced correctly. Although perfect reproduction of the experimental data is not possible (i.e. the differences in carboxyl group carbon chemical-shift values are overestimated; see Fig. 5), the three distinct serine phases can be qualitatively distinguished by means of DFT calculations. The same holds for the ShiftML3 predictions (Fig. S7). Note, though, that an optimization of the hydrogen atoms is required prior to the calculation of NMR observables, as discussed above (for the simulated spectra after full geometry optimization, see Fig. S8).
The scripts and data used in this manuscript will be made available through a public Radar4Chem repository hosted at DOI: https://doi.org/10.22000/7022jhfh8mwq5tyk.
| This journal is © the Owner Societies 2026 |