Tommaso
Giovannini
and
Chiara
Cappelli
*
Scuola Normale Superiore, Piazza dei Cavalieri 7, 56126 Pisa, Italy. E-mail: chiara.cappelli@sns.it
First published on 19th April 2023
Molecular spectral signals can be significantly altered by solvent effects. Among the many theoretical approaches to this problem, continuum and atomistic solvation models have emerged as the most effective for properly describing solvent effects on the spectroscopic signal. In this feature article, we review the continuum and atomistic descriptions as applied to the calculation of molecular spectra, by detailing the similarities and differences between the two approaches from the formal point of view and by analyzing their advantages and disadvantages from the computational point of view. Various spectral signals, of increasing complexity, are considered and illustrative examples, selected to exacerbate the differences between the two approaches, are discussed.
In parallel, computational spectroscopy, i.e. the calculation of spectral properties by means of computational methods, has become an invaluable tool in most fields of chemical research.6 In fact, thanks to the high level of accuracy that has been reached in the reproduction of experimental data in the condensed phase, it often provides a theoretical rationalization of many experimental findings, yielding truly synergistic investigations.7–9 In fact, although many algorithms of increasing accuracy have been proposed and tested for systems in the gas phase,10–12 the large majority of molecular spectra are routinely measured in the condensed phase, i.e. when chemical systems are dissolved in a solvent or other kinds of external environments (biological matrix, polymeric materials, crystal phase etc.). Indeed, molecular structure and response to external electromagnetic fields can be significantly altered by the environment, thus making the simulation for isolated systems mostly inappropriate for a reliable comparison with experimental findings.13–16
Effective theoretical modeling of spectroscopy in the condensed phase requires catching the physico-chemical features of the simultaneous interaction of a chemical system with the environment and the external radiation field.1,17–20 Since spectroscopy arises from the interaction between the molecule and the radiation, which is in general driven by the electronic component, reliable modeling needs to resort to Quantum-Mechanical (QM) descriptions. However, while such approaches may be feasible for small-to-medium molecules in the gas phase, for systems in the condensed phase a “brute-force” description, i.e. which treats all atoms at the QM level, is unfeasible due to the enormous number of degrees of freedom that would need to be taken into account. However, such a description would be useless in most cases; in fact, the electronic structure of the environment would be described, however it commonly does not drive the spectral signal of chemical systems in the condensed phase.21,22 In fact, it is universally recognized that molecular spectra are modified but not determined by the environment, and this is indeed an important concept, which lays the foundations for the spectroscopic identification of molecular structures.
Focused models1,13,19,23–25 are the most successful class of approaches to computational spectroscopy in the condensed phase. There, the focus is the target molecule (solute in case of solutions) and the key is the accurate modeling of molecule/environment interactions and their consequences on the molecular structure and properties.17,18,20,26 The intrinsic spectral properties of the environment are not modelled, which corresponds to assuming molecular properties to be local properties of the molecule, which can be modified but not determined by the presence of the environment.
The most renowned class of focused models belongs to the family of multiscale (multilayer) QM/classical approaches.23 The latter have had great success in modern chemical research because they can be effectively coupled with most QM descriptions, ranging from semi-empirical methods27,28 to Density Functional Theory (DFT) or Wavefunction Theory.29–32 Remarkably, there is no substantial increase in computational cost with respect to the corresponding QM calculation of the isolated molecule because the number of degrees of freedom to be treated at the QM level does not change. Also, the machinery of Quantum Chemistry can be exploited to obtain the desired spectral signals in the same way as they are calculated for isolated systems.17–19 In fact, additional environment-related terms are introduced in the molecular Hamiltonian, which results in extra terms appearing in the formulation of energy derivatives and response equations.19
The main difference among the various types of focused QM/classical models stays in the way the environment is modelled. Implicit (continuum) or explicit (atomistic) approaches have been proposed: the former is especially suited to treat solvents,13 whereas fully atomistic descriptions may treat all kinds of environments, pending suitable parametrizations.1,2,27,33,34 In continuum solvation approaches, the atomistic nature of the solvent is neglected and the solvent is modelled in terms of a continuum dielectric, featuring specific dielectric properties. The Polarizable Continuum Model (PCM),13 has been particularly successful in the context of the present paper, because it has been extended to a large variety of spectral signals and molecular properties.17,18 A clear bottleneck of the QM/PCM approach, and in general of all implicit approaches, is their severe limitation in describing specific and directional solute–solvent interactions, such as hydrogen bonding. To this end, explicit approaches have been developed, belonging to the family of QM/Molecular Mechanics (QM/MM) methods.1,19,23,35,36 There, the target molecule is still described at the QM level, but the environment (solvent) is treated by resorting to MM force fields. QM/MM fully atomistic descriptions are not gained for free: in fact, whereas continuum models include implicitly the statistical average of all possible solute–solvent configurations, QM/MM approaches require the explicit sampling of the solute–solvent phase space.1,37 However, they provide a dynamic description of the solvation phenomenon, which basic continuum formulations lack.
In this feature article, we discuss continuum and atomistic solvent descriptions as applied to the calculation of the molecular spectra. First, we detail similarities and differences between the two approaches from the formal point of view, followed by a discussion of the advantages and disadvantages of the two methods, also from the computational point of view. Then, we discuss how to specify the models to catch the physics of the solvated system and transfer it to the computation of various spectral signals, of increasing complexity. Finally, some specific examples are discussed, especially suited to illustrate the concepts highlighted in the theoretical discussion. Some conclusions end the manuscript.
In modern implicit approaches, the solvent is still described in terms of a homogeneous polarizable continuum, featuring an electric permittivity constant ε.13,41,42 However, the simplistic approach proposed by Onsager is ameliorated in two directions. First, the solute is not approximated by a point-like dipole but is treated in terms of a quantum mechanical wavefunction/density. Second, the solute is accommodated in a molecular-shaped cavity, which is generally constructed in terms of interlocking spheres centered on each solute atom.43,44 The presence of the solute density polarizes the dielectric, yielding the formation of an apparent charge density on the cavity surface, which back-polarizes the solute, consistently with the concept of reaction field introduced by Onsager.13 The surface charge density is named apparent because it only exists in the presence of the solute; in the computational practice, it is discretized in a set of apparent surface charges (ASC), which are the solution of the Poisson equation with appropriate boundary conditions.45,46
The interaction between the solute and solvent is generally limited to electrostatics and mutual polarization terms; their inclusion into the QM/PCM Hamiltonian results in the remarkable feature that solvent-depend terms propagate to the calculation of molecular properties and spectroscopies through response equations/derivatives.17,18,47 Note that the limitation to electrostatic terms is a reasonably good approximation for polar solvents with large ε, however non-electrostatic (Pauli repulsion and dispersion) should also be considered, for instance by means of one of the methods which have been developed for this purpose.25,48–53
Clearly, in implicit models solvent molecules are not actually considered; thus, solute–solvent interactions, e.g. Hydrogen Bonding, are neglected. However, one of the most attractive aspects of continuum models is that their description can systematically be improved, for instance by including some (few) solvent molecules, in the so-called “cluster” or supermolecule models.54 Solvent molecules can be treated at the same QM level of the solute, or a lower level, for instance by resorting to ONIOM schemes.55 Clearly, in this way first-shell specific solute–solvent interactions can be described, however, it is not straightforward to define the number of solvent molecules to include, and especially their positions, i.e. how many configurations need to be considered to get reliable modeling.
A way to solve this problem, with additional advantages that will be discussed in the following presentation, consists of fully recovering the solvent atomistic nature and moving to explicit modeling, i.e. to QM/MM.35 There, the solute is described at the QM level, whereas the environment is described by resorting to MM force fields, therefore the problem shifts to the description of the interaction between the QM and MM portions. In this respect, different approximations have been proposed:35
• Mechanical embedding: the QM/MM coupling term is treated at the MM level only, therefore it is independent of the QM electronic density. The coupling is expressed in terms of dispersion–repulsion potentials (usually, the Lennard-Jones potential) and classical electrostatics. Thus, the contribution of the surrounding environment to the energy and properties of the QM portion is only indirect.
• Electrostatic embedding: MM atoms are endowed with fixed atomic charges, that produce an electric potential, which polarizes the QM electronic density. From the point of view of the QM solute, electrostatic embedding introduces a new term in the molecular Hamiltonian, that is the interaction between the potential generated by the MM fixed charges and the electron density. Dispersion – repulsion contributions are usually considered by resorting to classical potentials.
• Polarizable embedding: the mutual polarization between MM and QM portions is explicitly taken into account, thus recovering the concept of reaction field. Polarizable embedding work in such a way that the MM force field contains a response term, which modifies electrostatics due to the presence of the QM density. A polarization term is included in the QM Hamiltonian to represent the interaction between the electronic density and the MM portion. Also in this case, dispersion-repulsion contributions are treated in terms of classical potentials.
Among the several QM/Polarizable Embedding schemes that have been proposed in the literature,19,39,56–64 the most similar to the continuum approach, from the formal point of view, is the QM/Fluctuating Charge (QM/FQ) method,1,19,26 which was pioneered by Bryce et al.65 There, the environment is described by exploiting a set of charges, which vary as a response to the external potential due to both the other MM atoms and the QM portion. In parallel, FQs polarize the QM density, so that a mutual polarization is established. While the description of the environment in terms of electric charges directly recalls the continuum picture, in FQ the charge transfer between the atoms is governed by the electronegativity equalization principle,66 which states that, at the equilibrium, the electronegativity of all atoms needs to be the same. Therefore, while continuum approaches are entirely based on electrostatics, the FQ force field finds its theoretical foundations in conceptual density functional theory (CDFT).67 In fact, it is defined in terms of electronegativities and chemical hardnesses, i.e. energy first and second derivatives with respect to the number of particles, which are quantities well-defined in CDFT framework.
The formal similarity of the two approaches, and especially their formulation in terms of electric charges, make the way the energy and molecular properties are formulated very similar. For instance, by taking QM/PCM as an example of continuum solvation approaches, the interaction between the QM and classical portions is expressed as follows:
![]() | (1) |
![]() | (2) |
To end this section, it is worth highlighting that a three-layer approach, combining QM/FQ with an outer PCM (spherical) layer (QM/FQ/PCM), has been proposed and extended to the calculation of several spectral signals.15,70–72 In this case, extensive numerical testing has evidenced little effect of the PCM layer on computed properties/spectra.73,74 This is mainly due to the fact that in all applications a sufficiently large number of explicit solvent molecules around the solute have been considered. Thus the PCM layer simply results in a sort of confining potential, with only a scarce role in computed values.
Conceptually, solvent effects can be categorized as follows:
• Direct effects: the introduction of a solvent-dependent term in the QM/classical Hamiltonian modifies the solute ground state and propagates to spectral properties because solvent contributions also appear in response equations, such as Coupled-Perturbed Hartree–Fock (CPHF) or Kohn–Sham (CPKS) equations, or in the formulation of analytical energy derivatives.19,26
• Indirect effects: the solvent modifies minimum geometry(ies). Also, it may change the population of conformers, or give rise to conformations that cannot be accessible in the gas-phase.
The specific formulation of direct effects, i.e. the resulting terms entering the equations to be solved, depends on the actual solvation model (either implicit or atomistic) and the QM level that is adopted to describe the solute. Generally, while highly correlated and/or multireference approaches can be fruitfully exploited to study electronic spectra, vibrational, vibronic, magnetic and mixed electric/magnetic spectroscopies are usually computed at the DFT level, which constitutes a good compromise between computational cost and accuracy.88
Indirect (geometrical/conformational) effects are instead treated by resorting to different strategies in the two families of methods. In implicit calculations, the minimum geometries are determined through geometry optimization on the solute's potential energy surface (PES) as redefined with the inclusion of solvent-depending terms in the Hamiltonian/energy. Also, the molecular cavity is re-adapted to the diverse geometries that correspond to minima, however the dynamics of the solvation phenomenon (i.e. the solvent geometrical rearrangement around the solute) is discarded, giving rise so a substantially “static” picture. Differently, atomistic descriptions require a full configurational sampling of the solute–solvent phase-space, which may be achieved by resorting to classical or ab initio molecular dynamics (MDs) simulations, Montecarlo sampling, and other advanced sampling techniques.89 In this way, the dynamics of the solvent around the solute and their mutual geometrical conformation are taken into account. Obviously, the description of indirect effects that results from implicit/explicit approaches might be drastically different. Generally, explicit models yield more realistic descriptions, especially in the case of strongly interacting solute–solvent couples, dominated for instance by hydrogen-bonding effects.
The account of indirect solvent effects on spectral properties has relevant consequences from the computational point of view. In fact, implicit approaches follow the same strategies as QM methods for gas phase systems: the calculation of the spectral property is performed on each minimum conformation and then averaged according to Boltzmann populations computed by including solvent effects in the evaluation of (free) energies. Therefore, the cost of the calculation basically stays the same as for isolated systems, if self-consistent-field (SCF) descriptions (HF or DFT) of the QM portion are exploited. A different approach is followed in atomistic simulations. First, limiting the evaluation of the spectral properties on a single structure for each solute conformer surrounded by the solvent placed at specific geometrical positions is unphysical, because realistically the solvent configurations around the solute need to be sampled. For this reason, the spectroscopic signal is calculated on a set of uncorrelated structures taken from the sampling, whose number is selected so as to ensure the convergence of the property. Therefore, while in implicit descriptions the number of calculations equals the number of conformers, atomistic treatments require to evaluate the property of a number of structures that may vary from a few tens to several thousand.7,15,40 As a consequence, the total computational cost of explicit modeling is roughly n times larger than the corresponding implicit description (where n is the number of snapshots). These two procedures also highlight an important conceptual difference between the two approaches: in atomistic modeling, the average over solute–solvent configurations is explicitly performed, whereas in implicit methods the average is intrinsically considered by the solvent permittivity constant, which is by itself an averaged quantity.
The measured spectrum is the result of the evolution in time of the solute density interacting with the external field, which oscillates within a specific frequency range (UV-Vis, IR, MHz ecc.). This range determines how the spectrum is actually computed at the various QM levels, i.e. the formalism and equations that need to be solved. However, the QM theoretical framework needs to be coupled with the solvent evolution in time under the effect of the probing field. In particular, the solvent may only partially follow the solute which evolves under the effect of the field, because some of the solvent degrees of freedom can follow or stay frozen depending on the timescale associated with the solute excitation.
Also, modifications in the external electromagnetic field caused by the presence of the solvent need to be considered. This concept dates back to the Onsager model,38 where the “local field” is first introduced. In that case, it can be analytically evaluated for both static and dynamic fields, in terms of permittivity constants. In modern implicit solvation schemes, the presence of a molecular-shaped cavity and the description of the solute at full QM level yield local field factors not only depending on the solvent dielectric properties but also on the solute, and the specific spectral bands.17,18,90–92 Local field effects have been specified to many response and spectral properties treated by means of implicit solvation models, notably improving the agreement between calculated and experimental values. Much less attention has been paid so far to the formulation of local field effects for atomistic solvation models.93,94
Differently, in polarizable QM/MM approaches nonequilibrium is automatically described by readjusting the MM polarizable quantities (multiples) to the transition density (we specify here the discussion to TDHF/TDDFT).20,36,59,94,98 The other solvent degrees of freedom are kept frozen by fixing the geometry of the frames. Methods based on a combination of fixed multipoles and polarizable dipoles (such as Induced Dipoles and AMOEBA) have a clear correspondence with the implicit picture.56,62,63 In fact, the “fast” degrees of freedom are modeled in terms of polarizable dipoles, which are equilibrated to the transition density, whereas the fixed multipoles represent the “slow” degrees of freedom. These concepts are more subtle in the case of QM/FQ, where static and polarization contributions are calculated together. However, the static contribution can be seen as arising from differences in atomic electronegativities, while the polarization contribution is due to the chemical hardness, which enters the definition of the FQ matrix.20,99 It is also clear that since in electrostatic embedding the MM multipolar distribution is fixed and does not depend on the QM density, neither nonequilibrium nor direct contributions to response equations are described, thus yielding an unrealistic picture of the electronic excitation.20
As stated above, polarizable approaches introduce additional terms in the QM Hamiltonian. When extending the models to the treatment of absorption spectra, it comes out that state-specific and state-average pictures deriving from the linear response formalism in the SCF context, which in vacuo coincide, become different. Two complementary formalisms can be exploited,19,36,57,59,62,98,100–102 depending on which density the MM polarizable variables are adjusted to. In the linear response (LR) regime, they linearly respond to the transition density,98,103 while, in state-specific formulations,104–106 they are adjusted to the excited-state electronic density, for instance by resorting to a first-order correction (corrected linear response – cLR), which has been proposed for both implicit and explicit approaches.98,103
From the physical point of view, while in the LR regime dynamic solute–solvent interactions (similar to dispersion interactions) are described, cLR takes into account the relaxation of the solute density in a specific excited state. For this reason, the two formalisms can be seen as complementary and simultaneously considered, such as in the so-called cLR2 approach.34,107
The above discussion assumes the system to be in a nonequilibrium condition. However, once the solvent degrees of freedom fully relax to the solute's excited state electronic density, a novel equilibrium condition arises. This is only achievable if the excited state lifetime is long enough to allow solvent reorganization. Generally, the latter occurs on the order of picoseconds, i.e. it is slower than most excited state lifetimes. Solvent reorganization takes place in case of emission phenomena, such as fluorescence. In this case, the solute–solvent couple is at equilibrium in the excited state, and once vertical emission takes place a new nonequilibrium situation arises because the solvent slow degrees of freedom stay fixed to the excited state. In order to consistently describe this phenomenon, implicit approaches resort to geometry optimization (and cavity adjustment) in the excited state geometry.13 In discrete models, excited state equilibration is much more difficult to describe because the commonly exploited classical methods to sample the phase space are designed for the ground state. To solve this problem, excited state QM/MM MD simulations can be exploited, but their computational cost might become prohibitive. A pragmatical alternative implies the parametrization of classical force fields to properly describe the solute's excited state, however this topic has received so far only a little attention, so established protocols still need to be defined.108,109
Within the harmonic approximation, it is assumed that the molecule lies in the PES minimum, which within the focused approach is found by only optimizing the solute's geometry. This corresponds to assuming that the spectral signal is determined by the solute, and only modified by the environment. QM/MM geometries, which as already mentioned above, are obtained from a sampling of the solute–solvent phase-space, are not necessarily associated with energy minima. Consistently with the focused approach, QM/MM vibrational properties are computed by optimizing the solute geometry but keeping the structure of the environment fixed (i.e. the position of solvent molecules).73,74,116 From the theoretical point of view this procedure complies with the so-called Partial Hessian Vibrational Approach (PHVA),117 thus only the QM–QM portion of the Hessian matrix is retained, whereas the MM–QM, QM–MM, and MM–MM blocks are discarded.
From the computational point of view, simulating vibrational properties by using QM/MM approaches is more demanding as compared to implicit descriptions. This is mainly due to the fact that the geometries obtained from the sampling might be far from solute minima, and might require a large computational cost (in terms of the number of iterations) to converge. However, the QM/MM description of vibrational spectra that is achieved compares much better with experimental findings, especially in the case of aqueous solutions (vide infra).1 Notice also that attempts to reduce the computational cost associated with the calculation of the QM/MM Hessian have been recently proposed in different contexts (vibronic and Resonance Raman spectra),118,119 by projecting out the out-of-equilibrium normal modes, which are generally associated with the flexible dihedral angles. The selection of such modes is far from trivial, therefore it may strongly benefit from novel techniques to automatically select torsional modes.120
Finally, it is worth mentioning that specific spectral regions may be dominated by anharmonic effects.121 However, due to the large computational cost associated with anharmonic calculations, they are generally computed with implicit solvation approaches.122
It is also worth remarking that in either QM/continuum or QM/atomistic descriptions, the interaction between the QM and the classical portion is retained as purely electrostatic (see also eqn (1) and (2)). Non-electrostatic interactions (exchange, repulsion, dispersion) are generally discarded. However, while this assumption is reasonable for electric perturbations, it is much less justified for magnetic perturbations, where density fluctuation and confinement may play a relevant role up to generate induced densities in the environment. To the best of our knowledge, such effects have only marginally been taken into account in QM/classical approaches. As for QM/FQ, it has recently been challenged to describe EPR hyperfine coupling constants with the inclusion of no-electrostatic solute–solvent interactions,83 however full extension to magnetic properties and extensive testing are still missing.
RR signals and the environment are so tightly connected that RR is often exploited to evaluate solvent reorganization energies following electronic transitions.136 The presence of the solvent influences the positions of the peaks, which are related to modifications in vibrational energies, which in turn follow from changes in the PES. Therefore, reliable computational protocols need to be able to model at the same time and with similar accuracy the effects of the solvent on electronic transition energies, geometries, frequencies, and vibrational wave functions. Clearly, this means that solvent terms enter the calculation of the molecular electronic wave function, and alter the solute PES, i.e. equilibrium geometries, vibrational frequencies, and normal modes. Also, solvent terms need to be included in the evaluation of the excited state properties which are required to evaluate RR spectral intensities, i.e., transition dipole moments, excited state forces, and possibly Hessian, depending on the theoretical level which is exploited to describe RR signals.131
Within the implicit solvation framework, the nature of RR as a mixed vibrational–electronic spectroscopy calls for care when considering the solvent time evolution, in order to correctly account for the dynamical aspects of the solute–solvent interaction. The RR polarizability, which rules spectral intensities, originates from the time evolution of a vibrational wave packet on the excited state PES. Depending on the time scale of such an evolution, some of the solvent's nuclear degrees of freedom may remain static. A possible way to simulate this regime with implicit modeling in an approximate way (see ref. 137 for more details) consists of keeping the molecular cavity fixed to the solute GS equilibrium geometry and model normal models and frequencies within the vibrational nonequilibrium regime. The (electronic) nonequilibrium excitation energy derivative is then computed by also keeping the cavity fixed. Generally speaking, univocal assessment of the correct (nuclear + electronic) solvation regime in modeling RR spectra is not straightforward. Different combinations have been tested in ref. 137 leading to considerable differences in computed spectra. In addition, the solute–solvent interaction term acting on the RR polarizability is modeled by considering a dephasing term that can no longer be attributed solely to the finite lifetime of the molecular excited electronic state but should take into account the experimental conditions.137 The above discussion should have highlighted the complexity of RR signals, and their consequences on the computational approach. Clearly, in order to properly simulate RR of solvated systems by means of an explicit (polarizable) QM/MM approach, the latter needs to be able to tackle at the same time and with the same level of accuracy, the GS electronic and vibrational degrees of freedom of the system, as well as its excited states. Remarkably, QM/FQ has such features, therefore it has recently been successfully applied to the calculation of RR.8,134,135 Due to the full atomistic nature of the model, nonequilibrium effects are automatically considered by the computational protocol, in line with what has been discussed in Section 3.1 for electronic spectra.
Chiroptical spectroscopies are another fascinating class of mixed electric, magnetic and/or vibrational responses.138 Examples are Electronic Circular Dichroism (ECD), Optical Rotation (OR), Vibrational Circular Dichroism (VCD) and Raman Optical Activity (ROA).138–140 These kinds of spectroscopies are generally exploited to assign the absolute configuration of chiral molecules, especially molecular drugs, and for this purpose need to be assisted by molecular simulations.26,141,142 The effect of the solvent on the measured signal can be drastic, i.e. the solvent can even change the sign of the measured properties or the sign of specific bands with respect to simulations for the isolated systems.26,72 Such complexity comes together with the complexity of the signals to be simulated, which require the simultaneous consideration of electronic and magnetic perturbations, further coupled to geometrical perturbations for VCD/ROA.139 Also, all perturbations need to be treated at the same level of accuracy, to avoid unphysical descriptions. While QM/PCM has been extended to the analytical evaluation of all aforementioned chiroptical properties and spectroscopies,143–145 to the best of our knowledge QM/FQ is the only polarizable QM/MM method able to treat all these signals, remarkably through analytical algorithms.19,73,74,116
The strategy to model chiral spectra derives from the nature of the spectral signal. In fact, in the case of ECD, the theoretical framework does not require the calculation of spectra on energy minima, therefore the same strategy followed for purely electronic or magnetic spectroscopies can be exploited.91,146 This again means that continuum calculations are generally performed on GS PES minima, whereas QM/MM ECD calculations may be performed on structures extracted from the GS sampling without any further optimization.147,148 As for direct solvent effects, additional terms arising from the presence of the solvent need to be considered in the calculation of the response tensors to the presence of the external circularly polarized light, by simultaneously taking into account the electric and magnetic components. Also, the description of solvent nonequilibrium is the same as electronic absorption spectroscopy, then follows the same strategies discussed in Section 3.1.91
Moving to VCD or ROA, the spectral signal arises from the interaction of the polarized light (electric and magnetic components) with the vibrational degrees of freedom of the molecule, which lies on the GS. Therefore, the same consideration as IR and Raman spectra hold. Then, to comply with the harmonic approximation, it is necessary for the calculations to be performed on PES minima. Reaction and local field effects need also to be included in the calculation of QM/PCM VCD and ROA tensors, as well as a proper description of solvent vibrational nonequilibrium effects, similar to what is discussed in Section 3.2.73,116,144,145
Before deepening the discussion of the selected examples, it is worth pointing out that continuum solvation models are nowadays available in most quantum chemistry software, both commercial and open-source, so that various kinds of spectra can be computed in a plethora of solvents. Open-source libraries, such as PCMSolver,149 are also freely available and interfaced with many codes. Unfortunately, the same does not hold for polarizable QM/MM approaches, which are only sporadically included in quantum chemistry codes or in open-source libraries.150 Most implementations, however, are limited to a single specific force field, thus limiting wide applicability. A remarkable exception is ADF151,152 which is part of the Amsterdam Modeling Suite (AMS),153 where three different polarizable force fields, either based on FQ or induced dipoles63 are available and extended to several kinds of spectra, thus allowing for a consistent comparison between the various approaches.40 Also, the current implementation in ADF, and the related graphical interface, guide the user along the whole computational protocol which needs to be followed to perform (polarizable) QM/MM spectral calculations.154
![]() | ||
Fig. 1 (a) Graphical depiction of BET HOMO and LUMO in aqueous solution. (b) Experimental and computed QM/PCM, QM/EE, QM/FQ absorption energies as a function of solvent polarity ϕ. (c) Experimental and computed QM/PCM, QM/EE, QM/FQ solvatochromic shifts with respect to the in vacuo absorption (experimental: 1.66 eV; computed: 1.18 eV). QM level: CAM-B3LYP/6-31+G*. Figure reproduced from ref. 34. |
In ref. 34, BET absorption spectrum has been computed at the QM/PCM, QM/EE and QM/FQ levels of theory, as dissolved in dioxane (DIO, polarity index ϕ = 0.164), acetonitrile (ACN, ϕ = 0.460), methanol (MET, ϕ = 0.762) and water (WTR, ϕ = 1.000). The solvent polarity scale ϕ is defined in terms of the solvatochromic shift exhibited by BET lowest π → π* electronic excitation, which has a charge-transfer character (see Fig. 1a). In Fig. 1b, computed and experimental156,157 absorption energies as a function of the solvent polarity are graphically depicted. Among the tested computational approaches, the polarizable atomistic QM/FQ is the only one that is able to reproduce the experimental behavior, while both the non-polarizable QM/EE and the implicit QM/PCM values flatten out as the solvent polarity increases. The differences between the tested methods can be further appreciated by comparing solvatochromic shifts, i.e. the differences between the absorption maxima in solution and in gas-phase (see Fig. 1c). First, we note that QM/PCM cannot correctly reproduce the experimental solvatochromic shifts for the most polar and protic solvents. The non-polarizable QM/EE, instead, gives values that are in very good agreement with the reference data, in particular for DIO, ACN and MET. The polarizable QM/FQ approach predicts solvatochromic shifts which are generally larger than experiments. However, it is worth noting that all approaches limit the QM-classical interaction to purely electrostatic forces. However, solute–solvent Pauli repulsion can play a relevant role, providing quantum confinement in the electronic density. For instance, if these effects are taken into account (for instance by using the approach developed by the authors in ref. 80), for aqueous BET the estimated error moves from 0.46 eV to 0.1 eV, overall reducing computed solvatochromic shifts. Therefore, the QM/EE good performance reported above is probably caused by error cancellation, due to the fact that both Pauli repulsion and solute–solvent polarization effects are neglected. Notice that very good agreement between computed and experimental values is also reported by the polarizable method exploited in ref. 62, which is based on a multipolar expansion of the QM/MM interaction.
Similar outcomes result from the study of the n → π* transition of pyrimidine (see Fig. 2a), which is a rather rigid molecule.98,158–161 In this case, the values are averaged over 200 snapshots that have been extracted from MD simulations where pyrimidine is kept fixed to the QM/PCM optimized geometry. In this case, we focus simply on vacuo-to-water solvatochromic shifts, as computed by using purely implicit QM/PCM or explicit polarizable QM/FQ and non-polarizable QM/EE approaches. From the inspection of Fig. 2b, it is clear that while QM/PCM cannot reproduce the experimental reference values, QM/EE apparently yields solvatochromic shifts in agreement with experiments. Similarly to the previous case, the polarizable QM/FQ, instead, overestimates, in absolute value, the experimental solvatochromic shift. However, an almost perfect agreement is obtained if Pauli repulsion effects are included. Indeed, quantum confinement has a drastic effect on calculated solvatochromic shifts, affecting the data by about 30/40% on average.
It is worth remarking that a systematic study on the computational prediction of vacuo-to-water solvatochromic shifts by exploiting both non-polarizable QM/EE, and polarizable QM/FQ, QM/FQFμ, QM/DRF (fixed charges and induced dipoles in the MM portion) has recently been proposed.40 The results reveal that none of the approaches outperform the others and that errors strongly depend on the nature of the molecular transition to be described, thus highlighting that the description of solvatochromic shifts is indeed a delicate task.
To conclude the discussion on electronic spectra, we focus on the fluorescence spectrum of acetone in an aqueous solution, which is an illustrative example of the complexity associated with emission phenomena. Although acetone is a rather small molecule, in aqueous solution it undergoes a geometrical distortion in its first excited state geometry (see Fig. 3), which is associated with a n → π* transition. In fact, while in the ground state, the improper dihedral angle δ oscillates around 0 degrees, in the excited state, the geometry is distorted and δ oscillates between ±20 degrees. Therefore, to properly predict the fluorescence spectrum of aqueous acetone, the solute–solvent phase-space in the excited state needs to be accurately sampled. To this end, classical MD simulations can be exploited, however they need to be based on force fields that are suitably refined to reproduce ES energies, gradients, and Hessian, together with a correct accounting for changes in the solute electronic density.
![]() | ||
Fig. 3 Experimental and computed QM/PCM, QM/FQ and QM/FQ + rep emission vacuo-to-water solvatochromic shifts for acetone in aqueous solution (see also ref. 108). QM level: CAM-B3LYP/aug-cc-pVDZ. |
The emission solvatochromic shifts associated with the π* → n transition of acetone in aqueous solution as calculated at the QM/PCM and QM/FQ levels are graphically reported in Fig. 3. QM/FQ values are obtained by averaging over a set of uncorrelated snapshots extracted from classical MD simulations. Effects of Pauli repulsion contributions are also considered for QM/FQ (+Rep) again by means of the method proposed by the authors in ref. 80. Indeed, repulsive interactions affect the averaged QM/FQ emission energy, thus yielding a final computed shift of about ∼−0.01 eV. This value is in almost perfect agreement with the experimental shift (see Fig. 3), showing that QM/FQ + rep is the only method that can model the experimental negative solvatochromic shift. Remarkably, a positive shift is reported instead by both QM/FQ and QM/PCM.
![]() | ||
Fig. 4 Experimental162 and calculated QM/PCM, QM/FQ and QM/FQFμ IR spectra of gallic acid in aqueous solution (QM/PCM FWHM = 10 cm−1; QM/MM FWHM = 4 cm−1). QM/PCM, QM/FQ and QM/FQFμ stick spectra are also depicted as grey lines. QM level: B3LYP/6-311++G**. |
The quality of the reproduction of experimental data163 is also different for the two approaches. In fact, the experimental spectrum is characterized by three bands in the region 1200–1500 cm−1, which involve a combination of different normal modes, and by the presence of a well-separated peak at 1000 cm−1 (C–OH bending) and three low-in-intensity bands (1500–1800 cm−1), that are associated to composite C–OH bending modes of the hydroxyl groups and the acidic CO stretching. While the general features of the experimental spectrum are generally reproduced by all solvation models, the intensity of the band at about 1350 cm−1 (the most intense in the experimental spectrum) is correctly predicted only by QM/FQFμ. Remarkably, it is interesting to note that as it is reported in ref. 162, the intensity of this peak is correctly reproduced by exploiting a supermolecule approach, where 8 water molecules are included in the QM region in QM/PCM calculations. Therefore, specific and non-electrostatic interactions probably play a substantial role in this spectral region, therefore purely electrostatic approaches, such as QM/PCM, QM/FQ and QM/FQFμ cannot perfectly reproduce the experimental findings. Finally, the inhomogeneous band broadening that is predicted by both atomistic approaches is in very good agreement with the experimental data, in particular for the patterns in the spectral region between 1200 and 1400 cm−1.
As an additional example, we move to the vibrational absorption spectrum of methyl-lactate (MLac) in aqueous solution.74 QM/PCM predict three main conformations (see red dots in Fig. 5b),74,164 as a consequence of the flexibility of the dihedral angles δ1 and δ2 (see Fig. 5a). A completely different conformational distribution arises from MD simulations (see Fig. 5b). In fact, MLac tends to cluster around a conformer, which is however absent in the QM/PCM conformational analysis. Such a difference is due to the formation of hydrogen bonding interactions between MLac and water in MD simulation, whereas QM/PCM structures are mainly characterized by intra-molecular hydrogen bonding, due to the absence of explicit solvent molecules in the PCM description. The resulting MLac IR and Raman spectra can be compared with the experiments. By first considering the IR spectrum (Fig. 5c, we note that QM/PCM and QM/FQ yield a completely different description of band relative intensities and broadening. The region between 1200–1300 cm−1, which involves OH group vibrational motions, is almost perfectly reproduced by QM/FQ. On the contrary, the QM/PCM spectrum is characterized by an intensity pattern that disagrees with the experimental spectrum. Similar findings also apply to the Raman spectrum (see Fig. 5d), for which QM/PCM and QM/FQ predict different relative intensity patterns, especially in the region between 1200–1300 cm−1. This confirms once more the benefit of atomistic descriptions of the environment.
![]() | ||
Fig. 5 (a) MLac molecular structure. δ1 and δ2 dihedral angles are also highlighted. (b) MD population map as a function of δ1 and δ2 (blue triangles). The three QM/PCM conformations (red circles) and their populations are also given. (c and d) Experimental164,165 and computed QM/PCM and QM/FQ IR (c) and Raman (d, 488 nm) spectra of aqueous MLac (see also ref. 74). QM level: B3LYP/aug-cc-pVDZ. Figure is reproduced from ref. 1. |
![]() | ||
Fig. 6 Computed QM/PCM, QM/FQ/PCM, and QM/QMw/PCM solvent-induced shifts on chemical shifts (ppm) for pyrimidine in aqueous solutions.72 Values for hydrogen (a) and heavy (b) atoms are shown (only non-equivalent atoms are considered). Chemical shifts are computed on the minimal cluster depicted in panel (a). QM level: B3LYP/6-311+ +G(2df,2pd) |
Among the different representations of solvent effects, QM/QMw/PCM can be considered the most accurate and can be taken as a reference, because all interactions between the solute and the two water molecules are retained at the QM level. To quantitatively evaluate solvent effects, solvent-induced shifts as compared to gas-phase results are analyzed and graphically depicted in Fig. 6a and b for hydrogen and heavy atoms, respectively. While solvent has a minor impact on both hydrogen and carbon atoms, it increases the shielding of the nitrogen atom by about 30%. This is not surprising considering that the two explicit water molecules are hydrogen bonded to pyrimidine through the nitrogen atom. Also, QM/FQ/PCM behaves particularly well as compared to the QM/QMw/PCM reference, reporting a root mean square error (RMS) of just 0.4 ppm. Differently, QM/PCM predicts an RMS of 15.2 ppm. Such an error primarily arises from the wrong prediction of the NMR shielding for nitrogen atoms (Fig. 6b), which are instead almost perfectly described by QM/FQ/PCM (the deviation is of just 0.3 ppm). Such findings confirm the benefit of atomistic descriptions to accurately describe hydrogen-bonded systems. To end the discussion, computed solvent shifts can be compared to experimental data. Notice that experimental reference data for calculating solvent shifts are reported for a cyclohexane solution (i.e. not for the isolated molecule), which can be reasonably described by means of QM/PCM,166 due to the absence of strong/specific solute–solvent interactions. The agreement with the experimental value reported in ref. 167 for nitrogen (16.84 ppm) is particularly good (12.73 ppm), especially considering that in our modeling solute–solvent interactions are limited to the electrostatic component, while, as explained in Section 3.3, non-electrostatic forces are expected to play a relevant role in magnetic properties.
![]() | ||
Fig. 7 Experimental and computed QM/PCM, QM/FQ Raman (a) and Resonance Raman (b) spectra of NMA dissolved in aqueous solution. QM level: B3LYP/aug-cc-pVDZ. Figure is adapted from ref. 8. |
![]() | ||
Fig. 8 (a) Computed QM/FQ/PCM OR (589 nm) for aqueous (R)-MOXY as a function of time (MD snapshot). (b) Computed QM/PCM and average QM/FQ/PCM OR at 589 nm of aqueous (R)-MOXY. Vibrational corrections computed at the QM/PCM level with the inclusion of vibrational nonequilibrium effects are also added to QM/FQ/PCM results (+NeqVib). Experimental data reproduced from ref. 175 (Exp1) and ref. 173 (Exp2) are also reported. |
The final QM/FQ/PCM value is obtained by averaging computed values on a number of snapshots extracted from MD simulations. Differently from the spectral properties mentioned in the previous sections, where a few hundred snapshots guarantee the convergence of the average property, for OR 2000 frames are necessary, thus clearly increasing the computational cost as compared to the static QM/PCM calculation, which only requires a single calculation (R-MOXY features only a single minimum conformation in the PCM PES). The above discussion clearly highlights the complexity of modeling OR, especially when the solute is dissolved in strongly interacting environments, such as aqueous solutions. This can be further appreciated by observing the QM/FQ raw data associated with OR calculations on the single snapshots extracted from the MD simulation (see Fig. 8a). It is evident that not only does the magnitude of the computed OR vary as a function of the considered frame (oscillating between 100 and −100 degrees, while the average falls around −1.2 degrees), but also the sign. The success of the atomistic QM/FQ/PCM approach is the result of the accurate modeling of the physico-chemical features which dominate this property, which are in this case hydrogen bonding interactions and dynamic solvation effects. Similar findings have been reported in ref. 174, where the closest water molecules are included in the QM region, at a much larger computational cost.
To further demonstrate how sensitive chiroptical spectroscopies are to the external environment, we report the case of the Vibrational Circular Dichroism (VCD) spectrum of zwitterionic (L)-alanine (ALA) in aqueous solution.116,176 As stated above, VCD is the chiral analogous of the IR spectrum and is obtained by irradiating the sample with left and right circularly polarized light; the spectrum results from the difference between the absorption of the two radiations in the infrared region.140 Alanine experimental VCD spectrum is characterized by a significant (+, −, +) pattern in the region between 1250 and 1450 cm−1. The three bands are associated to normal modes which involve the symmetric stretching of the CO2− group and the orthogonal methine CH bending mode. QM/FQ/PCM rotational strengths calculated for each snapshot are reported in Fig. 9, middle panel: a very large variability, in both sign and magnitude, is clearly reported, in line with what has been commented above for MOXY OR. This highlights once more the difficulties related to the prediction of chiroptical spectra of systems embedded in aqueous solution. Similar to the gallic acid IR spectrum, also in this case QM/PCM features much poorer spectral details (see Fig. 9, top panel).
![]() | ||
Fig. 9 QM/PCM (blue), QM/FQ/PCM (dark green) and experimental (red)176 VCD spectra of aqueous (L)-alanine. QM/PCM and QM/FQ/PCM raw spectra are also plotted as grey lines. QM level: B3LYP/6-311++G**. |
As a result, the static QM/PCM and the dynamic QM/FQ/PCM predict completely different spectra, also with regard to sign alternation patterns (see Fig. 9). In fact, QM/PCM fails at reproducing the experimental sign pattern, reporting a (−, −, +, +) sign alternation which is not compatible with the experimental spectrum of the L enantiomer. On the other hand, the agreement between QM/FQ/PCM and experimental data is almost perfect and indeed particularly impressive if compared to QM/PCM data. The only slight disagreement between QM/FQ/PCM and experimental spectra stays in the relative intensities of the bands in the region between 1300 and 1500 cm−1. However, the sign is correctly reproduced. It is finally worth pointing out that the dissimilarities between the implicit and explicit descriptions in the prediction of VCD spectra, not only arise from the diverse accounting of the solvent effects on the spectroscopic signal but are also the consequence of explicit solvent effects on the molecular geometry, which is in this case reoptimized to comply with the harmonic approximation.
This journal is © The Royal Society of Chemistry 2023 |