Open Access Article
Ekaterina
Zossimova
*ab,
Johannes
Fiedler
c,
Frank
Vollmer
a and
Michael
Walter
bde
aDepartment of Physics and Astronomy, Living Systems Institute, University of Exeter, EX4 4QD, Exeter, UK. E-mail: ez216@exeter.ac.uk
bFreiburg Center for Interactive Materials and Bioinspired Technologies (FIT), University of Freiburg, D-79110 Freiburg, Germany
cDepartment of Physics and Technology, University of Bergen, Allégaten 55, 5007 Bergen, Norway
dCluster of Excellence livMatS @ FIT, Freiburg, Germany
eFraunhofer IWM, MikroTribologie Centrum μTC, Freiburg, Germany
First published on 26th February 2024
Optical whispering gallery mode biosensors are able to detect single molecules through effects of their polarizability. We address the factors that affect the polarizability of amino acids, which are the building blocks of life, via electronic structure theory. Amino acids are detected in aqueous environments, where their polarizability is different compared to the gasphase due to solvent effects. Solvent effects include structural changes, protonation and the local field enhancement through the solvent (water). We analyse the impact of these effects and find that all contribute to an increased effective polarizability in the solvent. We also address the excess polarizability relative to the displaced water cavity and develop a hybrid quantum-classical model that is in good agreement with self-consistent calculations. We apply our model to calculate the excess polarizability of 20 proteinogenic amino acids and determine the minimum resolution required to distinguish the different molecules and their ionised conformers based on their polarizability.
Amino acids are the building blocks for proteins that are central for living systems. It is essential to understand how to categorise and potentially differentiate amino acids7 using optical biosensors due to their diverse applications in healthcare, environmental monitoring, biotechnology, and scientific research.8,9 Amino acids are particularly sensitive to solvent effects due to their amphoteric behaviour. At low pH, the amino group (–NH2) acts as a base and accepts a proton from the solvent. At high pH, the carboxyl group (–COOH) donates a proton to the surrounding medium.
Within first-order perturbation theory, the shift of the WGM resonance is proportional to the dipole polarizability of the molecule in excess of the displaced solvent,10i.e. the excess polarizability of the molecule
. Numerous methods exist to calculate the gasphase polarizability α of molecules, including additive atomistic models,11,12 electronic structure methods13,14 and machine learning algorithms15–17 as the polarizability is central for the understanding of e.g. weak van der Waals interactions.18–20 However, it is not so obvious how to transform
when the molecule is in a solvent21,22 due to several screening and field enhancement effects.23,24 Developing an accurate model for
can help to decode the biosensor signals and improve the specificity of polarizability-based biosensing techniques.
Density functional theory (DFT) is a quantum-mechanical method that is used to calculate molecular properties from their electronic density. DFT intrinsically accounts for molecular charge and can be conveniently combined with implicit solvent models to study solvated single molecules where the solvent is modelled as a continuum.25–27 The inclusion of solvent effects is essential as the protonation state of the amino acid can be expected to affect its polarizability. Previous studies have shown that implicit solvent models are able to describe the relative energetic stability of different conformers and protonation states of molecules in aqueous environments.28–31
Here, we model the solvation shell encompassing the molecule as a smooth cavity32 to obtain optimised solvated structures of 20 proteinogenic amino acids, including their different protonation states. The smooth cavity model does not distinguish between the externally applied field and the local field seen by the solute molecule. The local fields are required to calculate
and can be included using the classical Onsager model.33 This leads to a self-consistent equation for the polarizability that must be solved numerically.24 Alternatively, we also propose a hybrid quantum-classical model for
that combines DFT calculations for molecular polarizability (without local field corrections) and scattering effects through a 3-layer cavity model for finite-sized particles. We implement both approaches in GPAW DFT code34,35 and find that they give comparable results for
. However, the hybrid quantum-classical model outperforms the numerical model in terms of computational speed and efficiency.
| μ = μ0 + αF, | (1) |
![]() | (2) |
![]() | (3) |
The WGM laser typically probes molecules using near-infrared to visible light to avoid radiation damage on the molecular level.2 The associated frequencies are well below the electronic excitations of amino acids,37 such that we can consider their real-valued response functions in the electrostatic limit,38i.e. α(ωWGM) ≈ Re[α(ω ↦ 0)] ≡ α. This defines the electronic static polarizability of the molecule, which should represent a good approximation at the frequency of the exciting light.22,39 We further assume that the molecules are very small compared to the wavelength of the WGM laser, such that the spatial variation of the biosensing field in the range of the molecule can be neglected.
Under these conditions, the linearity in the external field F allows us to calculate α as
![]() | (4) |
There are changes for molecules in a solvent,21,23,40,41 where the best model to describe these effects is not settled in the literature.42 The influence of the solvent can be separated conceptually into two main factors:22 first, the solvent affects the electron density of the molecules; second, it changes the local field FL acting at the position of the molecule.23,33 Generally, the effective polarizability within the solvent α* is defined analogously to eqn (1) as22
![]() | (5) |
Using α-gasphase as a reference, we can determine αPCMg,ij = [ΔμPCMg(FL)]i/FLj within the polarizable continuum model (PCM) using the gasphase nuclear structure to isolate the effect of the solvent on the electronic density. Fig. 1a shows that this merely scales the gasphase polarizability by 9% consistently. The increase in the PCM polarizability compared to the gasphase is commonly found in the literature,43,44 but was criticised by some authors.45 The FA of the gasphase structures is not changed within the PCM, as shown by the green data points in Fig. 1b. The FA is thus solely caused by structural asymmetries within the molecule.
We next consider conformational and structural changes of the amino acids in water. Amino acids have charge-neutral functional groups in the gasphase, whereas these become ionised in the solvent. Most of the proteinogenic amino acids are zwitterions at physiological pH and several are either anions (aspartic acid, glutamic acid) or cations (arginine, lysine). The unbalanced charge distribution across the ionised conformers is more sensitive to the external field, resulting in a larger polarizability for the solvated vs. gasphase structures in the PCM solvent (αPCMvs. αPCMg in Fig. 1a). The polarizability increase due to conformational change is approximately 5.5% on average, resulting in a total scaling factor of 1.15 relative to α-gasphase.
The conformational and structural changes also affect the FA, as shown by the orange data points in Fig. 1b. The solvated structures tend to have a higher polarizability anisotropy compared to the gasphase structures. Only histidine (H+), aspartic acid (D−, D) and leucine (L) show a lower fractional anisotropy for the solvated structures compared to the gasphase structures in the PCM solvent. These molecules correspond to the orange data points below the dashed identity line in Fig. 1b.
Finally, we have to consider changes in the local field due to the formation of a cavity around the solute molecule. This is not accounted for in our electronic structure calculations of αPCM. We turn to a classical cavity model, originally formulated by Onsager,33 to calculate the field enhancement within the cavity.
The solute molecule is described as a polarizable point dipole in the center of a spherical cavity with volume V. The solvent surrounding the cavity is described by the PCM with relative permittivity ε. The local field acting on the solute molecule can be represented as a superposition of a cavity field and reaction field, the latter of which depends recursively on the polarizability of the solute molecule. This leads to an equation for the local field enhancement, η = FL/F, which can be written as23,24,33
![]() | (6) |
![]() | (7) |
![]() | (8) |
We solve eqn (5) and (6) self-consistently. In the first step of the self-consistent cycle, we start with an initial guess for α*
| α* ≈ αPCMηPCM | (9) |
, respectively. We can express the excess polarizability
in terms of the polarizability densities as![]() | (10) |
. Assuming that the solvent is water, we can calculate
by simulating a single explicit water molecule in an otherwise implicit background of water, following the self-consistent approach described above. The cavity volume of the solute molecule V and single explicit water molecule Vw can be calculated in GPAW using the Kirkwood–Buff method.47 We find that
from numerical calculations.
We note that
can also be calculated analytically using the polarizability density of water from the Lorentz–Lorenz equation,
![]() | (11) |
![]() | (12) |
, which simplifies to![]() | (13) |
using the permittivity of water at optical frequencies, which is the same as our numerical result from the self-consistent calculation.
The results for the excess polarizability of amino acids using the self-consistent numerical model in eqn (10) are shown in Fig. 3a. Generally, the excess polarizability of the amino acids tends to be smaller than the gasphase polarizability, except for tryptophan, where the polarizability enhancement due to solvent effects overcompensates the negative contributions of the displaced water (see Fig. S2 in ESI†).
![]() | (14) |
![]() | (15) |
Note, that the polarizabilities defined in eqn (14) and (15) are scaled by a factor of (4πε0)−1 to get volumetric units.
The polarizability can be derived from the viewpoint of the field as a scattering process at the outer boundary of a multi-layer cavity using the scattering Green function for a spherically layered system.48,49 This leads to an equation for the excess polarizability, given by21
![]() | (16) |
We can replace the hard sphere in the classical model with a real molecule as shown in Fig. 2b. In doing so, we substitute the polarizability of the molecule without local field corrections αPCM for αs in eqn (16) and show that
![]() | (17) |
in eqn (13). We can use Rc3 = 3V/4π to convert between the cavity radius and volume.
The numerical model in eqn (10) and the hybrid quantum-classical model in eqn (17) give very similar results for the excess polarizability, as shown in Fig. 3a. The benefit of using eqn (17) is that the excess polarizability is expressed in terms of χPCM, which is less computationally expensive to calculate than the self-consistent polarizability density χ* that appears in eqn (10). For the set of 20 amino acids considered here, it takes approximately 7 iterations of the self-consistent equation to converge the result for α*. Therefore, the calculation of χPCM is approximately 7 times faster than χ*, on average.
Returning to eqn (17), we can define the excess polarizability density as
. We can express eqn (17) as a quadratic polynomial in terms of χPCM by using a Taylor series expansion for ηPCM in the limit of small fRχPCM as ηPCM ≈ fC(1 + fRχPCM). This lets us express
in powers of χPCM as
![]() | (18) |
The quadratic equation for
agrees very well with the amino acids data, as shown in Fig. 4.
is generally smaller than χPCM in the range 0.00 < χPCM < 0.12, although the two quantities become comparable as χPCM increases due to the approximately quadratic scaling of the reaction field term. The excess polarizability density of water is zero since water is the background medium.
![]() | ||
| Fig. 4 Solid blue line: quadratic approximation for the excess polarizability density, see eqn (18). Dotted blue line: linear approximation for the excess polarizability density (no reaction field, fR = 0). The reference values for water are shown as orange dashed lines for comparison [χw = 0.0488 from the Clausius–Mossotti relation in eqn (11)]. | ||
, assuming that the binding site of the molecule can be well controlled in experiments. The polarizability roughly correlates with the molecular volume, as shown in Fig. 3b. Glycine (G) is the smallest molecule studied and has the lowest excess polarizability; tryptophan (W) is the largest molecule studied and has the highest excess polarizability. Assuming that the biosensing field has the same intensity at the position of both molecules, we would expect glycine to produce the smallest shift in the biosensor signal and tryptophan the largest. The biosensor would need to resolve a polarizability difference of approximately 20 Å3 to distinguish the two molecules with extreme polarizabilities. Other molecules, such as cysteine (C) and lysine (K+) would be more difficult to resolve based on
, despite the significant difference in cavity volume.
We use glutamic acid as an example to compare how other polarizability models consider the effect of charge. Glutamic acid plays an important role in neuroscience, the food industry (e.g. monosodium glutamate), and energy production in the body where glutamate is involved in various metabolic pathways. Glutamic acid has two carboxyl groups (pK1 = 2.13, pK2 = 4.31) and an amine group (pK3 = 9.67).50 The net charge of glutamic acid is q = −1 at physiological pH but can vary between q = +1 in a highly acidic environment (fully protonated) and q = −2 in a highly alkaline environment (fully deprotonated). The actual protonation state for a given pH can be evaluated using the Henderson–Hasselbalch equation51–53 as detailed in ESI†.
Optimised structures of solvated glutamic acid ions are used to compare different polarizability models in describing the effect of pH in Fig. 5. The direct DFT calculation through the hybrid quantum-classical model shows the expected behaviour of increasing polarizability with increasing charge. This effect cannot be captured with additive atomistic models, such as the Thole model,11 because there the molecular polarizability always scales with the number of atoms only. The AlphaML (B3LYP) machine learning algorithm also cannot capture this effect because it has been trained on a dataset of neutral molecules,54,55 where the effect of charge would not be evident. The polarizability difference between conjugate acid–base pairs in these atomistic (and machine learning) models is roughly equal to the polarizability of a hydrogen atom αH = 0.7 Å3, but no hydrogen atoms are removed from the amino acids in reality. The hybrid model thus finds a much larger polarizability difference between conjugate acid–base pairs (between 1.7–2.5 Å3) with the opposite charge dependence.
![]() | ||
| Fig. 5 Polarizability of glutamic acid as a function of pH. The hybrid quantum-classical calculation includes solvent effects and is sensitive to the electronic density of the molecule. The Thole model11 is an additive atomistic model and can only capture the effect of adding or removing a hydrogen atom between conjugate acid–base pairs. The machine learning algorithm15,16 (AlphaML) has not been trained on charged molecules and captures the same effect as the Thole model. | ||
Our results will help to decode the signals from single molecule biosensing experiments, where it is important to accurately quantify the excess polarizability of single molecules. The solute molecule can be identified through
by measuring the magnitude of the biosensor signals. The fractional anisotropy can also be used to identify single molecules by measuring the shift ratio between transverse electric (TE) and transverse magnetic (TM) modes.56–58 Some of the molecules, such as cysteine (C), asparagine (N) and valine (V), may be hard to distinguish based on
alone. However, the TE to TM shift ratio provides a second molecular fingerprint to help differentiate and categorise the biosensor signals. The shift ratio is expected to be larger for a molecule with high anisotropy and can be referenced against the fractional anisotropy data† to help identify the target molecules. The results presented here form a guide on the experimental sensitivity required to differentiate between different amino acids using WGM-based biosensors. The methods presented can also be applied to other molecules that are of physiological significance.
The electronic structure of the molecules is described with the GPAW DFT package.34,35,61 The electron density and Kohn–Sham states are described within the projector augmented wave (PAW) method, where the smooth parts are represented on real space grids. The grid spacing in the simulation is 0.2 Å for representing the smooth wavefunctions within the PAW description. The size of the simulation box ensures that each atom is at least 4 Å away from its boundary. We use the PBE exchange–correlation functional62 in the calculation of the DFT energy. The aqueous environment is described by the polarizable continuum model (PCM).32
The amino acid conformers are relaxed using the Broyden–Fletcher–Goldfarb–Shanno algorithm as implemented in the atomic simulation environment (ASE)63 until the maximal force on each atom is below 0.05 eV Å−1, corresponding to the structure of the molecule in the next local minimum of the energy landscape. Here, we use the static permittivity of water (εstat = 78.36)64 in the ground state DFT calculations. The resulting amino acid conformers are sorted according to their relative energies. We consider different protonation states separately, since the net charge can fluctuate depending on the pH of the buffer solution. We find that charge neutral amino acids prefer the zwitterionic form in an aqueous environment. We further consider different protonation states of arginine (R), lysine (K), histidine (H), glutamic acid (E) and aspartic acid (D), which exist predominantly in their ionised form at neutral pH. We use the lowest energy conformers in the polarizability calculations and note the minor variation due to different gasphase and solvated conformers, see Fig. S4 and S5 in ESI,† respectively.
The molecular polarizabilities are calculated from eqn (4) using the finite-field method with external field strength |F| = 0.01 V Å−1. This field strength is tested to be sufficiently small to avoid higher-order contributions and still large enough to avoid numerical difficulties.65 The calculation using finite fields does not have convergence issues in contrast to perturbation theory approaches (see ESI† for the comparison between linear response time-dependent DFT (TDDFT)66 and the finite field method). We apply the same approach to calculate the polarizability in an aqueous environment αPCM using eqn (5) and the local field strength, |FL| = η|F|. Here, we use the optical permittivity of water (εopt = 1.77)67 in the solvation model for the calculation of αPCM because the biosensor typically probes the molecules at optical to near-infrared frequencies, where the permittivity of water is equal to the squared refractive index.
Our description relies on the application of the PCM, which does not account for explicit water molecules that may interact with and stick to charged groups of the amino acids. Such situations can be modelled by including explicit water molecules within the solute region.68,69 However, including explicit water molecules raises the question of how to separate the polarizability of the amino acids from that of the attached water molecules. Wannier orbitals may be a solution,70,71 but this approach largely influences the calculated polarizabilities (see Table S3 in ESI†). Furthermore, we note that an implicit description of the solvent is consistent with the 3-layer cavity model that is used to describe the scattering of the fields (see Fig. 2), where the background medium is represented by its permittivity, ε. It is not immediately clear how this description has to be adjusted in the case of additional explicit water molecules. Answering these questions is beyond the scope of the presented work and has to be assigned to future investigations.
Footnote |
| † Electronic supplementary information (ESI) available: Tabulated polarizabilities of amino acids; comparison of linear response TDDFT polarizabilities versus finite field polarizabilities; relative energetic stability of amino acid conformers in gasphase and solvent; polarizabilities determined from Wannier orbitals. See DOI: https://doi.org/10.1039/d3nr05396b |
| This journal is © The Royal Society of Chemistry 2024 |