Open Access Article
Janko Čivić
*a,
Hideaki Mizuno
b and
Jeremy N. Harvey
*a
aDepartment of Chemistry, KU Leuven – Celestijnenlaan 200F, Box 2404, 3001 Leuven, Belgium. E-mail: janko.civic@kuleuven.be; jeremy.harvey@kuleuven.be
bDepartment of Chemistry, KU Leuven – Celestijnenlaan 200G, Box 2403, 3001 Leuven, Belgium
First published on 4th November 2025
Photoinduced decarboxylation of acidic amino acids near the chromophore is observed in various fluorescent protein variants, including avGFP, PA-GFP, DsRed, LSSmOrange, and PSLSSmKate. While a mechanism has been proposed and theoretically supported for GFPs, its generalization to other variants remains unclear. This study investigates the decarboxylation mechanism across different fluorescent proteins, with a particular focus on the red fluorescent proteins DsRed and PSLSSmKate, to establish a structure–function relationship. A benchmark study was conducted to assess various excited-state methods, basis sets, and embedding approaches. Excited-state QM/MM calculations using TD-DFT, ADC(2), and CC2 methods were employed to explore the excited states of multiple fluorescent protein variants. Our findings provide new insights into the decarboxylation mechanism in fluorescent proteins, contributing to the broader field of protein photophysics and potentially informing the design of novel fluorescent probes.
Nearly all FPs share a characteristic β-barrel fold that shields the chromophore from the environment. In avGFP, the chromophore forms from three residues (Ser, Tyr, Gly) and consists of a π-conjugated hydroxybenzylidene-imidazolinone system.3 Despite this conserved architecture, mutations in or near the chromophore can produce variants with diverse photophysical properties (Fig. 1).
Apart from fluorescence, many FPs undergo light-induced chemical transformations of the chromophore or surrounding residues.4 These transformations correspond to changes in physical properties, which may be undesirable when they cause loss of fluorescence, but can also be advantageous in certain contexts, for example in advanced imaging techniques such as super-resolution microscopy.5 Because these processes occur on ultrafast timescales and involve electronically excited states, they are challenging to study experimentally, and the mechanisms of many of these reactions remain incompletely understood.
One specific transformation is the photoinduced decarboxylation of acidic residues near the chromophore. This reaction has been reported in a range of FPs, including avGFP itself.6–12 For GFPs, irradiation at high energies leads to decarboxylation of Glu222, and both one-photon and multiphoton excitation routes have been demonstrated.6,13,14 The generally accepted mechanism (Fig. 2) involves as a key step excited-state electron transfer (ET) from deprotonated Glu222 to the neutral chromophore, generating a glutamate radical that subsequently loses CO2. Computational studies support this picture: Ding et al.15 showed that C–C bond cleavage can proceed spontaneously once Glu222 is oxidized, while Grigorenko et al.16 identified glutamate-to-chromophore charge transfer (CT) excited states at energies consistent with experiments.
![]() | ||
| Fig. 2 Overview of the proposed mechanism for photoinduced decarboxylation in GFPs.6 | ||
Whether this mechanism generalizes to other FPs remains an open question. In red FPs such as DsRed, photoinduced decarboxylation of Glu215 has been observed following pulsed laser excitation.8 However, because the DsRed chromophore is anionic, ET from a negatively charged glutamate would yield a dianionic intermediate, which should in principle be unfavorable. Alternative mechanisms have been proposed, for example involving an initial ET step from the chromophore to a nearby positively charged residue.17
Another informative variant for understanding photoinduced decarboxylation is LSSmKate1, which does not undergo this process but can be converted into a variant that does through just two point mutations (PSLSSmKate).9 Reported crystal structures suggest that the differences between these variants arise from rearrangements in the hydrogen-bond network around the chromophore. To our knowledge, no computational studies have yet been performed on these variants, but such calculations could provide valuable insights into the structure–function relationship underlying photoinduced decarboxylation.
In this work, we address these open questions by systematically comparing six FP variants: four that undergo photoinduced decarboxylation and two that do not (Fig. 1). Using QM/MM, we optimized their structures and computed vertical excitation energies with different electronic structure methods. We then analyzed the excited states to identify glutamate-to-chromophore CT states and assess whether their energetic accessibility can explain why some variants undergo photoinduced decarboxylation while others do not. A benchmark study was also performed to evaluate the sensitivity of these CT states to QM/MM methodological choices, including basis set size, embedding schemes, and QM region size. Together, our analysis provides new insights into the mechanism of photoinduced decarboxylation and suggests possible strategies for the rational design of photoactivatable FPs.
For residues with double occupancy, only the dominant conformation was selected. Crystallized water molecules within 8 Å of the chromophore were retained, while all other co-crystallized molecules were removed. Protonation of the structure was performed using the reduce program from AmberTools23.22 Protonation states of residues near the chromophore were manually checked and compared with literature reports. The protonated protein was solvated in an octahedral box, with a minimum distance of 12 Å between the protein and the box edge, in 0.15 M NaCl. Additional sodium or chloride ions were added to neutralize the system's charge. The TIP3P23 water model was used, along with the ff19SB24 force field for the protein residues. TIP3P was chosen for practical reasons, as the recommended OPC25 water model contains dummy atoms that complicate QM/MM calculations. Partial charges of the chromophore were developed using the restrained electrostatic potential (RESP) method.26 The chromophore structure was extracted from the PDB file and capped with methylamine (NME) and acetyl (ACE) groups. Its geometry was optimized using B3LYP/6-31G*, while the positions of the terminal carbon atoms in the capping groups were kept fixed. The electrostatic potential was calculated with HF/6-31G* in the gas phase, and the RESP fitting of partial charges was performed using the resp program from AmberTools23.22 These QM calculations were performed with Gaussian16.27 Bonded parameters were based on the amber general force field (GAFF2).28
The structure of the solvated protein was relaxed through a multi-step procedure (Table S1) using sander or pmemd.cuda from Amber22.29–32 The system was first minimized, then heated to 300 K, and equilibrated to a pressure of 1 bar. It was then cooled under constant volume to 50 K before final minimization. During all steps, the chromophore and residues within 3 Å of the chromophore were restrained with a force constant of 500 kcal mol−1 Å−2. Temperature control was achieved with a Langevin thermostat, while pressure was maintained with an isotropic Berendsen barostat. Periodic boundary conditions (PBCs) were applied, using the particle mesh Ewald method33 for electrostatic interactions with a 10 Å cutoff for short-range interactions. The SHAKE algorithm34 was employed to constrain bonds involving hydrogens during the heating, cooling, and pressure equilibration steps. Next, constraints on the chromophore region were removed, and the non-periodic system was minimized using QM/MM, with atoms beyond 30 Å from the chromophore constrained. Non-bonded interactions were calculated for all atom pairs, and all MM atoms were included for electrostatic embedding. The QM calculation employed the composite PBEh-3c method35 through the Amber22 QM/MM interface36 with Orca.37 The QM region included the chromophore, the side chain of the nearby glutamic acid residue prone to decarboxylation, and other interacting residues (Table S2). Geometry optimization proceeded until the root mean square of the Cartesian gradient elements was less than 0.01 kcal mol−1 Å−1.
Two different embedding potentials were tested: an electrostatic potential using Amber point charges from the system preparation, and a polarizable potential where the environment is represented by multipoles and polarizabilities developed using the PyFraME38 Python package. In polarizable embedding calculations, to reduce computational cost, only solvent water molecules within 3 Å of any protein atom were retained, while the remaining waters were removed. Grabarek and Andruniów demonstrated that including more distant water molecules has minimal impact on calculated excitation energies of GFP chromophores.39 The protein was divided into smaller fragments using the molecular fractionation with conjugate caps (MFCC) scheme.40,41 For all residues not included in the QM region, atom-centered multipoles up to the second order and anisotropic dipole–dipole polarizabilities were determined using the localized properties (LoProp) method,42 with the LoProp script43 for the Dalton program44,45 and B3LYP, employing an atomic natural orbital (ANO) recontraction of the 6-31+G* basis set. Charges of MM sites closer than 1.4 Å to the QM region were shifted to the nearest MM sites, while other embedding parameters on those sites were removed.
Excited-state QM/MM calculations were conducted using time-dependent density functional theory (TD-DFT) with the CAM-B3LYP46 and ωB97X-D47 functionals, as well as the CC248 and ADC(2)49 methods. The resolution of identity (RI) approximation was employed for all methods. Several basis sets of the def2-type50–52 were tested. Grimme D3(BJ) dispersion53,54 was included in TD-DFT calculations. Additionally, the frozen-core and restricted virtual space (RVS) approximation was tested for ADC(2) and CC2 by freezing virtual orbitals above 60 eV, a threshold previously suggested as suitable for FPs.55 These calculations were executed with Turbomole 7.8.56–59 The character of excited states was analyzed with the TheoDORE analysis package.60
To investigate the impact of thermal fluctuations on the vertical excitation energies in PA-GFP, ground-state QM/MM MD simulations were performed for 10 ps with a 1 fs timestep, starting from the final structure of the constant-pressure equilibration (step 4 in Table S1). The QM region included only the chromophore with carbonyl and amine capping groups, described using the PBEh-3c method.35 The SHAKE algorithm34 was used to constrain bonds involving hydrogen atoms in the MM region. PBCs were used with particle-mesh Ewald for computing long-range electrostatic interactions of the MM region with a 10 Å cutoff for short-range nonbonded interactions. The particle mesh Ewald method for treating long-range electrostatic QM–MM and QM–QM interactions in periodic systems is not supported for the selected combination of methods, so a 15 Å real-space cutoff was used for all nonbonded interactions involving the QM region. Excited-state CAM-B3LYP/def2-TZVP calculations were carried out using QM region 2 (Table S3) and electrostatic embedding, on 100 evenly spaced snapshots from the last 5 ps of simulation.
Selected approaches were also applied on QM/MM optimized fluorescent protein structures of LSSmOrange, EGFP, DsRed, LSSmKate1, and PSLSSmKate. The quantum regions included the chromophore, the Glu residue at the decarboxylation-prone position, and portions of other residues that form hydrogen bonds with the conjugated part of the chromophore or the Glu side chain (Table S3).
Given the large number of calculations performed, and to minimize subjectivity, it was necessary to define a quantitative and automated criterion for identifying Glu222–chromophore CT states. To this end, we used the TheoDORE analysis package60 to analyze the one-electron transition density matrix, focusing on predefined fragments of the system. This approach enables quantification of the CT character of each excited state. Fig. 4 shows an example hole/electron population plot generated with TheoDORE, which provides a quantitative representation of the information one could obtain from plotting the natural transition orbitals (NTOs) of the excited states. We classified a state as a Glu222–chromophore CT state if the Glu222 side chain contributes more than 40% to the hole population and the conjugated part of the chromophore contributes more than 80% to the electron population. These thresholds were determined empirically. In most cases we find three distinct Glu222–chromophore CT states, corresponding to ET from each of the three highest occupied molecular orbitals of the carboxylate group. However, we sometimes observed that the CT character was distributed over multiple states. This observation motivated the choice of a somewhat lower threshold (40%) for the hole population of Glu222. Nevertheless, in most cases, this contribution was significantly higher, and the conclusions of the subsequent analysis are not sensitive to the exact threshold chosen.
![]() | ||
| Fig. 4 Fragment hole/electron population plot generated with TheoDORE60 for a QM/MM excited-state calculation of PA-GFP (QM region 3 from Table S3 and polarizable embedding for the MM region) at the CC2/def2-TZVP level of theory. The most significant NTO pair is shown for the first excited state and the Glu222–chromophore CT states. | ||
Calculating higher excited states using correlated methods such as CC2 and ADC(2) with a triple-zeta basis set and a QM system of approximately 100 atoms is not possible without additional approximations. A common approach is the restricted virtual space (RVS) approximation, where a selected number of high-lying molecular orbitals are excluded from the excited-state energy calculations. We assessed its impact on PA-GFP using the smallest QM region and electrostatic embedding by systematically varying the energy cutoff for orbital exclusion (Fig. 5 and Tables S6, S7). For the first excited state, the effect is negligible for both methods, even when discarding more than half the orbitals, consistent with previous findings.55 In contrast, the Glu222–chromophore CT states are more sensitive. ADC(2) shows larger errors (up to −0.25 eV) than CC2 (up to −0.14 eV). CC2 also benefits from error cancellation, giving very small errors at a 60 eV cutoff. Importantly, these errors remain similar when the QM region is enlarged or when polarizable embedding is applied (Table S5). Overall, while the RVS approximation introduces non-negligible errors for CT states, it remains suitable for semi-quantitative purposes where deviations of a few tenths of an eV are acceptable, as in our case, where precise experimental data on the positioning of the CT states is lacking.
Next, we examined the effect of increasing the basis set from def2-TZVP to def2-TZVPD, which includes diffuse functions (Fig. S3). Due to the increased computational cost, this analysis was performed only at the TD-DFT level. The results show that the inclusion of diffuse functions leads to only a minor change of approximately ±0.05 eV in both the first excited and Glu222-chromophore CT state energies. This effect is consistent across different embedding types and QM region sizes. Given the substantial increase in computational cost, the use of diffuse functions is not beneficial in this context.
Another aspect we examined is the impact of modeling the protein environment using polarizable embedding compared to electrostatic embedding (Fig. 6 and Fig. S4). For S1, the impact of polarizable embedding relative to electrostatic embedding is somewhat unpredictable in terms of the sign of the shift (ranging between −0.1 and 0.2 eV), depending on the QM region (Fig. S4). For the CT states, a more uniform trend is observed, with polarizable embedding raising the energy irregardless of the computational method. For the smallest QM region the shift is very large, around 1.5 eV, but it reduces to approximately 0.1 eV for the largest QM region. For QM region 3 (104 atoms), the largest system we can use with correlated methods, the effect is between 0.2 and 0.3 eV. This is not negligible and suggests that the use of polarizable embedding is worthwhile, which we assume to be the more accurate approach. However, the shift is systematic, so even electrostatic embedding calculations are useful for qualitative insights or relative comparisons.
Additionally, we tested a larger solvent shell of 10 Å for polarizable embedding to assess the effect of the 3 Å cutoff. This increased the excitation energy of the lowest CT state by 0.12 eV (CAM-B3LYP/def2-TZVP level of theory with QM region 3), further increasing the difference relative to electrostatic embedding. This indicates that the 3 Å cutoff introduces a modest but non-negligible effect. However, as a larger solvent shell results in a considerable increase in computational cost, we restricted all subsequent calculations with polarizable embedding to the 3 Å cutoff. The sensitivity to the cutoff may deserve more attention in future studies.
Finally, we discuss the impact of the QM region choice. Definitions of the QM regions can be found in Table S3 and Fig. S2. The results with polarizable embedding indicate that the shifts in S1 induced by increasing the QM region are generally small, between −0.05 and 0.05 eV. In contrast, a very different trend is observed for the CT states. Expanding the QM region from 1 to 2 induces a significant shift of up to −0.7 eV (Fig. 7 and Fig. S5). Interestingly, further increases in the QM region have negligible effects (less than ±0.05 eV), despite the fact that the increase from QM region 2 to QM region 3 adds 34 atoms, and further to QM region 4 additional 75 atoms. This suggests that accurately modeling the CT states requires at least an accurate description of the environment around Glu222, while the chromophore environment is of lesser importance.
![]() | ||
| Fig. 7 Impact of increasing the QM region on the lowest Glu222-chromophore CT state in PA-GFP, calculated with def2-TZVP and polarizable embedding. QM regions are defined in Table S3 and Fig. S2. | ||
Now that the impact of all the different choices has been discussed, we present the results of some selected calculations in Table 1 in order to compare the results with experimental data. TD-DFT overestimates the first excited state energy by around 0.3 eV, consistent with previous reports.62 CC2 and ADC(2) perform better.
| Method | S1/ eV | CT / eV |
|---|---|---|
| CC2 | 3.31 | 4.69, 4.69, 5.39 |
| ADC(2) | 3.21 | 4.42, 4.51, 5.13 |
| CAM-B3LYP | 3.43 | 5.08, 5.13, 5.23, 5.65 |
| ωB97X-D | 3.44 | 5.30, 5.37, 5.83 |
| Experiment | 3.107 | ∼4.6514 |
Due to the limited data, it is hard to exactly position the experimental energy of the Glu222-chromophore CT state. However, some inference is possible. The most comprehensive experimental study of this process is by Langhojer et al.14 They studied the T203V mutant of GFP where, similarly to PA-GFP, the neutral chromophore form is dominant. They report efficient decarboxylation with sequential 400 nm (3.10 eV) followed by 800 nm (1.55 eV) excitation populating a higher excited state. The CT state could be populated either directly from S1 or from a higher locally excited chromophore state, but in either case, its energy should be around 4.65 eV. Both ADC(2) (Fig. S9) and CC2 (Fig. S10) predict an energy close to this, encouraging the use of these methods to study other variants. The NTO orbital pairs for CT states calculated with CC2 are shown in Fig. 4. TD-DFT overestimates the CT state energy, with the CAM-B3LYP functional (Fig. S7) yielding results closer to those of correlated methods than ωB97X-D (Fig. S8). Nevertheless, due to its low computational cost, TD-DFT can be useful for obtaining qualitative insights.
![]() | ||
| Fig. 8 Vertical excitation energies of the S1 and lowest Glu222-chromophore CT state during ground-state QM/MM MD simulations of PA-GFP. | ||
Having established a reliable computational protocol, we applied it to other systems where the decarboxylation mechanism remains less well understood. These calculations were performed on the CC2/def2-TZVP level of theory with polarizable embedding, an equivalent of the third QM region, and the RVS approximation with freezing all virtual orbitals above 60 eV.
After the QM/MM optimization, the chromophore environment remained close to the crystal structure (Fig. 9 and Fig. S13). The only notable difference was the displacement of a structural water molecule (residue 433 in the crystal structure) by 3.6 Å, allowing it to form a hydrogen bond with Glu215. This suggests that this water molecule may occupy multiple positions, or that additional water molecules near the chromophore were not resolved in the crystal structure.
![]() | ||
| Fig. 9 Comparison of QM/MM optimized (orange) and crystal (white, PDB: 4Q7R20) structure of LSSmOrange. The chromophore and nearby amino acid side chains are shown as sticks and nearby water molecules as spheres. Hydrogen atoms are omitted for clarity. | ||
Fragment decomposition of electron and hole populations (Fig. 10 and Fig. S14) from excited states calculated using CC2/def2-TZVP identified Glu215–chromophore CT states at 4.06, 4.49, and 4.91 eV. Experimental data to determine whether excitation occurs via one- or two-photon absorption is not available. However, assuming a two-photon process, as reported for PA-GFP with a 400 nm femtosecond laser,14 an energy of 6.2 eV is delivered to the system. Since the calculated CT state energies lie well below this value, a mechanism involving ET from Glu215 to the chromophore, analogous to that proposed for PA-GFP appears plausible. Interestingly, compared to PA-GFP, the energy of the lowest CT state is lowered by more than 0.5 eV, likely due to differences in the chromophore and/or the environment surrounding the Glu residue prone to decarboxylation.
N−C
O dihedral angle of 87°. In contrast, the crystal structure features two conformations with similar occupancies: one s-cis and one s-trans. A similar conformational change was previously observed in another computational study of DsRed.67 This discrepancy has been attributed to the refinement of the original crystal structure, while a more accurate QM-based refinement results in the gauche conformation.68 Additional structural differences include the shortening of the H-bond between the chromophore and Lys163 from 3.1 to 2.6 Å, and the lengthening of the H-bond between the phenolate oxygen of the chromophore and a water molecule from 2.7 to 3.1 Å.
![]() | ||
| Fig. 11 QM/MM optimized structure of DsRed. The QM region used in the excited-state calculations is shown as sticks. The remainder of the system was described with polarizable embedding. | ||
Habuchi et al.8 reported photoinduced decarboxylation of Glu215 in DsRed upon illumination with a nanosecond-pulsed 532 nm laser. This step was proposed to precede chromophore cis/trans isomerization by creating free volume around the chromophore. The decarboxylation was supported by mass spectrometry, which revealed a 44 Da mass loss consistent with CO2 release, and by FT-IR spectroscopy, which suggested that the decarboxylated residue is an anionic glutamate. Kinetic measurements further indicated that the process involves two consecutive one photon absorptions. Drobizhev et al.17 later showed that decarboxylation in DsRed2, a variant with similar optical properties to DsRed, requires absorption of three 790 nm photons (totaling 4.7 eV). Given the expected lower electron affinity of the anionic chromophore, they argued that the initial triggering step is not ET from Glu215 to the chromophore (as proposed for GFPs), but rather ET from the chromophore to a nearby positively charged residue, such as Lys70.
To evaluate this hypothesis, we calculated vertical excitation energies on the optimized DsRed structure (Fig. 11) using CC2/def2-TZVP. Fragment decomposition of electron and hole populations of all ten computed excited states is shown in Fig. S25. Notably, state 9 (4.91 eV) exhibits dominant CT character from Glu215 to the anionic chromophore (Fig. 12). The calculated excitation energy of 4.91 eV is close to the experimentally reported threshold for decarboxylation (∼4.7 eV) and is not much higher than the 4.69 eV we calculated for PA-GFP. This suggests that ET from Glu215 to the anionic chromophore is possible in DsRed, supporting a mechanism similar to that proposed for GFPs.
![]() | ||
| Fig. 12 NTOs of the Glu215-chromophore CT state in DsRed calculated with CC2/def2-TZVP. The rest of the system, not shown here, was described with polarizable embedding. | ||
Although this may seem unexpected given the chromophore's negative charge, the presence of several nearby positively charged residues (Fig. 11) likely provides significant stabilization of the dianionic state. Such dianionic states in red fluorescent proteins have previously been proposed as intermediates potentially involved in bleaching and phototoxicity,69 although it remained unclear whether other amino acid residues could serve as electron donors.
Among the ten calculated states, only state 10 appeared to exhibit ET from the chromophore to another residue (Fig. S25). However, closer inspection of the NTOs (Fig. S27) indicates that this state is likely an artifact caused by electron spill-out into the MM environment, a known issue in QM/MM calculations and previously reported for polarizable embedding calculations of FPs.70 We note that the previously discussed state 9 does not exhibit such artifacts. Due to the computational cost of CC2, we did not compute more than ten excited states. However, calculation of 20 excited states using CAM-B3LYP/def2-TZVP with the same QM region and polarizable embedding confirmed the presence of Glu215-chromophore CT states, but still did not reveal any CT states from the chromophore to the environment besides those attributable to electron spill-out (Fig. S28). To assess whether nearby aromatic residues (Trp93, Trp141, Phe177) could act as electron acceptors, we individually added their side chains to the QM region and repeated the excited-state calculations. No states involving ET to these residues were observed (Fig. S29–S31).
In summary, our results suggest that photoinduced decarboxylation in DsRed likely follows a similar mechanism as in GFPs, involving ET from Glu215 to the anionic chromophore. ET from the chromophore to surrounding residues appears unlikely. Our focus was specifically on the initial ET step, which is likely the trigger for decarboxylation. However, it would also be interesting to investigate the subsequent steps, as their mechanisms remain poorly understood, even in the case of GFPs.
As in the previous systems, the choice of the protonation state of the chromophore and of the Glu residue prone to decarboxylation is crucial. Based on the absorption around 450 nm, it is reasonable to assume that the chromophore exists predominantly in a neutral state in both variants. In LSSmKate1, Glu215 is likely deprotonated since, as based on the crystal structure, it forms an H-bond with the positively charged Arg195. After QM/MM geometry optimization, the structure surrounding the chromophore remained similar to the crystal structure (Fig. 13 and Fig. S33), with the most significant change being the displacement of a structural water (residue 285 in the crystal structure) by 4.39 Å to form a hydrogen bond with Glu215. A similar movement was also observed for LSSmOrange.
![]() | ||
| Fig. 13 Comparison of QM/MM optimized (pink) and crystal (white) structures of LSSmKate121 (left) and PSLSSmKate9 with neutral (middle) and deprotonated (right) Glu215. The chromophore and nearby amino acid residues are shown as sticks and nearby water molecules as spheres. Hydrogen atoms are omitted for clarity. | ||
In PSLSSmKate, the authors suggest that Glu215 is neutral because, in the crystal structure, one oxygen of its carboxylate group lies within H-bonding distance (3.12 Å) of the N2 nitrogen of the chromophore's imidazolinone ring.9 The relatively low resolution of the crystal structure (2.70 Å) may limit the accuracy of the reported interatomic distances, so we have prepared PSLSSmKate systems with Glu215 modeled in both neutral and deprotonated states. In both cases, after QM/MM geometry optimization, the structure around the chromophore remained generally similar to the crystal structure (Fig. 13 and Fig. S38), making it difficult to determine which Glu215 protonation state is more likely based on structure alone. In the neutral Glu215 system, the H-bond between Glu215 and the N2 nitrogen of the chromophore's imidazolinone ring increased to 3.38 Å with an angle of 146°, indicating a weaker H-bond than initially inferred from the crystal structure. Additionally, Glu215 was no longer connected to Arg197 via a structural water molecule. In the deprotonated Glu215 system, Glu215 remained connected to both Arg197 and Lys70 through a structural water molecule, with the most notable change being the displacement of a structural water (residue 327 in the crystal structure) by 4.23 Å to form an H-bond with Glu215, similar to the movements observed in LSSmKate1 and LSSmOrange.
The CC2/def2-TZVP excitation spectra of LSSmKate1 and the two PSLSSmKate systems were compared, focusing on the energies of the Glu215–chromophore CT states (Table 2 and Tables S34, S39, S42). In the case of PSLSSmKate with deprotonated Glu215, the Glu215–chromophore CT states are more than 1.1 eV lower in energy than in LSSmKate1, despite both systems having the same structure and protonation states of the chromophore and Glu215. The energy of the lowest CT state is 3.66 eV and its non-zero oscillator strength of 0.02 may suggest it is possible to populate it directly with prolonged 405 nm (3.06 eV) illumination. This indicates that the loss of direct contact between Arg195 and Glu215 in PSLSSmKate (Fig. 13) significantly lowers the CT state energy, suggesting that only differences in the immediate environment of Glu215 are sufficient to explain the distinct photoactivation properties of the two variants. For neutral Glu215, however, no CT states are found up to 5.17 eV, and in that case our results cannot account for the differences between the two variants. Assuming deprotonated Glu215 in PSLSSmKate reproduces the experimental 0.1 eV blue shift of the first excited state relative to LSSmKate1, whereas assuming neutral Glu215 instead predicts a 0.2 eV red shift. Taken together, these results point toward deprotonated Glu215 as the more likely protonation state, although a more complex mechanism involving proton-coupled ET with neutral Glu215 cannot be ruled out.
The CT state energies in LSSmKate1 are very similar to those calculated for DsRed, which has been reported to undergo photoinduced decarboxylation. This possible discrepancy may be explained by differences in the experimental laser setups: an LED diode array was used for LSSmKate1, whereas a nanosecond pulsed laser was employed for DsRed. If the process requires multiphoton absorption, it is possible that the LED source cannot efficiently induce two-photon absorption, while the pulsed laser can. Thus, reexamining LSSmKate1 with a high-intensity pulsed laser could potentially reveal some degree of photoactivation.
Comparison of six FPs allowed us to propose a structure–function relationship for photoinduced decarboxylation. Our results reinforce the previously suggested mechanism in which ET from deprotonated glutamic acid to the neutral chromophore is the key step in PA-GFP, and show that the same mechanism is also possible in DsRed despite its anionic chromophore. Differences between LSSmKate1 and PSLSSmKate further illustrate how the local environment of the glutamate residue significantly influences the energies of the CT states, providing a possible explanation for their distinct photoactivation properties. These findings suggest that in the rational design of photoactivatable FPs, the microenvironment of the glutamic acid residue is a promising tuning target.
Key open questions remain, particularly how the CT state is initially populated. Addressing this requires going beyond our model of calculating vertical excitation energies at the ground-state minimum, ideally by performing non-adiabatic excited-state QM/MM MD simulations. The steps following CO2 loss also remain poorly understood and warrant further study.
Altogether, this work advances both the computational modeling of FPs and the mechanistic understanding of photoinduced decarboxylation. It provides a possible strategy for a more rational and targeted design of photoactivatable proteins. Additionally, the six prepared FP systems could be very useful for future benchmarking studies. Finally, we believe that the use of TheoDORE60 for the rapid analysis of large numbers of excited states could be useful for exploring the mechanisms of other photoinduced reactions.
Additional files supporting the paper and enabling replication of the results are available through the KU Leuven Research Data Repository (https://doi.org/10.48804/OTLZEI); including classical molecular dynamics parameter files; initial and optimized system structures; trajectory of 100 snapshots from QM/MM MD simulations of PA-GFP; and QM region coordinates with their corresponding embedding potentials used in excited-state QM/MM calculations.
| This journal is © the Owner Societies 2025 |