Erik
Díaz-Cervantes
ab,
Juvencio
Robles
c,
Miquel
Solà
b and
Marcel
Swart
*bd
aDepartamento de Alimentos, Centro Interdisciplinario del Noreste, Universidad de Guanajuato, 37975 Tierra Blanca, Guanajuato, Mexico
bInstitut de Química Computacional i Catàlisi (IQCC) and Departament de Química, Universitat de Girona, 17003 Girona, Spain. E-mail: marcel.swart@udg.edu
cDepartamento de Farmacia, Universidad de Guanajuato, Noria Alta S/N, Gto. 36050, Guanajuato, Mexico
dICREA, Pg. Lluís Companys 23, 08010 Barcelona, Spain
First published on 7th February 2023
Given the importance of serine proteases for biochemical processes, we have studied the peptide bond rupture mechanism using three sequential scale models as representations of the KLK5 enzyme (a protein overexpressed in ovarian cancer). The first model contains the basic functional groups of the residues that conform to the catalytic triad present in serine proteases; the second model contains some additional residues and, finally, the last representation includes all atoms of the KLK5 protein together with 10.000 explicit water molecules. This separation into three scale models allows us to separate the intrinsic reactivity of the catalytic triad from the process taking place in the enzyme. The methodologies employed in this work include full DFT calculations with a dielectric continuum in the first two models and a multi-level setup with a Quantum Mechanics/Molecular Mechanics (QM/MM) partition in the whole protein system. Our results show that the peptide-bond rupture mechanism is a stepwise process involving two proton transfer reactions. The rate-determining step is the second proton transfer from the imidazole group to the amidic nitrogen of the substrate. In addition, we find that the simplest model does not provide accurate results compared to the full protein system. This can be attributed to the electronic stabilization conferred by the residues around the reaction site. Interestingly, the energy profile obtained with the second scale model with additional residues shows the same trends as the full system and could therefore be considered an appropriate model system. It could be used for studying the peptide bond rupture mechanism in case full QM/MM calculations cannot be performed, or as a rapid tool for screening purposes.
Three principal residues constitute the active site of serine proteases (Ser, Asp, and His), the so-called catalytic triad.6–10 They form part of a network of hydrogen bonds within the protein, which originates at the Asp residue and ends with the Ser residue.11 The role that these residues and the H-bond network play in the reaction mechanism (one of the primary aims of biochemical studies12,13) was analyzed in experimental studies on peptide bond ruptures in serine proteases; they observed the formation of one particular pose of the triad (Fig. 1A), which plays a role in the proposed mechanism.14,15
One member of the protein serine family (trypsin) was studied by a combination of molecular dynamics (MD) and multi-level Quantum Mechanics/Molecular Mechanics (QM/MM) by Kato and co-workers.16 They described the possible peptide bond rupture mechanism and observed an intermediate in the reaction pathway, which is stabilized by an oxyanion hole. Note that in this study the effect of additional residues and the vibrational frequencies to confirm the reaction path were neglected; for this reason, it is necessary to propose a sequential scale model study, which focuses on the importance of these additional residues. In particular, determining whether it is sufficient to include only these or, alternatively, the full protein environment is necessary.
Here, we are proposing three sequential scale models for the study of the peptide bond rupture mechanism in serine proteases, based on the kallikrein-related peptidase 5 (KLK5), a protein overexpressed in ovarian cancer.17,18 It plays a key role in stratum conium desquamation,19,20 and has been related to the carcinogenesis process, and possesses the ability to act as a novel cancer biomarker.21,22 The goal of this work is three-fold: first, we aim to study the reaction mechanism of the peptide bond rupture taking place in the active site of KLK5, including confirmation of the stationary points through vibrational frequency analysis. Second, we compare the reaction paths obtained with three representations of KLK5, going from a small set of active site residues, through an extended representation of the active site, and finally a real model based on the full protein. This comparison will highlight the role of the protein environment in the peptide bond rupture mechanism. Third, through an energy decomposition analysis we investigate the chemical origins of the barriers involved.
To locate the active sites in the protein, specifically the catalytic triad, we performed an in silico molecular coupling, using the Molegro Virtual Docker suite (MVD)24 with the MolDock25 score function.
The reaction mechanism of the peptide bond rupture is described using three different scale models as representations of KLK5 for the hydrolysis of the substrate (Fig. 2). The first model, m1, consists of 47 atoms (including the substrate, see Fig. 3) and contains the minimum number of components of the catalytic triad: the nucleophilic oxygen (acting as the oxygen of Ser), the cis-urocanic acid26 as the electron donor system (simulating the behavior of Asp and His) and two amine groups as an oxyanion hole. This model includes a dielectric continuum solvent model for the simulation of the enzyme environment (methanol was chosen as the solvent, to represent the hydrophilic active site cavity). The same dielectric continuum solvent model was used in the second representation of KLK5 with 78 atoms (see Fig. 4). This second scale model, m2, has more realistic residues for the catalytic triad and, additionally, contains some residues of KLK5 (His57, Asp102, Gly193, Asp194, and Ser195) that were proposed as relevant for the reaction mechanism. Finally, we use a “real” system based on the atoms extracted from the PDB file mentioned above (m3 model, Fig. 5). This model contains 13968 atoms, and is divided in two regions: the QM region (treated with the S12g/TZ2P27 method), consisting of 127 atoms and with the principal residues from the catalytic triad (His57, Asp102, Gly193, Asp194, and Ser195) plus the binding residues that are involved in appropriately positioning the substrate in the active site (Gln192, Ser214, Trp215, and Gly216) and the MM region (treated with the AMBER force field28), which consists of the rest of residues together with 10.000 molecules of water.
Fig. 3 Representation of the m1 model; grey spheres depict carbon atoms, white spheres represent hydrogen atoms, red spheres represent oxygen atoms and blue spheres represent nitrogen atoms. |
Fig. 4 Representation of the m2 model. Grey spheres depict carbon atoms, white spheres represent hydrogen atoms, red spheres represent oxygen atoms, and blue spheres represent nitrogen atoms. |
All geometry optimizations of stationary points (reactants, transition states, intermediates, and products) were performed with DFT at the S12g/TZ2P27 level, using the QUILD29 program (models m1 and m2) and the NEWQMMM code (model m3) from the Amsterdam Density Functional (ADF)30 package. In the NEWQMMM calculations, the cut-off between QM and MM was chosen at sensible positions, thereby avoiding the rupture of complex bonds (aromatic, double bonds, etc.). Solvent effects were included with the COSMO31 approach for the first two models considering methanol as the solvent. In contrast, explicit water molecules were included in the m3 model. Thermal and entropy effects were not considered to enable a fair comparison with the m3 model.
Moreover, to analyze in detail the interactions between the protein and the substrate along the reaction path, we carried out an energy decomposition analysis of all stationary points, considering the factorization of the whole interaction energy (ΔEint) as follows:
ΔE = ΔEprep + ΔEint | (1) |
ΔEint = ΔEelstat + ΔEoi + ΔEPauli + ΔEDisp | (2) |
Fig. 6 Proposed substrate into the active site of KLK5. The blue lines depict a hydrophobic surface and the red lines represent a hydrophilic surface. |
Fig. 7 depicts the reaction paths obtained for each proposed model. In all cases, we found two transition states (TSs) and an intermediate, which is stabilized by the oxyanion hole. The first proton transfer, from the nucleophilic oxygen to the nitrogen of imidazole, has an activation barrier of 28.0 kcal mol−1 for the m1 model; this value is reduced to 15.5 kcal mol−1 for m2 and 5.0 kcal mol−1 for m3, showing the stabilization conferred by the surrounded functional groups in the m2 model and the total residues in the m3 system, which are not present in m1.
Concerning the intermediate, we observe similar behavior to that of TS1, with stabilization due to the surrounding functional groups acting on the oxyanion hole, as discussed widely by Warshel and co-workers.36–38 They concluded that the oxyanion hole plays a key role in charge stabilization, distributing the electronic density in the functional groups around the reaction site.34–36 For this stationary point, we observed energy differences of 6.9 kcal mol−1 between m3 and m2, and a higher difference, by a factor of almost three, between the energies of m3 and m1.
On the other hand, the TS2 corresponding to the second proton transfer has a much lower energy (12.6 kcal mol−1) than TS1 in the m1 model. In contrast, models m2 and m3 yield higher energy for TS2 than TS1. As a whole, the rate-determining step according to the m1 model is the first proton transfer with an overall energy barrier of 28.0 kcal mol−1. In contrast, for m2 and m3, the rate-determining step is the second proton transfer with energy barriers of 21.7 (m2) and 8.5 (m3) kcal mol−1. From a qualitative point of view, the m2 model provides results closer to the “real model” m3. The proposed oxyanion hole for m2 can simulate the behavior of the protein electronic environment in the proton transfer between the nitrogen of the imidazole and the nitrogen of the substrate better than the functional groups of the m1 model.
Finally, the relative energy of the products obtained through the three models is very similar, varying only by 2.9 kcal mol−1 between m1 and m3, and 2.0 kcal mol−1 between m1 and m2, being in all the cases exothermic reactions. Although the m2 and m3 models differ by about 10 kcal mol−1 in the energy barriers, the qualitative trends given by these two models are the same, whereas those of the m1 model clearly disagree when compared to m3.
For the above reasons, the m2 model has been selected to perform a more detailed study into the electronic density behavior of the highest occupied molecular orbital (HOMO) along the reaction path (see Fig. 8). The MO plots show that the HOMO electronic density in the reactants is localized on the electron donor system site. When the first proton transfer occurs in the TS1, the electronic density goes to the substrate site and the nucleophilic oxygen. In the intermediate, the HOMO is located in part at the substrate but mainly in the oxyanion hole site. As expected, the HOMO follows the negatively charged fragment generated during the proton transfer process. In the second proton transfer process, crossing TS2, the HOMO electronic density begins to migrate to the electron donor system, but mainly remains at the substrate site. Finally, when the peptide bond rupture is done, in the products, the lobes of the HOMO move to the electron donor system, as depicted in Fig. 8.
Table 1 shows the energy decomposition analysis (EDA) across the reaction path, computed through eqn (1) and (2), respectively, and considering as the baseline the reactant energies. Note that the fragments were defined in each path step considering the main involved functional groups. The preparation energy (ΔΔEprep) plays a key role in the rate-determining step selection. It contributes mostly to the higher energy of TS2 compared to TS1, leading to this second step being rate-determining (see Table 1); the solvation energy contributes most to this. On the other hand, the orbital interactions are essential to stabilize stationary points, being higher in the case of the transition states and lower in the reactants and products. The ΔΔEPauli, stabilizes the intermediate. In other words, if the Pauli repulsion decreases in a stationary point, a stable state is promoted. Finally, the classical electrostatic interaction ΔΔEelstat is lower in the transition states and higher at the minima (reactants, intermediate and products).
ΔΔEPauli | ΔΔEelstat | ΔΔEoi | ΔΔEdisp | ΔΔEint | ΔΔEprep | ΔΔE | |
---|---|---|---|---|---|---|---|
a The ΔE is the dissociation energy, which is split into the preparation energy ΔEprep and the interaction energy ΔEint; the latter in turn is split into the classical electrostatic interaction ΔEelstat, the orbital interaction energy ΔEoi, the dispersion energy ΔEdisp, and the Pauli repulsion ΔEPauli. | |||||||
React | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
TS1 | −82.92 | −28.29 | 117.89 | 0.10 | 6.78 | 9.03 | 15.81 |
Inter | −160.92 | 4.81 | 158.13 | 0.72 | 2.74 | 8.08 | 10.82 |
TS2 | −265.75 | 1.99 | 263.04 | −0.33 | −1.05 | 22.72 | 21.67 |
Product | −213.31 | 65.83 | 131.17 | 5.60 | −10.71 | 4.16 | −6.55 |
The rupture of peptide bonds can be modelled with a minimal set of residues, obtaining qualitative results of the reaction path. Moreover, in the small models, transition states can be characterized through the frequencies and finding the imaginary frequency that promotes the formation and rupture of peptide bonds. Due to the computational requirements, calculation of the frequencies of the full system, which consists of thousands of atoms, is prohibitive. Therefore, we suggest characterizing the transition states with smaller models.
However, the rate-determining step of the reaction cannot be determined by using small model systems, such as in the case with the smallest model proposed in this work, m1. Not surprisingly, the functional groups around the reaction site are essential in a quantitative study of the reaction mechanism of an enzyme. At the same time, we find that the energy needed to break the peptide bond presents a small variation of ca. 2 kcal mol−1 between the m1 and m2 models, and the difference with the last m3 model is only 1 kcal mol−1.
Moreover, through an energy decomposition analysis study we observed that the component which contributes most to the rate-determining step is the preparation energy ΔEprep (and within that, the solvation energy). The interaction energy between the fragments is overall almost zero due to the opposite effects of Pauli repulsion, ΔEPauli, which has a stabilizing effect along the reaction path, and orbital interactions, ΔEoi, which has a destabilizing effect on it.
To conclude, we show that with medium-sized models like m2 it is possible to provide qualitatively meaningful results, albeit with an overestimation of the barriers. Model m2 can be therefore be used for rapid screening studies on the peptide bond rupture in this kind of protein that is overexpressed in ovarian cancer and used in some environmental applications, which demonstrates the importance of proposing a minimally sufficient model to simulate a reaction mechanism in biological systems.
This journal is © the Owner Societies 2023 |