Carlos A.
Ramos-Guzmán
,
J. Javier
Ruiz-Pernía
* and
Iñaki
Tuñón
*
Departamento de Química Física, Universidad de Valencia, 46100 Burjassot, Spain. E-mail: ignacio.tunon@uv.es; j.javier.ruiz@uv.es
First published on 29th January 2021
The irreversible inhibition of the main protease of SARS-CoV-2 by a Michael acceptor known as N3 has been investigated using multiscale methods. The noncovalent enzyme–inhibitor complex was simulated using classical molecular dynamics techniques and the pose of the inhibitor in the active site was compared to that of the natural substrate, a peptide containing the Gln–Ser scissile bond. The formation of the covalent enzyme–inhibitor complex was then simulated using hybrid QM/MM free energy methods. After binding, the reaction mechanism was found to be composed of two steps: (i) the activation of the catalytic dyad (Cys145 and His41) to form an ion pair and (ii) a Michael addition where the attack of the Sγ atom of Cys145 to the Cβ atom of the inhibitor precedes the water-mediated proton transfer from His41 to the Cα atom. The microscopic description of protease inhibition by N3 obtained from our simulations is strongly supported by the excellent agreement between the estimated activation free energy and the value derived from kinetic experiments. Comparison with the acylation reaction of a peptide substrate suggests that N3-based inhibitors could be improved by adding chemical modifications that could facilitate the formation of the catalytic dyad ion pair.
Several lead compounds have already been demonstrated to be effective at inhibiting the activity of SARS-CoV-2 3CLpro, including Michael acceptors,4 α-ketoamides,5 carbamoyl derivatives6 and aldehydes.7 N3 is a Michael acceptor, an α,β-unsaturated carbonyl compound, that was designed as an inhibitor of the 3CLpro of several coronaviruses, including SARS-CoV and MERS-CoV8 and that has been demonstrated to have inhibitory activity against the ortholog enzyme of SARS-CoV-2.4 This compound has a chemical structure similar to that of a peptide, the natural substrate of the enzyme (see Fig. 1). However, the microscopic details of 3CLpro inhibition by N3 are still unclear.
Fig. 1 Chemical structure of a peptide substrate of SARS-CoV-2 3CLpro (top) and of the N3 inhibitor (bottom). The scissile bond of the peptide is placed between Gln-P1 and Ser-P1′. |
Kinetic experiments showed that N3 is a potent time-dependent irreversible inhibitor of SARS-CoV-2 3CLpro that follows the next kinetic scheme:4
(1) |
In a first stage, the inhibitor reversibly binds into the active site of the enzyme forming a noncovalent complex (EI) with a dissociation constant (KI = k2/k1). Afterwards, the inhibitor irreversibly reacts with the enzyme, with a rate constant k3, to give a stable acylenzyme (E–I). This acylenzyme is characterized by the formation of a covalent bond between the Sγ atom of Cys145 and the Cβ atom of the inhibitor, as observed in the X-ray structure of the inhibited enzyme.4
N3, or any of the other 3CLpro inhibitors characterized until now, can be used as a starting point for the development of an efficient drug for the treatment of COVID-19. One of the steps in this development is the optimization of the thermodynamic (binding) and kinetic properties of the inhibitor. This improvement should be based in the microscopic knowledge of the inhibition process, which in part relies on the details provided by simulations of the enzyme and the complex formed with the inhibitor. The analysis of the reaction step requires of the use of QM/MM potentials, which are adequate to describe bond forming and breaking processes. QM/MM techniques have been already employed to study, at microscopic level, the SARS-CoV-2 3CLpro hydrolysis mechanism with a natural peptide9 and a modified peptide having a fluorescent tag as leaving group10 as substrates. We here use these methods to investigate the inhibition process of this enzyme by N3. The atomistic details provide here could be applied to improve the design of future drugs based in this compound.
Fig. 2 Molecular dynamics simulation of the noncovalent EI complex between N3 and SARS-CoV-2 3CLpro. (a) N3 in the active site of the enzyme, showing the location of the catalytic dyad. (b) Probability densities of the distances from the Cys145-Sγ atom to the Cβ carbon atom of the substrate, in red, and to the Nε atom of His41, in blue. (c) Fraction of hydrogen bond contacts between residues of N3 and a peptide substrate9 and those of the protease. A hydrogen bond contact is counted when the donor–acceptor distance is <3.8 Å and the hydrogen bond angle is >120°. |
Fig. 2c displays the fraction of hydrogen bond interactions established between protein residues and the different groups of the inhibitor and the peptide substrate. 3CLpro presents an absolute requirement for Gln at P1 position.21 As seen in Fig. 2c the P1 residue of the peptide substrate is the one establishing more hydrogen bond interactions with the enzyme.9 In particular, main chain atoms of Gln-P1 form hydrogen bonds with residues Gly143, Ser144 and His164, while the side chain is accommodated through hydrogen bond contacts with Phe140, Leu141, His163 and Glu166. In N3, the Gln residue is substituted by a γ-lactam ring at P1 position, which essentially reproduces the same hydrogen bond interactions. The interaction pattern of the P2–P5 groups is also quite similar in the inhibitor and the peptide substrate, which could explain the affinity between the enzyme and the inhibitor. Important differences appear at the P1′ site, where the serine residue of the peptide substrate is substituted in the inhibitor by a benzyl ester group. While the main chain O atom of Ser-P1′ forms hydrogen bonds with the amide group of Gly143 and the side chain of Asn142, the hydroxyl group of its side chain can contact the catalytic dyad (Cys145 and His41). In the inhibitor, the carbonyl O atom of the P1′ group can form a hydrogen bond contact with the main chain NH of Gly143. The terminal benzyl group can establish a CH⋯π interaction with the methyl group of Asn142 side chain, while the other side of the ring remains solvent exposed. Because of its mobility, this benzyl ester group can also establish interactions with threonine residues placed at positions 24–26 and nearby residues. It is also noticeable to remark that the strong interactions established by Gly-P2′ of the peptide substrate with Thr25 and Thr26 are significantly weakened or absent in the inhibitor, opening a way to improve the binding affinity between the protease and N3.
Fig. 3 Proton transfer from Cys145 to His41 in SARS-CoV-2 3CLpro. (a) Free energy profile for the transformation from the neutral catalytic dyad (left) to the ion pair (IP, right) in the EI complex with N3 (red line); in the complex with the peptide substrate9 (blue line) and in the apo enzyme9 (green line). (b) Snapshot of the IP configuration in the EI complex showing a water molecule placed between His41 and the inhibitor. (c) Snapshot of the IP configuration in the apo enzyme. |
Because of the comparatively larger free energy cost of forming the IP from the noncovalent EI complex with the N3 inhibitor found in our study, we investigated first the possibility of a reaction mechanism for the formation of the covalent E–I complex that does not involve a proton transfer from Cys145 to His41. In such a mechanism the sulfhydryl proton is directly transferred to the Cα atom of N3 while the Sγ atom attacks on the Cβ one (see Fig. S3†). However, the activation free energy found for this mechanism (about 50 kcal mol−1) is too high and incompatible with the observed inhibition rates (see below).
We thus explored a reaction mechanism for the formation of the E–I acylenzyme from the IP form. This mechanism implies the proton transfer from the Nε atom of His41 to the Cα atom of the inhibitor, mediated by the water molecule placed in between, and the nucleophilic attach of the Sγ atom of Cys145 to the Cβ atom of N3 (see Fig. 4a). The results obtained for the MFEP corresponding to this mechanism at the B3LYPD3/6-31+G*/MM level are shown in Fig. 4b and c. According to the free energy profile the reaction proceeds via two Transition States (TS1 and TS2) separated by a shallow intermediate (see Fig. 4b). TS1 is the rate-limiting one with a free energy 10.6 kcal mol−1 higher than the IP, while the free energy difference corresponding to TS2 is 9.4 kcal mol−1. The evolution of the CVs used to define the multidimensional free energy surface (Fig. 4c) shows that TS1 is associated to the nucleophilic attack of the Sγ atom to the Cβ atom and the change of the bond between Cα and Cβ atoms from double to single. The Sγ–Cβ distance at TS1 has been reduced from 3.3 to 2.33 Å, while the Cβ–Cα distance has been slightly increased from 1.34 to 1.41 Å (see Fig. 4d). TS2 corresponds to the proton transfer from His41 to the neighbor water molecule and from this to the Cα atom, being the first proton transfer more advanced than the second one (see Fig. 4e). At TS2 the Sγ–Cβ bond is significantly shorter (1.91 Å) while the Cβ–Cα distance has been elongated up to a value close to that of a single bond (1.50 Å). Note that in this mechanism the sequence of nucleophilic attack and proton transfer is just the reverse of that observed for the acylation mechanism of the peptide substrate, where the proton transfer to the N atom of the scissile bond precedes the nucleophilic attack on the carbonyl carbon atom.9 Finally, the reaction product, where a proton has been transferred to the Cα atom, is shown in Fig. 4f. The Sγ–Cβ bond distance found at the E–I complex (1.85 Å) is close to the value found in the X-ray structure of the inhibited enzyme (1.77 Å).4 The overlap between the QM/MM and X-ray structures is shown in Fig. S4.†
In order to check the robustness of our mechanistic proposal, the string calculation was repeated using the M06-2X functional with the same basis set and D3 corrections. The resulting free energy profile was almost identical to the B3LYP one, both from the energetic and structural points of view: the geometries and energies of the transition states were very similar at both theoretical levels, as can be seen in Fig. S5.† This result confirms the adequacy of the B3LYP functional for the present Michael addition, in spite of the reported limitations of this functional to correctly describe some enolate or carbanion intermediates.23,24 Note that these species are not strictly found in the proposed mechanism because of the proton transfer from His41 to the substrate. It must be also stressed that at both theoretical levels, the string converges to a mechanism evolving from the IP to the covalent product, confirming that, in agreement with our previous work,10 the IP is a metastable species from which the most favorable mechanism may proceed.
A complete representation of the free energy path from the noncovalent complex (EI) to the covalent one (E–I) is provided in Fig. 5. According to this free energy profile, resulting from the combination of those presented in Fig. 3 and 4, the transformation from the noncovalent EI complex to the covalent one (E–I) is an exothermic process. The free energy difference between E–I and IP (Fig. 4b) is −25.7 kcal mol−1, while the free energy difference between IP and EI is 10.7 kcal mol−1 (Fig. 3a). Combining these two values, our simulations predict that the covalently bonded E–I complex is −15.0 kcal mol−1 more stable than the noncovalent EI complex, which agrees with the observed irreversibility of the inhibition process of 3CLpro by N3.4
Fig. 5 Free energy profile for the whole transformation of the noncovalent complex (EI) into the covalent one (E–I) through formation of the IP. |
Regarding the inactivation rate (k3 in eqn (1)), our simulations predict that the associated activation free energy results also from the sum of two contributions: the free energy cost of creating the IP form from EI (10.7 kcal mol−1, Fig. 3a) plus the activation free energy of TS1 relative to IP (10.6 kcal mol−1, Fig. 4b). This gives in a total activation free energy of 21.3 kcal mol−1, which according to Transition State Theory (see eqn (S1) in SI†) corresponds to a rate constant of 1.9 × 10−3 s−1 at 300 K. Unfortunately, only the second-order rate constant (k3/KI) and not the inactivation rate constant of SARS-CoV-2 3CLpro by N3 has been estimated.4 However, the inactivation rate constant by N3 (k3) was determined for the highly similar ortholog protease of SARS-CoV.8 In this case the activation free energy derived from the rate constant measured at 303 K (3.1 × 10−3 s−1) is 21.2 kcal mol−1. Comparison between SARS-CoV and SARS-CoV-2 main proteases seems appropriate considering that they present identical active sites, the same substrate specificity and very similar reaction rate constants for the hydrolysis of peptides.25 The order of magnitude predicted for the k3 rate constant seems also correct when compared to the values determined for the main proteases of other coronaviruses.8 Even if the rate constant for SARS-CoV-2 inhibition would be one order of magnitude faster, which could account for the rapid inhibition reported experimentally,4 the resulting activation free energy for the SARS-CoV-2 enzyme would only be 1.4 kcal mol−1 smaller than the reported value for the SARS-CoV one. Lastly, our prediction for the rate constant is compatible with the experimental estimation of k3/KI for the SARS-CoV-2 enzyme (11300 M−1 s−1).4 Combination of our predicted k3 with the reported estimation for k3/KI gives a KI of ∼0.2 μM, a value similar to those determined for the inhibition of other coronaviruses's proteases with N3.8 The excellent agreement between the experimental values and our theoretical estimation strongly supports our mechanistic proposal for the inhibition of SARS-CoV-2 3CLpro by a Michael acceptor.
The QM/MM study of Moliner and coworkers found an activation free energy of 11.2 kcal mol−1 and a reaction free energy of −17.9 kcal mol−1.20 While the latter value is close to our findings (−15.0 kcal mol−1), the former departs significantly from our estimation (21.3 kcal mol−1). Their activation free energy provides a rate constant of ∼104 s−1, significantly larger than the aforementioned value measured for the homologous SARS-CoV protease. In their mechanistic proposal the proton transfer from His41 to the inhibitor is direct and not water-mediated. However, this mechanistic difference does not explain the gap between their and ours calculated activation free energies. In their simulations the rate-limiting TS corresponds, as in our case, to the Sγ–Cβ bond formation and the free energy difference with the IP is 9.9 kcal mol−1, very close to our value of 10.6 kcal mol−1 (see Fig. 4b). The main difference between our results and those reported by Moliner and coworkers is found in the first part of the process, the free energy cost of forming the IP from the EI complex, 10.7 and 1.3 kcal mol−1, respectively. Differences between the two works may arise from the different QM levels of theory, MM forcefields or to the sampling of different enzymatic configurations (their exploration of the mechanism started from the E–I complex while we started from the noncovalent EI complex); factors that could affect the relative stability of the neutral and ionic forms of the catalytic dyad. According to our previous discussion, we think that our simulations provides a general picture (see Fig. 5) in better agreement with current experimental results.
Roughly speaking, in our simulations the two steps presented in Fig. 3 and 4, IP formation and Michael addition, contribute similarly to the activation free energy of the inhibition process (about 10 kcal mol−1 each of them). This suggests that the kinetic properties of inhibitors can be improved also by stabilizing the ligand-bound ion pair state. It has been already suggested for other related 3CL proteases (from MERS and SARS-CoV) that stabilization of a charged catalytic dyad could promote catalysis.26 For the SARS-CoV-2 protease it has been shown that inhibitors can shift the protonation state of some residues, not only the catalytic dyad but also other residues found in the vicinity of the active site.27 In principle, a possible strategy is the introduction of chemical groups in the inhibitor structure that imitate the role played by Ser-P1′ in the natural substrate. The hydroxyl group of this residue can make contacts with the catalytic dyad that, together with the presence of solvent molecules, can contribute to stabilize the IP.9 Interestingly, the position of the water molecule in the rate limiting TS structure found in this work (TS1, see Fig. 4d) could be useful to assist in the design of inhibitors that favor this stabilization process. In this sense, a recently reported potent inhibitor of SARS-CoV-2 3CLpro (PF-00835231)28 presents a hydroxyl group that matches the position of the water molecule in TS1 (see Fig. S6† for an overlap of the X-ray structure of the enzyme inhibited by PF-00835231and TS1). This observation illustrates the insights offered by mechanistic studies for the design of new inhibitors.
Molecular dynamics simulations of the noncovalent EI complex show that the inhibitor mimics the interactions established by the P1–P5 residues of the natural substrate. Our analysis also shows that an interesting strategy to improve a potential inhibitor based in N3 could be the introduction of chemical changes in the benzyl ester group in such a way that could restore the interactions that the P2′ group of the peptide substrate establishes with Thr25, Thr26 and Gly143. This change could increase the affinity between the inhibitor and the protein, reducing the dissociation constant KI.
Regarding the formation process of the covalent E–I complex, our simulations show that the inhibition mechanism of SARS-CoV-2 3CLpro by a Michael acceptor involves two steps after binding the inhibitor: (i) the activation of the catalytic dyad by means of the formation of an ion pair and (ii) a Michael addition process where Cys145 attacks to the Cβ atom of the Michael acceptor and a proton is transferred, water mediated, from His41 to the Cα atom of the inhibitor. The contribution of each of the two steps to the activation free energy of the inhibition process is roughly the same (about 10 kcal mol−1) and thus inhibition kinetics can be favored by reducing either of the two contributions. The free energy cost to form the IP is substantially smaller in the enzymatic complex with the peptide substrate than when N3 is present in the active site (about 5.9 kcal mol−1, according to Fig. 3a) and the activation free energy for the acylation of a peptide substrate is also significantly smaller than for the N3 inhibitor (by about the same quantity reflected in Fig. 3a).9 This clearly suggests that, in order to improve the kinetic behavior of newly designed inhibitors (increasing k3), attention must be paid to the formation of the IP. In this sense, the structures found along our reaction path, an in particular the rate-limiting transition state, could be useful to guide that design.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/d0sc04978f |
This journal is © The Royal Society of Chemistry 2021 |