Walter
Alvarado†
a,
Gustavo R.
Perez-Lemus†
b,
Cintia A.
Menéndez†
b,
Fabian
Byléhn†
b and
Juan J.
de Pablo
*b
aBiophysical Sciences, University of Chicago, 929 E. 57th Street, Chicago, IL 60637, USA. E-mail: walt@uchicago.edu
bPritzker School of Molecular Engineering, University of Chicago, 5640 S. Ellis Avenue, Chicago, IL 60637, USA
First published on 15th September 2021
The interactions between the receptor binding domain (RBD) of SARS-CoV-2 and the angiotensin-converting enzyme 2 (ACE2) are crucial for viral entry and subsequent replication. Given the large and featureless contact surfaces between both proteins, finding a suitable small-molecule drug that can bind and disrupt regulatory pathways has remained a challenge. A promising therapeutic alternative has been the use of small compounds that bind at the protein–protein interface or at distal “hotspots” and induce conformational changes that inhibit or stabilize protein–protein interactions (PPIs). In this work, we conduct large-scale all-atom explicit solvent simulations of the top scoring compounds from a recent large-scale high-throughput docking screening to investigate their interaction with the RBD domain of the spike (S) protein in complex with ACE2. We identify several promising candidates that exhibit a large negative free energy of binding to the RBD/ACE2 complex based on ab initio thermodynamic integration calculations. A systematic structural analysis of two glycosylation profiles of the RBD/ACE2 complex reveals the important role glycans play in modulating the binding of small-molecules. Cross correlation, fluctuation and strain analysis identify several of these compounds as effective PPI modulators that inhibit or stabilize the protein–protein interactions of RBD/ACE2 based on the glycosylation profile. Among them, luteolin, a drug currently approved for asthma, exhibits an intense allosteric effect when it binds to a previously unidentified distal binding site of the RBD domain far from the RBD and ACE2 interface which may serve as a potential target for future drug discovery.
Design, System, ApplicationThe design of small-molecules that target the spike protein of SARS-CoV-2 has been the focus of recent studies searching for an effective treatment strategy against COVID-19. In this work, molecular dynamics simulations are used to study how several previously identified therapeutics alter the structure and dynamics of the spike protein. We analyze the strain and molecular stiffness induced by each small molecule upon binding, and identify a previously uncharacterized distal binding site that induces significant allosteric conformational changes to the protein–protein interaction between the subunits of the spike protein and the active site of the angiotensin-converting enzyme 2 (ACE2), a key binding partner required for host cell infectivity. The fundamental understanding obtained in this study may enable the development and engineering of novel small-molecules that disrupt viral entry, and provides an informed perspective on how allosteric modulators serve as a template for the design of suitable therapeutics. |
The surface-anchored viral spike (S) glycoprotein mediates coronavirus entry into host cells. The S protein is a type I transmembrane protein comprising two functionally distinct regions, S1 and S2, that mediate receptor binding and membrane fusion, respectively. Similar to SARS-CoV, responsible for the 2003 SARS outbreak in Asia, the S1 subunit of the SARS-CoV-2 engages the same angiotensin-converting enzyme 2 (ACE2) receptor for host cell entry and utilizes the serine protease TMPRSS2 for S protein priming.2 Of the two regions, the S2 subunit is the most conserved among different coronavirus genera and nine residues involved in the interaction between ACE2 and the receptor binding domain (RBD) of the S1 subunit are conserved between SARS-CoV and SARS-CoV-2.3,4
Specific RBD-receptor binding determines host cell infection and has been the target of several drug design efforts. Recent flow cytometry studies have shown that SARS-CoV-2 RBD exhibits a higher binding affinity to human ACE2 (hACE2) and bat ACE2 (bACE2) receptors than SARS-CoV RBD.5 Deletions in the RBD of closely related S protein sequences have also been shown to inhibit binding to hACE2.6 In addition, several studies indicate that the RBD is an attractive antigen for specific antibody detection.7,8 Taken together, these results suggest that SARS-CoV-2 RBD is a promising target for the development of novel vaccines and antiviral drugs.
Recent studies have identified key surface contacts that can be leveraged to disrupt the RBD/ACE2 protein–protein interaction (PPI).9,10 PPIs are physical contacts of high specificity between proteins that play an important role in cellular function.11 PPIs are characterized by their large, flat, and featureless binding interfaces, as in the case of the RBD/ACE2 complex. One major challenge in PPI inhibition is designing small molecules that can compete with the high binding affinity of a natural protein partner. In addition, the large and flat nature of the protein interaction surface often lacks clear binding pockets or grooves that can act as binding sites for smaller ligands. A promising alternative is the identification or engineering of allosteric modulators that stabilize protein–protein interactions, thereby interfering with the downstream pathways they mediate.12 The key strategy in this approach involves indirectly affecting PPI interfaces by targeting distal binding sites that are structurally distinct. Allosteric modulators induce a conformational change that either inhibits or stabilizes association with another protein and have shown promise in the design of suitable therapeutics.13,14
In this work, we focus on a set of small-molecule drugs that can be repurposed for the therapeutic treatment of COVID-19. Specifically, we focus on the top ten scoring compounds proposed in a recent high-throughput supercomputer-based docking screen that was performed in vacuum and relied on minimization of the potential energy of the ACE/RBD complex.15 Here we perform molecular simulations in explicit water and determine the relative and absolute binding free energies of each drug at various binding locations and with different glycosylation profiles. We find the affinity of the ligands to the complex to be different than previously predicted on the basis of energy minimization, with several of the top candidate drugs unbinding from the protein over the course of nanosecond long trajectories. We study the effect glycans have on binding affinity for several ligands by comparing two glycoforms of the RBD/ACE2 complex: a simple and “Abundant” model. Notably, our free energy results show that for an RBD/ACE2 complex glycosylated with small sugar moieties, high-affinity binding ligands stabilize the binding between RBD and ACE2. An RBD/ACE complex with glycans that were determined as the most abundant from a glycoproteomics study conducted by Zhao et al. was analyzed for comparison.16 The presence of these complex-glycans affect the binding affinity of several drug candidates and their mechanism of action. We then analyze the strain and molecular stiffness induced by each small molecule upon binding and identify a “hotspot”, comprising only a few amino acid residues, as a potential target for drug discovery. Surprisingly, binding of luteolin – one of the drugs considered here, to this distal site, induces a profound allosteric conformational change to residues near the RBD/ACE2 binding interface, which disrupts non-native contacts at the protein–protein interface. We also provide additional quantitative insights into the binding mechanism of luteolin to the RBD/ACE2 complex using cross-correlation analysis and principal component analysis (PCA). We conclude with a discussion of our findings and several suggestions for future experimental studies.
xj − xi = F(x0,j − x0,i) | (1) |
(2) |
(3) |
(4) |
For this study, a 10 Å radius around each Cα atom for our analysis was considered. A simulation of RBD/ACE2 without ligand was used as reference for strain calculations to elucidate the binding of several ligands at different sites.
Mathematically, PCs are evaluated by diagonalizing the correlation matrix Cij,
RTCR = Λ | (5) |
Cij = 〈(xi − 〈xi〉)(xj − 〈xj〉)〉 | (6) |
Multiple binding free energy calculations were conducted to determine the binding affinity between the RBD/ACE2 complex and each small ligand (Table 1). The results were compared to those of the high-throughput screening.15 We observed standard deviations ranging from 0.60–0.91 and 0.16–0.53 between our docking binding energies to that of the previous screening study for the simple and “Abundant” models, respectively. This could be due to the differences in the structures used (homology modeled vs. crystal structure) and the fact that important disulfide bonds and glycans groups were accounted for in our structure but were omitted in the Smith et al. analysis.15 The absolute binding free energies between ligand and protein reported here were determined using thermodynamic integration (TI) in the presence of explicit water. In our TI protocol, each ligand atom is treated as a softcore atom and is subsequently “removed” in a one-step alchemical cycle both in solution and in complex. This alchemical method, also referred to as free energy perturbation (FEP), allows for the calculation of absolute binding free energies (ABFE). We observed larger binding energies with standard deviations as high as 10.79 for the simple glycan structure and 5.04 for the “Abundant” glycoform in our free energy calculations than the docking binding energies. Of the three ligands studied, luteolin showed the highest binding affinity for both glycoforms of the RBD/ACE2 complex. Luteolin was also the only ligand to bind at the distal site compared to the other candidates which were found to have a stronger binding affinity for the binding interface of RBD and ACE2 (Fig. 1).
Molecule | Oak ridge ΔG (kcal mol−1) | Docking ΔG (kcal mol−1) | TI ΔG (kcal mol−1) | RBD/ACE2 ΔΔG (kcal mol−1) |
---|---|---|---|---|
Luteolin | −7.4 | −6.19 | −11.31 ± 0.62 | −13.93 ± 1.63 |
Protirelin | −7.3 | −8.45 | −1.22 ± 5.79 | −11.22 ± 5.10 |
Nitrofurantoin | −7.2 | −9.22 | −28.98 ± 4.83 | −13.96 ± 2.84 |
Sapropterin | −7.1 | −5.67 | −6.10 ± 1.21 | −18.89 ± 3.64 |
Vidarabine | −7.1 | −6.24 | −3.39 ± 1.70 | −17.77 ± 1.66 |
Eriodictyol | −7.1 | −7.86 | −9.36 ± 2.41 | −15.99 ± 3.53 |
To determine how each compound affects S protein and ACE2 binding, we calculated the free energy of binding between the RBD domain of the S-protein and the ACE2 receptor in the presence of each ligand using the molecular mechanics/generalized Born surface area (MM/GBSA) method. Tables 1 and 2 show the difference in binding free energy with ligand present versus without (ΔΔGBinding) for the simple and “Abundant” glycan models of the complex. In the case of the simple glycan model, our results show that every ligand enhances the binding free energy between the RBD domain and ACE2, acting as a binding stabilizer for the complex. Sapropterin, which binds inside ACE2 (Fig. 1), appears to be the most potent stabilizer of the analyzed group. For the “Abundant” glycan model, our results show that Isoniazid acts as an allosteric inhibitor as indicated by the positive binding energy between the RBD domain and ACE2 (ΔΔGBinding) when the ligand is present. In contrast, eriodictyol was observed to increase the binding affinity of the RBD domain to ACE2 thereby acting more as an allosteric stabilizer. Luteolin was observed to have negligible effect on the binding energy between RBD and ACE2. We hypothesize that inhibition/stabilization of the complex induced by these small molecules may serve to disrupt downstream events such as S protein priming by TMPRSS2, an essential step for cellular entry by the virus.
Molecule | Oak ridge ΔG (kcal mol−1) | Docking ΔG (kcal mol−1) | TI ΔG (kcal mol−1) | RBD/ACE2 ΔΔG (kcal mol−1) |
---|---|---|---|---|
Luteolin | −7.4 | −7.71 | −15.04 ± 5.04 | 0.0027 ± 8.94 |
Isoniazid | −7.3 | −6.57 | −7.29 ± 4.54 | 11.87 ± 7.26 |
Eriodictyol | −7.1 | −8.16 | −12.65 ± 2.36 | −8.06 ± 9.92 |
We study the changes of non-native contacts between the S protein and ACE2 receptor upon ligand binding by comparing the non-native contact maps of each compound (Fig. 3). Similar changes in non-native contacts are observed for compounds eriodictyol and nitrofurantoin, which bind at the RBD/ACE2 interface (Fig. 3A and B), and sapropterin, which binds inside ACE2 (Fig. 3C). Interestingly, binding of luteolin (Fig. 3D) to the distal binding region shows a clear difference in the number of contacts between the S protein and ACE2 receptor. These results suggest that an allosteric effect is induced when luteolin binds to the distal binding site.
Fig. 4 shows the estimated root mean squared fluctuations and the shear strain on each residue of the RBD/ACE2 complex for several ligands. Strain is a natural quantity to study local protein deformations, which are a good indicator of mechanical allosteric coupling between two binding events on the same protein. This deformation upon ligand binding is calculated as the shear strain measured relative to the average conformation from all frames of the RBD/ACE2 sans ligand. Average root mean square fluctuations (RMSF), a measure of the displacement of a residue relative to the initial crystal structure (PDB: 6M0J), suggest high levels of deviations for RBD residues Asn481, Gly482 and Val483 near the binding interface of ACE2, particularly for the luteolin ligand (Fig. 4A and 5A). Strain analysis also shows a significant peak at these residues (Fig. 4B and 5B). Consistent with the results of Fig. 3D, this profound allosteric effect is induced when luteolin is bound to the distal RBD binding site for both the simple and “Abundant” glycan forms of the RBD/ACE2 complex.
The formation of cross-correlation networks allows the transmission of information when the binding of a molecule at one site of the protein induces a change in local structure elsewhere in the protein. From our structural analysis, luteolin was the only ligand to induce a conformational change to the RBD protein after binding to a distal site. To validate the extent by which the atomic fluctuations of the complex are correlated with one another in the presence of luteolin, a dynamic cross-correlation network analysis was conducted using the Bio3D software.23 Anti-correlations between −0.40 and −0.60 are represented as blue lines between several distant regions of the RBD, as seen in Fig. 5C and D (ACE2 has been omitted for clarity). A comparison of simulations of the RBD/ACE2 with and without luteolin shows the disruption of the dynamic cross-correlation network by this drug. The disappearance of elements of the network, which include anti-correlated sites between this distal binding site and the RBD/ACE2 interface, may be critical in mediating allosteric transitions and ACE2 binding.
Principal component analysis (PCA) was used to gain quantitative insights into the binding of luteolin to the RBD/ACE2 complex. The orthogonal eigenvectors of the resulting principal components (PCs) describe the maximal variance of the distributions of structures. Details of the data processing are described in the Methods section. The two dominant principal components (PCs) were sufficient to describe the observed conformational changes at the RBD/ACE2 interface. The first principal component (PC1) corresponds to the rotational motion of the RBD/ACE2 complex. The second principal component (PC2) captures the conformational change observed at the RBD/ACE2 interface induced by Luteolin binding (Fig. 6). As previously discussed, residues near the binding site were observed to be anti-correlated to residues near the ACE2 binding site. Binding of luteolin disrupts this cross-correlation network, thereby inducing a conformational shift near the ACE2 binding site. Conformational changes to the RBD protein, as captured by PCA, confirm not only correlated motions of the RBD protein but suggest that the “hotspot” identified here may be a useful target in the discovery and design of new therapeutics that modulate the RBD/ACE2 protein–protein interface. Taken together, our results suggest that luteolin may serve as an important small-molecule allosteric modulator that affects RBD/ACE2 protein dynamics and may provide an alternative therapeutic approach.
In this study, we have focused on determining the binding mechanism and associated conformational changes in the presence of the top compounds from a docking screen recently conducted by Smith et al.15 After filtering compounds by their residence time in complex through MD simulations, rigorous binding free energy calculations were conducted in explicit water using thermodynamic integration. Our results have revealed a range of free energies and binding positions that were not anticipated in the original docking studies. The majority of such positions lie far from the interface, and the free energy extrema correspond to the shallow interface region of binding (Tables 1 and 2). The relative RBD/ACE2 binding free energy changes induced by the top ligands considered here indicate a binding enhancement of the RBD/ACE2 complex, even for drugs bound in the interface region. This enhancement is accompanied by a modification of the non-native contact maps of the protein interface, which lead to changes in the manner in which the proteins interact. Taken together, our results suggest that these ligands could not act as ordinary competitive inhibitors for RBD, since there is no binding disruption, but, given the contact modification and the energy differences with respect to the pure RBD/ACE2 complex, we propose that they may inhibit the system by serving as allosteric or direct PPI stabilizers.12,28
PPIs with large, flat and featureless surfaces, as in the case in the RBD/ACE2 complex, lack good drug-binding pockets for ligands. Recent approaches have shown that allosteric modulators provide an alternative strategy to target PPIs such as the RBD/ACE2 complex. Our results have uncovered pronounced effects at the RBD/ACE2 interface upon ligand binding at a distal site. In particular, upon luteolin binding, RMSF and strain analysis unveil significant levels of fluctuations and strain in RBD regions away from the binding site. In addition, cross-correlational analysis reveals the disruption of anti-correlated motions between the luteolin binding site and the binding interface of RBD/ACE2. Future research will require investigating the possible binding motifs for luteolin and its effects on S protein binding to the ACE2 receptor. Beyond the finding that luteolin might serve to inhibit viral entry, the discovery of this distal binding site offers potential for the design and engineering of future therapeutics for COVID-19.
Footnote |
† These authors contributed equally to this work. |
This journal is © The Royal Society of Chemistry 2022 |