Zheyi
Liu‡
a,
Wenxiang
Zhang‡
ab,
Binwen
Sun
ac,
Yaolu
Ma
ac,
Min
He
ac,
Yuanjiang
Pan
b and
Fangjun
Wang
*ac
aCAS Key Laboratory of Separation Sciences for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China. E-mail: wangfj@dicp.ac.cn
bDepartment of Chemistry, Zhejiang University, Hangzhou 310027, China
cUniversity of Chinese Academy of Sciences, Beijing 100049, China
First published on 23rd November 2020
Probing the conformational and functional hotspot sites within aqueous native protein complexes is still a challenging task. Herein, a mass spectrometry (MS)-based two-step isotope labeling-lysine reactivity profiling (TILLRP) strategy is developed to quantify the reactivities of lysine residues and probe the molecular details of protein–protein interactions as well as evaluate the conformational interventions by small-molecule active compounds. The hotspot lysine sites that are crucial to the SARS-CoV-2 S1–ACE2 combination could be successfully probed, such as S1 Lys417 and Lys444. Significant alteration of the reactivities of lysine residues at the interaction interface of S1-RBD Lys386–Lys462 was observed during the formation of complexes, which might be utilized as indicators for investigating the S1-ACE2 dynamic recognition and intervention at the molecular level in high throughput.
Lysine residue (K) bears a positively charged amino group on its side chain under the native physiological statuses of proteins. It could interact with a negatively charged amino acid such as aspartic acid (D) and glutamic acid (E) or anionic ligands to form salt bridges or hydrogen bonds, which is essential for stabilizing the protein structure and keeping the protein activity.9–12 Therefore, many lysine residues are protein function centers and involved in protein structure modulation and biological function regulation such as kinase activation.13–17 Besides, the protein conformation modulation levels are directly related to the alteration values of lysine reactivity in the modulated protein regions.18–22
Recently, a novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has spread over the world and causes atypical viral pneumonia COVID-19.23,24 Both structural modelling and experimental evidence have demonstrated the strong interaction of the spike (S) glycoprotein of SARS-CoV-2 with the human cellular receptor angiotensin-converting enzyme 2 (ACE2), which is crucial to the SARS-CoV-2 invasion of the host cells.25–27 The S glycoprotein contains two functional subunits S1 and S2 for binding the membrane receptor ACE2 and fusion of the viral and cellular membranes, respectively.25,28,29 The structures of the SARS-CoV-2 S trimer and ACE2–S1 receptor binding domain (RBD) complex have been successfully determined by cryo-EM.25,28,29 However, it is still challenging to probe the molecular details of the dynamic interactions and pharmaceutical interventions of S1 with ACE2 within aqueous solution in high throughput. Recently, much effort has been devoted to the molecular docking (MD) prediction of potential inhibitors for SARS-CoV-2 protein targets as well as human membrane receptor ACE2.30–34 It is urgent to develop a high-throughput and highly sensitive experimental strategy to monitor the dynamic interaction and intervention of SARS-CoV-2 S1 with ACE2, which might promote the discovery and progress of an effective therapeutic agent.
Herein, we developed a two-step isotope labeling-lysine reactivity profiling (TILLRP) strategy to probe the conformational hotspot sites within native protein–protein interactions. Briefly, the protein sample is labeled with heavy and light isotopic dimethyl tags under native and denatured conditions in sequence. After comprehensive proteomics quantification, the labeling reactivities of lysine residues in native states could be confidently quantified (Fig. S1†). We applied the TILLRP strategy to monitor the dynamic conformational hotspots of the recognition and intervention of SARS-CoV-2 S1 with the ACE2 receptor (Fig. 1). The labeling reactivities of lysine residues at the complex interaction interface of S1-RBD Lys386–Lys462 are directly related to the protein complex formation and might be utilized as indicators for investigating the S1–ACE2 dynamic recognition and intervention. This TILLRP strategy exhibits capability in probing and evaluating the dynamic conformational alterations of S1–ACE2 recognition and intervention by exogenous compounds at the molecular level in high throughput.
Bovine serum albumin (BSA) was utilized to benchmark the TILLRP strategy at first. The normalized labeling efficiency (NLE) values of 23 BSA lysine residues were obtained with high heterogeneity (Fig. 2A and B). The NLE values of sites Lys76, Lys322, and Lys499 are lower than 0.25, indicating the strong interactions within their proximal microenvironments (Fig. 2C–E). In contrast, the site Lys350 could be completely labeled (NLE = 0.98) and no interaction could be observed in its native microenvironments (Fig. S2A†). One-third of the charged residues in native proteins are involved in salt-bridges and a distance below 4 Å between the charged groups is a criterion to judge the formation of strong salt–bridge interactions.35,36 We summarized the distances between lysine residues and their proximal acidic residues in the BSA structure (Fig. S2B†). A good correlation between the NLE values and the distances could be observed, demonstrating that the relative reactivities (labeling efficiency) of lysine residues are dependent on their proximal salt–bridge interactions (Fig. S2B†). In contrast, the lysine NLE values exhibit a poor relationship with solvent-accessible surface areas (SASAs), which is different from previous protein foot-printing strategies (Fig. S2C†).37,38
Fig. 2 Lysine reactivity profiling of bovine and human serum albumins by using the TILLRP strategy. (A) The NLE values of 23 BSA lysine residues; (B) the conformational distribution of the quantified lysine sites; (C–F) the proximal microenvironments of Lys76, Lys322, and Lys499 in BSA (PDB: 4f5s), and Lys500 in HSA (PDB: 1ao6). |
The relative reactivities of lysine residues in the human serum albumin (HSA) were further investigated. Most of the lysine NLE values quantified in both purified and serum native HSA are consistent, indicating that the serum native structure is mainly maintained in its purified form (Fig. S3A†). Further, most of the conserved lysine residues between HSA and BSA exhibit similar NLE values except for Lys41, Lys51, Lys500, and Lys545 (Fig. S3B†). Compared to the BSA Lys499, HSA Lys500 is not engaged in any proximal interaction (Fig. 2F).
Then, the relative reactivities of lysine sites in catechol-O-methyl transferase (COMT) were further analyzed before and after the combination with a small-molecule inhibitor tolcapone. Finally, only the NLE of Lys144 was significantly decreased from 0.68 to 0.13, while little effect was observed for the other lysine residues (Fig. S4A†). COMT Lys144 is known as the crucial site that mediates the binding of COMT with a small-molecule inhibitor (Fig. S4B†),39,40 demonstrating that the lysine reactivity is sensitive to the conformation alteration modulated by exogenous small molecules. Overall, the relative reactivities (NLE) of lysine residues could be utilized to probe the conformational interactions and interventions of native proteins.
The relative reactivities of lysine residues within SARS-CoV2 S1 and human ACE2 were systematically analyzed before and after the formation of the specific-recognition complex. The structure of S1 protein is relatively loose and flexible for membrane receptor adaptation and combination, while the structure of ACE2 is more compact with many helix-rich regions.28 Thus, the NLE values of ACE2 lysine residues were generally lower than the ones of S1 under identical labeling conditions (Fig. 3A). For example, Lys288 within the ACE2 peptidase domain (PD) with strong proximal interactions exhibits a relative low NLE value (0.01), while the NLE of Lys444 within the S1 receptor-binding domain (RBD) is as high as 0.90 due to little proximal interaction existing at its native state (Fig. S5†).
After the formation of the S1–ACE2 complex, the conformation alteration regions could be clearly observed as the lysine NLE values obtained in these regions are changed significantly (Fig. 3A). Obviously, the sequence region of Lys386–Lys462 belonging to the S1-RBD exhibits the most significant decreasing levels in the NLE values. This region is known as the flexible loop region of S1-RBD that spans over the α1 helix of ACE2-PD, and thus is the direct interaction interface of S1 in the formation of the complex.28 Interestingly, the NLE values of S1-RBD Lys417 and Lys444 decreased from 0.40 and 0.91 to 0.16 and 0.24, respectively, after binding to ACE2. In the native conformation of the trimer-S glycoprotein,26 Lys444 is distributed at the top of the S1 trimer, and no interaction is found in its side chain proximal microenvironment (Fig. S6†). During the combination with ACE2, the conformation of one of the RBD domains changes from the closed (all RBD down) or semi-closed state (one RBD up) to a relatively stable open state (RBD up) and combines with ACE2.29 Then, the side chain of Lys444 in this conformation forms a salt bridge with the Gly447 skeleton (Fig. S7†). The decrease of NLE from 0.91 to 0.24 can reflect the conformation difference of the Lys444 local microenvironment during the combination of S1 and ACE2. Similarly, Lys417 can only form a salt bridge at most with Asn370 on the adjacent chain in the native conformation of the S protein (Fig. S6†). After the combination with ACE2, the conformation of the S1-RBD region changes to a relatively stable open state,26,41 and finally the Lys417 site could form salt bridges with both ACE2 Asp30 and S1 Leu455 (Fig. S7†). This is highly consistent with the Lys417NLE value decreasing from 0.40 to 0.16 during the formation of the S1–ACE2 complex. Thus, the Lys417 site is a crucial hotspot in the interaction and combination of S1 and ACE2. The substitution of Val404 in the SARS-CoV-RBD with Lys417 in the SARS-CoV-2-RBD is also recognized as one of the most important substitutions that enhance the binding affinity for ACE2.28
For the other part of ACE2, we discovered that the overall conformation of ACE2 from the N-terminal PD to C-terminal collectrin-like domain (CLD) becomes more relaxed after the formation of the S1–ACE2 complex as many lysine NLE values in these domains exhibited consistent increasing trends (Fig. 3A). The most significant changes of labeling reactivity were observed for residues Lys625 and Lys657 in the ACE2 neck domain, which is located between the PD and transmembrane helix and mainly mediates the dimerization of ACE2. The ACE2 dimer has two conformations, open and closed, while only the closed state is retained after the combination with S1-RBD.28 Thus, the increase of the NLE values of most of the lysine residues might be induced by the ACE2 dimer conformation change. Interestingly, little reactivity change was observed for the lysine residues within the direct interaction interface regions of ACE2 Lys31–Lys68 and Lys353–Lys363 during the formation of the S1–ACE2 complex. This might be because the α1 and α2 helices of ACE2 are relatively compact and stable and the combination of flexible S1-RBD will not modulate their conformations significantly.
Above all, the conformational details of the S1–ACE2 dynamic interaction including the direct interaction interface and related conformational changes could be probed by the TILLRP strategy with high sensitivity and high residue resolution. As the interaction interface of S1-RBD is modulated much more significantly as described above, we speculate that the relative reactivity (NLE values) of lysine residues within S1-RBD Lys386–Lys462 could be utilized as indicators for evaluating the dynamic recognition and intervention of the S1–ACE2 complex.
At first, glycyrrhizic acid-induced conformational effects were investigated by using the TILLRP strategy. Interestingly, the NLE value of Lys444 within the interaction interface of S1-RBD Lys386–Lys462 exhibited further a decreasing trend after the introduction of glycyrrhizic acid into the S1–ACE2 solution (Fig. 4). In contrast, little intervention was observed on the overall structure of ACE2, demonstrating that the target of glycyrrhizic acid might be S1. Although the glycyrrhizic acid exhibits significant intervention on the S1 conformation in the local regions of Lys77, Lys129, Lys182, Lys310, and Lys444 (t-test < 0.05, |δNLE| ≥ 0.1), we think the S1–ACE2 complex could be still retained as the reactivities of most of the lysine residues at the S1-RBD interaction interface rarely exhibit an increasing trend. The results obtained in MD simulations also demonstrate that the proximal interactions around Lys77, Lys182, Lys310, and Lys444 are enhanced after the combination with glycyrrhizic acid (Fig. S8†), consistent with their NLE value decrease.
Fig. 4 The δNLE of lysine residues in S1–ACE2 complexes induced by the treatment of exogenous small-molecule compounds. |
In the case of hesperetin, most of the conformational effects also focused on protein S1 (Fig. 4 and S9†). Briefly, the reactivities of lysine residues Lys77, Lys97, Lys206, Lys310, and Lys557 decreased significantly, implying that the local regions of these lysine residues are modulated by hesperetin (Fig. S9†). However, lysine residues at the direct interaction interface of S1-RBD Lys386–Lys462 have been rarely disturbed by hesperetin. Similar results were also observed when promethazine, one of the reported active drugs for SARS-CoV,45 was incubated with the S1–ACE2 solution (Fig. 4). Other active molecules including scutellarin, nicotinamide and sulfobutyl ether-β-cyclodextrin46 were also evaluated in a similar manner (Fig. S10†). Although some significant conformational effects on the S1–ACE2 complex could be successfully observed with high sensitivity, no obvious conformational intervention was discovered for disrupting the reactivities of lysine residues at the direct interaction interface of S1-RBD Lys386–Lys462. Therefore, the TILLRP strategy might be a useful experimental method for evaluating the pharmaceutical effects of active compounds on the S1–ACE2 complex at the molecular level.
This novel TILLRP strategy was applied to investigate the dynamic conformational hotspots of S1–ACE2 complex recognition and intervention in native aqueous solution. The direct interaction interface of S1–ACE2 recognition and related conformational changes could be successfully monitored. We suggest that the NLE values of lysine residues at the interaction interface of S1-RBD Lys386–Lys462 might be utilized as indicators for investigating S1–ACE2 recognition and intervention. Although no efficient active compound was discovered to disrupt the formation of the S1–ACE2 complex directly, dynamic conformational interventions on S1–ACE2 induced by small molecules such as glycyrrhizic acid could be successfully monitored with high sensitivity and high throughput. Furthermore, we think that the TILLRP strategy could also be applied to investigate the pharmaceutical interventions of active compounds to other SARS-CoV2 protein targets due to its broad applicability with little limitation on protein types. Overall, we believe that the TILLRP strategy might provide a promising experimental alternative for potential anti-SARS-CoV2 drug evaluation at the molecular level.
The recombinant SARS-Cov-2 spike glycoprotein S1, human ACE2 and the S1–ACE2 complex samples were diluted to 0.3 mg mL−1 with 50 mM phosphate buffer (pH 7.4). The protein samples were firstly labeled with 5 mM 13CD2O and 10 mM NaCNBD3 for 5 min in native aqueous solution, and the labeling reaction was quenched by adding 5 fold sample volumes of precipitation buffer. The second step of denature isotope labeling was identical to that of the serum samples as described above.
The protein samples of spike glycoprotein S1, ACE2 and the S1–ACE2 complex were analyzed using an Orbitrap Fusion Lumos Tribrid MS coupled to a Vanquish Flex HPLC system (Thermo Fisher Scientific). Briefly, 0.2 μg protein digests were firstly loaded onto a 5 cm × 150 μm C18 trap column (5 μm C18, SunChrom) and separated by a 15 cm × 150 μm C18 capillary column (1.9 μm C18, Dr Maisch) at a flow rate 0.3 μL min−1. Mass spectrometry was operated in positive mode in a data-dependent manner. The MS1 spectra were collected with the orbitrap with a resolution of 120000. The MS2 spectra were collected in a ‘top-speed’ manner (3 seconds) with a resolution of 15000 using the orbitrap. The precursor ions with a charge state of 2 to 5 were isolated with a m/z window of 1.4 m/z and subjected to HCD with a normalized energy of 28%. The dynamic exclusion was enabled with an exclusion time of 60 s.
The native labeling efficiency (NLE) values could be calculated via the formula: NLE = 1 − IL/(IH + IM + IL), where IH, IM and IL are the intensity of heavy, medium and light peptide labeling forms containing corresponding lysine residues, respectively.
Using AutoDockTools (version1.5.6),48 each K site with a significant NLE difference between S1–ACE2 free and complex states was transformed into flexible residues and preserved, the rest of corresponding protein structure was kept rigid respectively, and the small-molecule compounds (glycyrrhizic acid (ZINC96015174), hesperetin (ZINC39092), nicotinamide (ZINC5878), promethazine (ZINC20250), scutellarin (ZINC21992916) and sulfobutyl ether-β-cyclodextrin (Captisol)) were preserved after torsion. The above three parts of the file format are saved as *.pdbqt. Using AutoDock Vina version 1.1.2,49 each lysine site of each chain was docked three times. The docking parameters were as follows: the center_xyz set as the N atom position of the branched chain of each lysine, the grid box size is set to 30 (30 × 30 × 30 Å), num_modes was set to 20, and exhaustiveness was set to 30. In order to improve the simulated accuracy of the interaction region of small molecules, the docking conformations should be consistent with the changing trend of NLE as much as possible. The docking conformations at the top in the ranking of binding affinity energy were selected as representative results. Protein and ligands structures were rendered by PyMol.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/d0sc05330a |
‡ These authors contributed equally to this work. |
This journal is © The Royal Society of Chemistry 2021 |