Yan
Huang‡
a,
Hui
Luo‡
ac,
Yihui
Jin‡
a,
Yuheng
Ma
a,
Yan
Zhao
a,
Xin
Gao
a,
Yuting
Zhao
a,
Xiao
Qi
a,
Guodong
Liang
*ac,
Lu
Ga
*a,
Gang
Li
*a and
Jie
Yang
*b
aKey Laboratory for Candidate Drug Design and Screening Based on Chemical Biology, College of Pharmacy, Inner Mongolia Medical University, Hohhot, P.R. China. E-mail: lgd08502214@163.com
bNMPA Key Laboratory for Research and Evaluation of Drug Metabolism, Guangdong Provincial Key Laboratory of New Drug Screening, Guangdong-Hongkong-Macao Joint Laboratory for New Drug Screening, School of Pharmaceutical Sciences, Southern Medical University, Guangzhou, P.R. China. E-mail: yj528@smu.edu.cn
cBeijing Institute of Pharmacology and Toxicology, Beijing, P.R. China
First published on 31st October 2024
Human Immunodeficiency Virus (HIV) has continued to endanger human health for decades and has a substantial impact on global health defence. Peptide-based fusion inhibitors, as an integral part of Highly Active Anti-Retroviral Therapy (HAART), are effective in preventing and controlling the AIDS epidemic. Nevertheless, the current market leader, Enfuvirtide, is facing numerous challenges in clinical application. We herein devised a cutting-edge development strategy leveraging SWISS-MODEL and HDOCK, enabling the design of artificial N-peptides. The most active compound, IZNP02QE, surpassed the positive control by demonstrating remarkable nanomolar-level inhibitory activity against HIV-1. Mechanistic investigations unveiled IZNP02QE's ability to disrupt the crucial endogenous 6-helix bundle (6-HB) by forming heteropolymers, underscoring its potential as a novel anti-HIV-1 agent. This work not only pioneers a novel design methodology for N-peptides but also opens up the possibility of a CADD strategy for designing peptide-based fusion inhibitors.
The Food and Drug Administration (FDA) has approved mainly four categories of AIDS therapeutic agents:1,2 Reverse Transcriptase Inhibitors, Protease Inhibitors, Integrase Inhibitors, and Entry Inhibitors. Highly Active Anti-Retroviral Therapy (HAART), commonly known as “cocktail therapy”, involves two or more combinations of the above anti-HIV medications and is currently recognized as the most effective treatment for AIDS.3 The fusion inhibitor Enfuvirtide (also known as T20 in the entry inhibitor classification) is commonly used in cocktail therapies to block the membrane fusion process between HIV and host cells, thus acting in the early lifecycle of the virus. Fusion inhibitors are increasingly prominent in AIDS treatment, offering renewed hope and expanded possibilities.
The membrane fusion process by which HIV enters host cells is mainly dependent on the surface envelope spike, consisting of the envelope glycoprotein gp120 and the transmembrane protein gp41.4 Initially, gp120 mediates the attachment of HIV to the CD4+ receptor on the target cell membrane, triggering a conformational shift that activates gp41 upon binding to either the CXCR4 or CCR5 co-receptors. Subsequently, the fusion peptide portion in gp41 inserts into the target cell membrane, and stretches out to form the pre-hairpin structure. The pre-hairpin stretching-deforms the adjacent N-terminal heptad repeat (NHR) region to create the trimeric core (N-trimer) with widely distributed hydrophobic pockets on the surface, and the three helices in the downstream C-terminal heptad repeat (CHR) region are subsequently inserted into the hydrophobic pockets of the NHR with reverse folding to form the stable six-stranded helix bundle structure (6-HB). 6-HB drives the accomplishment of the virus–host cell membrane fusion process, initiating the entry of viral genetic material into the cell. Fusion inhibitors intervene in the formation of endogenous 6-HB, thereby blocking membrane fusion and exerting antiviral effects.5 Conventionally, peptide-based fusion inhibitors, derived from the constituent 6-HB sequences, are generally categorized into N-peptides targeting the CHR region and C-peptides targeting the NHR region.6 Enfuvirtide, a classic C-peptide inhibitor on the market for years, has encountered clinical resistance and various issues, including a short half-life in vivo and high medication costs.7,8 Therefore, there is an urgent need for more advanced peptide-based fusion inhibitors to effectively solve the abovementioned clinical problems.
N-peptides are emerging peptide-based fusion inhibitors and are expected to overcome the limitations of Enfuvirtide. It is presently assumed that N-peptides need to be spontaneously assembled into an N-trimer-like structure under solution or physiological conditions to exert anti-HIV activity, and Lai and Li et al. have already provided a workaround to construct the N-peptide active conformation through stapled amide bonds (also known as isopeptide bonds) among the helices.9,10 In this paper, we have selected (IZ10N24N)3, a prominent N-peptide, as the lead candidate for further development. Specifically, we innovatively harnessed SWISS-MODEL for homology modeling of N-peptides and HDOCK for docking simulations, providing vital guidance for optimizing the sequence design of our lead compound. Armed with the abovementioned Computer-Aided Drug Design (CADD)-aided rational design approach, we embarked on the synthesis, evaluation of inhibitory efficacy, and elucidation of the mechanistic underpinnings of N-peptides. Ultimately, we obtained IZNP02QE, which showed higher anti-HIV-1 activity than the positive control T20 and demonstrated the anti-HIV-1 mechanism by interrupting natural 6-HB formation. Our findings not only contribute a novel N-peptide inhibitor with a distinct structure and enhanced activity but also pave the way for CADD-aided design of peptide-based fusion inhibitors, addressing a previous gap in this field.
![]() | ||
Fig. 1 The N-peptide design schema. The hydrophobic pocket domain is shown in purple. The isopeptide bond formation site is shown in red. |
The realm of CADD has long been instrumental in advancing drug research and development, yet a notable gap exists in the availability of comprehensive, peptide-centric tools tailored for peptide-based fusion inhibitor design. With the increasing maturity of protein/peptide structure modeling tools and docking tools such as HDOCK and ZDOCK for studying the binding between protein/peptide(receptor) and protein/peptide(ligand),13,14 we believe that CADD is more feasible for the development of peptide-based fusion inhibitors. In this context, our approach centers around leveraging SWISS-MODEL for homology modeling of the lead compound (IZ10N24N)3, designated IZNP0, and subsequently harnessing HDOCK for precise docking simulations to assess the potential of N-peptides to bind the CHR target region. This integrated CADD strategy guides the design of novel sequences, using SWISS-MODEL for structural refinement and HDOCK for docking verification, ensuring the rationality and potential efficacy. The new sequences were designed under the above CADD guidance and the design rationality of the sequences was verified cyclically with SWISS-MODEL and HDOCK. The N-peptide design idea is as follows (Fig. 1):
(1) The SWISS-MODEL tool was used to construct a spatial model of IZNP0, and the HDOCK server was used to dock IZNP0 with the target. The results showed that the IZNP0 model was able to create a hydrophobic pocket similar to the natural N36 peptide, and the N-peptide bound to the CHR region in an N-trimer-like structure; macroscopically, the N-peptide hydrophobic pocket matched with the CHR hydrophobic residues, and microscopically, the e,g site amino acid side chains of the N-peptide sequence could interact with the a,d,e site amino acid side chains of the CHR region. Therefore, the b,c,f sites of the N-peptide sequence as non-target binding regions can be further modified. We mutated the sequences at the b,f sites into Glu acid and Lys residues for the formation of an E–K salt bridge at the (i,i + 4) position to enhance N-peptide helicity,15 at the c site into Lys residues to improve the N-peptide solubility, and at the a,d sites into Ile residues that facilitate N-trimer assembly, resulting in the novel N-peptides IZNP01EK, IZNP02EK, and IZNP03EK and the completion of SWISS-MODEL modeling and HDOCK docking.
(2) We mutated the sequences at the c site into Gln residues to improve the N-peptide solubility and exclude the positive charge effect of lysine residues on N-peptides, and at the a,d sites and b,f sites as described above, resulting in the novel N-peptides IZNP01EQ, IZNP02EQ, and IZNP03EQ and the completion of SWISS-MODEL modeling and HDOCK docking.
(3) Based on the sequence design, SWISS-MODEL modeling, and HDOCK docking of the above two groups, the third round of N-peptide design aims to explore the sequence modifiable regularity: ① expanding the mutation region to obtain IZNP04EQ; ② shortening the N-peptide functional region to retain only the hydrophobic pocket sequence to obtain IZNP02EQ-7; ③ the effect of double-pocket domains on N-peptide anti-HIV-1 activity was investigated by introducing both hydrophobic pocket and hydrophobic subpocket sequences to obtain IZNP02EQ + 7; ④ the Ei–Ki+4 salt bridge was replaced with Ei–Ki+3 to investigate the effect of different positions of the salt bridge on N-peptide anti-HIV-1 activity to obtain IZNP02QE. IZNP04EQ, IZNP02EQ-7, IZNP02EQ + 7, IZNP02QE all underwent SWISS-MODEL modeling and HDOCK docking (Table 1).
N-peptides | SWISS-MODEL | HDOCK (C34 as target) | ||
---|---|---|---|---|
GMQEa | QMEANDisCob | Docking scorec | Confidence scored | |
a GMQE is a template-based quality assessment method that mainly considers the relationship between the target protein and the template protein. Its value is between 0 and 1, with values closer to 1 indicating better modelling quality. b QMEANDisCo is a composite scoring function based on global (i.e., whole structure) and local (i.e., per residue) absolute quality estimates for a single model. Its value is between 0 and 1, with values closer to 1 indicating better modelling quality. c Docking Score: the docking scores are calculated using our knowledge-based iterative scoring functions, ITScorePP or ITScorePR. A more negative docking score means a more possible binding model. d Confidence Score: when the confidence score is above 0.7, the two molecules are very likely to bind; when the confidence score is between 0.5 and 0.7, the two molecules are possibly able to bind; when the confidence score is below 0.5, the two molecules are unlikely to bind. Confidence_score = 1.0/[1.0 + e0.02×(Docking_Score+150)]. | ||||
N36 | — | — | −371.13 (Max) | 0.9881 (Max) |
IZNP0 | 0.71 | 0.71 ± 0.09 | −203.80 | 0.7457 |
IZNP01EK | 0.77 | 0.84 ± 0.09 | −166.21 | 0.5803 |
IZNP02EK | 0.78 | 0.83 ± 0.08 | −217.41 | 0.7938 |
IZNP03EK | 0.72 | 0.80 ± 0.09 | −202.66 | 0.7414 |
IZNP01EQ | 0.79 | 0.84 ± 0.09 | −204.38 | 0.7479 |
IZNP02EQ | 0.69 | 0.69 ± 0.09 | −213.50 | 0.7807 |
IZNP03EQ | 0.73 | 0.81 ± 0.09 | −207.81 | 0.7606 |
IZNP04QE | 0.72 | 0.86 ± 0.09 | × | × |
IZNP02EQ-7 | × | × | × | × |
IZNP02EQ + 7 | 0.77 | 0.77 ± 0.08 | −196.91 | 0.7187 |
IZNP02QE | 0.78 | 0.84 ± 0.08 | −241.04 | 0.8607 |
The HDOCK server is a protein/peptide(receptor)–protein/peptide(ligand) docking platform based on a hybrid algorithm of template-based modeling and ab initio free docking (https://hdock.phys.hust.edu.cn/). Within this platform, the Docking Score and Confidence Score are pivotal metrics for evaluating the efficacy of molecular docking. The interaction between N-peptides and the CHR target is essentially a protein–protein interaction, so we used the HDOCK server to assess the ability and details. The output parameters of HDOCK are Docking Score and Confidence Score. The values of −371.13 and 0.988 for natural N36(ligand) and natural C34(receptor) are defined as Docking ScoreMax and Confidence ScoreMax, respectively. The closer the docking score between artificial N-peptide and receptor C34 is to Docking ScoreMax and Confidence ScoreMax, the stronger the binding. As a result, the IZNP0/C34 Docking Score was −203.80, with a Confidence Score higher than 0.7, and IZNP0 could build hydrophobic grooves in space for filling in C34 and its “WWI modif” (Fig. 2B), which indicated that IZNP0 is able to bio-engage with C34 as a reference to the HDOCK parameter standard.17 The SWISS-MODEL model as well as the HDOCK docking results implied that the lead IZNP0 is suitable for further development via CADD.
Next, in the N-peptide designed sequences, all IZNP01EK, IZNP02EK, IZNP03EK, IZNP01EQ, IZNP02EQ and IZNP03EQ could be modeled depending on SWISS-MODEL, and the output scores were higher than or equal to those of IZNP0 (Table 2), which theorizes that regular site-specific mutations in N-peptides did not affect the N-trimer-like structure. In the IZNPnEK series (n = 01, 02, 03), the HDOCK results showed that IZNP01EK exhibited an undesirably low confidence score below 0.7, while the hydrophobic pocket regions of IZNP02EK and IZNP03EK did not match the C34 “WWI modif” (Fig. S11†); presumably all had weak target binding ability. In the IZNPnEQ series (n = 01, 02, 03), the HDOCK output scores are closer to those of IZNP0 (Table 2), with the tangible hydrophobic pockets and feasible C34 matching (Fig. S12†), which may indicate relatively strong target binding ability.
Furthermore, we selected IZNP02EQ (with the lowest score) to investigate N-peptide sequence modifiable regularity and obtained IZNP04EQ, IZNP02EQ-7, IZNP02EQ + 7, and IZNP02QE by sequence mutation. Unfortunately, IZNP02EQ-7 could not be modelled because the sequence was too short without a high similarity template structure, and the docking algorithm result between IZNP04EQ and the receptor was erroneous, presumably due to peptide–target mismatch. Meanwhile, IZNP02EQ + 7 and IZNP02QE could be successfully analysed by SWISS-MODEL and HDOCK, with IZNP02QE showing predicted binding efficacy (Table 2).
Based on the above CADD results, IZNP02EQ-7 was not synthesised due to the inability to assemble a trimer model. The IZNP04EQ intermediate cannot undergo the isopeptide bond reaction, so it was not ultimately obtained. In summary, we ended up synthesising 9 novel N-peptides, which are IZNP0, IZNP01EK, IZNP02EK, IZNP03EK, IZNP01EQ, IZNP02EQ, IZNP03EQ, IZNP02EQ + 7, and IZNP02QE.
![]() | ||
Fig. 3 Inhibitory activity of peptides on HIV-1 Env-mediated cell–cell fusion. (A) Inhibition by N-peptides. (B) Inhibition by T20 and IZNP02QE. (C) Inhibition by (3HRN23)3 and IZNP02QE. |
Under the N-PAGE conditions, the advanced structure of peptides or peptide complexes can be maintained, therefore, with great significance to explore the interaction between the N-peptide and its target.20 The C34, due to its small molecular weight and negative charge, exhibited a single band towards the lower edge (lane 1), while the N36 and IZNP02QE are positively charged and consequently did not show any bands (lane 2 and lane 4). The C34/N36 showed a 6-HB complex band (lane 3). Following the incubation of the IZNP02QE and C34 mixed solution, the C34 band became lighter (lane 5). In lane 6, lane 7, and lane 8, as the increasing concentration of IZNP02QE, the 6-HB complex band got progressively darker (Fig. 5A), indicating that IZNP02QE is known as binding to the 6-HB target and forms “unidentified mixtures”. The SE-HPLC presented experimental findings consistent with N-PAGE (Fig. 5B), and the retention time (RT) of peptides or peptide complexes is only volume-dependent.21 Neither N36 nor IZNP02QE had peaks because of aggregation and deposition in the SE-HPLC. The RTC34 is about 9.5 minutes point, while the RTC34/N36 is about 7.6 minutes point as a larger 6-HB complex. The peaks of IZNP02QE/C34/N36 grew as the concentration multiplicity of IZNP02QE increase, which means that the so-called “unidentified mixtures” indeed formed.
![]() | ||
Fig. 5 (A) N-PAGE analysis of IZNP02QE with the target peptide. (B) SE-HPLC analysis of IZNP02QE with the target peptide (IZNP02QE/C34/N36 represents the IZNP02QE, C34 and N36 mixed solution). |
Furthermore, we thus applied sedimentation velocity analysis (SVA) to analyze the IZNP02QE/C34/N36 state in PBS solution. In Fig. 6, C34/N36 had a M.W. of 24.1 kDa, whereas IZNP02QE/C34/N36 had M.Ws. of 24.2 kDa and 39.1 kDa. 24.2 kDa is in proximity to M.W.C34/N36, and 39.1 kDa signifies that IZNP02QE/C34/N36 assembles into multimeric complexes (Fig. 6A). We speculate that IZNP02QE can interact with C34/N36 and combine into a multimeric form, thereby preventing the formation of endogenous 6-HB (Fig. 6B).18
![]() | ||
Fig. 6 (A) SVA results of IZNP02QE with the target peptide. (B) The plausible interaction mechanisms potentially correlated with N-peptides. |
The preceding results conclusively demonstrate that IZNP02QE can definitely interact with the HIV-1 gp41 NHR region, fostering the formation of stable helical multibundles. This process hinders the endogenous assembly of 6-HB complexes, effectively inhibiting HIV-1 replication.
The HDOCK server is a tool for assessing protein/peptide(receptor)–protein/peptide(ligand) interactions, providing docking scores and confidence scores that are capable of assessing the likelihood and the strength between ligand–receptor binding. We docked the N-peptides obtained from SWISS-MODEL modeling as the ligands and the HIV-1 gp41 subunit CHR region as the receptor using the HDOCK server, and screened for the ligand–receptor optimal binding conformations. Based on the docking results between the lead peptide IZNP0 and the target, we boldly modified the N-peptide sequences, verified the possibility of binding to the target using the HDOCK server, and succeeded in obtaining IZNP02QE with optimal anti-HIV-1 activity. It is worth pointing out that the hydrophobic pocket of IZNP02QE did not match the C34 “WWI” modification (Fig. S13†), and perhaps it exhibits potential activity by a novel binding mode to the target, which needs to be further explored in the future!
To date, a standardized CADD blueprint tailored for HIV-1 peptide-based fusion inhibitors remains elusive. We combined SWISS-MODEL with HDOCK in tandem for the development of anti-HIV peptides and proved their feasibility, the first of its kind! In fact, there are many other methods for peptide homology modeling such as AlphaFold, Modeller, YASARA, I-TASSER, etc., and many docking tools for protein/peptide(receptor)–protein/peptide(ligand) docking, such as GalaxyPepDock, MDockPeP, HPEPDOCK, CABS-dock, pepATTRACT, and AutoDock CrankPep (ADCP).22 How to choose the applicable CADD method according to the research needs is a question worth pondering for researchers on their way to the novel peptide-based fusion inhibitors. Still, SWISS-MODEL and HDOCK are currently available to researchers for free, with fast data analysis and highly feasible results, which is the reason why we finally chose them after screening multiple homology modeling and molecular docking tools.
Admittedly, N peptides are still very lagging behind relative to C peptides, with no marketed agents, but there has been a gradual potential to move closer to candidates. The current R&D strategies for N-peptides mainly include site-mutagenesis strategies, self-assembly strategies with chimeric tool peptides, strategies for constructing covalent bonds among coiled-coil helixes (disulfide bonds or isopeptide bonds), and small molecule backbone stapling strategies.6,23–25 Therefore, we need to explore more approaches for facilitating N-peptide R&D and discover more lead compounds with novel structures for N-peptide growth.
The peptide side-chain thioesterified modification and the isopeptide bond cross-linking reaction are performed as follows:10 ①A catalyzer is added to the resin for the O-allyl group removal, which consists of tetrakis (triphenylphosphine) palladium, 5,5-dimethyl-1,3-cyclohexanedione in a DMF/THF mixed solution (v:
v = 1
:
1). In this reaction, the nitrogen stream was used to avoid contact with air, and the reaction vessel was wrapped with tin foil to protect it from light. After 4–6 h of reaction, the resin was washed with 0.5% sodium diethyldithiocarbamate trihydrate/DMF solution. Then, 1 eq. of 1-(3-dimethylaminopropyl)-3-ethylcarbodiimide hydrochloride, 1.5 eq. of HOBT, and 4 eq. of benzyl sulfide were added to complete the side-chain thioesterification. Subsequently, the crude peptides were cleaved from the Rink-Amide resin using the above lysis solution. All thioesterified peptides were purified to >90% purity. ②Briefly, the thioesterified peptide was dissolved in PBS (50 mM, pH 7.4)/H2O/CH3CN (v
:
v
:
v = 5
:
4
:
1) and incubated for 24–72 h at 37 °C. The reaction was completed, acetonitrile was evaporated and the product was purified using RP-HPLC.
The molecular weight (M.W.) of the pure peptides was confirmed by Fourier transform ion cyclotron resonance mass spectrometry (FT-ICR-MS, Bruker Daltonics, Germany) (Fig. S15–S24†).
The graphical abstract was created using Figdraw (https://www.figdraw.com).
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4ob01620c |
‡ These authors contributed equally to this work. |
This journal is © The Royal Society of Chemistry 2025 |