Shaojie Maad,
Shepei Tana,
Danqing Fang*b,
Rong Zhanga,
Shengfu Zhoua,
Wenjuan Wu*a and
Kangcheng Zhengc
aDepartment of Physical Chemistry, College of Pharmacy, Guangdong Pharmaceutical University, Guangzhou 510006, PR China. E-mail: wuwenjuan83@126.com; Tel: +86 2039352119
bDepartment of Cardiothoracic Surgery, Affiliated Second Hospital of Guangzhou Medical University, Guangzhou 510260, PR China
cSchool of Chemistry and Chemical Engineering, SunYat-Sen University, Guangzhou 510275, PR China
dDepartment of Pharmacy, Kangda College of Nanjing Medical University, Lianyungang 222000, PR China
First published on 9th September 2015
Nuclear factor-κB (NF-κB) and activator protein-1 (AP-1) are promising targets for a number of immunoinflammatory diseases, including asthma, psoriasis, rheumatoid arthritis, and transplant rejection. In this study, based on a dataset consisting of 127 pyrimidine/quinazoline-based derivatives as dual NF-κB/AP-1 inhibitors, an integrated computational protocol, including the three-dimensional quantitative structure–activity relationship (3D-QSAR), molecular docking and molecular dynamics (MD) simulations, was performed to explore the influence of the structural features on the NF-κB and AP-1 inhibitory activities and design derivatives with improved potency. The obtained CoMFA (comparative molecular field analysis) model exhibited satisfactory internal and external predictability. The most probable binding sites of the two receptors have been identified by docking and MD simulations, showing that they just locate on the joint regions between NF-κB (or AP-1) and DNA, wherein inhibitors can effectively prevent free NF-κB (or AP-1) from binding to DNA. At the same time, the key residues/deoxynucleotides for achieving strong binding were also revealed by docking studies, and the detailed dynamic binding process and binding modes of the inhibitors with different activities were determined by MD simulations. The binding free energies are in good agreement with the experimental bioactivities. The decomposition of binding free energies by MM-GBSA suggests that the hydrophobic interactions play an important role for the binding of compounds to NF-κB and AP-1. The results presented here can provide significant insight into the development of novel potential dual NF-κB/AP-1 inhibitors.
As a result, a variety of NF-κB or AP-1 inhibitors with different scaffolds have been developed;9,10 however, very few compounds are known to inhibit both NF-κB and AP-1. Recently, Palanki and co-workers synthesized a series of pyrimidine/quinazoline-based derivatives and assessed their activities.11–14 They discovered that these compounds potently inhibited NF-κB and AP-1 proteins at low nanomolar concentrations, demonstrating the great potential of developing pyrimidine/quinazoline-based derivatives as a novel class of dual NF-κB/AP-1 inhibitors for inflammatory disease therapy. Moreover, there is now abundant evidence that dual or multi-targeted inhibitors, usually inhibiting different cell pathways or compensatory mechanisms, might be more effective than selective inhibitors, so the search for dual NF-κB/AP-1 inhibitors is very attractive.15 Although some progress has been made in experimental research studies, to date theoretical studies on the structural factors of these compounds, which affect their anti-inflammatory activities as well as the inhibitory mechanisms of these compounds toward NF-κB and AP-1 kinases, are limited.
In recent years, 3D-QSAR, molecular docking and molecular dynamics (MD) simulations have been widely and successfully applied to guide the design of lead compounds.16,17 3D-QSAR models can help to determine the key structural features affecting the activities and understand the nonbonding interaction characteristics between the drug molecule and target, because they are vivid and robust.18–20 Moreover, molecular docking is an approach to predict the possible orientations of a ligand in the active site of a receptor and to study protein–ligand interactions.21–23 In addition, MD simulation is a useful methodology for providing vivid pictures to depict the fluctuations and conformational changes of molecules and allowing further investigation of the interaction mechanisms of a protein complex with a ligand at the atomic level.24 Therefore, a combined 3D-QSAR, molecular docking and MD simulation study can offer deep insight into understanding the structural features of ligand–receptor interactions.
In this study, a novel series of pyrimidine/quinazoline-based derivatives acting as dual NF-κB/AP-1 inhibitors were selected to perform an integrated computational protocol using molecular docking, MD simulations and 3D-QSAR methods. The purpose was to develop a rational predictive model and to investigate the interaction details between these compounds and NF-κB (and AP-1). The optimum 3D-QSAR CoMFA models were established, and the key structural features contributing to the inhibitory activities were also identified. Then, the orientations and the probable binding modes of these compounds interacting with both NF-κB and AP-1 were located by docking and MD simulations. We hope that the obtained results can guide rational design of novel and more efficacious dual NF-κB/AP-1 inhibitors and offer some reference for experiment study.
IC50) values and used as dependent variables in the 3D-QSAR study.
![]() | ||
| Fig. 1 General structural formula and numbering of pyrimidine (a, compounds 1–104) and quinzoline derivatives (c, compounds 105–127), and template molecules (b, compound 50 and d, compound 109). | ||
The 3D-structures of pyrimidine/quinazoline-based derivatives were constructed by the sketch molecule module in the Sybyl 6.9 software. Structural energy minimization was carried out using the Powell gradient algorithm and the Tripos force field with a convergence criterion of 0.001 kcal mol−1 Å−1 and a maximum of 1000 iterations MMFF94 charges were assigned to each inhibitor.26 The minimized structure was used as the initial conformation for molecular docking.
The X-ray crystal structures of NF-κB (1NFK) and AP-1 (2H7H) were obtained from the Protein Data Bank and used to dock. At the beginning of docking, all the water and subunits were removed, hydrogen atoms and AMBERFF99 charges were added to the protein, and only hydrogen positions were energy-minimized in 10
000 cycles with the Powell method in SYBYL 6.9. Next, the surface of the protein was calculated with the DMS program. To obtain possible binding sites, some spheres are generated and selected by the Sphgen module of the DOCK 6.0 program. At last, all compounds were flexibly docked into the binding sites wherein the protein is considered as rigid. The box size, the grid space, energy cutoff distance, and max orientation were set as 8 Å, 0.3 Å, 12 Å and 10
000, respectively.28
The 3D-QSAR equations were generated using the partial least square (PLS) statistical method. The PLS algorithm with the leave-one-out (LOO) cross-validation method was exploited to yield the highest cross-validation correlation coefficient (q2) and the optimum number of components N. The non-cross-validation methods were appraised by the conventional correlation coefficient R2, standard error of estimates (SEE), and F value. To further assess the robustness and statistical validity of the derived models, bootstrapping analysis for 100 runs was also performed.31,32
To assess the predictive abilities of the 3D-QSAR models generated from the training set, the biological activities of 39 compounds in the external test set were predicted. The predictive power of the models is judged based on the predictive correlation coefficient (Rpred2) calculated by the following equation: Rpred2 = (SD − PRESS)/SD, where SD is the sum of the squared derivations between the actual activities of the test set compounds and the mean activity of the training set compounds, and PRESS is the sum of the squared derivations between the actual and predicted activities of the test set compounds.33
Subsequently, two-stage energy minimizations were performed to avoid possible steric stress. First, each complex was fixed with a restraint constant of 2.0 kcal mol−1 Å−1 and the water molecules and sodium ions were minimized with the steepest descent (SD) method for 2000 steps followed by the conjugated gradient (CG) method for 3000 steps. Second, the whole relaxed system was optimized by 5000 steps via steepest descent minimization and 5000 steps via conjugated gradient minimization. Then, each complex was gradually heated from 0 to 300 K in 200 ps with a weak constraint of 1.0 kmol mol−1 Å−2 at a constant volume and equilibrated for 500 ps at 300 K and 1 atm. Finally, a 10 ns production MD simulation was performed in a NPT (constant composition, T = 300 K and P = 1.0 atm) ensemble. During these steps, the particle mesh Ewald (PME) method was used to treat the long-range electrostatic interactions with a non-bonded cutoff of 8.0 Å, and the SHAKE algorithm was turned on to constrain all covalent bonds involving hydrogen atoms with 2 fs time step.38,39 Coordinated trajectories were recorded every 1 ps and the stability of the complexes was checked from the root mean square deviation (RMSD).
| ΔGbind = Gcomplex − (Gprotein + Gligand) = ΔGMM + ΔGsol − TΔS | (1) |
The molecular mechanics gas-phase free energy (ΔGMM) contains the van der Waals energy (ΔGvdw) and electrostatic (ΔGele) energy:
| ΔGMM = ΔGvdw + ΔGele | (2) |
The solvation free energy (ΔGsol) is the sum of the electrostatic solvation, including the polar solvation free energy (ΔGele,sol) and the nonpolar solvation free energy (ΔGnonpol,sol):
| ΔGsol = ΔGele,sol + ΔGnonpol,sol | (3) |
| ΔGnonpol,sol = γ × SASA + β | (4) |
For discerning the difference of the binding modes of these complexes, the binding free energies were decomposed to each residue using the MM-GBSA method. Each inhibitor–residue pair includes four energy terms: a van der Waals contribution (ΔGvdw), an electrostatic contribution (ΔGele), a polar solvation contribution (ΔGele,sol) and a nonpolar solvation contribution (ΔGnonpol,sol), which can be summarized in the following equation:
| ΔGinhibitor–residue = ΔGvdw + ΔGele + ΔGele,sol + ΔGnonpol,sol | (5) |
![]() | ||
| Fig. 2 Possible binding sites of the pyrimidine/quinazoline-based derivatives on NF-κB (a) and AP-1 (b) binding to DNA, in which the green represents NF-κB and AP-1 protein and yellow indicates DNA. | ||
To find the optimal binding sites of the two receptors, docking studies were also carried out on the dataset. All studied compounds were docked into the possible binding sites of NF-κB and AP-1, and the energy scores of these different binding sites are listed in ESI Table S2,† wherein no precise correlations could be found between docking scores and pIC50 values. This is not surprising, because the experimental pIC50 values are very complicated, depending on not only the binding energy, but also many other factors. The average values of the energy scores are −61.7, −37.1, −45.4, −59.2 and −14.4, −14.6, −21.7 kJ mol−1 from site 1 to 4 for NF-κB and site 5 to 7 for AP-1, respectively. Obviously, the binding sites 1 and 7 have lower average values of the energy scores than sites 2–4 and 5–6, respectively. Therefore, site 1 and site 7 may be the most likely binding sites for NF-κB and AP-1, respectively.
To further ensure the rationality of binding sites of the two receptors, we selected the abovementioned seven binding modes of NF-κB and AP-1 with the highest active compound 109 as initial conformations for following 8.0 ns MD simulations.
First, to ascertain whether the MD simulations were stable and converged, the RMSD values of protein backbone atoms with respect to the seven starting systems are analyzed and displayed in Fig. 3(a) and (b). The plots show that seven systems reach equilibrium after 0.5 ns. The mean RMSD values were 1.7 Å, 2.2 Å, 1.3 Å, 1.8 Å, 2.1 Å, 1.6 Å and 1.4 Å, and the relative RMSD fluctuations were very small. These results suggest that the conformations of seven systems were relatively stable throughout the MD simulations. Then, the binding free-energy calculations for the 200 snapshots of the last 1.0 ns MD simulations were also carried out using the MM-PBSA method and the corresponding results are −36.68, −16.60, −21.20 and −36.04 kcal mol−1 from site 1 to 4 for NF-κB, and −29.98, −25.40, and −38.11 kcal mol−1 for site 5 to 7 of AP-1, respectively. Therefore, it is easy to observe that the trend of energies for the MD simulations is almost the same as that for the docking, and site 1 and site 7, having the lowest binding free energies, are the most favorable binding sites for NF-κB and AP-1, respectively. Thus, sites 1 and 7 were selected for further analysis as stated below.
![]() | ||
| Fig. 3 RMSD of backbone atoms of the studied complexes during MD simulations. (a) Complexes 109-1NKF in sites 1–4 of NF-κB, (b) complexes 109-2H7H in sites 5–7 of AP-1, (c) complexes 109-1NFK and 50-1NFK and (d) complexes 109-2H7H and 50-2H7H. | ||
Compared with the abovementioned docking results for NF-κB and AP-1, we find that key residues in site 1 (NF-κB) or site 7 (AP-1) (such as Ala243/13, Ser208/16, His144/Arg17, Lys144/20, and Leu206/21) are almost the same, demonstrating that the binding conformation and interaction with NF-κB of the target compound 109 are similar to those with AP-1. Therefore, there is no doubt that these compounds act as dual NF-κB/AP-1 inhibitors.
| Statistical parameters | CoMFA |
|---|---|
| a Note: N is the optimal number of components, q2 is the square of leave-one-out (LOO) cross-validation coefficient, R2 is the square of non-cross-validation coefficient, SEE is the standard error of estimation, F is the F-test value, Rbs2 is the mean R2 of bootstrapping analysis (100 runs) and SDbs is the mean standard deviation by bootstrapping analysis. | |
| R2 | 0.92 |
| N | 5 |
| q2 | 0.72 |
| SEE | 0.28 |
| F | 160.2 |
| Rbs2 | 0.95 |
| SDbs | 0.01 |
| Rpred2 | 0.71 |
Furthermore, the test set of 39 inhibitors was used to verify the efficacy of the CoMFA model, wherein a predictive coefficient Rpred2 of 0.71 was achieved, further demonstrating that our CoMFA model has a satisfactory predictive ability. The predicted pIC50 values and the residual values of compounds for the CoMFA model are listed in SEI Table S1.† The plot of the predicted pIC50 values by CoMFA versus the actual ones is depicted in Fig. 5, wherein most points are evenly distributed along the line Y = X, implying that the 3D-QSAR CoMFA model is of good quality.
![]() | ||
| Fig. 5 Plots of the predicted versus actual values using the training set (triangle) and test set (dot) based on the CoMFA models. | ||
![]() | ||
| Fig. 6 CoMFA contour maps of the highly active compound 109, in which the purple and black represent NF-κB and AP-1 protein, respectively. (a) Steric contour map and (b) electrostatic contour map. | ||
The electrostatic contour map of CoMFA is displayed in Fig. 6(b). A big red contour and a small blue contour near the 1′-position and 3′-position of ring-C suggests that introducing a high electronegative group to the 1′-position or electropositive group to 3′-position of ring-C may improve the activity. This may be attributed to the electrostatic interactions between the electronegative S atom of ring-C and NH3+ group of Lys144 in NF-κB (or Arg17 in AP-1).
Most of the excellent derivatives (105, 108–116) possess an electron-withdrawing S atom at the 1′-position, moreover, those with high electronegative N or O atoms at the 3′-position of ring-C (75–77, 50, 59, 27 and 10) are mostly inactive compounds. Comparing derivative 1 with 2, as well as 26 with 27, their activity discrepancies can be explained by this red or blue contour. For substituent R2, two blue contours embedded in the 2′′-, 5′′-positions of ring D and near the terminal H atoms of –CH3 at 3′′-substituent of ring D and C7-, C8-positions of ring B imply that electropositive groups at these positions can improve the activity. Therefore, compounds 67, 69, 70 have an order for the activity as 69 > 67 > 70, with the corresponding 3′′-substituent of ring D –CH3, –H and –Cl, respectively. Comparing compound 67 with 72 as well as 101 with 98, their activity discrepancies can be explained by these blue contours. Moreover, it can be easily observed that four red contours are near the N12, O13 and O14 atoms, which mean some electronegative groups at these positions are favorable. It indicates that electron-withdrawing groups or atoms linking to ring D would increase the activity. Compounds 50, 59, and 75–77, bearing the electropositive C or H atom at 12-position, with –CF3 or H as substituent R2 are the most inactive compounds. In addition, there is a bigger blue contour in the vicinity of C9- and C10-substituents of ring-B, suggesting the importance of electropositive atoms or groups on this region. Most of the excellent derivatives (108–113, 119 and 120) possess –OCH3 linking to ring-B, in which –CH3 with electropositivity just falls into the blue areas. Similarly, the fact that compounds 108 and 109 are more active than compounds 114 and 115 can also be interpreted. Compounds 122, 123 and 124 have an order for the activity as 122 > 123 > 124, with the corresponding C8-substituent -SMe, -SOMe, -SO2Me, respectively.
From the abovementioned results, we can deduce that the medium-sized substituent R1 with electronegative group on 1′-position is favorable for the activity. In Fig. 4(a) and (c), it can be easily observed that the terminal of R1 is close to the surface of protein and the negative charge can be replenished with the positive charge around S atom. As for R2, it is also shown that bulky and electronegative groups linking to 2′- and 5′-positions of ring D would increase the activity. In comparison with Fig. 4(a) and (c), there are large binding pockets accommodating R2 in two receptors. Thus, we can conclude that the structural features of binding sites for two enzymes are consistent with the results of 3D-QSAR. We can further validate that sites 1 and 7 may be the optimum binding sites for NF-kB and AP-1, respectively.
![]() | ||
| Fig. 7 RMSFs of each residue of the protein for the four systems. (a) Complexes 109-1NFK and 50-1NFK and (b) complexes 109-2H7H and 50-2H7H. | ||
The hydrogen bond interaction is also one of the important forces stabilizing the binding of a ligand to a receptor. For the four systems, the hydrogen bond interactions from MD simulations are listed in Table 2. The hydrogen bond was characterized by distance (<3.5 Å) and orientation (the angle D–H⋯A > 120°). From Table 2, we can see that in NF-κB systems, compound 109 (or 50) can form one strong H-bond with Asp206 (or Tyr57) and another relatively weak H-bond with Lys144 (or Asp206), because of the different structural scaffolds of the two ligands. In the 109-2H7H system, deoxynucleotide DC210 solidly formed a hydrogen bond with compound 109, with one occasionally formed hydrogen bond observed between DG208 and compound 109, instead of the long distance H-bond (3.4 Å) between N1 and Lys20 in the docking study.
| System | Donor | Acceptor | Occupancy (%) | Distance (Å) | Angle (°) |
|---|---|---|---|---|---|
| 109-1NFK | Asp206@OD2 | Ligand@N12-H | 87.45 | 2.878 | 34.18 |
| Ligand@O13 | Lys144@NZ-HZ2 | 11.20 | 2.933 | 26.38 | |
| 50-1NFK | Ligand@O9 | Tyr57@OH-HH | 97.12 | 3.043 | 28.86 |
| Asp206@OD2 | Ligand@N7-H | 21.4 | 3.009 | 28.56 | |
| 109-2H7H | Ligand@O14 | DC210@N4-H42 | 67.80 | 3.092 | 39.00 |
| DG208@O2P | Ligand @N12-H | 7.00 | 3.040 | 19.61 |
To gain further insight into the detailed ligand–receptor interactions, the binding free energy was decomposed to ligand–residue pairs using the MM-GBSA approach. From Fig. 8(a), it can be seen that almost all residues energetically contribute more for the binding of compound 109 than that of compound 50, especially residues Asp206, Leu207, Ala242 and Pro243. As the latter three residues are nonpolar, it is reasonable to propose that there are strong van der Waals interactions between inhibitors and these residues. Moreover, the hydrogen bond interactions between Asp206 and inhibitors, which had been validated by docking studies also played key roles in the binding. Moreover, we can also find that the amino acids exhibited more favorable interaction contributions to the binding process of two NF-κB complexes than those of deoxynucleotides. For AP-1 [Fig. 8(b)], it is shown that the residues Ala13, Ser16, Arg17, DG208 and DC210 are mainly responsible for the energy difference between 109 and 50. Obviously, the amino acids and deoxynucleotides have all important influences on the binding free energies for the two compounds.
![]() | ||
| Fig. 8 Free energy decomposition plots for the four systems. (a) Complexes 109-1NFK and 50-1NFK and (b) complexes 109-2H7H and 50-2H7H. | ||
The abovementioned analyses showed that hydrophobic interactions play a very important role for stabilizing these dual NF-κB/AP-1 inhibitors to the receptors.
Footnote |
| † Electronic supplementary information (ESI) available. See DOI: 10.1039/c5ra10831d |
| This journal is © The Royal Society of Chemistry 2015 |