Open Access Article
Bharat Kumar Reddy Sanapalli‡
*a,
Vidyasrilekha Yele‡
b,
Srikanth Jupudi
*b and
Veera Venkata Satyanarayana Reddy Karri
a
aDepartment of Pharmaceutics, JSS College of Pharmacy, JSS Academy of Higher Education & Research, Ooty, Tamil Nadu-643001, India. E-mail: bharathsanapalli@yahoo.in
bDepartment of Pharmaceutical Chemistry, JSS College of Pharmacy, JSS Academy of Higher Education & Research, Ooty, Tamil Nadu-643001, India
First published on 6th August 2021
MMP-9 is a calcium-dependent zinc endopeptidase that plays a crucial role in various diseases and is a ubiquitous target for many classes of drugs. The availability of MMP-9 crystal structure in combination with aryl sulfonamide anthranilate hydroxamate inhibitor facilitates to accentuate the computer-aided screening of MMP-9 inhibitors with the presumed binding mode. In the current study, ligand-based pharmacophore modeling and 3D-QSAR analysis were performed using 67 reported MMP-9 inhibitors possessing pIC50 in the range of 5.221 to 9.000. The established five-point hypothesis model DDHRR_1 was statistically validated using various parameters R2 (0.9076), Q2 (0.8170), and F value (83.5) at a partial least square of four. Hypothesis validation and enrichment analysis were performed for the generated hypothesis. Further, Y-scrambling and Xternal validation using mean-absolute error-based criteria were performed to evaluate the reliability of the model. Docking in the XP mode and binding free energy was calculated for 67 selected ligands to explore the key binding interactions and binding affinity against the MMP-9 enzyme. Additionally, high-throughput virtual screening was carried out for 2.3 million chemical molecules to explore the potential virtual hits, and their predicted activity was calculated. Thus, the results obtained aid in developing novel MMP-9 inhibitors with significant activity and binding affinity.
The crystal structure of MMP-9 consists of a catalytic domain with three α-helices and five stranded β-sheets. The catalytic pocket contains an active site with two zinc and five calcium ions. The zinc ions in the domain are coordinated by three histidine residues (His401, His405, His411) and glutamic acid residue (Glu402).17 All members of the MMP family differ in their S1 selectivity pocket and chain length. The selectivity loop of MMP-9 has no regular secondary structure and is often mobile, undergoing conformational modifications on ligand binding. Therefore, the sequence of the S1 loop, structural variability, and the cocrystal ligand nature contribute to the observed overall shape and size of the selectivity loop pockets in different MMPs.18 Unfortunately, the electron density at the side chain of residue Arg424 is very poor and oriented into the S1 loop, forming a smaller enclosed catalytic pocket in the MMP-9 crystal structure.19 The asymmetric crystal structure (PDB: 5I12) contains a single chain A, with sequence length 157. It contains a bound ligand, arylsulfonamide anthranilate hydroxamate, the resolution is 1.59 Å, and the selectivity pocket of the protein lacks electron density.20
Besides arylsulfonamide anthranilate hydroxamate, various bioactive moieties have been investigated as potent MMP-9 inhibitors. These include hydroxamates,19 hydroxyquinoline,21 curcuminoid pyrazoles,22 4-thiazolidione,23 tetrahydro β-carboline,24 4-phenoxybenzenesulfonyl pyrrolidine,25 5-substituted pyrimidine-2,4,6-triones,26 ethynylthiophene sulfonamido-based hydroxamates,27 amidine-based thiazole,28 caffeic acid amides,5 and anthranilic acid,29–31 based derivatives. Inhibition of MMP-9 activity has been explored for the advancement of novel small molecules, but a potent inhibitor of MMP-9 has not been reported yet. Although MMP-9 inhibitors have achieved promising therapeutic effects in pre-and peri-metastatic settings, in clinical trials (performed on advanced-stage patients), these inhibitors had serious side effects such as musculoskeletal pain and inflammation due to the broad inhibition of MMPs.
The availability of high-resolution crystal structure of MMP-9 investigated experimentally to develop potential inhibitors with fewer side effects32 using computational approaches.20 Amongst various medicinal chemistry approaches, pharmacophore-based drug design was a more efficient technique for identifying hits or designing novel compounds.33 In this article, we combined ligand-based pharmacophore modeling, 3D-QSAR, molecular docking in the extra-precision (XP) mode, binding free energy and molecular dynamic (MD) studies to discover key interactions between the protein and ligands that are accountable for MMP-9 inhibitory activity. Further, the validation of the model displayed significant predictive power for the in vitro activity (pIC50). The highest active molecule was nominated for MD simulation of 100 ns using GROMACS 2018.1 to validate the proposed binding pose. In addition, in silico virtual screening was performed to determine the novel hits against MMP-9 protein.
| Hypothesis | Survival | Site | Vector | Selectivity | Survival-inactive | Volume | Matches |
|---|---|---|---|---|---|---|---|
| a D: hydrogen bond donor; H: hydrophobic; R: ring aromatic. | |||||||
| DDHRR_1 | 5.639 | 1 | 0.962 | 2.089 | 1.379 | 0.809 | 7 |
| DDHRR_4 | 5.561 | 0.951 | 0.991 | 2.097 | 1.125 | 0.667 | 7 |
| DHRRR_2 | 5.521 | 0.832 | 0.966 | 2.084 | 1.375 | 0.788 | 7 |
| ADHRR_1 | 5.605 | 1 | 0.958 | 2.005 | 1.356 | 0.797 | 7 |
| ADHRR_2 | 5.523 | 0.956 | 0.958 | 2.005 | 1.476 | 1.387 | 6 |
A total of 20 variant hypotheses were developed, keeping five as a maximum number of sites while 50% as actives. Amongst 20 generated hypotheses, the best five were selected based on the survival score, survival inactive, site score, volume, vector, energy terms, and the number of matches (Table 1). The test ligands aligned according to the generated five-point best-fitted hypothesis factor did not affect the model DDHRR_1 model. The angles and distances were calculated among various pharmacophoric features of the selected model and are represented in Fig. 2(a and b) (ESI, Tables S2a and b†). van der Waals model of the aligned training set ligands was employed to generate 3D-QSAR model. A statistically significant model for the DDHRR_1 hypothesis was created based on the 46 training set ligands via PLS regression. The incremental predictivity and statistical significance were detected up to PLS factor four (Table 2). However, further, increase in the PLS statistics and predictive ability. The best-fitted hypothesis model DDHRR_1 was validated by the predicted activity of the test set, which is not included in the model development. Besides, ligands pharmacophoric features arrangement and recognition in space were evaluated using contour plots.
![]() | ||
| Fig. 2 Pharmacophore model DDHRR_1 (a) intersite angles in Å unit (b) intersite distances in Å unit, between the pharmacophoric points. | ||
| PLS | SD | R2 | F | P | Stability | Q2 | RMSE | Pearson-R |
|---|---|---|---|---|---|---|---|---|
| a PLS: partial least square; SD: standard deviation; R2: regression coefficient; F: test statistic for F-tests; P: level of significance; Q2: cross-validated correlation coefficient; RMSE: root-mean-square error; Pearson-R: Pearson product-moment correlation coefficient. | ||||||||
| 1 | 0.5778 | 0.7216 | 95.9 | 8.07 × 10−12 | 0.854 | 0.6712 | 0.64 | 0.8340 |
| 2 | 0.4696 | 0.8212 | 82.6 | 3.5 × 10−14 | 0.812 | 0.7343 | 0.57 | 0.8744 |
| 3 | 0.4172 | 0.8627 | 73.3 | 3.63 × 10−15 | 0.766 | 0.7650 | 0.54 | 0.9011 |
| 4 | 0.3473 | 0.9076 | 83.5 | 4.27 × 10−17 | 0.709 | 0.8170 | 0.48 | 0.9454 |
Moreover, Y-scrambling or randomization test was carried out using a training set to determine the reliability of the generated DDHRR_1 model.37 All independent variables (molecular descriptors) were kept unchanged during the model generation, while the dependent variable (activity data) was randomized throughout the study. After performing 10 Y-scrambling tests, the models with low R2 and Q2 values were considered reliable (ESI, Table S3†).
The prediction quality and systematic errors in the developed hypothesis DDHRR_1 were evaluated by the mean absolute error (MAE)-based criteria method using Xternal Validation plus 1.1 tool recognized by the DTC software lab (https://dtclab.webs.com/sofwafre-tools). All the external validation parameters of the hypothesis were also computed using the MAE-based criterion, which classifies the prediction quality of the model into ‘good’, ‘bad’ and ‘moderate’ based on their evaluation parameters such as standard deviation (SD), MAE, and absolute error (AE). The standard deviation of absolute-error (σAE) is well-defined as follows: (a) bad predictions (MAE + 3 × σAE > 0.25 × training set range and MAE > 0.15 × training set range); (b) good predictions (MAE + 3 × σAE ≤ 0.2 × training set range and MAE ≤ 0.1 × training set range) and moderate predictions are the ones that do not fall under either of the above criteria. Almost 5% of ligands with high AE values were eliminated using the MAE-based criteria method to overwhelm the outlier predictions possibility. About 95% of compounds were employed to determine the external data set of poorly predicted data, whereas, the predictivity of the model was penalized using 100% of MAE-based criteria.38 Besides, to determine the presence of systematic error prediction, absolute value means of the number of negative prediction errors (nNE), negative prediction errors (MNE), the average absolute prediction errors (AAE), the absolute value of average prediction errors (AE), the mean of positive prediction errors (MPE), number of positive prediction errors (nPE) were employed using Xternal validation plus 1.1 tool.39,40 The results obtained using the current method are considered satisfactory if any one or more of the above-mentioned prediction errors obeys the rules demarcated using the MAE-based criterion method (ESI, Table S4†).
Minimization of ligands was achieved using the local optimization feature in the Prime module. The continuum solvent model (VSGB), OPLS3e force field, and rotamer search algorithm were used for computing the binding free energies of the ligands.
Virtual screening includes three systematic layers viz., HTVS, standard precision (SP) docking, XP docking. These three precisions of docking were employed to obtain a potential lead molecule with rapid speed. Although HTVS screens a large set of molecules rapidly, the sampling methods were confined, and the outcome could not be interpreted directly. Thus, the ligands screened by HTVS were further docked using SP, which provides a suitable binding pose from a large pool of ligands. Additionally, the best 10% of SP molecules were selected for docking in the XP mode and analyzed using an XP visualizer. The first five molecules were chosen for free energy calculations based on their Glide score, Glide E-model, and Glide energy.
The binding free energy (ΔGbind) was calculated as follows:
| ΔGbind = ΔEgas + ΔGsolv − TΔSconf |
| ΔEgas = ΔEvdW + ΔEelec |
Solvation free energy (ΔGsolv) is the sum of a polar solvation free energy (ΔGNP) and electrostatic solvation free energy (ΔGPB):50
| ΔGsolv = ΔGNP + ΔGPB |
In enrichment analysis, the hypothesis model DDHRR_1 could determine 100% active compounds on the hit list. The results obtained from the enrichment analysis in terms of the recovered actives from the top 1–20% of decoy set molecules that rank-ordered with respect to fitness score of the generated pharmacophore model (ESI, Tables S7–S9 and Fig. S1†). From the results, it is clear that recovery of the 50% actives is possible from the top 5% of the decoy set with decent enrichment values. RIE was computed for the hypothesis models to evaluate the ranking of active set contribution in the enrichment study. The obtained RIE value 13.07 for the best-fitted model DDHRR_1 directed the superior ranking over random distribution. To estimate the DDHRR_1 performance, the area under the accumulation curve (AUC) of the receiver operating characteristic (ROC) curve was plotted as a reliable metric. The generated DDHRR_1 model achieved a good value of ROC (0.70) and AUC (0.83) (Table 3 and ESI, Fig. S1†).
| Pharmacophore model | AUC | RIE | ROC | Phase hypo score | BedROC160.9 |
|---|---|---|---|---|---|
| DDHRR_1 | 0.83 | 13.07 | 0.70 | 1.06 | 0.84 |
| DDHRR_4 | 0.83 | 13.07 | 0.70 | 1.05 | 0.84 |
| DHRRR_2 | 0.83 | 13.07 | 0.70 | 1.05 | 0.83 |
| ADHRR_1 | 0.83 | 13.07 | 0.70 | 1.05 | 0.80 |
| ADHRR_2 | 0.83 | 13.07 | 0.70 | 1.04 | 0.80 |
Further, to determine the reliability and robustness of the predicted model DDHRR_1, the Y-scrambling method was used, which guarantees that the generated hypothesis is reliable and not occurring by chance. Y-scrambling test also validates the training data set sufficiency by comparing the non-randomized score obtained with the scores of non-randomized data. If the similarity persists between the predicted activity of the original model and the randomized model, then the observation set is insufficient to sustenance the generated model. However, the new 3D-QSAR models were reported to have low Q2 and R2 values after numerous repetitions (ESI, Table S3†). The results show that the DDHRR_1 model is noticeably significant and reliable and was in good agreement with the calculated statistical parameters.
A total of 21 external test molecules were found to cover a response range of 2.3332 logarithmic units. The prediction quality of the model generated was found to be ‘good’ in harmony with MAE-based criteria (ESI, Table S4†). The predicted residuals MAE and MAE + 3 × σAE were observed at 0.3734 and 1.1394, respectively, after eliminating 5% test set objects with high residual values. To judge the predictivity of the model, threshold values such as 0.7820 (training set range × 0.1), 1.5639 (training set range × 0.2), 1.1729 (training set range × 0.15), 1.9549 (training set range × 0.25) were used. As per the prediction error of MAE-based criteria, the generated 3D-QSAR model was found to be reliable. Besides, to evaluate the systematic errors in the generated model, it is merely necessary to investigate the prediction errors of test compounds. The prediction errors were calculated and the results obtained are |MPE/MNE| (1.1445), AAE − |AE| (0.2669), nNE/nPE (0.5672), nPE/nNE (1.7632), |MNE/MPE| (0.8738) (ESI, Table S3†). The R2 value was found to be 0.8936 between predicted and observed values (ESI, Table S4†).
Hydrogen-bond donating pharmacophoric feature was evaluated for active ligands (Fig. 5a) and inactive ligand (Fig. 5b) by visualizing contour plots. The appearance of blue cubes at position four of the biphenyl ring system exhibited favorable contribution of electron-donating groups –OH (D8-pharmacophoric feature) and –NHOH (D7-pharmacophoric feature). In contrast, red cubes on the aromatic ring of the biphenyl system and at position two of the biphenyl ring were unfavorable because of the negative contribution to the MMP-9 inhibitory activity. In the case of inactive 66, the presence of red cubes on the 8-hydroxy quinoline ring demonstrates the non-preference of electron-donating pharmacophoric features with an unfavorable contribution.
The hydrophobic group is a vital pharmacophoric feature that is accountable for the MMP-9 inhibitory activity. The favorable and unfavorable hydrophobic features for active 5 (Fig. 5c) and inactive 66 (Fig. 5d) were analyzed using contour plots. In this hypothesis, hydrophobicity (H9-pharmacophoric feature) was evaluated for both inactive and active compounds. The appearance of blue cubes at positions two and six of the biphenyl ring specifies the preference of hydrophobic feature. Further, the presence of the methyl group at position two is substantially favorable for the inhibitory activity of MMP-9. From this, it is clear that the appearance of blue cubes at these positions is vital for MMP-9 inhibition. However, the appearance of red cubes at position four of the biphenyl ring system was non-crucial for the MMP-9 inhibitory activity. In the case of inactive 66, red cubes around the hydroxy (OH) group of 8-hydroxyquinoline indicate the non-preference of hydrophobic features with an unfavorable contribution. The existence of red cubes on the bridge carbons of the fused ring system (8-hydroxyquinoline) also exemplifies the unfavorable contribution of the hydrophobic feature. Further, the appearance of blue cubes at position two of 8-hydroxyquinoline shows the favorable hydrophobic feature for the inhibitory activity of MMP-9.
Electron withdrawing pharmacophoric feature is another component that potentially impacts inhibitory activity. In the case of active compound 5 (Fig. 5e), the existence of blue cubes around the OH and NHOH at position four of the biphenyl ring system exemplifies the favorable contribution of electron-withdrawing groups at these positions. Moreover, OH and NHOH groups at these positions were observed to be indispensable for the MMP-9 inhibition, as evidenced by the high active compounds (pIC50 range = 8.30103 to 9.000) compared to low active compound 66 (pIC50 = 5.221849). However, the lower activity of compound 66 (Fig. 5f) might be because of the absence of electron-withdrawing features around its nucleus. The explanation was provided by the appearance of red cubes around the pyridine ring of the 8-hydroxyquinoline.
The glide score (Gscore) of ligand 5 was found to be −8.414 kcal mol−1 and exhibited good binding affinity with the receptor. From the docked pose of active ligand 5, the –NH present at position four of the biphenyl ring system displayed a hydrogen-bonding interaction with the backbone carbonyl group of Ala189 (–NH
O
C
, 1.80 Å). A second hydrogen-bond interaction was found between the
C
O of the sulphonyl group attached to the biphenyl ring system at position three and side-chain –NH of His190 (
C
O
HN–, 1.91 Å). Another hydrogen-bonding interaction was found between the OH of Glu227 and NHOH of ligand 5 (H
OH, 1.75 Å). Further, three π–π interactions were observed between the electron cloud of aromatic rings present in the active ligand and electron density residues such as Tyr179, His190, and Phe192 (Fig. 6).
![]() | ||
| Fig. 6 Glide XP-docked poses of compound 5, in the catalytic pocket of MMP-9 (PDB ID: 5I12). Dotted line (blue): π–π stacking interaction, dotted line (green): hydrogen bonding interaction. | ||
Binding free energy of the selected compounds 1–67/5I12 docked complexes were ranked using Prime, Molecular Mechanics Generalized Born Surface Area (MM-GBSA) approach (ESI, Table S6†). The binding free energy (ΔGbind) values were observed to range from 87.69 to −42.23 kcal mol−1. In addition, the van der Waals energy term (ΔGvdw = −66.300 to −30.350 kcal mol−1) is a major favorable contributor. In comparison, coulombic energy (ΔGcoul = −0.800 to −67.22 kcal mol−1) was a moderately favorable contributor to the MMP-9 inhibitory activity. Covalent energy term (ΔGcov = −2.67 to 14.58 kcal mol−1) strongly disfavors for the inhibitory activity.
![]() | ||
| Fig. 7 RMSD (Å) of (a) C atoms (b) protein backbone of 5/5I12 complex. | ||
![]() | ||
| Fig. 8 RMSF (Å) of (a) C atoms (b) protein backbone of 5/5I12 complex. | ||
In contrast, the protein backbone showed higher fluctuation at 0.32 and 0.34 nm, which covered almost 500–750 atoms of a receptor (Fig. 8b). Overall, protein residual fluctuations in the 5/5I12 complex were found to be minimal. The radius of gyration (RG) is indicative of the compaction level in the protein structure. The result indicates the constant stability of the 5/5I12 complex throughout the MD simulation study (Fig. 9).
![]() | ||
| Fig. 9 Radius of gyration of the 5/5I12 complex during 100 ns MD simulation. | ||
H bond formation/deformation depicts the number of hydrogen bonding interactions formed or broken during the MD simulation study. During 100 ns of study, hydrogen bond number was constant, indicating the molecular or structural stability of compound 5 with MMP-9 (PDB: 5I12). Almost six hydrogen bonds were found to be constant throughout the study. In contrast, extra two hydrogen bonds appeared from 60 to 75 ns (Fig. 10). The binding of the ligand with the protein residues is relatively stable, and electrostatic energy was found to be a key driving force for the binding stability of the 5/5I12 complex.
![]() | ||
| Fig. 10 Total number of hydrogen bonding interactions exhibited between compound 5 and 5I12 (MMP-9). | ||
Further, the continuous contribution of hydrogen bonding interactions in the binding pose analysis indicates that active compound 5 possesses stable interaction with the MMP-9 protein. The 3D structure of the 5/5I12 complex displayed a key hydrogen bonding interaction with the residues Tyr179, Leu187, His190, Phe192, Pro193. In addition, compound 5 was also stabilized by forming a halogen bond with Leu187 (ESI, Fig. S4†). All these interacting residues exhibited a radius of gyration in the range of 0.16 to 0.28 nm indicating minimal fluctuations and greater stability throughout the MD simulation study (ESI, Fig. S5†).
The total energy (potential, kinetic energies) was calculated. The graph was plotted using a three-dimensional Xmgrace plotting tool to know the stability of the 5/5I12 complex after the addition of water molecules and ions. The total energy of the complex should be persistent during the MD simulation study as it is a summation of both the potential and kinetic energy of the residues. Potential energy levels must be increasing or constant to exhibit structural stability. At the same time, the kinetic energy level represents the general confusion of protein structure. The total energy was found to be constant throughout the MD study (Fig. 11).
![]() | ||
| Fig. 11 Total energy of the 5/5I12 complex during the 100 ns MD simulation. | ||
The 5/5I12 complex was snapshotted at every 20 ns intervals of 100 ns MD simulation trajectory and was superimposed to evaluate the binding stability of compound 5 (ESI, Fig. S6†). It showed that the compound 5 positions bound in MMP-9 are more confined and stable because of the binding pocket accessible volume, which further dictates that ligand interactions inside the catalytic pocket must be stronger and more specific to stabilize the system throughout the study.
Further, RMSD was calculated for conformations obtained through MD, pharmacophore, and docking poses. Similar orientation was observed with compound 5 after XP-docking and MD pose (ESI, Fig. S7†); conformations of compound 5 after MD pose and DDHRR_1 (ESI, Fig. S8†); conformations of compound 5 of XP-docking pose and DDHRR_1 (ESI, Fig. S9†).
| van der Waal interactions (kcal mol−1) | Electrostatic energy (kcal mol−1) | Polar solvation energy (kcal mol−1) | Solvent accessible surface area energy (kcal mol−1) | Binding energy (kcal mol−1) |
|---|---|---|---|---|
| −103.870 ± 41.78 | −16.633 ± 9.308 | 36.499 ± 23.749 | 9.678 ± 4.00 | −93.698 ± 34.65 |
C
O group of the VH1 and Glu227. Further, the molecule was stabilized by π–π interaction between the electron cloud of the aromatic nucleus present in the molecule and His226, His230, His236 (ESI, Fig. S13†). The virtual hit VH2 (CACPD2011a-0000241403) is also embedded within the catalytic pocket of MMP-9, which was supported by its glide score (−9.948 kcal mol−1), binding free energy (−62.97 kcal mol−1), and predicted activity (7.562). VH2 exhibited a single hydrogen bonding interaction with –NH of benzimidazole nucleus and Met247. In addition, VH2 was stabilized by forming a π–π interaction between the electron cloud of the aromatic nucleus present in the molecule and His226 residue (ESI, Fig. S14†).
Further, a 50 ns MD simulation was carried out for the complexes VH1/5I12 and VH2/5I12 to determine the stability of the complex. The RMSD and RMSF were calculated against the complexes (VH1/5I12 and VH2/5I12); graphs were plotted using the three-dimensional Xmgrace plotting tool to compare the stability of protein backbone and Cα atoms. The RMSD for the VH1/5I12 complex, Cα atoms, and the protein backbone atoms remain stable throughout equilibration but showed minimal fluctuations at 20 ns in the MD simulation study (ESI, Fig. S15†). The RMSF plot of complex VH1/5I12 revealed that Cα atoms and protein backbone atoms displayed fluctuations at 0.4 and 0.42 nm, respectively (ESI, Fig. S16†).
The RMSD for the VH2/5I12 complex, Cα atoms, and protein backbone atoms remain stable throughout equilibration but showed minimal fluctuations from 10 to 18 ns in the MD simulation study (ESI, Fig. S17†). The RMSF plot of the complex VH2/5I12 revealed that Cα atoms and protein backbone atoms displayed fluctuations at 0.42 and 0.44 nm, respectively (ESI, Fig. S18†).
Footnotes |
| † Electronic supplementary information (ESI) available. See DOI: 10.1039/d1ra03891e |
| ‡ Authors contributed equally. |
| This journal is © The Royal Society of Chemistry 2021 |