Dual mechanism of HIV-1 integrase and RNase H inhibition by diketo derivatives – a computational study

Development of novel therapeutics for treatment of HIV infections is a very challenging process due to the high rate of viral mutation. On this basis, inhibition of more than one HIV replication pathway is a potential efficient way to obtain control over the HIV progression. In the present study we have performed computational analyses in order to investigate the dual inhibitory action of a set of diketo derivatives (carboxylic acid and esters) against RNase H (RNH) and integrase (IN). Docking studies performed with these compounds revealed that the interaction between the ligands and magnesium ions and the surrounding amino acids/water within the protein are important for the dual inhibitory activity of these compounds. Moreover, from a binding mode analysis, the carboxylic acid (series 8) and ester (series 7) derivatives showed distinct binding patterns in RNH and IN, meaning that all compounds bind with magnesium ions through oxygen atoms of the ligands (either enol or carboxylate); however, the orientation of the hydrophobic tail of the ligand is quite different in both systems. Additional validation using a small dataset also strengthens this binding mode hypothesis. The results reported here could be useful for design or screening of small molecules against IN and RNH activity for the development of effective drugs for HIV treatment.


Introduction
UNAIDS (Joint United Nations Program on HIV/AIDS) estimates that currently more than 34 million people worldwide are infected with HIV-1 and that 2.5 million new HIV infections occur every year. 1 Aer introduction of the HAART (highly active anti-retroviral therapy) concepts at the 11th International Conference on AIDS at Vancouver (British Columbia) in 1996 there has been great progress in HIV therapy.This concept suggests that the combination of several antiretroviral drugs slowdown HIV replications and that this combinational therapy is more effective than mono-drug therapy in order to treat HIV. 2 Although AIDS related mortality has been reduced by 24% (1.7 million in 2011) compared to 2005 data (2.3 million), the development of improved anti-HIV regiments is still required.To control HIV progression, several viable chemo-targets have been identied in the HIV replication cycle; 3,4 however, from a pharmaceutical point of view, only reverse transcriptase (RT) and protease (PR) have been successful targets for HIV therapy.More than 50% of the currently marketed drugs being used for HIV therapy belong to these target classes.However, due to an increasing drug resistance to HIV-1 strains, 5 considerable attention has in recent years been paid to other target sites within the HIV replication process 6-8 e.g., integrase (IN) and RT associated RNase H (RNH), 9 which both are essential for viral replication.
Reverse transcriptase (RT) is an enzyme which reverse transcripts the viral genome (single strand RNA) into double strand DNA (dsDNA) through RNA-DNA hybrid formation using polymerase and RNH domains.Integrase takes over the dsDNA for integration with the genome of the host cell.1][12] All RT inhibitors approved by the FDA for the treatment of HIV infections particularly inhibit at the polymerase domain; 13 however, there are a large number of RNH inhibitors reported and some are even entered into clinical trial even though none of these inhibitors have yet reached the market.On the other hand, two IN inhibitors, named Raltegravir and Dolutegravir, were recently approved by the FDA in 2007 and 2013, respectively, for HIV treatment. 14,15IV-1 IN is a 32 kDa protein composed of three structural domains: the N-terminal domain (NTD) (residues 1-50), the catalytic core domain (CCD) (residues 51-212) and the C-terminal domain (CTD) (residues 213-288).Among the retroviral classes, the catalytic core domain is highly conserved and CTD plays an essential role in the enzymatic activity.][18][19][20][21] RNH is one of the two domains of the p66 (66 kDa) subunit of reverse transcriptase.From mutation and X-ray crystallographic studies, the structure of the RNH domain has been well characterized.][24][25] Although the overall structural folds for RNH and IN are quite different, the topology of the catalytic sites are very similar, e.g. both RNH and IN are composed of DDDE and DDE conserved residues, respectively, in addition to catalytically active magnesium ions (Mg 2+ ) and water molecules.Both enzymes execute their catalytic mechanism through the Mg 2+ ions which coordinate to the carboxylate groups of the conserved residues.It has been shown that mutation of any of these residues abolishes the catalytic activity.This is because these residues provide a favorable environment for stabilizing metals which is essential for a proper binding and positioning of the substrate and also important for formation of a nucleophile (OH À ) from water by deprotonation. 24A schematic representation of the catalytic process of RNH and IN is shown in Fig. 1A.
7][28][29][30][31] Recently, two series (1-benzyl-pyrrolyl diketo acid and ester derivatives) belonging to diketo derivatives were synthesized and the effect of substitution at the benzyl ring was investigated against recombinant reverse transcriptase associated RNH and IN function (Fig. 1B). 32The results demonstrated that the majority of the screened compounds showed good HIV-1 replication inhibitory activity (IC 50 ranges from >100 to 0.02 mM).Moreover, SAR studies suggest that molecules in the 1-benzyl-pyrrolyl diketo acid series (in here series 8) show good inhibition against IN and that the 1-benzyl-pyrrolyl diketo ester series (in here series 7) is more active as RNH inhibitors.From a modeling point of view, it is interesting to understand the structural features that discriminate or show the dual mechanism of these classes of compounds as the inhibition mechanism of both systems are quite similar.The aim of the present investigation is to understand the dual mechanism of this class of compounds using ligand and structure based modeling.

Dataset preparation
The dataset consists of 50 diketo derivatives collected from a recent publication by Costi et al. 32 (Table 1).Briey, two sets of diketo compounds were synthesized, the rst series consists of 1-benzyl-pyrrolyl diketo acids (series 8) and the other series consists of 1-benzyl-pyrrolyl diketo esters (series 7).Reported biological activity (IC 50 ) data was converted to pIC 50 values.Molecular structures of all the compounds were built in the Maestro module of the Schrödinger suite (v2013-2) 33 and saved in sdf format.Subsequently, these 2D structures were converted into 3D structures using the OMEGA tool (v2.0) employing the MMFF95S force eld. 34To predict possible tautomers for the diketo group, we used a test system (2,4-dioxohept-5-enoic acid/ ester), which mimics the compounds under investigation.For this, energies and geometries were calculated using DFT based on B3LYP/6-311++G** in combination with the PCM solvation model. 35All DFT calculations were performed using Gaussian 09 program. 36

Homology modeling
The model of the HIV-1 RT associated RNH domain was constructed from an X-ray crystal structure (resolution of 1.4 Å) obtained from the Protein Data Bank (PDB ID: 3QIO) 23 as shown in our previous study. 29In the present study, we modeled the full-length IN structure based on the PFV structure (resolution 2.65 Å, PDB ID: 3OYA) reported by Hare et al. 16 Briey, the HIV-1 IN sequence (288 amino acids, 1148-1435) was obtained from UniProt. 37This sequence was imported into Prime (version 3.4, Schrödinger, LLC, New York, NY, 2013), a homology modeling tool from Schrödinger, and the structure 3OYA was used as a template to build a homology model.The sequence of IN was aligned with the PFV structure as previously reported. 16,17In the secondary structure prediction, the bound ligand (RZL), magnesium ions and three water molecules, which lies close to the magnesium ions, were also included.Models were constructed using a knowledge based method (construct insertion and close the gaps based on the known structure).Subsequently the nal model was used for the optimization process in the Protein Preparation Wizard as implemented in Schrödinger. 38his protein structure optimization includes adding hydrogen atoms, assigning correct bond orders and building of di-sulde bonds.The protonation states of all the ionizable residues were predicted by PROPKA 39 provided in the Protein Preparation Wizard in the presence of the Mg 2+ ions at the active site.Finally, the optimized model was energy minimized (only hydrogen atoms) using the OPLS 2005 force eld.

Docking methodology
The docking experiments were performed using the grid based exhaustive search algorithm implemented in the Glide module of Schrödinger suite. 40Glide uses a series of hierarchical lters to nd possible ligand binding poses in the active site, and the program has the option to treat the ligand fully exible or rigid during the docking run.SP (standard precision) docking and scoring is oen recommended for prediction of binding poses, virtual screening and ranking due to its efficient and relative accuracy in pose prediction. 41,42The docking settings used in this study are described elsewhere 29 (see also ESI †).

Grid-based ngerprint for ligand and protein (FLAP) modeling
The soware FLAP 43 was used to build and validate ligand based models.FLAP uses ngerprints derived from GRID molecular interaction elds (MIFs) and GRID atom types are characterized as quadruplets of pharmacophoric features.The GRID approach is a well assessed concept for determining energetically favorable interaction sites in molecules with known structures using chemical probes e.g., H, O, N1, and DRY probes which describe the shape, hydrogen bond acceptor, hydrogen bond donor and hydrophobic interactions, respectively.The distance (i.e.spatial resolution) between two GRID points was set to 0.75 Å.

Protein ligand interaction ngerprint
In order to better understand the protein-ligand interaction patterns of the different binding modes of the diketo derivatives, it is of interest to analyze which residues in the protein and which type of interactions are involved in the binding of the ligands in RNH and IN.To do this, we performed a PLIF (Protein-Ligand Interaction Fingerprint) analysis as implemented in the MOE soware 44 (see also ESI †).

Evaluation of the models
Reproduction of the bound conformation of co-crystallized ligands based on docking experiments were evaluated in terms of their atom-positional root mean square deviation (RMSD) with respect to the bound conformation of the ligands for both systems.3D QSAR models were assessed through leave-one-out (LOO) cross-validated squared correlation coefficient (Q 2 ).

Results and discussion
Diketo derivatives bind to IN or RNH through a two-metal-ion chelation mechanism, meaning that either the acid or enolic groups of the ligand chelates with the magnesium ions.Recently, Liao et al. 45 have studied the tautomerism of IN inhibitors using density functional theory (DFT).Their study suggests that the carboxylic acid groups are deprotonated and that the enol tautomer is more stable compared to the keto tautomer.Moreover, at biological pH and in presence of magnesium ions, the enolic form may undergo deprotonation which facilitates the chelation formation.Here we have performed free-energy based tautomer stability calculations on a test system i.e. 2,4-dioxohept-5-enoic acid and ethyl 2,4-dioxohept-5-enolate using the keto and enolic forms.According to QM based energy optimizations (B3LYP/6-311++G** level, PCM solvation model) of the test systems, the enolic form 1 was found to be more stable than the enolic form 2 (Fig. 2).The keto form also exists to some extent.This observation is in good agreement with previously reported calculations. 45Therefore, the rest of this study was carried out with the enolic form 1 for both acidic and ester derivatives.

Homology model and validation
Due to the unavailability of full-length IN structural models, we have in this work built a full-length homology model based on the recently published PVF X-ray structure. 16In comparison to the PVF structure, the generated model shares very similar overall folds, which includes the catalytic domain, C-terminal domain (CTD) and N-terminal domain (NTD) (Fig. 3A).
Although the overall fold is very similar in both cases, the PVF structure has a unique N-terminal extension (NET) domain that is absent in HIV-1 IN (Fig. 3B).By comparison to the crystal structure of the CCD domain reported for HIV-1 (PDB ID: 3L3U, 3NF8, 4DMN and 1BIZ), we nd our model to be quite similar with RMSDs between 1.7 and 2.7 Å (a comparison of the catalytic domains of different structures is provided in the ESI, † Fig. 1), and also similar to the homology model reported by Johnson et al. 46 In order to validate the docking results and to characterize the binding site our homology model was built with the crystal bound ligand (RLZ).oxygen atoms of NHQD and the neighboring two oxygen atoms interact with the other magnesium (1) ion and bound water molecules.As seen from both validations, the rst ranked docking pose reproduces the bound conformation, thus, the rst ranked compounds of both series (series 7 and 8) were analyzed in order to study the dual mechanism.

Binding mode analysis
Initially, the binding poses of series 8 (8a-8y) and series 7 (7a-7y) derivatives were analyzed for RNH.In general, the carboxylate group in the molecules is oriented towards the two magnesium ions and water molecules (especially water no.17 and 24).The binding modes of all acid derivatives are very similar in terms of their interactions with the magnesium ions by the carboxylate group and the position of the hydrophobic tail of the ligands (Fig. 4).In contrast to series 8, series 7 compounds bind with magnesium ions through "three-oxygencoordinates", meaning that the enolic and keto oxygens are arranged in the same plan for chelation.The middle ketooxygen is slightly inserted between the two magnesium ions and the ethoxy group is pointing out of the active site.However, only the deprotonated enolic oxygen atom binds strongly with the magnesium ions.All series 7 compounds favorably bind with Arg557 which performs p-cation interactions with the pyrrole ring and salt bridge formation between the deprotonated enolic oxygen atom and magnesium ions (so called le-tail mode).Similar to Arg557, the His539 residue is also involved in p-p stacking with the pyrrole ring and hydrogen bonds with the enolic oxygen.Moreover, the majority of series 8 compounds chelate with the two magnesium ions through the terminal mono-oxygen atom, in case of the 8n, 8w and 8x compounds, both enolic and terminal oxygen atoms bind with the magnesium ions, which is also reected in the relatively high docking score of these compounds (>À8.0 kcal mol À1 ).This observation is also in good agreement with experiment as these compounds have quite strong RNH inhibition (3-28 mM) compared to the other compounds.Another common interaction of series 8 compounds with RNH is that all compounds are either exposed or involved in p-cation interactions with Lys540 and that the keto-oxygen next to the enolic group binds with water which in turn is coordinated with the conserved residue Asp498.
(Binding mode comparison of poor and good inhibitors is provided ESI, † Fig. 3A-F).Although the binding mode of series 8 compounds make chelation and hydrogen bonding with magnesium ions and waters, respectively, orientation of tail of these compounds are very different and majority of them are exposed to large number of hydrophobic residues such as Val536, Ala538 and Pro537 of the RNH, this called right-tail binding mode.On the other hand, both series 7 and 8 binding modes in IN show a quite different binding pattern compared to RNH, where series 7 and 8 compounds bind in a le (tail of the ligand is orientated between the residues E152 and P145) or right-tail (tail of the ligand is orientated between the residues F143 and P117) binding mode, respectively.For IN, the majority of the series 8 highly active compounds bind similar to the "right-tail" compounds in the RNH series 7, and the low active series 8 compounds show a "le-middle-right-tail" binding pattern.A comparison between series 7 and 8 compound binding poses in IN is shown in Fig. 5A and B (a binding mode comparison of poor and good inhibitors is provided Supplementary Information, Fig. 4A and B).In order to explain this concept better, the activity ranges were divided into three classes; highly active (pIC 50 $ 6 mM), moderately active (5 < pIC 50 > 6 mM) and low active (pIC 50 < 5 mM) compounds and the binding modes were analyzed according to this grouping.As shown in Fig. 6a large portion of the highly active compounds (19 out of 22) lies on the right-tail mode, surprisingly none of them are oriented to the le-tail mode and 3 compounds show middle-tail mode.9 out of 16 moderate active compounds bind in right-tail mode and only 2 compounds show le-tail binding.Furthermore, out of 13 low active compounds, 5 compounds show le tail and the remaining 5 compounds middle-tail binding mode.Noticeably, only a minor number of compounds (3) show righttail mode.This analysis suggest that based on the binding orientation of the tail, the compounds can be classied into either highly active or low active compounds.This pattern was not observed in the RNH binding poses, where all series 7 compounds show le-tail mode and series 8 compounds righttail binding mode, which is also reected by experiment i.e., series 7 compounds are favorable for RNH inhibition and series 8 compounds bind strongly to IN.Overall, based on the binding orientation of the tail of the molecules in both series, one could suggest that if the compound's tails orient to the le side of the active site, the probability of showing RNH inhibition is quite high and for IN it is a right-tail orientation.A common interaction pattern of series 8 compounds is that the majority of the compounds is found to be involved in p-p interaction with Phe143 and exposed to the Pro142 residue, in particularly the pyrrole ring.At least one of the 3 water molecules participate in hydrogen bonding network with the ligands in addition to the magnesium ions.In terms of chelation, series 8 compounds bind with the two magnesium ions through two reactive oxygen species; in case of series 7, only one oxygen (enol) atom is involved in the chelation with the two magnesium ions.This phenomena could also explain why series 7 compounds are low potent as compared to series 8 against IN.
In order to analyze the inuence of water molecules to the binding mode prediction, an additional docking experiment was performed without considering water molecules at the active site.As seen from Fig. 7A and B the binding pose of series 8 compounds did not change (reproduce the right-tail mode); however $50% of the compounds in series 7 follow le-tail mode and the rest of the compounds fall into right-tail mode.This observation indicates that the inclusion of water molecules in the docking experiments play a signicant role in the binding pose prediction as observed for the other proteins. 47

Validation of binding mode hypothesis
In the following we focus on classifying a small external dataset containing quinolinonyl diketo acid derivatives screened for RNH and IN.These compounds were also screened in the same experimental assay and under the same conditions as the diketo derivatives used for this study. 48In total, 17 compounds were tested against IN and 8 tested against RNH.All compounds were built and preprocessed as described in the method section.The compounds were docked into the RNH and IN models and the poses were subsequently analyzed.Out of 17 compounds docked into the IN binding site, only two active compounds (12b and 12i) misclassied into the le-tail mode as low active compounds and the rest of the compounds in the dataset followed the right-tail binding mode.For RNH, all compounds fall into the le-tail binding mode as proposed previously.Interestingly, compound 2 (IC 50 > 100 uM), which possesses a righttail binding mode, is correctly classied as an inactive compound in accordance to what was proposed earlier based on the large dataset (Fig. 8).

Protein-Ligand Interaction Fingerprints (PLIF)
PLIF is a highly valuable analysis tool that helps to examine residue interactions with ligands in a high-throughput screening mode.This method summarizes the different interactions e.g., hydrogen bonds (acceptor and donor), ionic interactions and surface contacts of ligand-protein complexes and could be useful in order to differentiate inhibition differences of a set of particular compounds with respect to the interaction  with the receptor (a summary of the PLIF result is provided in the ESI Table 2A and B †).
RNH.The studied molecules possess nearly 50 interactions with RNH.The major interactions are surface contact (C), side chain donor and/or acceptor (D/A), backbone donor and/or acceptor (d/a), ionic interactions (I), and solvent donor and/or acceptor (O) interactions.It is interesting to note that water molecules such as 17, 562 and 563 have 100% abundance for the solvent acceptor interaction with the hydrogen donor groups in the molecules.Molecules in both series 7 and series 8 bind with the two magnesium ions.However, compounds 8a-8y have a terminal COO À ion in addition to the enolic group and this makes them more strongly binding with magnesium ion compared to the series 7 compounds, which have interactions  with the metal ions primarily through the enolic oxygen atoms.Due to the substitution in the terminal carboxylic group with an ethyl group, an overall chelation formation is slightly different, meaning that all compounds are positioned slightly away from the magnesium ions compared to series 8 where the terminal carboxylate ion pointing towards the magnesium ions.Side chain acceptor and ionic interactions were observed to residue Arg557, but the signicant difference is that ionic interactions are only observed for compounds 8a-8y and side chain acceptor interactions are observed primarily for the 7a-7y compounds, i.e., the molecules that possess substitutions mainly on the meta (R 2 ) position on the phenyl ring and compounds with CN, F, OCH 3 , etc. in the ortho (R 1 ) position.Residues such as Asp443, Asn474 and Asp498 possess substantial interaction through side chain donor (Asp443 and Asp498) and surface contact (Asn474) with the compounds containing ester groups (7a-7y).Another important interaction observed for these compounds (7a-7y) is side chain donor interactions with Asp549 ($25%).The majority of the compounds in series 7 interact with His539 and Ala538 through side chain and backbone acceptor interactions as the binding pose is le-tail mode as shown in the previous section.
Integrase.Both series 7 and 8 possesses nearly 40 interactions with the amino acids, water and magnesium ions.More than 90% of the compounds in the dataset show interaction with water 400 and 534 either through solvent acceptor or donor.Water 401 also interacts signicantly with the compounds (67% abundance).In contrast to RNH, with IN, the magnesium ions have better interaction with the ethyl ester group (7a-7y) containing compounds than the compounds containing a carboxylic group (8a-8y).The magnesium ion (1) possesses better interaction with the carboxylic group than the magnesium ion (2).The majority of the compounds (>90%) in the dataset showed interaction with Pro145 and most of the compounds in series 8 have surface contact with Phe143.It is important to note that conserved residues such as Asp64 and Asp116 primarily interact with series 7 compounds compared to series 8. Gly118 makes surface contact with some of the series 8 compounds and this was nearly absent in series 7 compounds.Glu152 possesses signicant side chain donor interactions (>50%) with the molecules containing a carboxylic acid group.
In general, for both enzymes, the metal ions (Mg 2+ ) play an important role for the activity elicitations.The water molecules in the active sites have almost 100% abundance for the interaction through hydrogen bonding (either acceptor or donor).In integrase, the magnesium ion (1) has better coordination with the majority of the compounds compared to magnesium ion (2).In both systems, chelation was observed for all compounds.The carboxylate group containing compounds (8a-8y) have strong interaction with Arg557 in RNH and for series 7 compounds, only the side chain acceptor (a), donor (d) and surface interactions with Asp443, Asn474, Asp498 and Arg557 were observed.The compounds in series 7 and 8 are more exposed to the active site residues of IN compared to RNH, this is due to the tail-mode concept, e.g.right, middle, and le-tail mode.

FLAP models
The docking poses of series 7 and 8 for RNH and IN were used as initial structures in the FLAP modeling.For RNH and IN, we chose the highly active top 5 compounds for alignment and the aligned models were used to generate a pharmacophore psuedomolecule, which consist of pharmacophoric points.The obtained pharmacophore models were used to build 3D-QSAR (Partial Least Square with 5 latent variables) and Leave-one-out cross validation was used to assessed the quality of the nal model (ESI, † Table 2).In general, models obtained for RNH were very poor in terms of the cross-validation (Q 2 ), therefore we could not use it for interpretation of the dual mechanism.However, models derived for IN found to be moderate with correlation coefficient of 0.88 (Q 2 ¼ 0.42, SDEP ¼ 0.66) for latent variable 3. The MIF (molecular interaction elds) of the highly active compounds and poor inhibitory compounds were analyzed.It is noted from Fig. 9 that hydrogen bond acceptors in the molecule is essential in order to possess the IN and RNH inhibition.The compounds 8g (both RNH and IN active) and 7w (active against RNH and inactive for IN) were analyzed.The result suggests that the psuedomolecule for IN is well suited for series 8 compounds compared to series 7 as compounds in series 7 are slightly moved from the pharmacophoric atom position (Fig. 9A and B) although they share similar structural pattern other than the substituted ethoxyl group.On the other hand, we analyzed the psuedomolecule of RNH (Fig. 9C and D) and compared to the highly active compounds from both series (8g/7b).As expected both molecules aligned very nicely to the pharmacophoric atoms derived from the highly active RNH compounds.Overall this suggests that the series 8 compounds share similar pharmacophore alignment to both system, and series 7 compounds suite only to the RNH pharmacophore atoms.This observation is also in agreement with conclusion derived from experiment.

Conclusions
In the present study we have investigated the dual mechanism of diketo derivatives (compounds 7a-7y and compounds 8a-8y) against HIV-1 RNH and IN through computational methods.Although both series share a common scaffold and show RNH and IN activity, the pharmacophore pattern of both systems is quite different in terms of distance and position of hydrophobic sites to hydrogen bond acceptors which is essential for binding to the magnesium ions.As previously reported from experiment, our computational models also suggest that the ester derivatives (7a-7y) are relatively high active against RNH and the carboxylic acid group containing compounds (8a-8y) show better activity against IN.Although all compounds bind to the magnesium ions quite similarly, the most deciding factor for RNH or IN inhibition is how well the ligand's tail orients in both system, e.g., the binding mode or position of the tail is classied into "le, middle, right-tail" binding modes.The majority of series 7 compounds favor "le-tail" mode while the majority of series 8 compounds prefer "right-tail" mode.The orientation of the tail in these classes of compounds dictates a specic type of interaction with residues (Asp443, Asn474, Asp498, Lys540 and Arg557 for RNH and Asp64, Asp116, Gly118 and Glu152 for IN) in the catalytic site and this factor impacts the overall inhibitors activation against IN and RNase H.The presence of an additional formal negative charge on the compound has a signicant impact on the activity of IN compared to RNH.
Inclusion of negatively charged functional groups with exible bonds in the ligands can lead to favorable interactions with the active site metals and water molecules.The aromatic ring/ hetero-aromatic rings connected with the charged pharmacophoric group in the compounds can make p-p interaction or hydrogen bonding interactions with the His539 and Arg557 residue in the RNH and Phe143 or Pro145 residues in IN.The observations from this study could be useful in order to design selective or dual inhibition of RNH/IN for the development of effective ant-HIV agents in AIDS therapeutics.

Fig. 1 (
Fig. 1 (A) Schematic diagram of the catalytic mechanism of RNase H and integrase.Conserved residues for both systems are shown in red (integrase) and blue (RNase H) letters.(B) Common scaffolds for series 7 and series 8.

Fig. 3 (
Fig. 3 (A) The structure of the full-length HIV-1 integrase homology model is shown in cartoon representation with magnesium ions (green), water molecules (red) and important residues shown in stick representation.(B) Comparison of the homology model (surface) with the PFV structure (PDB ID: 3OYA).The active site is shown in yellow.

Fig. 4
Fig. 4 Comparison of binding poses of the compounds in series 7 and 8 at the RNase H binding site.(A) Overall view of HIV-1 reverse transcriptase and various sub-domains and ligand binding sites are highlighted, including RNase H domain (blue).Important residues are highlighted in yellow.

Fig. 5
Fig. 5 Comparison of series 7 (A) and 8 (B) binding mode in the integrase active site.

Fig. 6
Fig. 6 Venn diagram of classification of RNase H and integrase compounds according to the binding pose of the tail of the series 7 and 8 compounds.

Fig. 7 (
Fig. 7 (A) Comparison of binding mode of representative compounds in series 7 (green) and series 8 (cyan) at the binding site of RNase H from the docking experiment without water molecules.Compounds that showed right-tail mode are also shown (violet).Protein-ligand interaction diagram for series 8 compound (B) and series 7 compounds that showed left-tail mode (C) and right-tail mode (D).

Fig. 8
Fig. 8 Comparison of the binding mode of the external dataset compounds (quinolinonyl diketo acid derivatives) in IN (left) and RNH (right).Misclassified compounds in IN and RNH are shown in green and pink sticks, respectively.

Fig. 9
Fig. 9 Comparison of MIF derived from RNase H (A and B) and integrase (C and D) pharmacophore models.Here A, B, C, D denotes compound 8g, 7b, 8g, 7w respectively.N1 probe (blue surface with energy level 0.2 kcal mol À1 ), DRY probe (green surface with energy level À0.1 kcal mol À1 ) O probe (red surface with energy level À0.5 kcal mol À1 ).The pseudo-pharmacophore atoms are represented in blue and red spheres.

Table 1
Structure and activities of the studied compounds a Activity reported against HIV-1 RT-associated RNase H activity. b Activity reported against HIV-1 IN ST activity.