Valeria
Scardino
ab,
Mariela
Bollini
c and
Claudio N.
Cavasotto
*bde
aMeton AI, Inc., Wilmington, DE 19801, USA
bAustral Institute for Applied Artificial Intelligence, Universidad Austral, Pilar, Buenos Aires, Argentina
cCentro de Investigaciones en BioNanociencias (CIBION), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Ciudad de Buenos Aires, Argentina
dComputational Drug Design and Biomedical Informatics Laboratory, Instituto de Investigaciones en Medicina Traslacional (IIMT), Universidad Austral-CONICET, Pilar, Buenos Aires, Argentina
eFacultad de Ciencias Biomédicas, and Facultad de Ingeniería, Universidad Austral, Pilar, Buenos Aires, Argentina. E-mail: CCavasotto@austral.edu.ar; cnc@cavasotto-lab.net
First published on 2nd November 2021
The use of high-throughput docking (HTD) in the drug discovery pipeline is today widely established. In spite of methodological improvements in docking accuracy (pose prediction), scoring power, ranking power, and screening power in HTD remain challenging. In fact, pose prediction is of critical importance in view of the pose-dependent scoring process, since incorrect poses will necessarily decrease the ranking power of scoring functions. The combination of results from different docking programs (consensus scoring) has been shown to improve the performance of HTD. Moreover, it has been also shown that a pose consensus approach might also result in database enrichment. We present a new methodology named Pose/Ranking Consensus (PRC) that combines both pose and ranking consensus approaches, to overcome the limitations of each stand-alone strategy. This approach has been developed using four docking programs (ICM, rDock, Auto Dock 4, and PLANTS; the first one is commercial, the other three are free). We undertook a thorough analysis for the best way of combining pose and rank strategies, and applied the PRC to a wide range of 34 targets sampling different protein families and binding site properties. Our approach exhibits an improved systematic performance in terms of enrichment factor and hit rate with respect to either pose consensus or consensus ranking alone strategies at a lower computational cost, while always ensuring the recovery of a suitable number of ligands. An analysis using four free docking programs (replacing ICM by Auto Dock Vina) displayed comparable results.
Among in silico methods in drug discovery, molecular docking has been widely used during the last three decades.4–6 In protein-molecule docking, the optimal position, orientation and conformation (pose) of the molecule within the binding site is assessed (“docking stage”), and an estimation of its binding energy calculated. High-throughput docking (HTD) allows the screening of large chemical libraries (from thousands to millions of molecules) to generate a hit-list enriched with potential binders, which will be then advanced for biochemical and biological evaluation. To be computationally efficient, HTD involves several approximations at different levels,7 and the binding free energy calculation is later replaced by a docking score, which is a measure of the probability that the molecule will bind to the target. Thus, the docking stage is followed in this case by the “scoring stage”.7,8
In spite of its undoubted success, HTD is not without challenges, since its performance depends on the energy representation of the system, the degree of target flexibility,4,9–11 and the consideration of water molecules within the binding site.4,12,13 A recent extensive comparison of docking programs showed that, in agreement with earlier works,14,15 they perform better in terms of docking accuracy (docking stage) than in terms of scoring power, ranking power, and screening power (scoring stage).16 We would like to stress that pose prediction is nevertheless of the utmost importance in molecular docking, since incorrect poses will result in meaningless scores, which would thus reduce the ranking capacity of scoring functions. The performance of HTD using different docking programs has been further evaluated on several systems,17–19 and many inconsistencies have been found, such as different performances across programs, also showing that the effectiveness of each scoring function is system dependent.18,20,21 Several efforts have been conducted to improve the reliability at the scoring stage, such as machine-learning-based scoring functions,22,23 and quantum mechanical-base scoring.24–29
The combination of several docking programs (consensus scoring) has been shown to improve the performance of HTD.20,30–33 In 2013, Houston and Walkinshaw proposed for the first time a consensus docking procedure that used several docking programs to increase the reliability of the predicted poses.33 Tuccinardi et al. later used ten docking protocols to evaluate pose consensus on database enrichment,32 and later extended their analysis to 36 benchmark targets of the DUD database.31 They obtained comparable results to Arciniega and Lange's Docking Data Feature Analysis (DDFA), an approach for carrying out virtual screening analysis based on artificial neural networks which was among the best performing methods at the time.34 To obtain good hit rates with their pose consensus strategy, molecules with at least seven matching poses between programs should be selected; in general, the best results were obtained with ten matching poses, which could represent a high computational cost. However, and more importantly, the number of ligands retrieved in most of those cases was very small, with the risk of being zero in some cases.31
It should be highlighted that in consensus scoring (or consensus ranking), for the sake of robustness, it would desirable that scores for a given molecule be combined only when the poses assessed by the different docking programs are similar. We thus present a new strategy that combines both pose and ranking consensus to overcome the limitations of each strategy when used in a stand-alone fashion, and thus increase the performance of HTD campaigns. This method, named Pose/Ranking Consensus (PRC) is consistent with theory in the sense that scores (or ranks) obtained with different programs are only combined when poses are coincident. Using four docking programs (ICM, rDock, Auto Dock 4, and PLANTS) we performed an exhaustive search to look for the best way of combining pose and rank requirements, and evaluated this new method over a wide range of targets that correspond to diverse protein families sampling different binding site properties. Our results show a consistent and improved performance compared to either pose consensus alone, or consensus scoring (ranking) alone strategies. This method is simple to use, simpler than machine learning consensus scoring methods, and displays an excellent performance also using free software programs.
Receptor | Receptor code | Receptor | Receptor code |
---|---|---|---|
Thymidine kinase | KITH | Tyrosine-protein kinase ABL | ABL1 |
Phospholipase A2 | PA2GA | Protein-tyrosine phosphatase 1B | PTN1 |
Coagulation factor VII | FA7 | Inhibitor of apoptosis protein 3 | XIAP |
Hexokinase type IV | HXK4 | Androgen receptor | ANDR |
Cyclin-dependent kinase 2 | CDK2 | Renin | RENI |
Cyclooxygenase-1 | COX1 | Glutamate receptor ionotropic, AMPA 2 | GRIA2 |
Fatty acid-binding protein 4 | FABP4 | Aldose reductase | ALDR |
Heat shock protein 90 alpha | HSP90a | Dihydrofolate reductase | DYR |
Estrogen receptor alpha | ESR1 | Dihydroorotate dehydrogenase | PYRD |
Neuraminidase | NRAM | 11-Beta-hydroxysteroid dehydrogenase 1 | DHI1 |
β2 Adrenergic receptor (agonist bound) | ADRB2 | Angiotensin-converting enzyme | ACE |
HMG-CoA reductase | HMDH | Progesterone receptor | PRGR |
Dopamine D3 receptor (antagonist bound) | DRD3 | Human immunodeficiency virus type 1 reverse transcriptase | HIVRT |
Histone deacetylase 2 | HDAC2 | Purine nucleoside phosphorylases | PNPH |
Leukocyte function associated antigen-1 | LFA1 | Protein kinase C beta | KPCB |
Leukotriene A4 hydrolase | LKHA4 | Insulin-like growth factor I receptor | IGF1R |
Urokinase-type plasminogen activator | UROK | Phosphodiesterase 5A | PDE5A |
Receptors were prepared with the ICM program35 (version 3.8-7c; MolSoft, San Diego, CA 2020), in a similar fashion as in other works.25 Missing residues and hydrogen atoms were added followed by a local energy minimization of the system. Polar and water hydrogens within the binding site were optimized using a Monte Carlo simulation in the torsional space. Glutamate and asparte side chains were assigned a −1 charge, and lysine and arginine were assigned a +1 charge. Asparagine and glutamine were inspected for possible flipping and adjusted if necessary. Histidine tautomers were assigned according to their most favorable hydrogen bonding pattern.
Auto Dock Tools utilities38 were used to prepare the input files for Auto Dock 4, where the Lamarckian genetic algorithm was used for a 20-run search for each compound using 1.75 million of energy evaluation. For PLANTS, the ChemPLP scoring function was used and speed 1 was set as search speed. For rDock, a radius of 8.0 Å ± 2.0 Å from a reference ligand binding mode was used to represent the cavity. For Vina, an exhaustiveness value of 8 was set. For ICM, a thoroughness of 2 was used for the search algorithm. All the other parameters for every software remained at their default values. On average, each program took between 13 and 130 seconds per core per molecule, with ICM being the fastest and Auto Dock 4 the slowest program.
(1) |
(2) |
The hit rate (HR) was calculated as
(3) |
As starting point, we calculated the Exponential Consensus Ranking (ECR).30 This consensus method combines results from several docking programs using an exponential distribution for each individual rank. In a previous work, it demonstrated a higher performance than other traditional consensus strategies and individual programs. In this work we extended the analysis of the ECR to 34 targets using four instead of the original six programs. Our results confirmed its better performance when compared to individual programs. On average, it showed at least a 1.4-fold increase for the enrichment (average ratio over all targets between the ECR EF1 and an individual program EF1) (cf.Table 2).
Average | ICM | rDock | Auto Dock 4 | PLANTS |
---|---|---|---|---|
a Average value calculated as , where N is the number of targets. b This value does not include XIAP and ANDR, which had EF = 0, and therefore make the average fold increase ≫ 100. c This value does not include these five targets: HXK4, NRAM, XIAP, DYR and PNPH, which had EF = 0, and therefore make the average fold increase ≫ 100. d This value does not include FABP4 and PYRD, which had EF = 0, and therefore make the average fold increase ≫ 100. | ||||
EF1 | 23.5 | 10.5 | 5.8 | 9.9 |
Fold increasea | 1.4 | 3.4b | 7.4c | 3.2d |
Receptor | 2 MPs | 3 MPs | 4 MPs |
---|---|---|---|
KITH | 1.4 | 2.6 | 4.7 |
PA2GA | 1.6 | 3.4 | 4.7 |
FA7 | 1.6 | 3.4 | 3.2 |
HXK4 | 1.3 | 1.4 | 1.7 |
CDK2 | 1.2 | 1.9 | 3.3 |
COX1 | 1.1 | 1.3 | 1.5 |
FABP4 | 1.2 | 1.5 | 1.5 |
HSP90a | 1.0 | 1.3 | 2.2 |
ESR1 | 1.2 | 1.9 | 4.1 |
NRAM | 1.8 | 4.7 | 5.6 |
ADRB2 | 1.2 | 1.2 | 0.4 |
HMDH | 2.3 | 5.2 | 7.1 |
DRD3 | 1.1 | 1.1 | 0.9 |
HDAC2 | 1.2 | 2.1 | 1.8 |
LFA1 | 0.9 | 1.4 | 2.8 |
LKHA4 | 1.5 | 1.9 | 1.8 |
UROK | 1.5 | 3.8 | 9.9 |
ABL1 | 1.4 | 1.6 | 2.8 |
PTN1 | 1.3 | 1.9 | 1.3 |
XIAP | 1.4 | 4.1 | 7.5 |
ANDR | 1.2 | 1.9 | 4.6 |
Renin | 1.8 | 9.3 | 28.9 |
GRIA2 | 1.4 | 3.5 | 7.7 |
ALDR | 1.2 | 2.0 | 4.0 |
DYR | 1.3 | 1.7 | 2.5 |
PYRD | 1.3 | 2.6 | 3.4 |
DHI1 | 1.1 | 1.7 | 2.7 |
ACE | 1.3 | 5.0 | 5.6 |
PRGR | 1.2 | 1.8 | 3.5 |
HIVRT | 1.4 | 1.8 | 3.8 |
PNPH | 1.2 | 2.3 | 5.6 |
KPCB | 1.3 | 2.6 | 6.6 |
IGF1R | 1.4 | 2.3 | 1.6 |
PDE5A | 1.9 | 4.8 | 14.2 |
Average | 1.4 | 2.7 | 4.8 |
Receptor | Option A | Option B | Option C | |||
---|---|---|---|---|---|---|
A/Sa | EF | A/Sa | EF | A/Sa | EF | |
a Number of actives and selected molecules for each target. b Average (A)/average (S). | ||||||
KITH | 24/38 | 14.3 | 3/4 | 17.0 | 1/3 | 7.6 |
PA2GA | 26/48 | 22.8 | 9/10 | 37.9 | 3/3 | 42.1 |
FA7 | 84/107 | 27.5 | 33/35 | 33.1 | 1/1 | 35.1 |
HXK4 | 17/72 | 9.2 | 0/10 | 0.0 | 0/2 | 0.0 |
CDK2 | 23/56 | 12.2 | 17/47 | 10.8 | 11/22 | 14.9 |
COX1 | 11/210 | 1.8 | 11/134 | 2.8 | 9/54 | 5.7 |
FABP4 | 22/65 | 17.3 | 14/30 | 23.8 | 5/7 | 36.5 |
HSP90a | 21/84 | 10.1 | 5/23 | 8.8 | 0/5 | 0.0 |
ESR1 | 44/136 | 17.0 | 30/70 | 22.5 | 19/24 | 41.7 |
NRAM | 20/58 | 10.0 | 10/12 | 24.2 | 0/1 | 0.0 |
ADRB2 | 63/147 | 17.2 | 9/38 | 9.5 | 1/10 | 4.0 |
HMDH | 30/69 | 22.8 | 6/12 | 26.2 | 3/3 | 52.4 |
DRD3 | 11/140 | 3.1 | 2/29 | 2.8 | 0/6 | 0.0 |
HDAC2 | 24/84 | 16.2 | 9/19 | 26.8 | 1/5 | 11.3 |
LFA1 | 15/121 | 7.7 | 11/46 | 14.9 | 3/11 | 17.0 |
LKHA4 | 36/166 | 12.2 | 12/43 | 15.7 | 1/10 | 5.6 |
UROK | 47/80 | 36.3 | 35/38 | 56.9 | 15/19 | 48.7 |
ABL1 | 42/164 | 15.4 | 20/58 | 20.7 | 2/18 | 6.7 |
PTN1 | 33/129 | 14.5 | 16/63 | 14.4 | 2/21 | 5.4 |
XIAP | 7/13 | 28.2 | 1/2 | 26.2 | 1/1 | 52.4 |
ANDR | 28/172 | 8.8 | 16/45 | 19.3 | 7/12 | 31.6 |
Renin | 15/21 | 48.2 | 8/9 | 60.0 | 3/3 | 67.5 |
GRIA2 | 34/129 | 20.0 | 15/37 | 30.8 | 11/12 | 69.6 |
ALDR | 68/188 | 20.8 | 46/97 | 27.3 | 26/48 | 31.2 |
DYR | 39/144 | 20.4 | 12/39 | 23.1 | 1/8 | 9.4 |
PYRD | 38/120 | 18.7 | 25/42 | 35.1 | 7/14 | 29.5 |
DHI1 | 31/334 | 5.5 | 21/128 | 9.8 | 8/60 | 7.9 |
ACE | 30/94 | 19.5 | 7/17 | 25.2 | 0/0 | 0.0 |
PRGR | 33/300 | 6.0 | 31/150 | 11.2 | 25/54 | 25.1 |
HIVRT | 47/214 | 12.5 | 17/76 | 12.7 | 5/22 | 13.0 |
PNPH | 44/133 | 22.7 | 29/62 | 32.0 | 9/22 | 28.0 |
KPCB | 47/74 | 41.5 | 25/33 | 49.5 | 15/16 | 61.3 |
IGF1R | 20/43 | 29.7 | 7/13 | 34.3 | 1/1 | 63.7 |
PDE5A | 61/201 | 21.3 | 27/52 | 36.4 | 15/22 | 47.8 |
Average | 33/122b | 18.0 | 16/45b | 23.6 | 6/15b | 25.7 |
Next, we considered a combination of the three options A, B, and C, in the following fashion: if a molecule had a maximum of two MPs, the corresponding ranks obtained with those two programs should be within the top 5%; with a maximum of three MPs, those corresponding three ranks should be within the top 10%; with four MPs, the four ranks ought to be in the top 20%. While this strategy (named option D) showed a slightly less average EF than options B and C (20.0 vs. 25.7), there were no cases where actual ligands could not be found. Therefore, it was preferred over each individual option. We explored other combinations of ranking thresholds which are presented in Table S5,† but 5%, 10% and 20% for two, three and four MPs, respectively, was the best choice (similar results were also obtained with values of 5%, 10% and 25%).
Receptor | Option D | PRC | |||
---|---|---|---|---|---|
A/Sa | EF | A/Sa | EF | HR | |
a Number of actives and selected molecules for each target. b Average (A)/average (S). | |||||
KITH | 15/23 | 14.8 | 13/15 | 19.7 | 0.87 |
PA2GA | 16/30 | 22.4 | 12/16 | 31.5 | 0.75 |
FA7 | 64/73 | 30.7 | 44/45 | 34.3 | 0.98 |
HXK4 | 15/50 | 11.6 | 9/23 | 15.2 | 0.39 |
CDK2 | 14/33 | 12.6 | 11/17 | 19.3 | 0.65 |
COX1 | 11/111 | 3.4 | 8/47 | 5.8 | 0.17 |
FABP4 | 20/37 | 27.6 | 20/25 | 40.9 | 0.80 |
HSP90a | 11/39 | 11.4 | 8/21 | 15.4 | 0.38 |
ESR1 | 33/80 | 21.7 | 33/53 | 32.8 | 0.62 |
NRAM | 14/29 | 14.0 | 9/19 | 13.8 | 0.47 |
ADRB2 | 53/101 | 21.1 | 35/60 | 23.4 | 0.58 |
HMDH | 25/55 | 23.8 | 14/30 | 24.5 | 0.47 |
DRD3 | 7/77 | 3.6 | 6/48 | 5.0 | 0.13 |
HDAC2 | 23/68 | 19.2 | 21/43 | 27.7 | 0.49 |
LFA1 | 9/59 | 9.5 | 8/43 | 11.6 | 0.19 |
LKHA4 | 29/130 | 12.6 | 18/69 | 14.7 | 0.26 |
UROK | 46/70 | 40.5 | 46/50 | 56.8 | 0.92 |
ABL1 | 36/125 | 17.3 | 33/75 | 26.4 | 0.44 |
PTN1 | 31/128 | 13.7 | 24/57 | 23.9 | 0.42 |
XIAP | 4/8 | 26.2 | 3/6 | 26.2 | 0.5 |
ANDR | 12/94 | 6.9 | 10/40 | 13.5 | 0.25 |
Renin | 15/20 | 50.6 | 14/17 | 55.6 | 0.82 |
GRIA2 | 25/83 | 22.9 | 20/52 | 29.2 | 0.38 |
ALDR | 55/135 | 23.5 | 51/81 | 36.3 | 0.63 |
DYR | 34/112 | 22.8 | 24/70 | 25.8 | 0.34 |
PYRD | 31/80 | 22.9 | 30/51 | 34.7 | 0.59 |
DHI1 | 23/245 | 5.6 | 21/136 | 9.2 | 0.15 |
ACE | 27/86 | 19.2 | 22/59 | 22.8 | 0.37 |
PRGR | 36/185 | 10.5 | 30/94 | 17.3 | 0.32 |
HIVRT | 36/169 | 13.5 | 28/97 | 16.5 | 0.29 |
PNPH | 29/87 | 22.8 | 26/51 | 34.9 | 0.51 |
KPCB | 43/65 | 43.2 | 42/51 | 53.8 | 0.82 |
IGF1R | 20/38 | 33.6 | 20/33 | 38.6 | 0.61 |
PDE5A | 46/153 | 21.1 | 41/98 | 29.3 | 0.42 |
Average | 27/85b | 20.0 | 22/50b | 26.1 | 0.50 |
Fig. 1 shows a schematic representation of this Pose/Ranking Consensus (PRC) pipeline. Starting from the binding poses and ranks obtained with the four docking programs, a pose/ranking filtering approach is carried out. For this, the maximum number of MPs (1–4) is assessed for each molecule, coupled with identifying those programs where the poses matched. Then, the ones with four MPs are identified and filtered according to the 20% rank threshold in the corresponding programs. The same is performed for three MPs (10% rank threshold), and two MPs (5% rank threshold). In parallel, the ECR method is calculated onto the whole database. The molecules that pass the pose/ranking filters are ordered by their corresponding ECR, previously calculated, and the ones in the top 1.5% are finally selected.
In Table 5, we show the performance of the PRC method in terms of EF and number of actual ligands (actives) retrieved for each target. EF values of option D selection strategy are also presented. The last column shows the hit rate (probability of finding an actual ligand within the selected pool of molecules) in the PRC selected compounds. It can be readily noticed from these results that both the pose/ranking filtering and ECR threshold requirements are important to achieve high EF values. The PRC showed the best performance in almost every target evaluated, with the exception of one case (NRAM) where the difference was minimal.
As can be seen from Table 5, our method results in very high enrichment values, with an appropriate number of ligands. The latter could be critical in a prospective scenario, where the number of actual ligands might be scarce. When viewed in terms of probability, an average hit rate of 50% is achieved on the subsets of molecules selected. The maximum value (98%) was obtained for FA7 where 44 out of 45 selected molecules were ligands. DRD3 showed the lowest hit rate value (13%) and the lowest number of ligands retrieved (6). In 2016, Tuccinardi et al. achieved an average hit rate of 45% (vs. 50% with PRC), which they required to be at the level of the best performing methods.31 We note, however, that the results they report correspond to the maximum hit rate that can be obtained for each target, which depends on the number of MPs used, and therefore is not directly applicable in a prospective analysis.
This novel method achieves very high EF values, greatly surpassing previous pose consensus techniques and ranking consensus techniques, including the ECR, as it is shown in the next section. The results are especially higher for those targets that have a poor performance in the four docking programs (and ECR), reaching EFs of more than triple the values of EF1 ECR (see below and Table 6).
Receptor | ECR EF1 | PRC EF | Fold increase |
---|---|---|---|
a Average of the fold increase values. | |||
KITH | 12.5 | 19.7 | 1.58 |
PA2GA | 25.4 | 31.5 | 1.24 |
FA7 | 34.5 | 34.3 | 0.99 |
HXK4 | 5.5 | 15.2 | 2.76 |
CDK2 | 18.5 | 19.3 | 1.04 |
COX1 | 3.4 | 5.8 | 1.71 |
FABP4 | 40.5 | 40.9 | 1.01 |
HSP90a | 4.9 | 15.4 | 3.14 |
ESR1 | 35.1 | 32.8 | 0.93 |
NRAM | 4.5 | 13.8 | 3.07 |
ADRB2 | 24.5 | 23.4 | 0.96 |
HMDH | 17.1 | 24.5 | 1.43 |
DRD3 | 3.2 | 5.0 | 1.56 |
HDAC2 | 13.6 | 27.7 | 2.04 |
LFA1 | 10.9 | 11.6 | 1.06 |
LKHA4 | 15.2 | 14.7 | 0.97 |
UROK | 44.5 | 56.8 | 1.28 |
ABL1 | 25.3 | 26.4 | 1.04 |
PTN1 | 29.5 | 23.9 | 0.81 |
XIAP | 20.2 | 26.2 | 1.30 |
ANDR | 9.0 | 13.5 | 1.50 |
Renin | 17.4 | 55.6 | 3.20 |
GRIA2 | 19.8 | 29.2 | 1.47 |
ALDR | 33.5 | 36.3 | 1.08 |
DYR | 26.1 | 25.8 | 0.99 |
PYRD | 26.3 | 34.7 | 1.32 |
DHI1 | 8.8 | 9.2 | 1.05 |
ACE | 14.3 | 22.8 | 1.59 |
PRGR | 9.2 | 17.3 | 1.88 |
HIVRT | 15.1 | 16.5 | 1.09 |
PNPH | 37.1 | 34.9 | 0.94 |
KPCB | 45.3 | 53.8 | 1.19 |
IGF1R | 18.3 | 38.6 | 2.11 |
PDE5A | 17.1 | 29.3 | 1.71 |
Average | 20.3 | 26.1 | 1.50a |
We also analyzed for each target the ECR EF when selecting the same number of molecules from the top as those returned by the PRC. It should be noted that this is not a measure of practical value in prospective HTD, as this threshold is never known beforehand. However, the PRC in this case also surpassed the ECR, showing, on average, a 1.33-fold increase; moreover, our method showed an eight times higher EF in HXK4, and still showed 3-fold increase values in the worst performing targets.
Taking into account that the ECR already represents an improvement of the results over previous consensus strategies and to individual programs, these results show that the PRC method allows for significantly higher hit rates and EF values, with a minimal computational cost, and can therefore reach better results in future prospective HTD campaigns.
Receptor | Option D | PRC | |||
---|---|---|---|---|---|
A/Sa | EF | A/Sa | EF | HR | |
a Number of actives and selected molecules for each target. b Average (A)/average (S). | |||||
KITH | 3/17 | 4 | 3/9 | 7.6 | 0.33 |
PA2GA | 11/16 | 28.9 | 10/13 | 32.4 | 0.77 |
FA7 | 27/40 | 23.7 | 22/29 | 26.6 | 0.76 |
HXK4 | 2/31 | 2.5 | 1/22 | 1.8 | 0.05 |
CDK2 | 17/34 | 14.9 | 12/22 | 16.3 | 0.55 |
COX1 | 11/236 | 1.6 | 5/91 | 1.9 | 0.05 |
FABP4 | 13/64 | 10.4 | 12/23 | 26.7 | 0.52 |
HSP90a | 2/48 | 1.7 | 0/31 | 0 | 0 |
ESR1 | 36/159 | 11.9 | 31/80 | 20.4 | 0.39 |
NRAM | 4/33 | 3.5 | 2/25 | 2.3 | 0.08 |
ADRB2 | 23/119 | 7.8 | 20/73 | 11.1 | 0.27 |
HMDH | 16/41 | 20.4 | 7/21 | 17.5 | 0.33 |
DRD3 | 9/151 | 2.4 | 6/70 | 3.4 | 0.09 |
HDAC2 | 22/72 | 17.3 | 16/38 | 23.9 | 0.42 |
LFA1 | 10/86 | 7.3 | 9/53 | 10.6 | 0.17 |
LKHA4 | 46/181 | 14.3 | 31/74 | 23.6 | 0.42 |
UROK | 46/89 | 31.9 | 43/62 | 42.8 | 0.69 |
ABL1 | 22/172 | 7.7 | 19/85 | 13.4 | 0.22 |
PTN1 | 28/80 | 19.8 | 23/43 | 30.3 | 0.53 |
XIAP | 2/11 | 9.5 | 2/9 | 11.7 | 0.22 |
ANDR | 14/164 | 4.6 | 13/97 | 7.3 | 0.13 |
Renin | 9/18 | 33.7 | 9/13 | 46.7 | 0.69 |
GRIA2 | 25/95 | 19.9 | 16/51 | 23.8 | 0.31 |
ALDR | 45/187 | 13.9 | 38/90 | 24.3 | 0.42 |
DYR | 24/179 | 10.1 | 20/104 | 14.5 | 0.19 |
PYRD | 28/88 | 18.8 | 24/50 | 28.3 | 0.48 |
DHI1 | 25/317 | 4.7 | 20/169 | 7.1 | 0.12 |
ACE | 16/87 | 11.3 | 11/49 | 13.7 | 0.22 |
PRGR | 36/313 | 6.2 | 21/149 | 7.6 | 0.14 |
HIVRT | 32/269 | 6.7 | 14/150 | 5.3 | 0.09 |
PNPH | 24/90 | 18.3 | 20/52 | 26.3 | 0.38 |
KPCB | 41/92 | 29.1 | 39/56 | 45.5 | 0.70 |
IGF1R | 20/61 | 20.9 | 20/42 | 30.4 | 0.48 |
PDE5A | 44/210 | 14.7 | 39/140 | 19.5 | 0.28 |
Average | 27/140b | 13.4 | 22/78b | 18.4 | 0.34 |
In Table 8 we compare the results of the PRC and ECR for free docking programs. Better results were obtained in 28 of the 34 targets with an average 1.62-fold increase of the PRC method over the ECR EF1. Of the remaining six, ANDR is the one that shows the highest decrease. In this target, the docking programs did not perform well, with rDock showing zero EF1. For HSP90a, neither the ECR nor the PRC exhibited good results. In option D it achieved a slightly better EF than the ECR EF1, and it may be a better selection strategy for cases where individual performances in terms of scoring are very poor. Regarding ESR1 and ABL1, while they still show acceptable EF values, they performed slightly worse than ECR. This was also the case for ESR1 with the previous procedure (Table 6). It should be noted, anyway, that the number of selected molecules for this target (80) is higher than 1% of its database (67).
Receptor | ECR EF1 | PRC EF | Fold increase |
---|---|---|---|
a Average of the fold increase values. | |||
KITH | 2.3 | 7.6 | 3.22 |
PA2GA | 16.7 | 32.4 | 1.94 |
FA7 | 24.1 | 26.6 | 1.1 |
HXK4 | 0.8 | 1.8 | 2.23 |
CDK2 | 12.8 | 16.3 | 1.27 |
COX1 | 1 | 1.9 | 1.95 |
FABP4 | 19.4 | 26.7 | 1.38 |
HSP90a | 0 | 0 | 1 |
ESR1 | 23.9 | 20.4 | 0.85 |
NRAM | 0.5 | 2.3 | 5.12 |
ADRB2 | 10.8 | 11.1 | 1.03 |
HMDH | 7.6 | 17.5 | 2.30 |
DRD3 | 3.1 | 3.4 | 1.11 |
HDAC2 | 16.3 | 23.9 | 1.46 |
LFA1 | 12.3 | 10.6 | 0.86 |
LKHA4 | 18.2 | 23.6 | 1.30 |
UROK | 30.9 | 42.8 | 1.39 |
ABL1 | 18.2 | 13.4 | 0.74 |
PTN1 | 33.4 | 30.3 | 0.91 |
XIAP | 6.1 | 11.7 | 1.92 |
ANDR | 10.5 | 7.3 | 0.70 |
Renin | 12.5 | 46.7 | 3.74 |
GRIA2 | 14.7 | 23.8 | 1.62 |
ALDR | 25.3 | 24.3 | 0.96 |
DYR | 10.4 | 14.5 | 1.39 |
PYRD | 20.0 | 28.3 | 1.42 |
DHI1 | 6.1 | 7.1 | 1.16 |
ACE | 5.0 | 13.7 | 2.74 |
PRGR | 4.1 | 7.6 | 1.85 |
HIVRT | 5.3 | 5.3 | 1 |
PNPH | 25.4 | 26.3 | 1.04 |
KPCB | 37.9 | 45.5 | 1.20 |
IGF1R | 17.6 | 30.4 | 1.73 |
PDE5A | 12.9 | 19.5 | 1.51 |
Average | 13.7 | 18.4 | 1.62a |
A very noticeable improvement of PRC over ECR can be seen for KITH, HXK4, NRAM, HMDH, renin and ACE, where EF values of more than double the ECR EF1 were obtained. The average fold increase was even higher than in the previous case (1.62 vs. 1.50), therefore confirming the applicability of PRC method even when only free docking programs were available.
In spite of the obvious success, we would like to point out two facts related to this methodology: (i) it is still dependent on the performance of the individual programs on the target. If no program managed to perform well, the PRC method would still improve the results obtained, but in a limited way; (ii) option D (cf.Table 5) is a good alternative in a prospective case when it is suspected that a little number of actual ligands might be present in the query database, or when the target belongs to a family of proteins that does not usually perform well in HTD campaigns, since it will likely retrieve more ligands. While (i) is a common limitation to all consensus strategies, PRC shows itself as a promising tool to by-pass it. In a follow-up contribution, we will evaluate the dependence of the method on the relationship between the number of ligands and decoys in the database for each target.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/d1ra05785e |
This journal is © The Royal Society of Chemistry 2021 |