 Open Access Article
 Open Access Article
      
        
          
            Evgeny 
            Gutkin‡
          
        
        
       a, 
      
        
          
            Filipp 
            Gusev‡
a, 
      
        
          
            Filipp 
            Gusev‡
          
        
       ab, 
      
        
          
            Francesco 
            Gentile‡
ab, 
      
        
          
            Francesco 
            Gentile‡
          
        
       de, 
      
        
          
            Fuqiang 
            Ban
          
        
      c, 
      
        
          
            S. Benjamin 
            Koby
de, 
      
        
          
            Fuqiang 
            Ban
          
        
      c, 
      
        
          
            S. Benjamin 
            Koby
          
        
       a, 
      
        
          
            Chamali 
            Narangoda
          
        
      a, 
      
        
          
            Olexandr 
            Isayev
a, 
      
        
          
            Chamali 
            Narangoda
          
        
      a, 
      
        
          
            Olexandr 
            Isayev
          
        
       *ab, 
      
        
          
            Artem 
            Cherkasov
*ab, 
      
        
          
            Artem 
            Cherkasov
          
        
       *c and 
      
        
          
            Maria G. 
            Kurnikova
*c and 
      
        
          
            Maria G. 
            Kurnikova
          
        
       *a
*a
      
aDepartment of Chemistry, Mellon College of Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA. E-mail: olexandr@olexandrisayev.com; kurnikova@cmu.edu
      
bComputational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
      
cVancouver Prostate Centre, The University of British Columbia, Vancouver, BC, Canada. E-mail: acherkasov@prostatecentre.com
      
dDepartment of Chemistry and Biomolecular Sciences, University of Ottawa, Ottawa, ON, Canada
      
eOttawa Institute of Systems Biology, Ottawa, ON, Canada
    
First published on 11th April 2024
The Critical Assessment of Computational Hit-Finding Experiments (CACHE) Challenge series is focused on identifying small molecule inhibitors of protein targets using computational methods. Each challenge contains two phases, hit-finding and follow-up optimization, each of which is followed by experimental validation of the computational predictions. For the CACHE Challenge #1, the Leucine-Rich Repeat Kinase 2 (LRRK2) WD40 Repeat (WDR) domain was selected as the target for in silico hit-finding and optimization. Mutations in LRRK2 are the most common genetic cause of the familial form of Parkinson's disease. The LRRK2 WDR domain is an understudied drug target with no known molecular inhibitors. Herein we detail the first phase of our winning submission to the CACHE Challenge #1. We developed a framework for the high-throughput structure-based virtual screening of a chemically diverse small molecule space. Hit identification was performed using the large-scale Deep Docking (DD) protocol followed by absolute binding free energy (ABFE) simulations. ABFEs were computed using an automated molecular dynamics (MD)-based thermodynamic integration (TI) approach. 4.1 billion ligands from Enamine REAL were screened with DD followed by ABFEs computed by MD TI for 793 ligands. 76 ligands were prioritized for experimental validation, with 59 compounds successfully synthesized and 5 compounds identified as hits, yielding a 8.5% hit rate. Our results demonstrate the efficacy of the combined DD and ABFE approaches for hit identification for a target with no previously known hits. This approach is widely applicable for the efficient screening of ultra-large chemical libraries as well as rigorous protein–ligand binding affinity estimation leveraging modern computational resources.
The absence of experimentally validated ligands presents a major difficulty for drug discovery. Since docking typically requires only the structure of the protein binding pocket, SBVS is routinely used in cases where no binders are known for a given target protein. However, the recent exponential expansion of the sizes of the available chemical libraries poses a significant challenge for SBVS. Advancements in automated chemical synthesis and the surge of available chemicals have led to the emergence of ultra-large virtual databases that include billions of compounds. For instance, Enamine REAdily accessibLe (REAL) database14 currently contains over 38 billion compounds, with 5.5 billion drug-like compounds that can be synthesized at high success rate (>80%). It is noteworthy that the “make-on-demand” libraries are expanding without compromising chemical diversity, with novel scaffolds being consistently incorporated.11 At the same time, the current capabilities of docking software and the amount of available computational resources rarely allow for SBVS campaigns to surpass hundreds of millions of molecules threshold,15 with several exceptions.10,16,17 Therefore, much faster screening protocols are required to enable an efficient screening of currently available chemical space.
Recently, Gentile et al.18 developed the Deep Docking (DD) method to screen billions of molecules using a deep learning machine learning (ML) model capable of predicting docking scores. DD utilizes a multilayer perceptron (MLP) model which is trained using docking scores and circular chemical fingerprints. MLP model training uses active learning (AL). In this iterative approach, an ML model is first trained on the results of structural docking of a small subset of molecules and then used to predict docking scores for the rest of the chemical library. Next, a sample of ML predicted virtual hits is structurally docked and used for training set augmentation and ML model retraining. The latter step is repeated iteratively until the model converges or a pre-defined number of iterations is reached. AL iteratively improves the ML model performance and ability to identify top-scoring molecules. Screening an ultra-large database using DD was demonstrated to provide enrichment factors of up to 6000 for top-ranked hits with an approximate 50× speed acceleration compared to brute force docking. For example, the application of DD for the SBVS of ZINC15![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 19 database against the SARS-CoV-2 main protease yielded several novel experimentally validated inhibitors.18
19 database against the SARS-CoV-2 main protease yielded several novel experimentally validated inhibitors.18
The second major challenge for SBVS stems from the limited accuracy of modern scoring functions, resulting in a large share of false positives among top-scoring ligands and thus the limited success rate of virtual screening campaigns.13 While the assessment of docking results by expert medicinal chemists can reduce20 this deficiency, this approach is not feasible for large libraries; thus, automated ways of discarding false positives from docking hits are needed. One common approach to reduce the number of artifacts is to dock each molecule using multiple programs, then re-ranking the molecules based on a consensus docking score.21,22
The predictive power of SBVS can be further improved by rescoring docking hits using absolute binding free energy (ABFE) simulations based on classical molecular dynamics (MD) simulations.23–25 This method utilizes a thermodynamic pathway between the two end-point states: the protein–ligand complex and the free protein and ligand in solvent. To achieve the most accurate results, all-atom simulations in explicit solvent are required.24–26 For example, benchmarking of ABFE simulations on congeneric series of ligands binding to eight protein targets demonstrated an agreement between calculations and experiment with a root mean squared error of 1.1 kcal mol−1 and a weighted average R2 of 0.55.24
In this work, we combined DD and ABFE simulations for the in silico screening of Enamine REAL (4.1B compounds) for potential inhibitors of leucine rich repeat kinase 2 (LRRK2) targeting the WD40 repeat (WDR) domain. Mutations in LRRK2 are known as the most common genetic cause of the familial form of Parkinson's disease (PD).27 Therefore, inhibitors of LRKK2 are considered promising therapeutic agents for the treatment of PD. LRRK2 is a large (286 kDa), multi-functional protein which contains seven unique domains: three N-terminal repeat domains, the GTPase domain, the dimerization domain, the kinase domain, and the C-terminal WDR domain.28 Pathogenic mutations of LRRK2 were suggested to induce the kinase domain to adopt a closed active conformation which promotes the binding of LRRK2 to microtubules. In vitro motility assays demonstrated that LRKK2 mutants inhibit kinesin and dynein movement along microtubules, suggesting that these mutants act as roadblocks for the motors.28 Kinase inhibitors that stabilize the open conformation of LRRK2 were shown to reduce the formation of LRRK2 filaments and recover the motor motility. However, most kinase inhibitors stabilize the closed conformation of the kinase domain and do not inhibit the oligomerization of LRKK2 on microtubules.28
The WDR domain is in close proximity to the kinase domain and can play a role in modulating LRRK2 kinase activity. The WDR domain is a closed protein solenoid domain consisting of repeating units forming a circularized beta-propeller structure.29,30 The structural model of the microtubule bound LRRK2 filaments suggested that the WDR domain mediates protein dimerization and formation of the LRRK2 filaments.28 Therefore, targeting the WDR domain is a promising yet underutilized strategy to antagonize the pathogenic LRRK2 filament formation. Identifying small molecule compounds binding to the WDR domain of LRRK2 can thus open the pathway towards the design of efficient therapeutics for the treatment of PD associated with LRKK2 malfunction.
Our workflow for in silico hit identification included the following steps: first, DD was performed for the Enamine REAL database to the LRRK2 WDR domain. Two sets of ligands from top-scoring docked molecules for reranking with ABFE simulations were selected: (1) molecules with the best consensus docking score, and (2) molecules chosen based on visual inspection of docked poses. For both sets, 7 ns MD simulations were performed to obtain equilibrated ligand poses in the protein binding pocket. ABFE for these poses were then computed using an automated MD TI protocol developed in our previous work.9 The molecules were then re-ranked by computed ABFE and top-scoring molecules were submitted for experimental testing.
![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 132
132![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 679
679![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 525 molecules, in SMILES format.14 Structures of small molecules were prepared for docking to the protein by processing SMILES with OpenEye tautomers tool31 to calculate one dominant ionization state per compound followed by translating them into 3D low-energy structures with OpenEye omega tool.32 For docking with AutoDock-GPU,33 input files were generated with OpenBabel.34
525 molecules, in SMILES format.14 Structures of small molecules were prepared for docking to the protein by processing SMILES with OpenEye tautomers tool31 to calculate one dominant ionization state per compound followed by translating them into 3D low-energy structures with OpenEye omega tool.32 For docking with AutoDock-GPU,33 input files were generated with OpenBabel.34
        ![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 39 within a 40 × 40 × 40 points grid with 0.375 Å spacing. Parameters for AutoDock-GPU were set as 5 million energy evaluations per Lamarckian genetic algorithm run for 10 independent runs using Solis-Wets local search, results were clustered based on a root-mean-square deviation (RMSD) threshold of 2 Å, and the best solution was chosen as the best scoring pose from the most populated cluster.40 ICM maps were generated with ICM's GUI. For ICM docking the thoroughness value was 3.
39 within a 40 × 40 × 40 points grid with 0.375 Å spacing. Parameters for AutoDock-GPU were set as 5 million energy evaluations per Lamarckian genetic algorithm run for 10 independent runs using Solis-Wets local search, results were clustered based on a root-mean-square deviation (RMSD) threshold of 2 Å, and the best solution was chosen as the best scoring pose from the most populated cluster.40 ICM maps were generated with ICM's GUI. For ICM docking the thoroughness value was 3.
        ![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 molecules. The same process was then repeated using ICM with sample sizes of 120
000 molecules. The same process was then repeated using ICM with sample sizes of 120![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 molecules.
000 molecules.
        The simulations were performed using the GPU-accelerated pmemd.cuda module of AMBER20.45–48 A simulation time step of 2 fs was used with hydrogen atoms constrained via SHAKE. A Langevin thermostat and Berendsen barostat were used to control temperature and pressure, respectively. Long-range electrostatics were calculated using the Particle Mesh Ewald method with a cutoff distance of 10 Å.
Only the WDR domain monomer chain A (residues A2141 to A2496) and crystal water molecules were included in the simulation. The missing loop regions of the protein (residues 2159–2163, 2254–2258, 2306–2313, 2396–2407, 2463–2467, and 2478–2487) were modeled using the ModLoop web server.49,50 The remaining missing atoms and hydrogens were added using the tleap program of AMBERTools18. The protein was solvated with water extending 10 Å in all directions and Cl− ions were added to neutralize the system.
The system energy was minimized for 3000 steps and heated at constant volume from 0.1 K to 300 K in 250 ps MD. The protein Cα atoms were restrained using a harmonic potential with a force constant (k) of 20 kcal mol−1 Å−2 during the above stages. The system was then equilibrated for 40 ns in the NPT ensemble. The restraints on the protein were gradually released during equilibration and only the Cα atoms of the end residues (residues 2141–2142 and 2495–2496) were restrained with k = 1 kcal mol−1 Å−2 for the final 25 ns. A production run of 100 ns was then carried out in NVT ensemble with the Cα atoms of the end residues restrained.
![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 52 and ligand atomic charges were derived using the RESP51 method. GPU-accelerated MD simulations were performed using the pmemd.cuda module of AMBER20. The simulation protocol included the following steps: (1) 2000 steps of minimization with gradient descent method; (2) 100 ps of heating at constant volume from 1 K to 298 K (3) 300 ps of density equilibration in NPT ensemble; (4) 7 ns of production simulation in NVT. Harmonic RMSD restraints were imposed on heavy atoms of the protein and the ligand during minimization and heating and were gradually removed during density equilibration; no restraints were used during production simulations.
52 and ligand atomic charges were derived using the RESP51 method. GPU-accelerated MD simulations were performed using the pmemd.cuda module of AMBER20. The simulation protocol included the following steps: (1) 2000 steps of minimization with gradient descent method; (2) 100 ps of heating at constant volume from 1 K to 298 K (3) 300 ps of density equilibration in NPT ensemble; (4) 7 ns of production simulation in NVT. Harmonic RMSD restraints were imposed on heavy atoms of the protein and the ligand during minimization and heating and were gradually removed during density equilibration; no restraints were used during production simulations.
          To identify unstable ligands, the average RMSD of ligand heavy atoms with respect to the average ligand structure (RMSDavg) was calculated for each protein–ligand complex using the cpptraj package.53 The first 2 ns of the MD production simulation was discarded and the average structure was obtained from the last 5 ns of the simulation. The RMSDavg was computed for each trajectory frame and its arithmetic mean was obtained. The MD trajectories of a subset of ligands with different average RMSDavg values were visually inspected. Based on this visual inspection, ligands with the average RMSDavg over 3 Å were discarded due to the instability of binding mode. For each ligand with average RMSDavg in the equilibrium MD simulation not exceeding 3 Å, a trajectory frame with the minimum RMSDavg was selected as a representative structure and used in further analysis and TI simulations.
|  | (1) | 
| STD = 1 − STS | (2) | 
The matrix of STDs was used to perform clustering with the DBSCAN algorithm as implemented in the scikit-learn library.55 Ligands assigned to the same cluster were visually validated.
During TI simulations of the protein–ligand complex, the position and orientation of the ligand with respect to the protein was restrained using the virtual bond approach.56 The carbonyl carbons of Cys2302 and Leu2303, and the backbone oxygen of Leu2303 were used for the virtual bond restraints as the protein atoms P1, P2 and P3, respectively (Fig. S1†). The majority of ligands provided by DD and expert selection contained an amide group. The requisite ligand atoms for the virtual bond restraints were selected by the following algorithm:
1. The amide group of the ligand with the oxygen closest to the carbonyl carbon of residue Cys2302 (P1) was selected.
2. The following two angles were compared: P1–O–C versus P1–O–N, where O, C and N are oxygen, carbon and nitrogen atoms of the ligand amide group, respectively. The atom combination with an angle less than 150°, greater than 30° and closest to 90° were selected as L1 and L2.
3. If the L1 and L2 were O and C, then N was selected as L3. If L1 and L2 were O and N, then the alpha carbon of the ligand amide group (Cα) was selected as L3.
4. If both atom combinations had angles that were greater than 150° or less than 30°, then N, Cα, and O were selected as L1, L2 and L3.
For the majority of ligands, this algorithm was successful for the selection of atoms for the virtual bond; however, some ligands had no amide groups, had amide groups with unsuitable angles, or amide groups too far on the periphery of the ligand. In these cases, the representative structure was visually inspected and restraint atoms were selected such that they:
1. Included rigid bonds, ideally sp2 hybridized carbons or otherwise constrained species.
2. The angles P2–P1–L1, P1–L1–L2 and L1–L2–L3 were close to 90°.
3. Were located close to the center of mass of the ligand.
Average gradients were calculated from the last 4.5 ns of the production simulations using the alchemlib python library.57 Errors were estimated using the bootstrap method. The free energies for both the free ligand, protein, and restraints were obtained by the trapezoid rule. The free energy of adding virtual bond restraints for the non-interacting ligand was calculated using the Boresch formula.56 The final values of ABFE were obtained according to the following equation (see Fig. 1 for the representation of free energy terms):
| ΔG = ΔG0bind = ΔGwatint − ΔGprotint − ΔGprot+VB − ΔGprot−VB. | (3) | 
![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) S) was predicted using an XGBoost58 model trained on alvaDesc molecular descriptors.59 Reference data was extracted from the literature,60 and the dataset was curated according to a well-known protocol.61,62 All models were trained within two nested five-fold cross validation loops and all splits were random. Final scoring was done by a simple averaging of five predictions from the external CV loop. The internal CV loop was used to perform hyperparameter search and variable selection for the corresponding fold using the protocol described in our previous work.62
S) was predicted using an XGBoost58 model trained on alvaDesc molecular descriptors.59 Reference data was extracted from the literature,60 and the dataset was curated according to a well-known protocol.61,62 All models were trained within two nested five-fold cross validation loops and all splits were random. Final scoring was done by a simple averaging of five predictions from the external CV loop. The internal CV loop was used to perform hyperparameter search and variable selection for the corresponding fold using the protocol described in our previous work.62
        Molecules with a predicted log![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) S lower than −5.5 were discarded from further selection. The remaining molecules were ranked based on ABFE, and the 150 molecules with the lowest ABFE were submitted to Enamine for price estimation and availability. Synthetic accessibility was verified for 127 molecules by the vendor. Considering the updated price estimate and the funding available from the CACHE Challenge, the final set of 76 molecules with the lowest ABFE was ordered for experimental validation.
S lower than −5.5 were discarded from further selection. The remaining molecules were ranked based on ABFE, and the 150 molecules with the lowest ABFE were submitted to Enamine for price estimation and availability. Synthetic accessibility was verified for 127 molecules by the vendor. Considering the updated price estimate and the funding available from the CACHE Challenge, the final set of 76 molecules with the lowest ABFE was ordered for experimental validation.
The docked structure of the 804 compounds selected as described above were equilibrated with molecular dynamics simulations in aqueous solution and standard thermodynamic conditions. All unstable protein–ligand complex structures were discarded as described below. Representative structures for each protein–ligand complex were extracted from the final 5 ns of the equilibrium trajectory and used for ABFE simulations. In total, 793 ABFE simulations were completed (see Fig. 2A). 77 molecules had a ΔG of less than −6.5 kcal mol−1. The top 76 compounds based on computed ΔG were purchased for experimental characterization, of which 59 compounds were successfully synthesized. The compounds not synthesized were weighted towards our top compounds, with 4 of our top 10 not synthesized while only 2 of our bottom 20 compounds failed synthesis. The mean ABFE of synthesized compounds was −7.70 kcal mol−1 while the mean ABFE of compounds that were not synthesized was −8.28 kcal mol−1.
Seven compounds were identified as hit molecules by the CACHE Challenge organizers based on their binding affinities measured via surface plasmon resonance (SPR); however, two of these compounds failed subsequent orthogonal validation via19F nuclear magnetic resonance (NMR) or isothermal titration calorimetry (ITC), resulting in a final list of five hit compounds. Thus, we achieved an overall hit rate of 8.5%. Four hit compounds were obtained from the Expert set while one came from Consensus set, which approximately matches their relative frequencies within the synthesized set of compounds (Expert set: 50 compounds; Consensus set: 9 compounds). Experimental validation performed by the CACHE Challenge organizers will be detailed in a follow up publication. Overall results of the CACHE Challenge #1 can be found here: https://cache-challenge.org/results-cache-challenge-1.
Hit 4 deviates in its binding pose when compared to the rest of the hits and has the highest computed ABFE among all hit compounds. This ligand adopts a U-shaped conformation; however, it is not within the U-shaped cluster (see “Binding pose analysis” section), as its binding pose is located closer to the periphery of the binding site. The conformation and orientation of Hit 4 are relatively stable in equilibrium MD simulations. Both the nitrogens of indole ring and amide group can form hydrogen bonds with the backbone of Lys2415. The indole ring also forms cation–π interactions with the amino group of the Lys2415 sidechain. The hydroxyl group connected to the indole ring forms a hydrogen bond with the backbone oxygen of Val2414, and the cyclohexane moiety forms hydrophobic interactions with Leu2200.
|  | ||
| Fig. 4 (A) The binding site of WDR domain of LRRK2 predicted by MOE Site finder. (B) Putatively important hydrophobic residues for ligand binding, in complex with one of the virtual hits. | ||
Docking grids centered on residues Ile2355 and Met2301 of the WDR domain were generated (Fig. 4B). The sequential Deep Docking process (Fig. 2B) reduced the size of Enamine REAL library from 4![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 132
132![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 679
679![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 525 to 17
525 to 17![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 885
885![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 310 molecules (231-fold reduction). In total, 23
310 molecules (231-fold reduction). In total, 23![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 088
088![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 153 molecules were docked to the LRRK2 target as part of the training (0.56% of the library). Consequently, the ∼17.9 million molecules predicted as hits by Deep Docking were docked to LRRK2 with Glide SP and AutoDock-GPU, and filtered based on the RMSD value between the poses generated by the programs. In total, 59
153 molecules were docked to the LRRK2 target as part of the training (0.56% of the library). Consequently, the ∼17.9 million molecules predicted as hits by Deep Docking were docked to LRRK2 with Glide SP and AutoDock-GPU, and filtered based on the RMSD value between the poses generated by the programs. In total, 59![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 187
187![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 165 molecules were structure docked to LRRK2 as both DD training and consensus docking procedures (1.4% of the Enamine REAL library).
165 molecules were structure docked to LRRK2 as both DD training and consensus docking procedures (1.4% of the Enamine REAL library).
Notably, applying consensus filtering drastically reduced the number of potential candidate molecules from Enamine REAL, 328![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 392 molecules passed the Consensus2 filter. Out of these, 100
392 molecules passed the Consensus2 filter. Out of these, 100![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 top scoring molecules were selected and the drug-likeness filter was applied followed by the best-first clustering of the top 5000 molecules. This reduced compound set contained 1891 molecules (Fig. 2B). The same protocol was applied to the Consensus3 molecular set resulting in 653 molecules. The top 100 molecules from each reduced compound set were selected and merged, resulting in 199 unique molecules considered for further evaluation (these molecules will be referred to as the Consensus set). In parallel, 605 unique molecules from the results of the three individual docking programs were selected via expert inspection of their predicted binding poses; compounds selected must have good hydrophobic interactions (see Fig. 4B) with Ile2355 and Met2301, and possibly good hydrophobic interactions with Val2457 and Met2459 (these molecules will be referred to as the Expert set). Thus, 804 molecules in total were sent for further evaluation by MD and ABFE simulations. Overall, the whole process took approximately 14 days in total, using 100 Tesla V100 GPUs for Autodock-GPU docking and machine learning and 1000 Intel® Xeon® Silver 4116 CPU@2.10 GHz cores for Glide and ICM docking and conformer generation. Brute-force screening of the entire library using the fastest screening tool (Autodock-GPU) and the same computational resources would have taken approximately 950 days just for the docking simulations.
000 top scoring molecules were selected and the drug-likeness filter was applied followed by the best-first clustering of the top 5000 molecules. This reduced compound set contained 1891 molecules (Fig. 2B). The same protocol was applied to the Consensus3 molecular set resulting in 653 molecules. The top 100 molecules from each reduced compound set were selected and merged, resulting in 199 unique molecules considered for further evaluation (these molecules will be referred to as the Consensus set). In parallel, 605 unique molecules from the results of the three individual docking programs were selected via expert inspection of their predicted binding poses; compounds selected must have good hydrophobic interactions (see Fig. 4B) with Ile2355 and Met2301, and possibly good hydrophobic interactions with Val2457 and Met2459 (these molecules will be referred to as the Expert set). Thus, 804 molecules in total were sent for further evaluation by MD and ABFE simulations. Overall, the whole process took approximately 14 days in total, using 100 Tesla V100 GPUs for Autodock-GPU docking and machine learning and 1000 Intel® Xeon® Silver 4116 CPU@2.10 GHz cores for Glide and ICM docking and conformer generation. Brute-force screening of the entire library using the fastest screening tool (Autodock-GPU) and the same computational resources would have taken approximately 950 days just for the docking simulations.
To further characterize protein and ligand stability during equilibrium MD simulations, we computed the RMSD of: (1) Cα atoms of protein residues that form the inner surface of the central cavity of WDR domain with respect to the minimized structure used for docking (protein RMSD), and (2) heavy atoms of the ligand from MD representative structure with respect to its docked structure (ligand RMSD). For all ligands selected for MD simulations, the average protein RMSD was below 1 Å, implying the structure of the protein binding site was stable during equilibrium simulations. In contrast, the stability of the ligand docked pose varied significantly for different ligands. The distribution of the ligand RMSD is depicted in Fig. 5A. Approximately 35% of ligands remained relatively stable with respect to the docked pose during MD (ligand RMSD ≤ 2 Å), while 45% of ligands had ligand RMSD from 2 Å to 4 Å, and 20% of ligands had ligand RMSD above 4 Å. On average, ligands from the Consensus set demonstrated smaller deviations from the docked pose compared to ligands from the Expert set. The ratio of ligands that deviated significantly from their docked pose was considerably larger for the Expert set than the Consensus set (Fig. 5A). For the Expert set, 47% and 23% of ligands had ligand RMSD from 2 Å to 4 Å and above 4 Å, respectively. For the Consensus set, these proportions were 31% and 10%, respectively. To estimate the conformational stability of ligands, we computed the best-fit RMSD of the MD representative pose of a ligand to its docked pose (conformational RMSD). For most ligands, conformations of docked poses remained relatively stable (see Fig. S2†). Only 6% ligands from the Expert set and 4% ligands from the Consensus set had conformational RMSD greater than 2 Å. Therefore, deviations from docked poses are mainly due to rigid-body translations and rotations of the ligand. Examples of molecules with ligand RMSD of less than 2 Å, ranging from 2 Å to 4 Å, and greater than 4 Å are shown in Fig. 5B–D.
While the ligand RMSD had no correlation with computed ABFE (Fig. 6C), the ligands with the lowest ABFE (ΔG of less than −10 kcal mol−1) had ligand RMSD from 2 to 4 Å for both sets. Interestingly, for the Expert set, the ligand RMSDs of ligands with the highest ABFE were also within this range. The spread of the ABFE distribution for ligands with ligand RMSD above 4 Å was smaller than for the rest of the ligands; however, the median and average ΔGs were quite similar.
Overall, system preparation, equilibrium MD simulations and MD TI simulations took approximately 40 days in total, using Intel Xeon E5-2620 CPUs for QM calculations for ligand charges and 100 Tesla V100 GPUs for MD simulations. The total computational cost of the QM calculations for all ligands was around 8400 CPU-hours. The total simulation time for equilibrium MD and MD TI simulations for all ligands was around 100 μs and the corresponding computational cost was around 13![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 GPU-hours.
000 GPU-hours.
The clustering resulted in ∼18% of ligands distributed between the four most populated clusters and ∼5% of ligands distributed between 17 clusters each of which included no more than five ligands. The rest of the molecules (∼77%) were identified as singletons. The clustering results are therefore indicative of significant diversity within the MD representative poses. Descriptive statistics of ligand ABFEs for the four most populated clusters is provided in Table 1. The name of each cluster corresponds to the MD representative pose ligand shape in the binding pocket. Remarkably, the ABFE distributions for L- and C1-shaped ligands are significantly shifted towards more negative ABFE values compared to the ABFE distribution for all ligands.
| Cluster | N | m(ΔG), kcal mol−1 | μ(ΔG) ± σ, kcal mol−1 | 
|---|---|---|---|
| L-shaped | 20 | −5.53 | −5.88 ± 3.02 | 
| C1-shaped | 14 | −2.94 | −3.70 ± 2.07 | 
| C2-shaped | 48 | −1.48 | −1.71 ± 3.35 | 
| U-shaped | 53 | −0.72 | −1.17 ± 3.51 | 
Molecules contained in the L-shaped cluster are composed of two sections. The first is long and rod-like and runs parallel to the interior of the protein. This section typically exhibits hydrogen bonds between a carbonyl oxygen and Thr2416 hydroxyl group, and an indole nitrogen and Lys2415 backbone oxygen. The second section runs perpendicular to the first section, forming the bend of the “L”. This section typically exhibits an oxamide group with hydrogen bonding between amide hydrogens and the backbone of both Met2155 and Leu2202. Using these hydrogen bonds as a reference, we identified an additional 16 ligands that exhibited hydrogen bonding to at least three of these residues in at least 40% of the frames of the equilibrium MD trajectory. These 16 ligands had an average ΔG of −2.84 ± 0.54 kcal mol−1. These results suggest that, while ligands exhibiting the L-shaped pose have significant similarity when considering hydrogen bonding interactions, these interactions by themselves are not enough to predict binding affinity. 8 out of 76 ligands submitted exhibit an L-shaped binding pose.
We virtually screened 4.1B compounds from the Enamine REAL library to identify binders of the central cavity of the WDR domain of LRRK2. Our computational workflow combined Deep Docking for the initial scoring of protein–ligand binding and ABFE simulations for the rescoring of docked poses. Deep Docking afforded the screening of this multi-billion compound chemical library within a reasonable timeframe, requiring only around 59 M compounds (1.4% of the library) to be docked with three programs (AutoDock-GPU, Glide SP, and ICM) while the rest of the compounds were deprioritized based on the ML-predicted docking score. Consensus filtering based on the deviation between docking poses obtained with different docking programs further reduced the number of compounds from 59 M to 238 K. The application of a drug-likeness filter followed by best-first clustering and the selection of the top compounds based on their docking scores resulted in the Consensus set of 199 compounds. In parallel, the Expert set of 605 molecules was selected based on the visual inspection of the docked poses.
Short equilibrium MD simulations were performed on each selected compound in complex with the protein to eliminate potential inaccuracies caused by docking. We found that a significant number of ligands changed their poses in the binding pocket as a result of MD simulations. On average, 45% of ligands had a ligand RMSD of 2–4 Å, and around 20% had more than 4 Å.
ABFE simulations utilizing MD TI were performed on the stable compounds with RMSDavg < 3 Å. ABFEs were computed for 793 molecules and then used to re-rank the molecules for the selection of the 76 compounds for the experimental testing. The distribution of Consensus and Expert set ABFEs were similar, with values ranging from approximately −13 to 8 kcal mol−1 and average ABFE of approximately −2 kcal mol−1. For both sets, a considerable ratio of ligands with favorable docking scores had weak predicted binding affinity. The ABFEs showed no correlation with either docking scores or the ligand RMSD. The poses of ligands with the lowest ABFEs for both sets significantly shifted during MD simulations (ligand RMSD > 2 Å).
Overall 76 compounds were submitted for experimental validation, with 59 compounds tested successfully and 5 compounds identified as hits, for an overall hit rate of 8.5%. Of the 23 participants in the CACHE Challenge #1, our submission was awarded 1st place. To the best of our knowledge, our method was the only one that utilized the combination of Deep Docking and ABFE simulations. Cumulatively these results suggested the importance of combining ultrahigh-throughput computational screening tools such as Deep Docking with advanced methods such as MD TI in structure-based virtual screening campaigns, especially for targets with no previously known ligands.
| Footnotes | 
| † Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d3sc06880c | 
| ‡ These authors contributed equally and share co-first authorship. | 
| This journal is © The Royal Society of Chemistry 2024 |