Christine N.
Morrison‡§
a,
Kathleen E.
Prosser‡
a,
Ryjul W.
Stokes‡
a,
Anna
Cordes
b,
Nils
Metzler-Nolte
b and
Seth M.
Cohen
*a
aDepartment of Chemistry and Biochemistry, University of California San Diego, La Jolla, CA 92093, USA. E-mail: scohen@ucsd.edu
bLehrstuhl für Anorganische Chemie 1, Bioanorganische Chemie, Ruhr-Universität Bochum, Universitätsstraße 150, 44801 Bochum, Germany
First published on 12th December 2019
Fragment-based drug discovery (FBDD) is a powerful strategy for the identification of new bioactive molecules. FBDD relies on fragment libraries, generally of modest size, but of high chemical diversity. Although good chemical diversity in FBDD libraries has been achieved in many respects, achieving shape diversity – particularly fragments with three-dimensional (3D) structures – has remained challenging. A recent analysis revealed that >75% of all conventional, organic fragments are predominantly 1D or 2D in shape. However, 3D fragments are desired because molecular shape is one of the most important factors in molecular recognition by a biomolecule. To address this challenge, the use of inert metal complexes, so-called ‘metallofragments’ (mFs), to construct a 3D fragment library is introduced. A modest library of 71 compounds has been prepared with rich shape diversity as gauged by normalized principle moment of inertia (PMI) analysis. PMI analysis shows that these metallofragments occupy an area of fragment space that is unique and highly underrepresented when compared to conventional organic fragment libraries that are comprised of orders of magnitude more molecules. The potential value of this metallofragment library is demonstrated by screening against several different types of proteins, including an antiviral, an antibacterial, and an anticancer target. The suitability of the metallofragments for future hit-to-lead development was validated through the determination of IC50 and thermal shift values for select fragments against several proteins. These findings demonstrate the utility of metallofragment libraries as a means of accessing underutilized 3D fragment space for FBDD against a variety of protein targets.
Although FBDD has proven to be a successful method of drug discovery that achieves great chemical diversity, it remains a challenge to create the same degree of structural diversity in fragment libraries.8–12 Structural diversity is highly desired because molecular shape is among the most important factors dictating biological effects of molecules.8,9,13,14 Also, increased 3D shape can lead to greater aqueous solubility due to greater solvation and poorer solid-state crystal lattice packing, as well as improved ADMET properties (absorption, distribution, metabolism, excretion, and toxicity).13 As a result, increasing the 3D shape of molecules has been correlated to broader biological activity.8,14 It has been shown that molecular shape is more strongly dictated by the core compound scaffold rather than the shape or positioning of substituents decorating the core scaffold.8,14 Thus, fragment libraries consisting of a variety of 3D scaffolds are expected to display a wider range of biological activities compared to single scaffold libraries.8,9,12–14
Conventional organic fragments tend to be linear or flat molecules.15 For example, a previously reported analysis of 18534 organic fragments from the ZINC database (a collection of commercially available chemicals used for virtual screening) showed that the majority (∼75%) of conventional fragments have a linear (1D) or planar (2D) shape (Fig. 1).8 The ZINC database was analyzed using the method of Sauer and Schwartz,9 which employs the normalized principal moments of inertia (PMI) for each fragment and benchmarks them against three molecular standards: 2-butyne (intrinsically 1D), benzene (intrinsically 2D), and adamantane (intrinsically 3D). PMI is a measure of a molecule's resistance to angular acceleration around the principal axes (I1, I2, and I3); conventionally, I1 ≤ I2 ≤ I3. These values allow the comparison of molecular shapes by normalizing the PMI values and plotting the ratios (I1/I3, I2/I3) for each compound on a graph in which points occupy a triangular region (Fig. 1). In the resulting plots the PMIs of ZINC fragments overwhelmingly fall along the edge between 1D (2-butyne, top left corner) and 2D (benzene, bottom corner) shapes, with relatively few populating the 3D region of space.8,9
Fig. 1 Normalized PMI values of a molecule are plotted to assess molecular topology. Analysis of the ZINC database shows that ∼75% of conventional fragments have a linear/planar shape (fall in the white region of the plot),8 indicating that fragments with 3D topology (gray region of the plot) are vastly underexplored in FBDD. |
The lack of structural diversity in fragment libraries is in large part due to the challenge of producing small, organic molecules with inherent 3D shape.8 Efforts to create 3D organic fragments have included diversity-oriented synthesis,8,11,14 combinatorial libraries,9 incorporation of cubanes,16 and incorporation of chiral carbon atoms,10 all of which pose significant synthetic challenges. Herein, this issue is addressed by introducing the first metallofragment (mF) library composed entirely of small, inorganic complexes with inherent 3D topologies. This proof-of-concept library consists of 71 compounds divided into 13 different classes based on metal center and structural homology. The prospective value of our mF library is demonstrated by screening the library against three therapeutic targets, including an antiviral target, an antibacterial target, and an anticancer target. The specific proteins screened in this study are the PA N-terminal (PAN) endonuclease domain of the RNA-dependent RNA polymerase complex of the influenza A virus, New Delhi metallo-β-lactamase-1 (NDM-1), and the N-terminal domain of heat shock protein 90-α (Hsp90), respectively. As a demonstration, select fragments from one class were further characterized with thermal shift assays (TSA) as an orthogonal screening method, as well as dose response assays to determine IC50 values. Taken together, the findings presented here show that 3D mFs are an innovative and potentially useful new tool for FBDD that are capable of targeting topological space not readily accessible by conventional organic fragment libraries.
The heart of this work is a novel mF library, which consists of 13 classes of various sandwich, half-sandwich, and octahedral metal complexes (Fig. 2). Members within each class share the same metal and core geometry, but feature ligands with different functional groups and/or heterocycles. Approximately 15% of the library was purchased from commercial sources and used without further modification, while the remaining majority of complexes were prepared according to literature procedures (see ESI† for details). Of the prepared metal compounds, ∼30% (19/71) represent previously unreported chemical entities. The majority of these novel complexes are ruthenium arene derivatives and rhenium tricarbonyl complexes (Classes J and K). In addition to their described application in this work as mFs, these new compounds have potential applications in the development of new anticancer agents (Ru(II) arenes)27–30 and model agents for imaging applications (Re(I) tricarbonyls).31 The majority of ligands in Classes J and K are derived from previously reported metal-binding pharmacophores.32 The other synthesized compounds include metallocene and piano-stool derivatives with substituent modifications carried out on their respective aromatic rings (Classes A, B, D, E),33 and a variety of carbene and diimine Re(I) complexes that have been reported extensively elsewhere.34–37 One particular advantageous aspect of this library is that most complexes, both reported and novel, were prepared in one or two steps from commercially available starting materials. This presents the opportunity in this and future studies to rapidly expand the contents of the mF library, in part due to the intrinsically modular nature of ligand and metal complex syntheses. In addition to library expansion, the modularity of ligand synthesis will allow for fragment growth or linking, as is the practice in traditional FBDD campaigns. Indeed, the mFs could potentially be screened in tandem with organic fragment libraries, providing insight to the appropriate ligand substituents needed to achieve potent target binding. Every class of mFs contain positions for modification, via ligand modification, that will facilitate future hit-to-lead or fragment building efforts.
Fig. 2 Classes of compounds in the metallofragment library, separated into sub-groups defined by their overall geometry. |
To explore extensive chemical space, features such as charge (e.g., neutral Class Aversus cationic Class B), hydrophobicity, synthetic accessibility, structural diversity, aqueous stability, and rule-of-three compliance (see below) were used to guide this initial library design.4 Fragments were selected with the aim of having a kinetically and thermodynamically stable core, ideally posing no significant pharmacokinetic challenge beyond that found for conventional organic fragments. Each compound in the library presented herein consists of only one metal ion. Other studies have examined the utility of metal cluster compounds in drug discovery;26,38 however, these tend to be much larger compounds, which makes them less suitable for FBDD.
Within the metallofragment library there are three sub-groups described by their general structure (Fig. 2): sandwich or metallocene complexes (Classes A–C), half-sandwich or piano-stool complexes (Classes D, E and J), and six-coordinate octahedral complexes (Classes F–I and K–M). Class A contains ferrocene derivatives, which are one of the most common metal-containing scaffolds explored in medicinal bioinorganic chemistry due to their ease of functionalization, stability, and low cost.19,39 Ferrocene was first introduced as a bioisostere for aryl/heteroaryl rings, and ferrocene has been utilized to improve anticancer, antimalarial, and antibacterial properties of organic therapies.19,29,39 Classes B and C are comprised of cobaltocenes and bis(arene)rhenium scaffolds, which are structurally similar to Class A but possess a positive charge.
The half-sandwich compounds included in our library are Re(I) compounds (Class D) that have been used for biomedical imaging applications,40 Mn(I) complexes (Class E) that have been used as CO releasing agents, and Ru(II) agents (Class J) that have been extensively studied as potential therapeutics in their own right.19,24,28,29 Despite the large number of reports on the biological activity of the Ru(II) arene complexes, this work represents, to the best of our knowledge, the first attempt to use Ru(II) arene complexes as core structural scaffolds.
All the octahedral complexes presented in our mF library contain Re(I) with 3–4 carbonyl ligands (Classes F–I and K–M, Fig. 2). Complexes similar to those in Classes F and G with bidentate N,N donors have been investigated for their anticancer properties.22,36 Classes H and I are carbene complexes; most reported biologically active metal–carbene complexes are prepared with Ag(I) and Au(I) and have shown anticancer and antimicrobial properties.19,29 Class K molecules consist of O,O and S,O heterocyclic bidentate ligands, while Classes L and M are prepared with N,N,O (Class L) and N,N,N (Class M) tridentate donor ligands. Again, to the best of our knowledge, this work is the first time that the Re(I)-based compounds in Classes H, I, K, L, and M are being utilized in FBDD. It is worth noting that octahedral complexes in the library containing asymmetric bidentate ligands (Classes G and K) form enantiomeric mixtures due to different binding orientations of the bidentate ligand. Small molecule crystal structures of some Class K metallofragments (Fig. S14†) show that both enantiomers are present in the product. No attempt to separate the enantiomers was made as it is beyond the scope of this work. Use of enantiomeric mixtures in early stage drug discovery is commonplace and as such does not represent a significant shortcoming of the mF library. Future work dedicated to hit-to-lead development of Class G and K molecules will address purification of the enantiomers.
With this in mind, an analysis of representative mFs was performed to redefine the rule-of-three parameter for MW in terms of molecular size. This redefinition was validated by comparing the heavy atom count (HAC; the number of non-hydrogen atoms) and ‘apparent MW’ of mFs (where the atomic weight of the metal ion is substituted for a carbon atom) to that of conventional organic fragments. The MV of representative mFs was evaluated against their apparent MW and their HAC. The result of this analysis (Fig. S15†) shows that the MV of mFs varies in a manner that is indistinguishable from the MV of conventional organic fragments based on HAC and apparent MW. Thus, although the mFs have a greater MW compared to organic fragments, they are not proportionally greater in size, and hence should operate as suitable scaffolds for FBDD. Based on this analysis, in lieu of MW ≤ 300 Da, we propose a new rule-of-three for mFs: MV ≤ 300 Å3. As a representative example, the chemical structure, X-ray structure, and molecular surface of mF K5 is shown in Fig. 3; the molecular volume of K5 is 292 Å3 (calculated by Molecular Operating Environment, v. 2019.0101).43
Fig. 3 Representation of mF K5 (from left-to-right): chemical structure, X-ray structure, and molecular surface colored by lipophilicity. Hydrophilic and lipophilic regions are represented by pink and green surfaces, respectively. The molecular volume of K5 was determined to be 292 Å3 and only one enantiomer is shown from the X-ray structure (see ESI† for details). |
The topology of the mFs was also analyzed by complex type (sandwich, half-sandwich, and octahedral complexes). The normalized PMI plots (Fig. 4) show that mFs belonging to the same complex type tend to have similar topology. The sandwich mFs cluster near the linear region of the plot, half-sandwich mFs occupy the top of the plot in the linear to spherical region, and octahedral mFs are concentrated between the planar and spherical region. This analysis validates the notion that the core scaffold of a molecule contributes more to its overall shape than does its substituents.8,14 Knowledge of the general topology of various scaffolds may be useful in targeted FBDD.
For comparison to the mF library, the molecular topology of FDA-approved drugs was assessed (Fig. 4), using structures in the DrugBank (version 5.1.3 from 4 April 2019). Structures were downloaded as ‘3D’, meaning that the downloaded structure represents the lowest energy conformer of the free drug molecule. Normalized PMI calculations of the lowest energy conformer were performed on all structures, and the results are shown in Fig. 4. To the best of our knowledge, this 3D analysis of approved drugs has not previously been performed and it serves as an interesting insight to the current scope of therapeutic structural diversity. Of the approved drugs with structures in the DrugBank database, 23% (161/712) have 3D scores falling above 1.2 ((I1/I3) + (I2/I3) > 1.2). The largely 2D character of drug molecules has been previously described,13 but it is important to note that these energy-minimized structures may not reflect the protein-bound or solution confirmations of the drugs.13,44 These energy-minimized structures may also demonstrate some bias towards the left side of the PMI plot due to the steric interactions of side-chains, producing some artificial linearity/planarity. Even concerted efforts in FBDD campaigns to drive towards 3D diversity fail to achieve high 3-dimensionality (3D score > III).8 Of the 712 approved therapies examined here, only 5 (0.7%) compounds are considered highly 3D (3D score > III), while with only 71 entries, the mF platform places 2 complexes, ∼3% of the library, in this space. Given the 3D nature of these core fragment scaffolds, the mF library offers more direct access to molecules that occupy the previously underexplored 3D space at both the fragment and drug level.
To assess the utility of the mF library for FBDD, the library was screened against three therapeutically relevant targets (Fig. 5). The selected targets were: the polymerase acidic N-terminal (PAN) endonuclease domain from the H1N1 influenza A virus (antiviral target), New Delhi metallo-β-lactamase-1 (NDM-1; antibacterial target), and the N-terminal domain of heat shock protein 90-α (Hsp90; anticancer target).47,48 The role of these enzymes in their respective diseases and inhibitor development for each are presented elsewhere.47–49 Briefly, PAN endonuclease is one of three proteins in the RNA-dependent RNA polymerase complex of the influenza A virus, along with the polymerase basic protein 1 (PB1) and the polymerase basic protein 2 (PB2).50 PAN endonuclease contains a dinuclear metal active site, with two Mn2+ or Mg2+ cations that promote endonuclease activity.51 A functional RNA polymerase complex is essential to viral replication,50 and the first therapeutic targeting this protein, Baloxavir marboxil, has now gained FDA approval.52 Baloxavir, along with other leading drug discovery efforts, have targeted the metal centers in the large active site of PAN as a means of enzyme inhibition. NDM-1 is a protein found in both Gram-negative and Gram-positive bacteria that has been shown to hydrolyze clinically relevant β-lactam antibiotics.53 The active site of NDM-1 is largely hydrophobic, with fluctional conformations and two Zn2+ ions that participate in antibiotic hydrolysis.54 There has been substantial effort to develop NDM-1 inhibitors and the target is still considered promising,47 although NDM-1 inhibitors have yet to gain FDA approval. The final target, Hsp90, is a ubiquitous molecular chaperone with many diverse functions including the folding, stability, and activity of many proteins (‘clients’).55,56 Several Hsp90 clients have been identified as oncoproteins that are associated with cancer hallmarks.48,56 The binding site of the reported inhibitors is a 15 Å deep pocket capable of binding polypeptide chains.57 More than a dozen Hsp90 inhibitors have entered clinical trials, but none have received FDA approval.48,56
The mF library was screened at a fragment concentration of 200 μM against all three targets using established screening assays for each protein (Fig. 5 and S17†).51,58 Considering the library as an entirety, a hit rate (percent inhibition > 50%) of approximately ∼40% was achieved against PAN and NDM-1, while Hsp90 had a hit rate of ∼15% (Fig. S16†). Classic high-throughput screening (HTS) of drug-like molecules have reported hit rates between 0.001% and 0.2%, while organic fragment libraries are reported to have hit rates ranging from 3% to 30%.59,60 Thus, the mF library performs as well as or better than both traditional screening libraries, speaking to the promise of this screening platform.
The metallocene subgroup, particularly the Class A ferrocene derivatives, performed well against each of the targets (Fig. 5), achieving hits rates between 41–68% (Fig. S16†). While these compounds may present promising scaffolds for future work on fragment growth and elaboration, it is important to recognize their general lack of specificity. Excessive lipophilicity, along with properties such as redox activity and metal chelation are considered to be hallmarks of potential ligand promiscuity.61 The lipophilicity of ferrocene and ferrocene derivatives, discussed in terms of the partition coefficient logP, are typically between 2–5.62 These values fall on the higher end of what is generally considered acceptable for drug-like molecules (logP < 5), and it is likely that their lipophilicity would only increase with fragment growth. Given this, should a Class A mF be selected as a hit compound in future efforts, precautions are advised to ensure that the complex binds to the desired binding pocket rather than to non-specific hydrophobic patches on a protein target. While the class performed broadly well, suggesting promiscuity and non-specific interactions, some differences in their activity against the three targets was observed. For example, mF A5, ferrocenemethylamine (Fig. S1†), completely inhibited NDM-1 in the 200 μM screen but failed to inhibit PAN endonuclease and Hsp90. Such results demonstrate the promise of the metallocene subgroup to serve as fragment scaffolds in future FBDD campaigns.
In addition to validating the suitability of the mF platform for future FBDD campaigns, these screening results allow for the examination of the relationship between mF 3D topologies and their biological activity. By calculating the 3D score of each fragment ((I1/I3) + (I2/I3), Fig. 4), the 3D topologies of each mF can be plotted against the percent inhibition as determined through each assay (Fig. 6). Based on the PMI analysis presented above, the majority of mFs have 3D scores of II (1.2 ≤ (I1/I3) + (I2/I3) < 1.4) compared to the ZINC fragment library where ∼75% of fragments have (I1/I3) + (I2/I3) < 1.2, a 3D score of I.8 Within II there are mFs from each subgroup, and they all exhibit a broad range of inhibitory effects against the three targets. Of the most 3D fragments, those that fall into IV, none of them achieved percent inhibition values above 50%. There are only 2 mFs with this score, representing ∼3% of the library, and both are from the half-sandwich complex subgroup. It is challenging to draw any significant conclusions regarding their moderate biological activity given the limited scope. Nonetheless, both of these highly 3D fragments achieved percent inhibition values >20% at 200 μM against at least one of the targets, and as such could potentially be pursued for fragment growth and lead development. In future studies it will be important to populate these higher 3D scores so that more informative analyses of the relationship between 3-dimensionality and biological activity can be undertaken.
Fig. 6 Analysis of the inhibitory data and the 3D topologies of the mF library tested against PAN, NDM-1, and Hsp90 at fragment concentrations of 200 μM. |
mF | HAC | PAN endonuclease | NDM-1 | Hsp90 | ||||||
---|---|---|---|---|---|---|---|---|---|---|
IC50a (μM) | LEb | ΔTMc | IC50 (μM) | LE | ΔTM | IC50 (μM) | LE | ΔTM | ||
a IC50 values reported in μM with the 95% CI indicated. b Ligand efficiency (LE; kcal per mol per HAC). c Reported in °C. | ||||||||||
A4 | 14 | >500 | <0.33 | −1.1 ± 0.3 | 33 ± 17 | 0.45 | 0.7 ± 0.1 | 33 ± 2 | 0.45 | 0 ± 1 |
A7 | 14 | 80 ± 20 | 0.41 | −1.5 ± 0.1 | 33 ± 5 | 0.45 | 0.10 ± 0.04 | 24 ± 12 | 0.46 | 2 ± 0.6 |
A11 | 20 | 18 ± 8 | 0.33 | 0.8 ± 0.5 | 12 ± 5 | 0.34 | 0.12 ± 0.03 | 24 ± 4 | 0.32 | 2 ± 1 |
A12 | 19 | 55 ± 15 | 0.31 | −3.7 ± 0.4 | 50 ± 20 | 0.32 | 1.0 ± 0.1 | >500 | <0.24 | −2 ± 0.6 |
TSA experiments, which measure the change of the melting (unfolding) temperature (TM) of a protein, were carried out on the select Class A mFs. Inhibitors that stabilize a protein increase the melting temperature of the protein, while inhibitors that destabilize the protein decrease the melting temperature. The TSA data is provided in Table 1 and is reported as ΔTM (in °C), which refers to the difference in melting temperature of the inhibitor-bound protein compared to the native protein. While large ΔTM values for any protein with inhibitor were not observed, moderate ΔTM values suggest a range of different stabilizing and destabilizing interactions between the protein and the mF. Within the small subset of fragments examined here there are no immediate correlations between the ΔTM values and their determined IC50 values. However, the ability to measure and compare these different markers of the inhibitor–protein interaction will allow future FBDD campaigns to take full advantage of the topological diversity afforded by the mF library.
In an effort to elucidate the possible mechanism of inhibition, preliminary docking exercises were carried out on a representative mF, K6. Fragment K6 showed good inhibitory activity against PAN endonuclease, and also stood out as a potent inhibitor of Hsp90, providing a model mF for this docking study. The coordinates of an aquated version of K6 were determined from the X-ray crystal structure of K5 (see ESI Fig. S14 and Table S1†) and the complex was docked against a reported inhibitor-bound PAN endonuclease structure (6E3M) and an inhibitor-bound crystal structure of Hsp90 (1YET) using the Molecular Operating Environment program (version 2019.0101).64 The best scoring pose against each protein is shown in Fig. 7. While these poses may not indicate the true binding mode of K6 against these two targets, they serve as an initial starting point in the rational development of new inhibitors. In both docking studies, the proteins and mF are mapped from pink to green based on their lipophilicities (Fig. 7). Interestingly, many of the mF library entries do not have hydrogen bond donating or accepting entities, as is the case for K6. As such, an analysis of the molecular interactions shows that the docking of the mF in the pockets of PAN endonuclease and Hsp90 is driven primarily by the steric interactions, directly related to the fragment 3-dimensionality.
An analysis of the 3D topologies of ∼700 approved therapeutics demonstrates that a large majority fall under a linear/flat regime. Because of the limitations in the synthesis of 3D rich drug/fragment libraries, it is difficult to know if these flat/linear structures represent ideal geometries, or if they are simply a consequence of the tools used to prepare drug discovery libraries. The mFs presented here have comparatively high 3D topologies, but this space has been sufficiently challenging to access such that the potential of truly 3D scaffolds has yet to be determined. Overall, this work showcases the utility of a novel mF library of modest size with 3D diversity that exhibits a broad range of biological responses. Future efforts will build on these exciting proof-of-concept studies by expanding the mF library to include more highly 3D fragments with careful consideration of their kinetic and thermodynamic stability. With a second-generation library in development, these mFs can be further developed into lead-like molecules for addressing previously inaccessible or challenging targets.
Footnotes |
† Electronic supplementary information (ESI) available: Experimental and computational details, characterization, and detailed assay procedures. CCDC 1962326–1962330. For ESI and crystallographic data in CIF or other electronic format see DOI: 10.1039/c9sc05586j |
‡ These authors contributed equally to this work. |
§ Current address: Colorado School of Mines, 1500 Illinois Street, Golden, CO 80401, USA. |
This journal is © The Royal Society of Chemistry 2020 |