Daniel
Francis
ab,
Sannia
Farooque
ab,
Archie
Meager
ab,
Didi
Derks
a,
Abbie
Leggott
ab,
Stuart
Warriner
ab,
Alex J.
O'Neill
bc and
Adam
Nelson
*ab
aSchool of Chemistry, University of Leeds, Leeds, LS2 9JT, UK. E-mail: a.s.nelson@leeds.ac.uk
bAstbury Centre for Structural Molecular Biology, University of Leeds, Leeds, LS2 9JT, UK
cSchool of Molecular and Cellular Biology, University of Leeds, Leeds, LS2 9JT, UK
First published on 30th November 2022
Activity-directed synthesis (ADS) is a structure-blind, function driven approach that can drive the discovery of bioactive small molecules. In ADS, arrays of reactions are designed and executed, and the crude product mixtures are then directly screened to identify reactions that yield bioactive products. The design of subsequent reaction arrays is then informed by the hit reactions that are discovered. In this study, algorithms for reaction array design were developed in which the reactions to be executed were selected from a large set of virtual reactions; the reactions were selected on the basis of similarity to reactions known to yield bioactive products. The algorithms were harnessed to design arrays of photoredox-catalysed alkylation reactions whose crude products were then screened for inhibition of growth of S. aureus ATCC29213. It was demonstrated that the approach enabled expansion of a series of antibacterial quinazolinones. It is envisaged that such algorithms could ultimately enable fully autonomous activity-directed molecular discovery.
We have previously noted1,8 that the integrated and parallel nature of ADS workflows may facilitate the realisation of fully autonomous molecular discovery. Such a workflow would require the design of reaction arrays to be algorithm-driven, and all experimental activities to be automated and integrated. In this study, we describe the application of algorithms to design reaction arrays on the basis of (dis)similarity to reactions that yield known bioactive molecules. It was envisaged that the reactions would be selected from a large virtual reaction space defined by all possible combinations of heteroaromatic substrates 4 (quinazolinones and related substrates) and aldehyde or ketone co-substrates 5 (Panel C). A bacterial growth inhibition assay would then be used to identify photocatalysed alkylations9 that yield products (e.g.6) with antibacterial activity. The photocatalysed alkylation9a of quinazolinones with both aldehydes and ketones is well precedented. It was therefore envisaged that, by using aryl-substituted co-substrates, this reaction may enable variation of the linker between the heterocyclic core and an appended aromatic ring.
![]() | ||
Fig. 1 Establishment of an activity-directed synthesis workflow. Panel A: substrates and aldehyde/ketone co-substrates used. Panel B: inhibition of growth of two independent cultures of S. aureus ATCC29213 (see Fig. 3 for colour scale) by reaction mixtures derived from specific substrate/co-substrate combinations (total product concentration: 10 μM). Reactions were performed in microscale vials and involved a substrate (S1 or S2; 10 mM), a co-substrate (7, 8 or 9; 30 eq.), TTMS (2 eq.), TFA (2 eq.) and Ir[dF(CF3)ppy]2(dtbpy)PF6. |
![]() | ||
Fig. 2 Overview of the definition of a large virtual reaction space, and the algorithm used to design the Round 1 reaction array based on the hit reaction (substrate S2 + co-substrate 8). |
![]() | ||
Fig. 3 Difference in growth inhibition by crude products of the designed reaction arrays, and the corresponding reactions in which the substrate was omitted (see ESI†). Reactions were performed in microscale vials and involved a substrate (S1 or S2; 10 mM; omitted in control reaction arrays), a co-substrate (7, 8 or 9; 30 eq.), TTMS (2 eq.), TFA (2 eq.) and Ir[dF(CF3)ppy]2(dtbpy)PF6. Crude products were assayed against three independent colonies of S. aureus ATCC29213 at 10 μM total product concentration to yield hit reactions (yellow). Reaction products that displayed activity even in the absence of a substrate (pink, see ESI†) and combinations that were not investigated due to co-substrate unavailability (grey) are indicated. Panel A: reaction array in Round 1 designed on the basis of the hit reaction S2/8 (see Fig. 1). Panel B: reaction array in Round 2 designed on the basis of hit reactions from Round 1. |
![]() | ||
Fig. 4 Structures of substrates and co-substrates used in reaction arrays based on hit reactions (see Fig. 3). |
Next, we subjected the 20 substrates (S1–S20) to our activity-directed synthesis workflow. Here, “mock” reactions were performed in the absence of any co-substrate, and the crude reaction mixtures were evaluated at 10 μM total product concentration; crucially,<2% bacterial growth inhibition was observed in all cases, which gave confidence that any observed activities would be dependent on the presence of the specific co-substrate used.
An algorithm was then harnessed to design the reaction array that was executed in Round 1. Substrate combinations that were similar (Tanimoto coefficient based the ECFP4 fingerprints14 for the substrate combinations >0.68) to the hit reaction (substrate S2 + co-substrate 8, see Fig. 1) were extracted from the large virtual reaction space. This yielded 357 reactions that could, in principle, be executed in Round 1. To identify 20 possible arrays of 30 reactions for execution, Pareto optimisation15 was undertaken with two (conflicting) objectives: (a) to minimise the number of unique co-substrates that would need to be purchased; and (b) to maximise the diversity of the substrate/co-substrate combinations. It was decided to execute the designed reaction array that had the fewest unique co-substrates (Fig. 3, panel A).
A liquid handling robot was used to assemble the required combinations of substrates and co-substrates in the designed reaction array. Here, the appropriate substrate (100 μl of a 30 mM solution in MeCN) and co-substrate (100 μl of a 0.90 M solution in MeCN) were combined. Subsequently, TFA (25 μl of a 0.24 M solution in MeCN), TTMS (25 μl of a 0.24 M solution in MeCN) and Ir[dF(CF3)ppy]2(dtbpy)PF6 (50 μl of a 0.6 mM solution in MeCN) were added. The final concentration of the components in each 300 μl reaction were therefore: substrate (10 mM), co-substrate (300 mM), TFA (20 mM), TTMS (20 mM) and catalyst (0.1 mM). In parallel, a control reaction array was also assembled in which the substrates (but not the co-substrates) were omitted. The sealed vials were irradiated at room temperature for 24 h, and the crude reaction mixtures were evaporated and redissolved in 300 μl DMSO to give stock solutions with 10 mM total product concentration. The crude reaction mixtures were assayed against three independent cultures of S. aureus ATCC29213 at 10 μM total product concentration in 99:
1 ISB–DMSO. The difference in growth inhibition between the crude products of the designed reactions, and the corresponding control reactions that lacked a substrate, is shown in Fig. 3, panel A.
The crude products of eight reactions resulted in >70% growth inhibition for all three cultures. However, for six of these reactions (that involved the co-substrates C13, C494, C504, C661, C673 and C933), growth inhibition was also observed in the corresponding control reactions that lacked a substrate. On this basis, two hit reactions were taken forward: the reactions between either S2 and S3 and C981 (phenyl acetaldehyde, also labelled 8). Notably, the combination of S2 and phenyl acetaldehyde had previously been identified as a hit combination during the establishment of the assay.
In parallel, we harnessed an algorithm to design a diverse array of reactions that were dissimilar to the combination of S2 and 8 (= C981). These reactions were chosen from a virtual set of 1300 virtual reactions based on all possible combinations of the 20 substrates and 65 cheap, yet diverse, co-substrates (see ESI†). Unfortunately, no validated hit combinations were identified from this array of diverse reactions (see ESI†).
![]() | ||
Fig. 5 Overview of the definition of a large virtual reaction space, and the algorithm used to design the Round 2 reaction array based on the hit reactions from Round 1. |
The same algorithm was then harnessed to design the reaction array that was executed in Round 2. Substrate combinations that were similar (Tanimoto coefficient for the substrate combinations >0.8) to either of the hit reactions from Round 1 (S2/C981 and S3/C981; see panel A, Fig. 3) were extracted from the virtual set of 31340 reactions. This yielded 127 reactions that could, in principle, be executed in Round 2. We then identified 10 possible arrays of 30 reactions for execution through Pareto optimisation with the following objectives: (a) maximisation of the diversity of the substrate/co-substrate combinations; (b) minimisation of the number of unique co-substrates; (c) having the same number of reactions that were most similar to both of the hit reactions from Round 1; and (d) minimising similarity to the non-hit reactions that had been performed in Round 1. It was decided to execute the reaction array with the fewest unique co-substrates.
Unfortunately, two of the required co-substrates (C10900 and C12019) could not be readily prepared by oxidation of the corresponding primary alcohol, meaning that only 27 of the 30 designed reactions could actually be executed. Following execution of the reaction array, the crude reaction mixtures were assayed against three independent cultures of S. aureus ATCC29213 at 10 μM total product concentration in 99:
1 ISB–DMSO. The difference in growth inhibition between the crude products of the designed reactions, and the corresponding control reactions that lacked a substrate, is shown in Fig. 3, panel B. The crude products of six reactions resulted in >70% growth inhibition for all three independent cultures. However, for two of these reactions, involving the same co-substrate C10957, growth inhibition was also observed in the corresponding reactions in the control array that lacked a quinazolinone substrate. Thus, four new hit reactions were identified: the reaction between the quinazolinone S2 and aldehydes C10257, C10258 or C11371; and the reaction between the quinazolinone S3 and the aldehyde C10257. A total of six hit reactions was thus identified across both rounds of activity-directed synthesis.
![]() | ||
Fig. 6 Structures of compounds for which MICs were determined (see Table 1). |
Compound | Hit? | MIC (μg ml−1) | ||
---|---|---|---|---|
ATCC29213 | USA300 JE2 | SH1000 | ||
a Prepared by scale-up of the photoredox-catalysed alkylation reaction (see ESI†). b Prepared by an independent synthetic route (see ESI†). c The substrate and co-substrate were exploited in other hit reactions, but this specific combination was not explored in either array. | ||||
3a | √ (R1) (S2, C981) | 0.016 | 0.016 | 1–2 |
3b | √ (R2) (S2, C10257) | 0.5–1 | 0.5–1 | 4 |
3c | √ (R2) (S2, C10258) | 4 | 4 | 8 |
3d | √ (R2) (S2, C11371) | 0.016 | 0.016 | 1–2 |
3e | √ (R2) (S3, C10257) | 1 | 1 | 4 |
3f | c (S3, C10258) | 8 | 8 | 16 |
3g | √ (R1) (S3, C981) | 0.5–1 | 0.5–1 | 4 |
3h | x (R2) (S2, C11313) | >256 | >256 | >256 |
3i | x (R2) (S3, C11313) | >256 | >256 | >256 |
The quinazolinone products that corresponded to identified hit combinations had MIC values against S. aureus ATCC29213 ranging from 0.016 μg ml−1 (for 3a and 3d) to 4 μg ml−1 (for 3c). Reassuringly, 3h and 3i, which corresponded to investigated combinations that were not identified as hits, did not inhibit bacterial growth, even at 256 μg ml−1. 3f, which corresponded to a substrate/co-substrate combination that was not explored (but, instead was inspired by three other hit combinations, see above) had an MIC value of 8–16 μg ml−1. All of the analogues 3a–3i were also screened against the methicillin-resistant USA300 JE2 strain11b and the laboratory strain11c,d SH1000 (see Table 1 and Discussion). They were also evaluated against yeast (Candida albicans Ca6) and were found to be inactive at 16 μg ml−1.
We have also shown that it was feasible for a photoredox-catalysed reaction to underpin activity-directed synthesis, a reaction class that has potential to explore diverse regions of chemical space.17 As with other activity-directed synthesis workflows,3–6 it was critical that appropriate control reactions were performed to give confidence that observed activities stemmed from the specific substrate/co-substrate combination used. In this study, the use of control arrays enabled identification of co-substrates whose reactions yielded apparently antibacterial products even in the absence of a substrate. In addition, it was, of course, also important to purify, elucidate and characterise the products of identified hit reactions.
Our approach enabled expansion of a series7 of antibacterial quinazolinones. These compounds had comparable activity against S. aureus ATCC29213 and the methicillin-resistant USA300 JE2 strain, but were generally significantly less active against the laboratory strain SH1000. It was demonstrated that limited variation of the meta-substituted phenyl ring of 3a5 was possible: the corresponding m-acetamidophenyl-substituted analogue (3g) retained significant antibacterial activity. However, none of the reactions involving substrates other than S2 or S3 – for example, quinazolinones with replaced/substituted benzo rings or other phenyl substituents – resulted in any growth inhibition at the concentration tested (total product concentration: 10 μM). Some variation of the substitution of the phenyl ring of the 2-phenylethyl group of 3a was, however, possible: the quinzolinones 3c and 3f (with a 4-tert-butylphenyl group); 3b and 3e (with a 3,4-difluorophenyl group); and 3d (with a 2,4,6-trimethylphenyl group) displayed significant antibacterial activity. Indeed, the activity of 3d (with its 2,4,6-trimethylphenyl group) (MIC against ATCC 29213
:
0.016 μg ml−1) was comparable with that of 3a. It was notable that no reactions that involved any of the other seven substituted phenyl acetaldehydes used, nor the homologated aldehyde C11313, resulted in the formation of antibacterial products. Although two reactions of the homologated aldehyde C11313 were shown to shown to be productive, yielding the corresponding homologated products (3h and 3i), neither of these products displayed detectable antibacterial activity (MIC against S. aureus ATCC 29213
:
>256 μg ml−1). It was therefore concluded that the length of the 2-phenylethyl side chain was critical, and that only limited range of substituents on the phenyl ring of this side chain was tolerated.
A summary of the structure–activity-relationships of the antibacterial quinazolinones is provided in Fig. 7. Whilst the approach was certainly valuable for the rapid generation of these relationships, it is, perhaps, disappointing that it was not possible to expand the series of antibacterial quinaozolinones7 more extensively, perhaps because this series has already been largely optimised. Within an ADS workflow, the choice of screening concentration(s) is critical. Here the crude products were screened at 10 μM total product concentration.‡ This choice of screening concentration meant that the least active product identified was 3c (whose MIC against S. aureus ATCC29213 is 4 μg ml−1i.e. ∼8 μM). At this screening concentration, less active products would not have been detected, even if formed in high yield. In retrospect, it may have been helpful to have screened at higher total product concentration(s) in Round 1. This may have enabled the identification of more diverse, yet synthetically accessible, products with significant (albeit lower) antibacterial activity. These hit reactions could then have informed the design of a more diverse reaction array in Round 2.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d2ob01404a |
‡ We did also investigate screening at 0.1 and 1 (M total product concentration, but no new hits were detected at these lower concentrations. |
This journal is © The Royal Society of Chemistry 2022 |