Elena
Cressina‡
a,
Liuhong
Chen‡
b,
Chris
Abell
b,
Finian J.
Leeper
b and
Alison G.
Smith
*a
aDepartment of Plant Sciences, University of Cambridge, Downing Street, Cambridge, UK CB2 3EA. E-mail: as25@cam.ac.uk; Fax: (+44) 1223 333953
bUniversity Chemical Laboratory, University of Cambridge, Lensfield Road, Cambridge, UK CB2 1EW
First published on 4th October 2010
Riboswitches are regions of mRNA that bind selectively certain metabolites, thereby regulating gene expression. We have developed a method, which uses a combination of biophysical techniques (equilibrium dialysis, waterLOGSY and T2 relaxation-edited NMR spectroscopy and isothermal titration calorimetry) that allows screening and identification of novel ligands for riboswitches. By this method a library of 1300 fragments was screened on the E. colithiamine pyrophosphate (TPP) riboswitchthiM, resulting in the identification of 17 hits. The compounds showed KD values from 22 to 670 μM, with four compounds having KD values <100 μM. The fragments are structurally diverse and their ligand efficiency indicates good prospects for structure-guided elaboration. In addition, a riboswitch functional assay based on in vitrotranscription translation (IVTT) of the reporter gene luciferase was developed. This assay system is an alternative to in vivo assays and constitutes a valuable tool to determine the effect of small molecules on riboswitch-regulated gene expression.
Riboswitches responsive to thiamine pyrophosphate (TPP) (Fig. 1a) are found in virtually all bacterial genomes,14 usually in the 5′-untranslated regions (UTR) of mRNA. They are the only riboswitches known to be present in eukaryotes including fungi,15 algae16 and higher plants.17 The Escherichia coliTPP riboswitchthiM (thiM-RS) is one of the most extensively studied riboswitches from both biochemical and biophysical standpoints. It modulates the expression of the thiMDoperon, encoding two enzymes involved in the thiamine biosynthetic pathway.14 The thiM-RS comprises a 78-nucleotide aptamer domain (Fig. 1b), which is defined by the minimal sequence able to bind the ligand, and a 90-nucleotide expression platform that causes translation attenuation by sequestering the ribosome binding site (RBS) when TPP is bound.18 Although TPP is the preferred ligand (KD = 200 nM), thiM-RS can also bind thiamine monophosphate (TMP) and thiamine (Fig. 1a), with reduced affinity.4,19
Fig. 1 (a) Structures of thiamine, TMP, TPP and pyrithiamine pyrophosphate, all E. coli thiMriboswitch ligands. (b) Primary sequence and secondary structure of the E. coli thiMriboswitch aptamer.4Nucleotides in filled bold type form binding interactions with the pyrimidine moiety of thiamine; circled nucleotides in bold type are involved in pyrophosphate binding.20 |
Thiamine analogues such as pyrithiamine, benfotiamine, and amprolium, also bind to the TPP aptamer, with a reported KD for pyrithiamine of 6 μM.4,19 In addition, it has been suggested that the antimicrobial action of pyrithiamine is via its effect on TPP riboswitches.19 The crystal structure of the thiM-RS bound to TPP shows that this RNA acts like a pincer, wrapping the cofactor between two helical arms.20 The aminopyrimidine moiety of TPP makes contact with one helical arm by hydrogen bonding and stacking interactions, while the pyrophosphate binds the other arm through two Mg2+ ions. The thiazole ring appears to make only a minor contribution to binding. The structure of a TPP riboswitch found in the mRNA of the thiCgene in the plant Arabidopsis thaliana reveals a very similar folding motif.21
We are interested in discovering and developing novel ligands for the thiM-RS, which could potentially interfere with the riboswitch mechanism. In order to diverge from thiamine analogues and to obtain structurally diverse binders, a fragment-based approach22 was employed.23
Fragment-based methods22 have proven very successful for the discovery of enzyme inhibitors.24 They rely in part on the concept that screening small molecules (MW ≤ 300 Da, fragments) allows a more effective exploration of chemical space, and therefore increases the probability of finding efficient ligands. As a consequence, it is sufficient to screen libraries of a few hundreds of fragments (in contrast to millions of bigger “lead-like” compounds) to obtain a selection of structurally diverse ligands. Fragment ligands generally display KD values in the lower millimolar to higher micromolar range, but subsequent structure-guided elaboration allows the generation of more potent compounds.24
In contrast to proteins, only a few examples of the application of fragment-based approaches to RNAs of interest are encountered in the literature.23,25,26NMR spectroscopy25 and mass spectrometry26 have been used individually as screening methods. However a combination of different techniques would allow the examination of different aspects of RNA–ligand binding, furnishing a more complete picture of the binding event. We have previously published a communication outlining fragment screening on the TPP riboswitch.23 Here we describe in greater detail our investigation of different biophysical techniques, aimed at developing a general method to screen compounds against riboswitches and the complete results of our screening. From a library of approximately 1300 compounds, we have identified a number of fragment ligands for the thiM-RS, and explored their binding properties. In addition, we describe the development of a riboswitch functional assay, based on in vitrotranscription translation of a reporter gene.
When thiamine binds to the thiM-RS, the RNA undergoes a conformational change that stabilises its tertiary structure.23 Such stabilisation could be measured by monitoring changes in thiM-RS Tm in the presence of fragments that alter the RNA structure. We attempted to follow ligand-induced changes in Tm by adding a fluorescent dye to thiM-RS-ligand solutions, hoping to achieve a sensitive and high-throughput assay, without requiring covalent labelling of either the nucleic acid or the substrate.30
In preliminary experiments, we screened the fluorescence of seven commercially available DNA and RNA dyes (EvaGreen®, PicoGreen®, Sybr Green I®, LC Green®, Sybr Gold®, acridine orange and proflavin) between 37 and 70 °C, in a solution containing thiM-RS ± TPP. Only in the presence of the dye EvaGreen®,31 did the thiM-RS melting curves show a non-linear shape, as might be expected for a cooperative transition like the melting of an ordered RNA structure. The presence of 50 μM TPP in solution caused a more marked inflection of the melting curve, indicating an effect of the ligand on the overall structure of the RNA (Fig. S1, ESI†). However, it was difficult to determine a Tm from the curve, and experiments conducted with decreasing amounts of TPP did not show a direct correlation of the effect with the ligand concentration. In addition, although the same effect was observed with thiamine and TMP, it was much less marked, despite these molecules being good ligands. For these reasons, we concluded that this method was not adequate to screen our fragment library.
Fragment binding can be evaluated by performing competition experiments, in which the ability of the candidate ligand to affect [3H]thiamine binding (KD = 1.5 μM) is monitored. In our format, fragment(s) and [3H]thiamine were placed in one chamber and RNA in the other. If the fragment disrupted thiamine–RNA interactions significantly, then the concentration of thiamine in each chamber would be very similar or, in the best cases, equal. This is expressed in practice by the ratio of [3H]thiamine present in the RNA-containing chamber and the total amount of [3H]thiamine in the system. This is then converted into the percentage of [3H]thiamine displaced by normalising against a control with no additional ligand (0%) and a control with saturating amounts of TPP (100%).23 The practical choice of using commercially available [3H]thiamine as tracer, rather than radiolabelled TPP, is supported by the consideration that a weaker ligand would be displaced more easily by weak fragment binders, as the assay is more sensitive around the KD of the displaced ligand. This method has several advantages; it discriminates between generic binders and those compounds that disrupt thiamine–RNA interactions; it is quantitative; it does not require large amounts of radioligand (100 nM for 60 μL volume competition experiments) or RNA (10 μM); and it is inexpensive and easy to perform.
To increase throughput, the fragments in our library were screened in cocktails of five (each at a final concentration of 1 mM). Those that displaced ≥30% of the [3H]thiamine were considered as hits and subsequently deconvoluted by testing the individual components separately at 1 mM. Fig. 2 shows the displacement percentage of five representative cocktail hits (C1–C5) and their deconvolution. It is clear that cocktails containing compounds that bind to the RNA can be identified. This method is in principle applicable to any RNA, provided that a radiolabelled ligand is available. In addition, the displacement percentage threshold, above which a fragment is considered a hit, can be evaluated from case to case, in order to select only the most potent compounds.
Fig. 2 Percentages of thiamine displacement obtained by equilibrium dialysis with five representative cocktails containing five fragments each (top panel, black bars) and following deconvolution of the individual components at 1 mM (bottom panel, open bars). Cocktail 1 (C1, 80%) contained two hits (C1–3 (73%) and C1–5 (76%), subsequently numbered 4 and 3 in Table 1); cocktail 2 (C2, 99%) contained the hit 1 (99%); cocktail 3 (C3, 35%) contained the hit 5 (72%); cocktail 4 (C4, 68%) contained the hit 6 (70%). Cocktail C5 showed 99% displacement but the individual components only gave 12, 1.5, 21, 11 and 7% displacement, respectively. This might be due to synergistic effects of the fragments on the RNA structure. |
WaterLOGSY (Ligand Observed via Gradient SpectroscopY) is based on the magnetization transfer from bulk watervia the RNA–ligand complex to the free ligand.32 As a result, the peaks related to the ligand appear negative in spectra acquired in the absence of the macromolecule, and positive in its presence. We previously showed that this method offers efficient detection of small molecules binding on the riboswitch and that it can be used to validate hits and to screen fragments in cocktails.23
T 2 Relaxation-edited NMR spectroscopy relies on a different physical principle to waterLOGSY, exploiting the different rate of transverse T2 relaxation between free and macromolecule-bound ligand.33 In the absence of binding events, the spectrum appears similar to a standard 1H NMR spectrum, with sharp and well-resolved peaks. However, when a binding macromolecule is present, the peak intensity is reduced, often to the baseline. This method proved as efficient as waterLOGSY in displaying weak small-molecule/RNA interactions both in cocktails and as individuals.
Fragment cocktails for NMR experiments can be designed in order to minimize peak overlap; Fig. 3 shows an example of waterLOGSY (panel A) and T2 relaxation-edited NMR experiments (panel B) for a cocktail of five selected fragments. Two compounds (4 and 20) show positive peaks in the waterLOGSY spectrum and no signal in the T2 relaxation edited spectrum when RNA is present, indicating binding. Two other fragments (18 and 21) display negative peaks in the waterLOGSY and sharp peaks in the T2spectra, showing no binding. The fifth compound, 19, is not visible most likely due to poor solubility.
Fig. 3 (a) WaterLOGSY NMR spectra of a cocktail of five fragments (0.5 mM each) in the presence (top) and absence (bottom) of 15 μM thiM-RS. The aromatic region of the spectra shows peaks for all fragments, with the exception of 19. Fragments 4 (star) and 20 (circle) show binding while 18 (diamonds) and 21 (square) do not. (b) Representative T2 relaxation edited spectra of a cocktail of five fragments (1 mM each) in the presence (top) and absence (bottom) of 15 μM thiM-RS. Fragments are indicated in the same way as for A. 18 and 21 do not bind while 4 and 20 bind. (c) Structures of the fragments present in the cocktail. |
Both waterLOGSY and T2 relaxation-edited NMR can be used independently or in conjunction to screen libraries of fragments on RNA targets and to validate hits.24T2 relaxation-edited NMR has the advantage of requiring fewer scans than waterLOGSY, allowing faster experiments and higher throughput. These NMR methods are very sensitive (detecting weakly binding fragments), however, they do not give any information about the binding affinity of the ligand or its ability to disrupt the riboswitch functionality. For example, compound 20 (Fig. 3) binds to thiM-RS, according to the NMR spectra, but it showed only 5% displacement of thiamine by equilibrium dialysis. Equilibrium dialysis was chosen as our primary screening method, because it addresses this problem.
Fig. 4 (a) Isothermal titration thermogram obtained by titrating fragment 4 (5 mM) into 100 μM thiM-RS at 25 °C. The enthalpy of binding ΔH was −5.90 kcal mol−1 and the entropy of binding ΔS was −0.53 cal K−1 mol−1. (b) Structures of fragments 2 and 22–26.43 Compound 2 shows the pyrimidine numbering system. |
Hit | Structure | K D/μM | Thiamine displ. (%) | LE | Lys displ. (%) |
---|---|---|---|---|---|
1 | 22 ± 2 | 91 | 0.58 | 90 | |
2 | 49 ± 4 | 53 | 0.49 | 0 | |
3 | 56 ± 6 | 76 | 0.53 | 1 | |
4 | 60 ± 5 | 73 | 0.48 | 6 | |
5 | 103 ± 6 | 70 | 0.42 | 9 | |
6 | 202 ± 13 | 71 | 0.39 | 10 | |
7 | 234 ± 23 | 72 | 0.27 | 4 | |
8 | 280 ± 20 | 55 | 0.69 | 0 | |
9 | 280 ± 30 | 35 | 0.48 | 14 | |
10 | 320 ± 40 | 58 | 0.40 | 15 | |
11 | 325 ± 50 | 41 | 0.34 | 9 | |
12 | 330 ± 50 | 27 | 0.43 | 0 | |
13 | 380 ± 30 | 83 | 0.31 | 71 | |
14 | 400 ± 250 | 42 | 0.46 | 3 | |
15 | 500 ± 100 | 63 | 0.26 | 31 | |
16 | 550 ± 40 | 37 | 0.32 | 80 | |
17 | 670 ± 50 | 49 | 0.43 | 0 |
The ligand efficiency (LE, defined as −ΔG/number of heavy atoms) is a parameter used to estimate the effectiveness of a binder in relation to its size.37 The LE of the hits range from 0.69 for 8 to 0.26 for 15. The fragments with higher ligand efficiency are good candidates for further elaboration into larger and more potent ligands.38
All hits were counter-screened by equilibrium dialysis against the structurally different lysine riboswitch, present in the Bacillus subtilis lysC5′ UTR,8 using L-[3H]lysine (KD = 1 μM) as the tracer ligand. Ten hits out of seventeen were found to displace <10% lysine and so they can be considered selective for the thiM-RS (Table 1). This result is encouraging as fragments might have been expected to be more promiscuous ligands.
The seventeen hits are structurally diverse, showing a variety of shapes and functional groups. Some of the fragment hits are structurally related to common nucleic acid binding motifs. The fragments 1, 6 and 15, consisting of fused benzene and pyrimidine or pyrazine rings and an exocyclic amino group, resemble well known intercalators such as acridine orange, proflavin or ethidium.39 This could indicate that they bind by stacking into double stranded regions of the RNA, disrupting its secondary fold, and might also explain why they also bound to the lysC-RS.
The compounds 5, 7, 11, 13 and 16 contain one heteroaromatic ring directly linked to a phenyl ring and are substituted with alkyl amino groups, which are likely to be protonated at physiological pH. This motif has been found in known DNA minor-groove binders such as Hoescht 33285 and in intercalators such as DAPI and diphenylfuran amidines.40,41 The binding in this case is likely to be due a combination of electrostatic interactions and/or partial intercalation.
The compounds 3, 4 and 9 share a common motif consisting of one five-membered sulfur-containing ring fused with a six-membered aromatic ring. This motif is encountered in thiazole orange, a dye able to bind non-specifically to many nucleic acids.42 It is interesting to note that 3, despite not having an amino group that could interact electrostatically with the thiM-RS, is the third most potent compound. Compounds 3 and 4, both with KD < 100 μM, also show very good selectivity towards the thiM-RS.
The fragments 8, 14 and 17 are likely to interact with the RNA structure by hydrogen bonds with exposed bases in bulges and loops. Compounds 14 and 17 contain an aminopyrimidine and a pyrimidinone, respectively, which could bind on the pyridimidine-sensing helix of thiM-RS.20 All three compounds are also selective for thiM-RS.
Compound 10 stands out for being the only fragment with no heteroaromatic ring and for bearing a methyl ester functionality, while 12 presents a very electron-deficient nitropyridine ring.
Interestingly, the azide 2, prepared from thiamine itself,43 shows good potency (KD = 49 μM) and selectivity, indicating that it might be recognised by the pyrimidine binding motif of the thiM-RS. Encouraged by this result, we tested the analogues 22, 23, 24, 25,43 that contain a modified C-5 substituent, to assess its contribution to binding (Fig. 4b). Neither the aminomethyl pyrimidine 22 or the hydroxymethyl pyrimidine 23 (which is a biosynthetic intermediate of thiamine)16 showed any thiamine displacement by equilibrium dialysis or binding by ITC. In contrast, the cyano pyrimidine 24 and methylester pyrimidine 25 had KD of 44 μM and 67 μM, respectively (Table 2 and ESI†) and displaced 80 and 98% of thiamine, respectively, in equilibrium dialysis experiments (Table 2). Assuming that these compounds bind to the pyrimidine pocket of thiM-RS,20 the difference in potency may be due to subtle differences in the electronic features of the aromatic ring. The effect of the C-5 substituents on the ring was evaluated by comparing the 13C NMR chemical shift of C-5 and the 1H NMR of H-6 of the compounds 2, 22–25 with those of thiamine (Table 2).43 In compounds 24 and 25spectra, the H-6 signals is shifted downfield and the C-5 upfield, indicating that the cyano and ester group make the pyrimidine ring more electron-deficient than that of thiamine, by both inductive and resonance effects. The chemical shifts of C-5 and H-6 of compound 2 are very similar to those of thiamine, indicating very similar electronic features of the pyrimidine ring. The spectra of the inactive compounds 22 and 23, show instead downfield shifting of the C-5 signals and to a lesser extent, upfield shifting of the H-6 signals compared to thiamine, indicative of an electron-donating effect on the ring. These observations imply that electron-deficient pyrimidines bind preferably to thiM-RS. This information could be useful for future elaboration/modification of these fragments, in the absence of more conclusive structural data, and for the development of novel compounds that target the pyrimidine binding pocket.
We also tested the thiazole 26 (also a biosynthetic intermediate of thiamine),16 which did not show any thiamine displacement by equilibrium dialysis and no binding by ITC. In contrast, WaterLOGSY spectra showed binding and subsequent partial displacement by TPP (ESI†). The most likely explanation for this behaviour is that 26 binds only very weakly into the thiamine pocket and could be displaced when a stronger ligand (TPP or thiamine) is present. This observation is consistent with the published crystal structure of the thiM-RS bound to thiamine, in which the thiazole moiety has very little interaction with the RNA backbone.20
We created a DNA construct in which the thiM-RS aptamer and expression platform were fused upstream of the Renilla reniformisluciferase geneluc (ESI†).16,44 This plasmid DNA was used as template in IVTT reactions with cell-free E. coli T7/S30 extracts (containing ribosomes, T7 RNA polymerase and all the necessary components for transcription-translation) to generate the luciferase protein; the enzyme activity was then determined by adding the substrate coelenterazine and measuring the light generated upon its conversion into coelenteramide. The luminescence signal was linearly proportional to the amount of IVTT reaction used (Fig. S2, ESI†), demonstrating that the readout reflects the amount of luciferase produced and thereby the translation efficiency. The amount of luciferase produced from an IVTT reaction was then monitored over 2 h (Fig. 5a). When 100 μM TPP was included in the IVTT reaction mix, we observed up to 60% reduction of the luminescence signal. The repression factor (defined as the ratio of luminescence obtained without TPP to that obtained with TPP) was approximately 2.2 after 15 min of incubation and remained constant for a further 1.5 h during IVTT (Fig. 5b). This effect was not observed when a DNA construct without the riboswitch aptamer was used as template for IVTT (Fig. S3, ESI†). This clearly indicates that, in this system, the riboswitch can modulate gene expression.
Fig. 5 In vitro transcription translation assays of a DNA template containing the Renilla reniformisluciferase under control of the thiM-RS. (a) Time courses of IVTT reactions without TPP (◆) and with 100 μM TPP (□). The relative luminescence units are plotted vs. the IVTT incubation time. (b) Repression factor (defined as the ratio between the luminescence obtained with 100 μM TPP and without TPP) vs. the IVTT incubation time. |
Subsequently, the most potent fragments 1–5, and compounds 22–25 were tested using this system, both in the presence or absence of TPP. None of them showed a significant effect on gene expression, in either condition (±TPP) (Fig. S4, ESI†). The most probable explanation for this is that fragments only bind to part of the thiM-RS aptamer TPP binding site and so might not induce the conformational change required for the switching.30 Also, it has been observed that the binding affinity of a molecule for its aptamer tends to decrease when the aptamer is included in a longer RNA strand.4,6 Therefore, our fragments might bind less tightly to the longer transcripts present in the IVTT mix.
From a library of 1300 fragments, 17 hits have been identified, with KD values ranging from 22 to 670 μM. The most potent compounds have very high ligand efficiencies and four compounds (1–4, 24, 25) have KD < 100 μM. The fragments are structurally diverse and some show similarity to known nucleic acid ligands. The majority of them are selective for the thiM-RS over the lysC RS. This has positive implications for the possibility of finding artificial RNA ligands that are specific. We believe our hits are excellent candidates for X-ray crystallographic studies; their binding mode is currently being investigated and will provide necessary information to develop these hits further into more potent compounds.
The biophysical methods employed in the screening detect the binding of small molecules to the riboswitch. To complement this information with a functional assay, we also developed a novel IVTT reporter gene assay, which allows determination of whether a small molecule affects riboswitch-regulated gene expression. The use of the luciferase from Renilla reniformis allows for sensitive detection of small amounts of protein, which we believe is important while studying riboswitch-regulated gene expression. Using our system we did not detect complete suppression of luciferase production in the presence of the ligand TPP; this indicates that the extent of riboswitch-regulated gene expression might be affected by factors such as the length and sequence of the whole UTR or the type of RNA polymerase used (T7, in this case). Nonetheless, this assay will be a valuable tool to evaluate the effect of small molecules on riboswitch function, especially for those compounds whose in vivo behaviour is not known.
A 256 nt RNA containing the 176 nt Bacillus subtilislysC RS aptamer8 was prepared following the same procedures.
Footnotes |
† Electronic supplementary information (ESI) available: RNA and DNA constructs used in this work and NMR and ITC data for all the fragments and data from the IVTT experiments. See DOI: 10.1039/c0sc00406e |
‡ These authors contributed equally to this work. |
This journal is © The Royal Society of Chemistry 2011 |