Matthew H.
Todd
*a and
Chris
Abell
b
aDepartment of Chemistry, Queen Mary, University of London, Mile End Road, London, UK E1 4NS. E-mail: M.H.Todd@qmul.ac.uk; Fax: 020 7882 7794; Tel: 020 7882 7696
bUniversity Chemical Laboratory, Lensfield Road, Cambridge, UK CB2 1EW. E-mail: ca26@cam.ac.uk; Fax: 01223 336362; Tel: 01223 336405
First published on 18th October 2002
The use of tagging in combinatorial chemistry permits tracking of the solid phase as it is taken through iterative split and mix cycles. Several analytical approaches to the identification of tags (and hence the chemical history of the support) have been described. We describe herein a novel chemical tagging strategy for combinatorial solid phase chemistry. The identities of the tags attached to a single bead are discovered by the high resolution, accurate mass technique of Fourier transform ion cyclotron resonance mass spectrometry (FTICR-MS).
Matthew H. Todd | Matthew Todd has been Lecturer in Organic Chemistry at Queen Mary, London since September 2001. He completed his PhD with Chris Abell in 1998, then spent two years as a postdoc with Paul Barlett at the University of California, Berkeley, before returning to Cambridge as Fellow at New Hall College. His research interests are in the areas of organic synthesis, medicinal chemistry and asymmetric catalysis. |
In the second, similar, case9 an addressable radiofrequency semiconductor unit encased in glass was embedded in polymer beads, surrounded by an inert porous wall. Such devices are passive and acquire their energy from the encoding radiofrequency pulse. A library is made again under the control of the appropriate hard- and software, and using standard split-and-mix techniques. The storage capacity of such units is large, and can record not only the reagents at each step, but also the reaction time and temperature used (the device contains a temperature sensor). These microreactors are capable of withstanding a temperature range of −78 °C to 150 °C, and are inert to most solvents. This technology has been used in the synthesis of taxoid10 and epothilone11 and other libraries,12 and has been commercialised.13
These reactors are quite popular for small to medium sized libraries and are relatively inexpensive. Problems with these strategies in general hinge around the large size of the units (and the corresponding limit on the size of a library that can be made using them). These devices must be at least as large as the wavelength of the radiation used to encode them.
Scheme 1 General method of chemical encoding. TX = tag, RX = library component/synthetic step. |
Arguably the most important of these approaches is that developed by Still. This methodology has been developed commercially and has been repeatedly shown to be effective in chemically tagging a combinatorial library. The method was first used to encode a peptide library,19 and has been used to probe binding affinities between a receptor and a support-bound library of over 50000 compounds.20 The tags were inert haloaromatic alcohols, where variability in the tag was achieved by variation in the position and nature of the halogen substituents, and the length of the alcohol chain (Fig. 1a). The tags were attached to a small proportion (∼1%) of the available sites of amino-functionalised polystyrene resin by means of a carbonate linkage to a photocleavable linker via its carboxylic acid. The tags were detected as their trimethylsilyl derivatives at less than femtomolar levels by electron capture gas chromatography (ECGC), which is very sensitive to such electrophoric molecules, and formed a binary code by their elution order (i.e. the presence of a tag is a ‘1’ and its absence is a ‘0’, see below). The original scheme allowed for 40 tags differing in chain length and number/position of halide substituents, technically allowing for the encoding of a huge library (240 components). A single ECGC apparatus could at the time decode at least 50 structures per day, which is satisfactory given that one tends to be interested only in the active compounds from a library.
Fig. 1 Still’s encoding method. |
A limitation of the initial tagging approach was its reliance on the presence of amino functionality on the resin for tag attachment. The scheme was generalised21 to allow for the encoding of virtually any library by attaching the tag via a catechol ether linker introduced by indiscriminate rhodium-catalysed carbene insertion into the solid support (at a level of about 1 pmol per bead, compared to roughly 100 pmol of the library compound, Fig. 1b). The tag is detached by ceric ammonium nitrate oxidative cleavage and detected as before. The uses of Still’s system have been reviewed.22
An approach developed by Affymax shares several characteristics with that devised by Still, but instead uses secondary amines as tags.18 The tags are attached to the resin sequentially to form a tag oligomer using chemistry orthogonal to that of the library synthesis (e.g. the alloc protecting group was used to mediate the tagging chemistry, while the Fmoc group was used in the construction of the library). The tags are detached using 6 M HCl. Identification of the tags involves initial derivatisation to form the dansylates, followed by separation using HPLC on a standard column with fluorescent detection.
We felt at the outset of our work that FTICR-MS was an ideal analytical technique in a chemical tagging method. We shall now describe our overall approach.
To achieve this, the specific analytical issues we had to address were:
The mass spectral behaviour of a number of tag candidates, and the detection of such species at picomole levels.
The attachment of these tags to the solid support followed by their release and reliable detection from single beads, either as the tag or some derivative which is trivial to synthesise.
The detection of cleaved library and tag compounds from single beads of a chemical library.
Further, the parallel synthetic achievements were:
The development of a novel set of inert chemical tags suitable for detection at low levels by mass spectrometry.
The attachment of these tags to a solid support in a manner which allowed iterative attachment of the tags.
The use of this orthogonal tagging system to code for a chemical library, requiring the selective release of the library in the presence of the tags, followed by the release of tags from the solid support on demand.
We were able to fulfil all these requirements for the development of our tagging strategy, which is described in full elsewhere.26 We present here a summary of the salient results.
Scheme 2 Outline of the tagging strategy. L = Wang linker (4-hydroxybenzyl alcohol), A = alkyl group, R = library element, D = tag derivative. |
A split-and-mix step is then performed, followed by attachment of the second element of diversity, or some other chemical step. This is coded for by the second tagging step. This consists in our strategy of an organometallic reaction of the support-bound aryl iodide with a 1,3-diboronate containing tag 2 linked to it via a benzyl ether linkage (2). (The palladium-catalysed reaction between a boronate and a halide to form a carbon–carbon bond is known as a Suzuki reaction.) These diboronates are the key molecules to the whole strategy. After coupling to the aryl iodide, to generate an inert biaryl linkage, the second boronate group is left intact. A second split-and-mix step is performed, after which tag 3 is introduced by a second Suzuki reaction between the pendant boronate and an aryl iodide (3). This iodide is of the same form as the diboronates used previously, in that they contain the tag attached via a benzyl ether bond. (In theory diiodides could be used, setting up a pendant aryl iodide which could be used to couple with another boronate in a further round of tagging.) This third tag codes for the third library step, which is performed subsequently, as the last step in the scheme. In this way a library with three elements of diversity can be constructed, coded for by three sets of tags. The first tag resides in the resin at the outset of the synthesis, the second is introduced pre-attached to a diboronate, and the third is introduced pre-attached to an aryl iodide.
The linker chemistry employed allows for library molecules to be cleaved from the solid support under mild conditions and screened; those beads yielding compounds deemed important may then have the tags removed from them under harsher conditions, followed by detection of the tags by mass spectrometry.
There are a number of attractive features of this strategy:
The first tag is loaded when the linker is added to the bead, and the second and third tags are added ‘back to back’ using the sequential reaction of the bisboronate, with an intervening split-and-mix operation, but without work-up of the Suzuki reactions. In effect the second and third tags are introduced in the same reaction, and overall this is the only additional step involved in the library synthesis. No protecting group chemistry is employed.
The Suzuki chemistry employed in the tagging steps is performed under mild conditions and can tolerate a wide variety of chemical functionalities. There is ample precedence of Suzuki coupling on solid support.27
The biaryl and benzylic ether linkages used for the ultimate attachment of the tags to the support are robust, and allow for the facile selective removal of the library. Harsher conditions can then be used to remove the tags.
It is important to understand the power of binary coding in such a strategy. Suppose that the first element of diversity in the library sequence (R1 in Scheme 2) may take one of three forms (e.g. R1 may be one of three amino acids). We require only two chemical tags to code for this. If we translate the presence of a tag in the decoding spectrum as a ‘1’ and its absence as a ‘0’ (we met this idea earlier in the description of the Still encoding method) then we have at our disposal three situations: presence of one tag (‘10’) corresponding to the first amino acid, presence of alternative tag (‘01’) corresponding to the second amino acid, or presence of both tags (‘11’) corresponding to the third amino acid. Absence of both (‘00’) is not employed in the coding owing to the inherent ambiguity of such a result. This method may be applied to each step of library construction, as long as new tags are employed to avoid confusion with earlier steps. If we are able to employ one of three amino acids at each step of our library synthesis, then our complete three-step library as shown in Scheme 2 consists of a 3 × 3 × 3 library, i.e. 27 tripeptides. Yet as we have seen, each stage of library construction required only two tags, allowing this 27-member library to be completely coded for by only 6 tags. The size of library it is possible to encode in this way increases exponentially for a linear increase in the number of tags.
Fig. 2 Alkyl phenol tags employed in the tagging strategy. |
Electrospray ionisation is used with the FTICR mass spectrometer. For the tags to become charged well they require a protonatable site, or one that can chelate a metal ion. However for the tags to be inert during solid phase chemistry such a functional group ‘handle’ would be undesirable. Consequently the tags are attached to the resin via a benzyl ether linkage during library synthesis, and then derivatised in situ after cleavage to introduce the mass spectral handle on the exposed phenolic oxygen. This derivatisation step is an extra level of complication in the procedure and so had to be as simple as possible. Other tagging techniques use a derivatisation step. In Still’s technique,22 for example, cleaved tags are derivatised with bis-trimethylsilylacetamide prior to analysis, and in the secondary amine method,18 dansylation is used.
We developed a facile in situ dansylation procedure for the derivatisation step by modification and simplification of a literature procedure.29 The conversion of phenols to their dansylates is an attractive strategy because the dansyl group contains a protonatable nitrogen atom which assists in their detection by positive ion ES-MS, and the added mass of the dansyl group moves the region of interest in the ES mass spectrum to one of improved signal-to-noise ratio.
The phenols are attached to the solid support via a benzyl ether linkage. Removal of the tags prior to mass spectral analysis requires cleavage of this bond. This can be achieved very effectively using thioanisole, in combination with trifluoroacetic acid (TFA) and a Lewis acid such as trimethylsilyl trifluoromethanesulfonate.30 TFA-based reagent mixtures are amenable to dilution with dichloromethane (DCM), providing a solvent system that is frequently used for solid phase work. Dichloromethane ‘swells’ the resin effectively, allowing penetration of reagents into the interior of the polystyrene bead.
It is usual in peptide chemistry to cleave the Wang linker (and hence remove the library compound from the resin) with neat TFA containing traces of triisopropylamine and water. We have shown that the tags remain on the resin under these conditions. Indeed, lower concentrations of TFA in DCM are sufficient to cleave the Wang linker.
To illustrate facile removal of tags using thioanisole after cleavage of the Wang linker, the comparison reaction shown in Scheme 3 was performed. Thus treatment of support-bound phenol 5 with a 10% TFA–DCM solution produced no cleavage of phenol as before, whilst the same reaction mixture containing a drop each of thioanisole and TMS–triflate liberated the phenol within 2 min at room temperature. This method of cleavage is very simple practically. Cleavage of library compounds is therefore carried out in a 10% TFA–DCM solution, to which is then added thioanisole and TMS–triflate for the release of the tags.
Scheme 3 Solid phase loading and cleavage of phenols. |
A further advantage of this cleavage system is that all the reagents are volatile. In traditional peptide synthesis, cleavage reactions done with TFA involve the simple removal of excess TFA under a stream of nitrogen. The same method may be used with the thioanisole cleavage mixture, prior to injection of the cleaved tags into the mass spectrometer.
This cleavage–derivatisation procedure is capable of detecting phenol from a single resin bead. For example, a single bead of the tert-amylphenol loaded resin 5 was subjected (Scheme 4) to the thioanisole-based cleavage for 5 min at room temperature, and the supernatant concentrated in vacuo. The phenol was then derivatised with dansyl chloride and analysed by FTICR-MS. The resultant mass spectrum from this single bead experiment is shown in Fig. 3.30 The spectrum has an excellent signal-to-noise ratio, and the mass of the tag peak is only 5 ppm different to the calculated mass. Similar results are seen when mixtures of phenols were attached to the support, and cleaved and analysed in the same way, confirming that the phenol dansylation strategy with mass spectrometric detection is suitable for use in a tagging experiment.
Scheme 4 Single-bead cleavage experiment. |
Fig. 3 FTICR mass spectrum from single cleaved amylphenol bead. Reprinted from ref. 30, Copyright 2000, with permission from Elsevier Science. |
Fig. 4 Tagging constructs used in the strategy. |
The synthetic route to all the tag constructs has been described in full elsewhere.26 In summary, the benzyl ether bonds in the four molecules 6a–7b were constructed either with Mitsunobu chemistry or via conversion of the benzyl alcohol of the aryl iodide to its trichloroacetimidate derivative 10 (Scheme 5).
Scheme 5 Mitsunobu and trichloroacetimidate chemistry used in the synthesis of the tag constructs. |
Synthesis of the diboronates is from the corresponding diiodides via a palladium-catalysed coupling with pinacolatodiboronate (Scheme 6). This is the first demonstration of aromatic diboronate synthesis with this strategy.
Scheme 6 Synthesis of diboronate tag constructs. |
Scheme 7 Solid phase synthesis of encoded tripeptide. |
The tripeptide leucine–glycine–tyrosine was coded for with nonylphenol–amylphenol–butylphenol. Chloromethyl polystyrene resin was loaded with low levels of nonylphenol (<15% loading) and iodophenol (<25% loading) to give 12. This resin was loaded with the Wang linker to produce 13. The first amino acid was attached to this resin using the standard 2,6-dichlorobenzoyl chloride/pyridine method31 to give 14. The second amino acid was attached to the first via standard peptide coupling chemistry to give 15. This second amino acid was coded for by the attachment of the amylphenol-based diboronate. The final coding step was performed by palladium-catalysed attachment of the butylphenol-based iodide to give 17. Approximately two to three equivalents of diboronate and iodide tags were found to be necessary for these coupling reactions. This final tag coded for tyrosine, which was then attached in the final step to yield 18.
The peptide was readily identifiable with low resolution ES-MS after cleavage of the Wang linker with 10% TFA–DCM at room temperature for 10 min. Cleavage with this mixture can be used to verify that each step in a peptide synthesis has gone to completion.
After peptide removal, single beads were treated using the thioanisole–dansyl chloride cleavage–derivatisation sequence. The FTICR mass spectrum revealed clearly the three expected tags (Fig. 5). The masses of the molecular ions observed for the dansylated tags were all correct to within 4 ppm. While it would be desirable to improve the signal-to-noise ratio observed in these spectra, the encoding strategy is exemplified by this result since the three expected tags are unequivocally identified (at high resolution) whilst those tags not included are absent (asterisks indicate where they would have appeared). No peak on the mass spectrum was within 10 ppm of the expected masses of these tags. This provides the appropriate binary code for this peptide. The other peaks present we could only ascribe to noise that is typically present in a sample of this high dilution. Further, no chromatography was required prior to injection of this mixture into the mass spectrometer.
Fig. 5 FTICR mass spectrum showing the three required tags obtained from treatment of a single bead of resin 18, after removal of peptide, according to the cleavage–derivatisation method (Scheme 4). Asterisks indicate absence of tag peaks for heptyl (expected at m/z 426.210293), octyl (440.225943) and dodecyl (496.288543). Reprinted with permission from ref. 26. Copyright 2001 American Chemical Society. |
It is likely that large split-and-mix libraries will continue to feature in combinatorial chemistry due to their power to produce so many compounds so rapidly. There will therefore continue to be great analytical demands on the identification of compounds shown to be active from the screening of such libraries, and we look forward to future developments in this field.
This journal is © The Royal Society of Chemistry 2002 |