Mohit
Dhiman
,
Rafel
Cabot
and
Christopher A.
Hunter
*
Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK. E-mail: herchelsmith.orgchem@ch.cam.ac.uk
First published on 27th March 2024
Recognition-encoded melamine oligomers (REMO) are synthetic polymers with an alternating 1,3,5-triazine-piperazine backbone and side chains equipped with either a phenol or phosphine oxide recognition unit. Here, we describe an automated method for highly efficient solid-phase synthesis (SPS) of REMO of any specified length and sequence. These SPS protocols are amongst the most robust reported to date, as demonstrated by the synthesis of a mixed-sequence 42-mer, which was obtained in excellent crude purity on a 100 mg scale. Starting from loaded Wang resin and dichlorotriazine monomer building blocks, the SPS methods were automated and optimised on a commercial peptide synthesiser. Major side products were identified using LCMS, and the undesired side reactions were suppressed by the choice of resin, solvent and coupling conditions. REMO have been shown to form high-fidelity length- and sequence-selective H-bonded duplexes, analogous to nucleic acids, and automated synthesis will facilitate exploration of related functional properties, such as molecular replication and programmable self-assembly.
Fig. 1 Recognition-Encoded Melamine Oligomers (REMO) synthesised from piperazine and dichlorotriazines equipped with complementary recognition units. |
Solid-phase synthesis (SPS) is well established as a method for efficiently making sequence-defined polymers. High yielding amide and phosphoramidite coupling reactions were originally developed for SPS of peptides5 and oligonucleotides,6 and this chemistry, as well as other coupling reactions, have been adapted for the synthesis of non-natural sequence-defined oligomers.7–17 The efficiency of the SNAr coupling chemistry used for REMO synthesis makes these oligomers an attractive target for the development of an automated SPS method. When carried out under microwave conditions, SNAr reactions of chlorotriazines are fast and quantitative and require no reagents other than a base to neutralise the hydrochloric acid by-product.1 In addition, the fact that triazines ionise well in ESI-MS provides a powerful tool for the detection of side products and optimisation of reaction conditions using LCMS.
There are a number of parameters that determine the success of an SPS route. Clearly the coupling yields must be very high (i.e. >99%), and suppressing side reactions will make the final purification easier. A frequently encountered issue in peptide SPS is folding or aggregation on resin, which can hinder accessibility of reactive sites for coupling steps.18 Reaction time, temperature, solvent, and resin are all variables that can be optimised to maximise the efficiency of the protocol. Here we describe a two-step iterative SPS route for REMO that has been automated using a CEM Liberty Blue Peptide Synthesiser. Side products were identified using LCMS and eliminated by changing the reaction conditions to obtain an extremely efficient protocol, which was used to synthesise REMO up to 42 monomer units long in remarkably high yield and purity.
Two different H-bond acceptor building blocks bearing either isobutyl or 2-ethylhexyl solubilising groups were also synthesised on a multi-decagram scale (Scheme 2). Di-tert-butylchlorophosphine was heated with formaldehyde in aqueous acid to yield phosphine oxide 5. Mesylation of the hydroxyl group followed by microwave reaction with either isobutylamine or 2-ethylhexylamine gave amines 6 and 8. Reaction of 6 and 8 with cyanuric chloride at −10 °C yielded the dichlorotriazine building blocks 7 and 9, respectively.
Scheme 3 shows the synthesis of an azide-functionalised dichlorotriazine 10 from 1-Boc-4-bromopiperidine. Nucleophilic substitution with sodium azide, followed by deprotection of the Boc group with TFA gave the piperidine. Subsequent reaction with cyanuric chloride at −78 °C yielded 10. The alkyne-functionalised dichlorotriazine 11 was similarly synthesised by deprotection of 1-Boc-4-ethylnylpiperidine with TFA, followed by reaction with cyanuric chloride at −78 °C (Scheme 3).
Scheme 5 Automated SPS protocol for REMO synthesis via a two-step coupling cycle. Coupling reactions were carried out in DMF under microwave conditions. R = 2-ethylhexyl. |
Fig. 2 (a) Automated SPS of D2DD using the route shown in Scheme 5. Step 1 conditions: 0.1 M dichlorotriazine at 50 °C for 10 min. Step 2 conditions: 0.1 M piperazine at 50 °C for 10 min. (b) UPLC trace of the crude product. Cartoon representations of the products identified using ESI-MS are shown. UPLC Conditions: C4 column at 40 °C using a 5–100% gradient of MeCN/formic acid (0.1%) in water/formic acid (0.1%) over 2 minutes, then 100% MeCN/formic acid (0.1%) over 1 minute. |
The other five side products in Fig. 2b have D2 motifs at both ends of the oligomer. These compounds arise from coupling of two different oligomer chains on the same resin bead, as outlined in Fig. 3. During step 2 of the SPS coupling cycle, each growing oligomer chain should react with one equivalent of piperazine (Fig. 3a). However, if there are two oligomers in close proximity on a resin bead, then reaction of both of the monochlorotriazine end groups with the same molecule of piperazine will lead to cross-products (Fig. 3b). There are cross-products containing both odd and even numbers of recognition units due to the incomplete coupling. It should be possible to suppress the side products due to resin cross-reaction by increasing the concentration of piperazine in step 2 of the coupling cycle, and some success was achieved by using 0.7 M instead of 0.1 M piperazine.
Fig. 4 (a) Automated SPS of D2A using the route shown in Scheme 5. Step 1 conditions: 0.1 M dichlorotriazine at 90 °C for 10 min. Step 2 conditions: 0.7 M piperazine at 90 °C for 10 min. (b) UPLC trace of the crude product. Cartoon representations of the products identified using ESI-MS are shown. UPLC Conditions: C4 column at 40 °C using a 5–100% gradient of MeCN/formic acid (0.1%) in water/formic acid (0.1%) over 2 minutes, then 100% MeCN/formic acid (0.1%) over 1 minute. |
Scheme 6 Functionalisation of TentaGel Wang resin with either R = isobutyl (loading = 0.14 mmol g−1) or R = 2-ethylhexyl (loading = 0.10 mmol g−1). |
Scheme 7 shows the optimised, automated SPS route to REMO bearing terminal alkyne and azide groups. Functionalised TentaGel Wang resin was alternatingly reacted with a dichlorotriazine, then piperazine, under microwave conditions. The first coupling cycle utilised 10 to introduce a terminal azide group at one end of the oligomer. Subsequent coupling cycles with 2, 4 or 7 introduced the sequence of recognition units into the oligomer. In the final coupling step, the oligomer was capped with 4-ethynylpiperidine to introduce a terminal alkyne. Removal of silyl protecting groups was carried out on-resin before cleavage of the oligomer from the resin with TFA.
The scope of the automated SPS method was investigated by synthesising REMO of increasing length. Fig. 5 illustrates four REMO sequences, which were all synthesised on a 50 μmol scale. The crude UPLC traces show exceptionally high crude purity, at 90% for a 4-mer, 88% for a 13-mer, and 81% for a 42-mer. None of the impurities present corresponded to incomplete coupling, or any of the side products discussed above, even after 84 sequential SNAr reactions on the resin, showing the robustness of the methodology. Preparative HPLC was used to isolate all four oligomers with greater than 99% purity (Fig. 5) and with isolated yields of 22–49% based on the initial resin loading (see ESI†). The ESI-MS mass envelopes for each oligomer were used in conjunction with HRMS, 1H and 31P NMR spectroscopy to characterise the products (see ESI† for details).
Fig. 5 UPLC traces of the crude and purified products obtained using the automated SPS route shown in Scheme 7 for a 4-mer, 12-mer, 13-mer and 42-mer REMO. The sequences of the oligomers are described using upper case letters for the recognition units (D for phenol and A for phosphine oxide) and lower case letters for the end groups (z for azide, and y for alkyne). The crude purities are shown next to the major UPLC peaks, and the ESI-MS of the purified products are shown (see ESI† for calculated ESI+ masses). UPLC Conditions: C4 column at 40 °C using a 30–100% gradient of THF/formic acid (0.1%) in water/formic acid (0.1%) over 4 minutes, then 100% THF/formic acid (0.1%) over 2 minutes. |
Fig. 6 shows the full chemical structure of the 42-mer REMO, along with the MALDI-TOF mass spectrum recorded using a DCTB matrix. Peaks attributable to the [M+H]+, [M+2H]2+, and [M+3H]3+ species can be clearly identified. This compound represents one of the longest non-natural, sequence-defined oligomers to be reported, and with a mass of almost 16 kDa, it is equivalent in size to a 130 amino acid protein.
Fig. 6 (a) Chemical structure and (b) MALDI-TOF mass spectrum of the 42-mer REMO, zDDDDDDADDDDDADDDADDADAAAAAADAAAAADAAADAADAy. Calculated masses: 15876.2 [M+H]+, 7940.1 [M+2H]2+, 5293.4 [M+3H]3+. |
REMO have already been shown to form sequence-selective duplexes in a similar fashion to nucleic acids, and the ability to rapidly access REMO of any desired length and sequence will allow exploration of whether this new class of synthetic information molecule might show some of the other properties of nucleic acids, such as molecular replication or programmable self-assembly. The REMO sequences described in this work include a self-complementary 4-mer that could form a duplex with four base-pairs, and mutually complementary 12- and 13-mers that could form a duplex with twelve base-pairs and an overhanging unpaired base. Equipping these REMO with terminal azide and alkyne groups also opens opportunities for incorporation of new functionality through copper-catalysed alkyne–azide cycloaddition reactions, chain extension by ligation, and covalent trapping of supramolecular assemblies. The SPS methods reported here therefore pave the way for the development of a range of new tools for exploring the supramolecular properties of programmable synthetic macromolecules.
Footnote |
† Electronic supplementary information (ESI) available: Materials and methods, detailed synthetic procedures, characterisation including NMR spectra and HRMS of all compounds, and HPLC traces of oligomers. See DOI: https://doi.org/10.1039/d4sc00973h |
This journal is © The Royal Society of Chemistry 2024 |