Open Access Article
Peihan
Xu
a,
Sabrina
Piazza‡
a,
Michael R.
Reynolds‡
a,
Ryan
Gaudet
a,
Jonathan
Chiaramonte§
a,
Fabiola A.
Chapa-Villarreal
a,
John J.
Hayward
ab and
John F.
Trant
*abcd
aDepartment of Chemistry and Biochemistry, University of Windsor, 401 Sunset Ave., Windsor, ON N9B 3P4, Canada. E-mail: jtrant@uwindsor.ca
bDepartment of Biomedical Sciences, University of Windsor, 401 Sunset Ave., Windsor, ON N9B 3P4, Canada
cBinary Star Research Services, LaSalle ON, N9J 38, Canada
dWE-Spark Research Institute, 401 Sunset Ave., Windsor, ON N9B 3P4, Canada
First published on 24th October 2025
In 2021, our team reported a concise synthesis of Thomsen-Nouveau (Tn) antigen, a tumour-associated O-linked mucin glycopeptide. We have since realized that our characterization of the reported glycoside was mistaken. Instead of the intended ether-bonded α anomer, the β-anomer containing an ester glycosidic bond was formed using palladium catalysis and characterised incorrectly as the spectra are remarkably similar. We demonstrated this conclusively be repeating Danishefky's synthesis, with some required modifications of the protocol, of Fmoc-Tn for confirmation. The error is too significant for a correction as it does affect the conclusions of the work, and the original paper has been retracted, though its work is included in this manuscript. In this replacement we report a successful synthesis of the Tn antigen using adjusted glycosylation conditions: a different glycosyl acceptor, N-Fmoc serine benzyl ester, using TMSOTf as the catalyst. This remains, to the best of our knowledge, the shortest Tn antigen synthesis reported from galactose, although it provides the Tn antigen now. Furthermore, this route gives ready access to an essential Tn antigen building block that can be used for large-scale solid phase peptide synthesis.
For many purposes, cancer can be defined as uncontrolled cell growth that has evaded the immune system, and will eventually progress to the death of the host:8 the vast majority of precancerous clusters are hypothesized to be cleared by the immune system long before they can develop into a tumor.9,10 When looking to activate the immune systems, vaccines are particularly attractive;11,12 however, they require the identification of biomarkers correlated with cancer. Unfortunately, many oncotargets are simply upregulated in cancer, rather than unique to it. As they are also present on healthy cells, they are inappropriate vaccine targets as stimulation would initiate a systemic and very dangerous immune response. Tumor-associated carbohydrate antigens (TACAs) are different as they are not found on healthy adult tissue, yet are found on over 90% of biopsied carcinomas (e.g. breast, ovarian, colon, lung, and prostate cancers).13 This discrepancy likely arises because these TACAs are found at the reducing-end of surface glycans, and are anchored to the amino acids or lipids of the membranes. In healthy tissue they are always further elaborated into larger glycostructures; however, in cancer the misregulation of glycosyl transferases responsible for this elaboration ensures that these TACA scaffolds remain naked, missing the additional residues. Their precise role in oncogenesis is unclear, but overexpression is generally correlated with a poor prognosis.8,14 These carbohydrate antigens have, when incorporated into glycopeptides or other immunogenic scaffolds, formed the basis of the development of anti-tumor immunotherapies through the induction of a specific immune response against cancer cells.15–18 However, ready access to these materials remains a significant challenge, and a simple scalable synthesis is required. As part of an extensive carbohydrate vaccine effort, we have needed a route to produce the Tn antigen (1)—an α-2-deoxy-2-N-acetylamino-D-galactose (GalNAc) O-linked to a serine residue. The simplest derivative of this appropriate for solid phase peptide synthesis (SPPS), for which we require it on multigram—and up to 100 g scale—is the Fmoc-protected, peracetylated building block (1A). Although this molecule has been prepared in the past,19 we found many of the simple published routes were inconsistent batch-to-batch and couldn’t always be reproduced if the chemist doing the reactions changed.20–24 Furthermore, the published routes often required complex protecting group strategies that increased the step and purification count, decreasing yield (Fig. 1). Finally, the NMR data provided, despite generally considered to be a critical piece of characterization data, was often reported without detailed assignment but instead with references to data that was obtained decades ago and lacking actual images of the spectra. We wish to report the route we are using for our immediate needs, as well as up-to-date characterization data of all important products and byproducts.
![]() | ||
| Fig. 1 Key previous syntheses of Tn building blocks for Fmoc protected SPPS. (a) Tn antigen 1 and SPPS building block 1A; (b) Paulsen and Hölck's synthesis of the Cbz-protected peracetylated Tn antigen;25 (c) Danishefsky's synthesis of the Fmoc-protected Tn antigen.20 | ||
To introduce a nitrogen at C2, we found that Lemieux's ceric ammonium nitrate (CAN)-mediated azidonitration was highly effective and robust, providing crude 5;27 the anomeric nitrate can be readily hydrolysed to provide reducing sugar 6 as a mixture of two anomers. This hemiacetal was converted to a number of different activating groups (see references above) but yields for introducing the activating group were highly variable and the subsequent glycosylations inevitably inconsistent; Schmidt's trichloroacetimidates (TCA) proved the most promising.28,29 General protocols for the synthesis of TCAs call for the use of trichloroacetonitrile and potassium carbonate; however, we found that the more soluble caesium carbonate reduced reaction times from several hours to mere minutes with a concomitant reduction in product decomposition, leading to quantitative yields of the crude TCAs in a 10
:
1 mixture of α (7a) and β (7b) anomers. These compounds were used immediately without purification beyond simple isolation by filtration, to minimize decomposition.
TACAs are generally made to be incorporated into peptides. This means that the resulting product should have an Fmoc-protected amino group and a free acid. Fmoc amino acids are notoriously challenging to manipulate compared to Boc-protected analogues, but they are required for modern SPSS. The reason for this difference is not immediately clear, as Fmoc groups are stable to standard acid, base, and mild hydrogenation conditions, but any examination of the literature will always find many examples of a transformation being accomplished with Boc- or Cbz-protected amines, with many fewer examples, employing Fmoc.¶ This is often because the yields of reactions using Fmoc are simply lower, driven by a combination of factors including partial deprotection and often poor solubility of the fluorenyl group that might inhibit any desired reaction. However, Fmoc installation is often accompanied by the generation of hard-to-separate impurities—e.g. Fmoc-β-alanine-OH from the Lossen rearrangement of the Fmoc N-hydroxysuccinimide reagent, or the formation of activated esters during installation leading to undesired dipeptides (when using Fmoc-Cl).30 Consequently, despite the potential difficulties with the use of such a substrate, we decided we needed to develop conditions appropriate for the Fmoc-protected amino acid. Finally, to shorten the number of steps further, we wanted to use an approach that avoided protection of the carboxylic acid at any point in the synthesis under the assumption that a protonated carboxylic acid is not a competitive nucleophile with the serine alcohol.
However, this approach proved to be more problematic than we expected. We tried various Lewis acids as catalysts, finally settling on the unusual palladium(II) catalyst, [Pd(MeCN)4][BF4]2 used by Nguyen,31–33 as by far the most efficacious. While it was known that the [PdLn]2+ can complex to the oxygen of an appropriately oriented C-2 ether or the TCA nitrogen (Scheme 2a), we anticipated that it would not coordinate to the C-2 azide and so the reaction would pass through a standard oxocarbenium intermediate (Scheme 2b). This way the anomer of the initial TCA would not matter as the reaction would proceed through a planar intermediate (pseudo-SN1 mechanism), and the anomeric effect would ensure the formation of the desired α anomer. At first, the reaction was thought to be successful after comparing our NMR data with that of a similar compound from the literature (Compound 20, Fig. 2).20 Compound 20 has an acetonide on C-3 and C-4, and a TIPS on C-6, instead of the acetate groups, but the NMR data of what we took to be the anomeric and amino acid signals largely agreed with each other. Combined with other characterization data such as a precise match by HRMS and a carbon spectrum that was consistent with the desired product, we believed that we had successfully made the target compound 8. We published this result in the now retracted article.34 As implied by that sentence, we were incorrect (Fig. 2).
![]() | ||
| Scheme 2 Proposed mechanism for Nguyen's Palladium catalyzed glycosylation, and our proposed application of it to a C2 azido glycosyl donor. | ||
![]() | ||
| Fig. 2 Comparison of our intermediate with the most relevant comparator and structural revision. (A) The structure of our proposed intermediate 8 and the equivalent structure from Danishefsky and colleagues’ 1998 synthesis.20 (B) Expected NOE (blue) and HMBC (pink) correlations in compound 8 and observed correlations in compound 9b. | ||
As noted, this study was conducted precisely so that we could generate large amounts of the Tn antigen to explore comparative TACA vaccine design. The yields of the glycosylation, although barely sufficient, were not acceptable for our process route, and we worked extensively on improving the reaction (including moving away from using large amounts of palladium as a Lewis acid), and continued to stock-pile material at what we believed was the stable azide 8. While doing this a new student attempted to assign all resonances in both the 13C and 1H spectra, as an academic exercise, and observed some interactions in the 2D NMR that were not consistent with the anticipated structure. This led to the proposal that instead of compound 8, we had prepared the β anomer of the serine ester glycoside 9b. The evidence was that the proton signal at 5.57 ppm showed a large J-coupling of 8.5 Hz in the 1H NMR spectrum. We had initially assigned this as the amide N–H proton from α-galactosamine, as we observed coupling to the C2-proton. However, it became apparent that this was not the case, and it was in fact the C1 anomeric proton. An α-anomer should have 3J1,2 coupling of around 3–4 Hz, whereas the 8.5 Hz observed is far more consistent with a β anomer. Further NMR experiments were conducted, and 1D NOE spectra showed interactions between this anomeric proton Ha and both Hc and He (Fig. 2). This interaction is unlikely to occur in α galactoside 8 as the anomeric proton is equatorial, therefore out of range of NOE effect from Hc and He, but is very reasonable in β galactoside 9b because of the 1,3-diaxial interactions present between these three protons. Furthermore, careful inspection of the 1H–13C HMBC spectrum revealed an interaction between the anomeric proton and a carbonyl carbon, which would be highly unlikely had the galactose been connected to serine through the hydroxyl oxygen. However, if the glycosidic bond is made through the carboxylic oxygen to form an ester, this unexpected correlation would be observed. Comparison of our 1H NMR data on compounds 5, 6, and 8 showed that the anomeric proton's chemical shift in compound 8 (5.57 ppm) is closer to that of the β anomer of compound 5 (5.58 ppm), where it is next to an electron-withdrawing nitrate group, than to that of the β anomer of compound 6 (4.71 ppm), where it is next to an alcohol (Fig. 2). All these observations were only consistent with the formation of compound 9b. In order to confirm this assignment, we wished to prepare the α-anomer 9a as well as Danishefsky et al.'s compound, 20 (Scheme 4).
We investigated other conditions (a full table can be found in the SI, experimental procedure of 9b) and found that BEt3 and LiBF4 in THF gave slightly higher yields of 9b, 58% and 46% respectively, than [Pd(MeCN)4][BF4]2 in DCM (up to 38%), but α-anomer was only observed in trace amount and could not be separated cleanly. The key determinant for the α/β selectivity seems to be the choice of base in the reaction of 6 to 7, as potassium carbonate favoured the formation of 7b (7a
:
7b = 1
:
4–1
:
9), whereas caesium carbonate favoured 7a (10
:
1). We were thus able to synthesise compound 9a, with only a trace amount of compound 9b, by using K2CO3 to form the acetimidate mixture and using BEt3 in THF for the glycosylation (Scheme 3).
The inversion of stereochemistry suggests that this process is dominated by an SN2 mechanism instead of going through an oxocarbenium intermediate. The fact that with the palladium Lewis acid we only obtained 9b with (retrospectively) only trace evidence of 9a is likely due to the attenuated reactivity of this Lewis Acid (passing through the SN2 rather than oxocarbenium pathway), and a combination of the relative underactivity and limited amount of the β-TCA under the reaction conditions. This need for an SN2-capable nucleophile makes the engagement of a supposedly protonated carboxylic acid more surprising.35
One of the factors leading to our mistakenly identifying 9b as 8 was that the NMR data of similar compounds—8 itself is novel and has no exact precedent—is from the older literature; it therefore lacks spectral support (i.e. a reproduction of the spectrum), the data is unassigned, and was collected on lower resolution spectrometers. This meant that it was easier for us to convince ourselves that the interpretation of our own data was for our desired compound; we saw what we wanted to see. We also did not consider that ester formation could be competitive; after all the product 9b has never been reported. We thus wished to reproduce Danishefsky's synthesis to confirm their data—and our own—and provide the spectra to ourselves and others, while providing a comparison of their approach to this glycoside to our own (Scheme 4).
The reported synthesis started with compound 16, but its preparation was not described.36 Thus we prepared 16 from galactal 4, following the most obvious synthesis (likely the same route used by Danishefsky). Deacetylation of compound 4 using NaOMe gave compound 14. It is worth noting that our standard protonation quench of Zemplén conditions employing DOWEX (always regenerated at least once)37 caused the enol ether in the product to undergo various side reactions and eventually led to full decomposition, as the crude NMR did not show the presence of the desired compound but instead a complex mixture of what we took to be oligomers. We could successfully obtain 14 using 50% acetic acid instead of DOWEX during the quench, titrating the acid until the reaction mixture reached a neutral pH (monitored using litmus paper). C6-selective TIPS protection of compound 14 was conducted at room temperature according to literature protocols,38 and the product, 15, could be used without purification for the next step, although over-protection was observed. We note that modifying the conditions by conducting this reaction at a lower temperature (0 °C) significantly improved the yield of 15 as it effectively prevents the addition of a second TIPS group. The acetonide protection to fully-protected 16 was straightforward following the reported procedure.39 The last three reactions were conducted as described by Danishefsky,20 except for a solvent change to DMF in the reaction from compound 16 to 17 after attempts with the reported MeCN failed. With these modifications, we were able to access Danishefsky's glycoconjugate intermediate 20. The data acquired is an excellent match for what was published (with improved resolution arising from our higher field magnet), and we provide full assignment for the convenience of future readers (see either the SI or the deposited NMR raw data). Re-synthesis and assignment of compound 20 helped confirm our characterization of compounds 9a and 9b; however, the overall yield using this approach is unsatisfactory, partially arising from the use of acid-sensitive protecting groups in a silica-chromatography intense sequence. In fairness, Danishefsky's protecting group strategy was designed precisely so that 20 could be selectively deprotected at C-6 (while keeping C-3 and C-4 protected) for F1α antigen synthesis; however, for Tn antigen synthesis we do not need to discriminate between these positions and thus we can use a more unified protection strategy.
We thus needed to develop an alternative route to the Tn-SPPS building block 1A. We began with an analysis of what had gone wrong: a protonated carboxylic acid (the reaction occurs in the presence of a Lewis Acid without exogenous Brønsted base) outcompeting an alcohol in gross nucleophilicity was surprising. However, a potential explanation for this comes from a consideration of the relative acidities of the species involved. While we were unable to find a value for trichloroacetamide itself, in DMSO the pKa of the related trifluoroacetamide is 17.2, whereas an aliphatic carboxylic acid is significantly lower (AcOH = 12.6).35 We thus hypothesize that the trichloroacetamide anion acts as a base to deprotonate the carboxylic acid moiety of the serine. Consequently, we had unwittingly produced the carboxylate and that was competitive with the alcohol.
To avoid this carboxylate and the concomitant formation of ester glycoside, we reluctantly protected the offending functional group as a benzyl ester (Scheme 5). The benzyl ester is well precedented, easy to install and the deprotection conditions (Pd/C and H2) would not affect other protecting groups in the compound.20,24 We kept the trichloroacetimidate strategy as it showed promising reactivity and started the new route with compound 11 and LiBF4, but no reaction was observed. This agrees with our hypothesis that Cl3CCONH− may have been acting as a base in the presence of carboxylic acid, generating carboxylate as the nucleophile attacking trichloroacetimidate in SN2 fashion—the neutral alcohol is simply not competitive and with no possible acid, no reaction can occur (other than the TCA rearrangement to the more stable N-glycoside40). We then changed the Lewis acid to TMSOTf (0.3 equiv.) and it resulted in satisfactory yield. Both anomers were isolated from the crude product (a 10
:
1—α
:
β ratio). Compound 12a’s anomeric proton is at 4.87 ppm, J = 3.6 Hz, and shows no interaction with either the C3 or C5-H in the NOE experiment. The characterization data is in line with literature data and what is expected for α-galactoside. Full spectral data and assignments are provided in the SI. Compound 12a was then subjected to reduction and acetylation to give 13 in high yield. The benzyl ester can be reduced to the acid under H2 and Pd/C to give compound 1A, which is ideal for SPPS. Under the same conditions but with a higher amount of Pd/C (1 equiv.) present, the Fmoc group can also be cleaved off to give compound 22, which can be easily deacetylated to give the parent Tn antigen 1 should that be desired.
Additionally, we have deposited all of the raw data obtained directly from the spectrometer organized by compound as presented in the supplementary information as a.zip archive in the Borealis Repository at: https://doi.org/10.5683/SP3/XZA7PC.
Footnotes |
| † In memory of Jonathan Chiaramonte, friend and colleague. |
| ‡ These authors contributed equally to this work and the order is due to the arbitrary vagaries arising from the combination of the Latin alphabet and family name alone. Both authors, and those citing this work, may invert the order of the first two authors for any and all professional purposes. The preprint of this article inverts the names with the same statement to emphasize this point. |
| § Deceased. |
¶ A SciFinder search of reactions of carbamate-protected α-amino esters (excluding amide formation) found a total of 59 495 reactions; only 11% used Fmoc-protected substrates (6726 reactions). Conversely, 58% used Boc (34 656 reactions) and 23% used Cbz (13 967 reactions) – even though the most common use of such products (i.e. SPSS) requires Fmoc protection on the nitrogen. This is borne out by the fact that it is much more common for carbamate-protection of α-amino acids to be switched to Fmoc (Boc → Fmoc, 3305 reactions; Cbz → Fmoc, 1485 reactions) rather than from Fmoc (Fmoc → Boc, 284 reactions; Fmoc → Cbz, 105 reactions). |
| This journal is © The Royal Society of Chemistry and the Centre National de la Recherche Scientifique 2025 |