Open Access Article
Sıla
Arsın
a,
Maija
Pollari
ac,
Endrews
Delbaje
b,
Jouni
Jokela
a,
Matti
Wahlsten
a,
Perttu
Permi
de and
David
Fewer
*a
aDepartment of Microbiology, Faculty of Agriculture and Forestry, University of Helsinki, 00014 Helsinki, Finland. E-mail: david.fewer@helsinki.fi
bFaculdade de Ciências Farmacêuticas de Ribeirão Preto, Universidade de São Paulo, Ribeirão Preto, Brazil
cDepartment of Agricultural Sciences, Faculty of Agriculture and Forestry, University of Helsinki, 00014 Helsinki, Finland
dDepartment of Chemistry, University of Jyväskylä, 40014 Jyväskylä, Finland
eDepartment of Biological and Environmental Science, Nanoscience Center, University of Jyväskylä, 40014 Jyväskylä, Finland
First published on 20th August 2024
Mycosporine-like amino acids (MAAs) are a family of water-soluble and colorless secondary metabolites, with high extinction coefficients, that function as microbial sunscreens. MAAs share a cyclohexinimine chromophore that is diversified through amino acid substitutions and attachment of sugar moieties. The genetic and enzymatic bases for the chemical diversity of MAAs remain largely unexplored. Here we report a series of structurally distinct MAAs and evidence for an unusual branched biosynthetic pathway from a cyanobacterium isolated from lake sediment. We used a combination of high-resolution liquid chromatography-mass spectrometry (HR-LCMS) analysis and nuclear magnetic resonance (NMR) spectroscopy to identify diglycosylated-palythine-Ser (C22H36N2O15) as the dominant chemical variant in a series of MAAs from Nostoc sp. UHCC 0302 that contained either Ser or Thr. We obtained a complete 9.9 Mb genome sequence to gain insights into the genetic basis for the biosynthesis of these structurally distinct MAAs. We identified MAA biosynthetic genes encoded at two locations on the circular chromosome. Surprisingly, direct pathway cloning and heterologous expression of the complete mysABCJ1D1G1H biosynthetic gene cluster in Escherichia coli (E. coli) led to the production of 450 Da monoglycosylated-palythine-Thr (C18H30N2O11). We reconstructed combinations of the two distant biosynthetic gene clusters in refactored synthetic pathways and expressed them in the heterologous host. These results demonstrated that the MysD1 and MysD2 enzymes displayed a preference for Thr and Ser, respectively. Furthermore, one of the four glycosyltransferases identified, MysG1, was active in E. coli and catalysed the attachment of a hexose moiety to the palythine-Thr intermediate. Together these results provide the first insights into the enzymatic basis for glycosylation of MAAs and demonstrates how paralogous copies of the MysD enzymes allow the simultaneous biosynthesis of specific chemical variants to increase the structural variation in this family of microbial sunscreens.
000 M cm−1, effectively absorbing UVA/UVB radiation.2 MAAs can dissipate 98% of the absorbed radiation as heat without generating radical oxygen species and some MAA variants were even shown to possess potent radical scavenging properties.3–6 Given their non-toxic nature, efficient UV absorption capabilities, antioxidant potency and potential other anti-inflammatory and anti-aging activities MAAs have garnered significant attention in the scientific community over the past decade, showcasing their potential applications in the cosmetic and pharmaceutical industries.7–11
MAAs are small, colorless, water-soluble compounds composed of either a cyclohexenone or cyclohexenimine chromophore, with amino acid substituents attached to the first carbon (C1) or third carbon (C3).1,12 The structural diversity of MAAs can be further expanded through modifications in the amino acid residues and glycosylation.13–15 Shinorine (absorption maximum (λmax) = 333 nm, 333 Da) stands out as one of the most prevalent MAA variants in cyanobacteria, and its biosynthetic pathway involving the mysABCD/E gene cluster was the first to be elucidated.16–18 Cyanobacterial MAA biosynthesis seems to be able to either involve both the shikimate and pentose phosphate pathways or to be strictly dependent on one pathway to supply the precursor sugar–phosphate.19–22 In the established biosynthetic pathway, MysA functions as 2-epi-5-epi-valiolone synthase (EEVS), which acts on sedoheptulose 7-phosphate from the pentose phosphate pathway forming 2-epi-5-epi-valiolone intermediate that gets methylated by a O-methyltransferase enzyme (MysB) to make the 4-deoxygadusol core.16 Following this step, MysC, an ATP-grasp ligase, catalyzes the addition of Gly onto the C1 position, resulting in the formation of mycosporine-glycine.16,17 The addition of Ser onto the C3 position is carried out either by MysE, a non-ribosomal peptide synthetase (NRPS), or by MysD, recently renamed as mycosporine-glycine amine-ligase (MGA ligase) as recently proposed.16,23 MysD was shown to exhibit a level of substrate promiscuity with higher affinities towards Ser or Thr.24 The phytanoyl-CoA dioxygenases, MysH, have also recently been identified as important in the biosynthesis of palythine (λmax = 320 nm, 244 Da) variants.24 In our previous work we have also noted the presence of more complex MAA biosynthetic pathways and presented additional potential substrates for MysE enzymes as well as a role for methyltransferase enzymes (MysF) in aplysiapalythine-type MAA synthesis.25
Numerous glycosylated MAA structural variants have also been reported in cyanobacteria, particularly in the genus Nostoc. However, the glycosyltransferase enzymes responsible for the addition of sugar moieties remained unidentified.13,14,25,26 Here, we investigated the MAA biosynthetic pathway of the benthic Nostoc sp. UHCC 0302, maintained at the University of Helsinki Culture Collection (UHCC). Nostoc sp. UHCC 0302 constitutively produces 568 Da diglycosylated; hexosyl-pentosyl-palythine-Ser as the major structural variant. We provide a complete genome for Nostoc sp. UHCC 0302 that contains a type of discontiguous MAA biosynthetic gene cluster which includes a pair of two distinct glycosyltransferase enzymes as well as a pair of MysD homologues. Through heterologous expression experiments involving reconstructed MAA biosynthetic gene clusters in E. coli, we have demonstrated potential involvement of two different glycosyltransferase enzymes in MAA glycosylation. Additionally, our results provide further insights into the distribution of various MysD homologs involved in MAA synthesis, each exhibiting distinct substrate specificities that contribute to the structural diversity of MAAs.
![]() | ||
| Fig. 1 MAA chemical variants of Nostoc sp. UHCC 0302 detected by HR-LCMS. The predicted chemical structures of the MAA variants are shown corresponding to their peak of absorbance at 320 nm. The dominant, 568 Da, 11-(β-D-galactopyranosyl)-7-(α-D-xylopyranosyl)-palythine-Ser chemical structure is highlighted in light green background and its chemical structure is based on the NMR analysis (Tables S2, S3 and Fig. S2–S19, ESI†). | ||
We purified the major MAA chemical variant (no. 6) with absorbtion maximum at 323 nm, C22H36N2O15 (m/z 569.21945 for [M + H]+, Δ 1.1 ppm) for further characterization with NMR analysis (Fig. 1 and Fig. S11, Table S1, ESI†). The sample was dissolved to D2O and 1H, 1H–1H DQF–COSY, 1H–1H TOCSY, 1H–13C HSQC–TOCSY, CHn multiplicity edited 1H–13C HSQC and 1H–13C HMBC spectra were measured. Numerical data is presented in Table S3 (ESI†) and spectra in Fig. S3–S8 (ESI†). All δH and δC signals and COSY/HMBC correlations for atom positions 1–11 were typical for the palythine-Ser (PS) structure (Table S4 and Fig. S11, ESI†). Also, resonances from one pentose (Pent) and one hexose (Hex) were recognized. HMBC correlation between Hex anomeric H-1 and PS C-11 was present which means that Hex is connected to the PS unit 11-OH (Fig. 1 and Fig. S9, ESI†). Hex had coupling constant 3JH1,H2 of 8.0 Hz meaning that both H1 and H2 are axial which is the situation in β-D-glucose, -galactose, -gulose and -allose. A correlation from Pent anomeric H-1 and PS C-7 was present which means that Pent is connected to the PS unit 7-OH (Fig. 1 and Fig. S9, ESI†). All δH and δC signals and vicinal coupling constant 3JH1,H2 of 3.7 Hz match best to D-pentoses with α-D-xylopyranose structure. Data from proton spectrum of hydrolyzed 568 Da MAA confirmed that subunit monosaccharides were galactose (Gal) and xylose (Xyl) (Fig. S10, ESI†). Both Gal and Xyl C-1 signals appear at region typical for O-linked glycosides (Agrawal Phytochem 1992). The PS unit C-7 δC signal had shifted 5 ppm downfield compared to average data from –(C-7) H2OH from La Barre and colleagues which also shows that C-7 OH is not free but bonded.27
859
258 bp and comprised of an 8
388
664 bp circular chromosome and 8 plasmids. Based on the known MAA biosynthetic enzyme sequences in the literature, we screened the genome of Nostoc sp. UHCC 0302 and identified the potential MAA biosynthetic gene clusters. As anticipated, we found an 8.3 kb gene cluster which included the well-established four MAA biosynthetic genes (mysABCD) and an additional three, designated as mysABCJ1D1G1H (Fig. 2(A)). MysH is homologous to 2-oxoglutarate-dependend dioxygenase, recently shown to be involved in palythine synthesis.24 MysJ1 and MysG1 are glycosyltransferase enzymes homologous to beta-1-1,6-N-acetylglucosaminyltransferase and glycosyltransferase family 4 proteins respectively (Fig. 2(A)). Additionally, we identified and annotated a distant and incomplete, mysD2J2G2 enzyme cluster for their potential involvement in MAA biosynthesis in Nostoc sp. UHCC 0302 (Fig. 2(A)).
![]() | ||
| Fig. 2 MAA biosynthetic gene clusters detected in the complete genome of Nostoc sp. UHCC 0302 (A). MS-ES+ chromatograms of the detected MAA variants from E. coli BL21 (DE3) transformants expressing the main MAA biosynthetic gene cluster with and without mysG in pET28a+ system (B) with the simplified structures of the variants detected, labelled as palythine-Thr (a), porphyra-334 (b) and hexosyl-palythine-Thr (7-(α-D-glucopyranosyl)-palythine-Thr) based on NMR data (Tables S5, S6 and Fig. S11–S17, ESI†) (C). | ||
We conducted heterologous expression experiments to elucidate the roles of enzymes involved in the biosynthesis of MAA structural variants detected in Nostoc sp. UHCC 0302. Initially, we designed a construct containing only the main (complete) MAA biosynthetic gene cluster mysABCJ1D1G1H in the pET28a+ vector for heterologous expression in E. coli. MAA structural variants produced by these clones were analysed by HR-LCMS screening of their methanol extracts (Fig. S18, ESI†). Here, we detected m/z 451 [M + H]+ hexosyl-palythine-Thr as the main MAA product which is direct match to the intermediate no 4 detected in Nostoc sp. UHCC 0302 (Fig. 1 and 2(B), (C)). Following this, we confirmed the structure of the purified hexosyl-palythine-Thr as 7-(α-D-glucopyranosyl)-palythine-Thr using NMR (Fig. S11–S17, ESI†). The intermediates of porphyra-334 and palythine–threonine were still detected in comparable amounts alongside the main hexose bound variant (Fig. 1 and 2(B)). This could be due to various reasons such as suboptimal protein expression and folding, differing substrate availabilities, pathway flux and/or potential metabolic interferences in E. coli.28
Next, we wanted to determine which glycosyltransferase is responsible for the hexose addition, for this we omitted the mysG1 gene and tested the expression of construct mysABCJ1D1::pET28a+ (Fig. 2(B) and Fig. S18, ESI†). The heterologous expression of the constructs including only MysJ1, resulted in agluconic structural variants of palythine-Thr and porphyra-334 (Fig. 2(B) and Fig. S18, ESI†). This revealed that, mysG1 annotated as glycosyltransferase family 4, is responsible for the attachment of a hexose onto the MAA variants (Fig. 2(B)). BlastP screening of both MysG1/2 amino acid sequences also show similarity with WcaC-type enzymes which are involved in addition of sugar units in colanic acid biosynthesis.29 Colanic acid is composed of repeating hexasaccharide chains as part of the protective exopolysaccharide (EPS) layer.30 We think MysG1/2 may also function in a similar way in EPS synthesis and also MAA glycosylation. Presence of MAAs in analysed EPS extracts and synergistical upregulation of both EPS and MAAs pathways in response to UV and desiccation stress could also be further evidence for localisation of glycosylated MAAs in EPS layers.13,14,31,32 This incorporation may help these organisms develop an effective outer shield against UV exposure and dessication.
Conversely, we observed no activity for MysJ1 (a beta-1,6-N-acetylglucosaminyltransferase enzyme) or for the other glycosyltransferases encoded in the distant mysD2J2G2 cluster in our expression systems (Fig. 2(B) and Fig. S18, ESI†). In Nostoc sp. UHCC 0302, MysJ1 is likely responsible for adding a pentose sugar, resulting in a variant corresponding to the 568 Da diglycosylated palythine Ser (no. 6) (Fig. 1 and 2). It is probable that the glycosylation of MAAs for potential incorporation into EPS layer involves a complex series of reactions by several glycosyltransferase enzymes, an activity likely also not supported in our E. coli system.33 Our additional homology-based protein model and binding prediction revealed that both MysJ1/2 and MysG1/2 type enzymes are not well characterized as the best hits with highest sequence identity provided minimum to no information (Table S11, ESI†). Interestingly for MysJ1/2 enzymes, the second best hit with around 30% sequence identity is a xylosyltransferase 1 enzyme found in human genomes with established activity in catalysing xylose addition onto Ser and Thr residues (Table S11, ESI†).34 Similarly the second best hit for MysG1/2 were of low sequence identity (∼21.5%), but a well characterized sucrose–phosphate enzyme from Thermosynechococcus elongatus with suggested glucose transfer activity onto a fructose-6-phospahate (Table S11, ESI†).35
We then investigated the distribution of these glycosyltransferase enzymes identified in Nostoc sp. UHCC 0302, in 336 complete publicly available cyanobacterial genomes. Among these genomes which included a type of an MAA biosynthetic gene cluster, 52 strains contained glycosyltransferase enzymes homologous to MysG1/2 and MysJ1/2 based on 40% minimum alignment identity and coverage. While the majority (87%) of these genomes encoded glycosyltransferases homologous to MysG, only 7 (13%) genomes encoded enzymes with homology to MysJ, forming a single cluster (Fig. S21, ESI†). These were Nostoc sp. ATCC 53789, Nostoc sp. Lobaria pulmonaria 5183 cyanobiont, Nostoc sp. NIES-4103, Nostoc linckia NIES-25, Nostoc sp. C052 and our Nostoc sp. UHCC 0302 (Fig. S21, ESI†). It is noteworthy that all MysJ encoding genomes were exclusively found within the Nostoc genus and co-encoded a MysG enzyme (Fig. S21, ESI†). This observation suggests a potential interdependence between MysJ and MysG, with MysJ involvement being rarer and specific to Nostoc species (Fig. S21, ESI†). In addition, Nostoc sp. UHCC 0302 seems the be unique among the studied strains as it encodes as MysJ1/2 and MysG1/2 (Fig. S21, ESI†). Cyanobacterial genomes encode multiple glycosyltransferases which are generally thought to be mainly involved in the synthesis of complex EPS layers, yet the activities of these enzymes are scarcely understood.36,37 To confirm the exact activities of MysJ1/2 and MysG1/2 further work involving structural characterisation and activity assays are necessary. Our analysis thus far can only predict that both sets of enzymes might have a role in MAA glycosylation based on the identified MAA variants in Nostoc sp. UHCC 0302, experiments and bioinformatic analysis.
![]() | ||
| Fig. 4 MAA biosynthetic gene clusters identified in the chromosome of Nostoc sp. UHCC 0302 and the proposed biosynthetic scheme based on heterologous expression results for MAA chemical variants and intermediates detected by HR-LCMS. Detected MAA chemical structures are numbered to match the HR-LCMS structures on Table S1 (ESI†). (Galactose: Gal, glucose: Glc, xylose: Xyl). | ||
We observed a scattered phylogenetic distribution amongst MysD enzymes identified from 27 cyanobacterial genomes encoding multiple copies of MysD enzymes near MAA biosynthetic gene clusters (Fig. 3(B)). Notably, in the case of Nostoc sp. UHCC 0302, MysD2 appears to represent an earlier evolutionary iteration compared to MysD1, grouping within a clade predominantly consisting of Nostoc sp., some of which are known to produce tricore MAAs (Fig. 3(B)).25 Promiscuity is regarded as part of enzymatic evolution and such substrate flexibility in secondary metabolite synthesis tends to be favoured as it enhances structural diversity of the metabolites contributing to fitness of the organisms in a changing environment.39–41 The phenomenon is widespread in nature and these shifts between promiscuity and specificity can occur by mere point mutation, leading to cases such as this one.40,42 In Nostoc sp. UHCC 0302 the presence of different two distinct and active MysD homologs with varying substrate specificities is likely due to series of horizontal gene transfer events. Interestingly, the distant MysD2 is the dominant active enzyme in Nostoc sp. UHCC 0302 as the major MAA variant is the Ser bound, galactosyl-xylosyl-palythine-Ser (no. 6) (Fig. 1). The cooperation of enzymes from discontiguous biosynthetic gene clusters is reported amongst MAA biosynthetic pathways25 and such cooperation might be one of the key mechanisms resulting in structural MAA variants with a seemingly simple and conserved MAA biosynthetic gene cluster.
This study reveals novel insights for the genetic and molecular mechanisms underlying the structural diversity of MAAs in cyanobacteria. Despite the apparent simplicity of MAA biosynthetic gene clusters, our findings suggest these gene cluster organisations diversified by constant reconfiguration, leading to the observed structural variation. In essence, our work not only advances our understanding of the genetic and molecular mechanisms governing the biosynthesis of different MAA variants but also underscores the dynamic nature of these pathways.
000g using an Eppendorf centrifuge 5415D. 100 μl of the supernatant was taken up and filtered using Injekt-F 1 ml syringe (B-Braun) with 0.2 μm Fisherbrand PTFE syringe filter tip (Fisher Scientific) into a short thread sample vial (VWR) for HR-LCMS analysis.
An initial method of screening was performed using UPLC-QTOF (Acquity I-Class UPLC-SynaptG2-Si, Waters Corp., Milford, MA, USA) with the ACQUITY UPLC BEH Amide Column (2.1 mm × 100 mm, 1.7 μm, 130 Å, Waters Corp., Milford, MA) with solvent A: 0.2% ammonium formate and solvent B: acetonitrile with a flow rate of 0.300 ml min-1. The initial percentages of solvents were: 10% solvent A and 90% solvent B, which changed linearly to 40% solvent A and 60% solvent B by 9.00 minutes. The sample was injected 0.5 μl at a time. The target sample temperature was 5.0 °C and column temperature: 40.0 °C. Samples were run at ES+ polarity with the capillary voltage at 2.5 kV. The sampling cone was set to 20 V with a source temperature of 120 °C and desolvation temperature of 600 °C. The cone gas flow was set to 50 L h−1 and desolvation gas flow to 1000.0 L h−1 with a nebuliser gas flow of 6.0 bar. Photodiode array detector recorded between 210 to 800 nm.
000g and centrifuged at 5000g for 10 minutes for three times. Supernatants were collected into a rotor evaporator flask and mixed with 10 ml of 100–200 mesh Chromatorex chromatography silica gel (Fuji-Davison Chemical). Methanol was evaporated from the sample using Vacuum Controller V-800 Rotavapor R-200 (Buchi) at 130 mbar at 30 °C. MAAs were then purified using a Strata Silica, Florisil, NH2, CN Normal Phase column (Strata) for polar retention mechanisms primed with 100% isopropanol and heptane. Dried cell material bound to silica gel was loaded as 5 ml at a time at the top of the column and eluted with 10 ml of 100% heptane, ethyl acetate, dichloromethane, acetone and methanol.
The methanol eluate was centrifuged and diluted in 100 × methanol to confirm the presence of mycosporines by measuring the samples in a UV-1800 Ordior spectrophotometer (Shimadzu) with the spectrum range of 190 to 400 nm. Methanol samples were dried using TurboVap LV Evaporator (Zymark) at 30 °C with pressure set to 0.4 bar, re-dissolved in 1 ml ultra-pure water, homogenized by vortexing for 10 seconds and sonicating with Sonorex Super 10P (Bandelin) for 10 seconds. Samples were then centrifuged at 10
000 g for 5 minutes and supernatants were collected to be fractioned using the high-performance liquid chromatography (HPLC) method. For each run, 50 μl samples were injected into the XSelect HSS T3 column (10 × 150 mm, 5 μm, Waters) in the HP Agilent 1100 series liquid chromatograph (Hewlett Packard) for reverse phase HPLC. The flow rate of 4.5 ml min−1 was set with 100% of 0.1% formic acid. Fractions were then detected using the UV-Vis diode array detector at the wavelength range of 200 to 330 nm.
In addition to one-dimensional 1H spectra, the structural characterization was accomplished using a panoply of two-dimensional (2D) homo- and heteronuclear NMR experiments. 2D 1H, 1H TOCSY (total correlation spectroscopy) experiment, using DIPSI-2 sequence with a mixing time (tm) of 90 ms was supplemented with 2D 1H,1H DQF–COSY (double quantum filtered correlation spectroscopy) for the spin-system identification. 2D 1H,13C HSQC and CHn-multiplicity edited 1H, 13C HSQC (heteronuclear single quantum coherence) and 1H, 13C HMBC (heteronuclear multiple bond correlation) were employed to establish one- and multiple-bond 1H–13C connectivities, respectively. Transfer delay for the long-range 1H, 13C correlations was set to 62 ms, based on nJC,H couplings of 8 Hz. 2D 1H,13C HSQC–TOCSY experiment (tm = 90 ms) was utilized to lift degeneracy in 1H chemical shifts when necessary. 2D ROESY (rotating frame Overhauser effect spectroscopy) with a spinlock time of 200 ms was employed to confirm long-range correlations in 13C-HMBC spectrum and establish through-space connectivities between protons. 1H spectrum of hydrolyzed MAAs in 2 M D2SO4 was measured using 30-degree flip angle and recycle delay of 14 seconds.
Pacbio Sequel II instrument was used for the sequencing reactions and the initial assemblies were done according to the Pacbio's SMRTlink version 9 microbial assembly user guide at the University of Helsinki sequencing centre. De novo genome assemblies were obtained with Flye v2.9.44 The assembled scaffolds were classified with Kaiju v1.7.245 at the phylum level and separated using in-house scripts to obtain only cyanobacterial scaffolds. The circularity of sequences was checked with Bandage v0.8.146 and the completeness and contamination of the genomes were assessed with CheckM v1.0.13.47
To confirm whether the transformed E. coli could produce the MAA structural variants anticipated we set up a small-scale induced expression culture experiment. The E. coli BL21 (DE3) colonies were grown in 5 ml Luria Bertani (LB) broth overnight at 37 °C degrees with constant shaking at 185 rpm (shaker) to be re-inoculated to a fresh 5 ml LB medium and induced with 0.1 mM IPTG the next day. After the induction the cultures were placed in 18 °C with constant shaking at 185 rpm for another 24 hours. Cultures were then harvested by centrifuging at 8000 g using Eppendorf centrifuge 5804 R (Germany) and cell pellets were collected for lyophilisation and methanol extraction as described before. The extract samples were then analysed with HR-LCMS.
Footnote |
| † Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4cb00128a |
| This journal is © The Royal Society of Chemistry 2024 |