Martin
Baunach
ab,
Arthur
Guljamow
a,
María
Miguel-Gordo
a and
Elke
Dittmann
*a
aUniversity of Potsdam, Institute of Biochemistry and Biology, Karl-Liebknecht-Str. 24/25, 14476 Potsdam, Germany. E-mail: editt@uni-potsdam.de
bUniversity of Bonn, Institute of Pharmaceutical Biology, Nußallee 6, 53115 Bonn, Germany
First published on 13th December 2023
Covering: 2000 to 2023
Cyanobacteria produce a variety of bioactive natural products that can pose a threat to humans and animals as environmental toxins, but also have potential for or inspire pharmaceutical use. As oxygenic phototrophs, cyanobacteria furthermore hold great promise for sustainable biotechnology. Yet, the necessary tools for exploiting their biotechnological potential have so far been established only for a few model strains of cyanobacteria, while large untapped biosynthetic resources are hidden in slow-growing cyanobacterial genera that are difficult to access by genetic techniques. In recent years, several approaches have been developed to circumvent the bottlenecks in cyanobacterial natural product research. Here, we summarize current progress that has been made in unlocking or characterizing cryptic metabolic pathways using integrated omics techniques, orphan gene cluster activation, use of genetic approaches in original producers, heterologous expression and chemo-enzymatic techniques. We are mainly highlighting genomic mining concepts and strategies towards high-titer production of cyanobacterial natural products from the last 10 years and discuss the need for further research developments in this field.
Widespread interest in cyanobacteria has increasingly revealed the structural uniqueness of cyanobacterial natural products and their versatile bioactive potential.9–11 Notably, cyanobacteria produce a number of highly active cytotoxic compounds that have attracted pharmaceutical interest such as dolastatins (3) and cryptophycins (4).12,13 There are also cyanobacterial natural products with antibiotic properties, such as the hapalindole family of compounds (5).14 The significant bioactive potential of cyanobacteria has been confirmed by genome sequencing programs over the past two decades, and furthermore, a large untapped biosynthetic potential has been revealed.15,16 In particular, cyanobacterial genomes frequently contain biosynthetic gene clusters (BGCs) encoding nonribosomal peptide synthetases (NRPS), polyketide synthases (PKS) or hybrids thereof, as well as ribosomally produced and posttranslationally modified peptide (RiPPs) and terpene biosynthetic pathways. Despite all progress made in assigning of BGCs to known families of cyanobacterial natural products (detailed in previous reviews9–11,17), to date 80% or more of BGCs are still orphan.
Besides being a rich source of unique natural products; cyanobacteria also hold great promise for sustainable production of chemical compounds.18 Cyanobacterial biotechnology has long outgrown its infancy. An increasing number of companies are cultivating cyanobacteria at large scale as food supplements, cosmetics or fertilizers or develop them as chassis for the production of biopolymers, biofuels and pigments.19 The autotrophic metabolism of cyanobacteria enables the production of organic chemicals using only sunlight, CO2 and minerals. Production through green biotechnology is CO2 neutral or even CO2 negative and meets the aim of establishing a climate-neutral circular bioeconomy.19 Accordingly, there are extensive efforts to further develop methods of synthetic biology for cyanobacteria and to expand the product range. Currently, synthetic biology and biotechnology of cyanobacteria is focused on unicellular model strains of the genera Synechococcus and Synechocystis and on the edible genus Spirulina.20 Considering both the richness in untapped BGCs with great pharmaceutical and biosynthetic value and the possibility for exploitation by sustainable biotechnology, cyanobacteria could become a mainstay of microbial natural product research.
Yet, both the assignment of unknown BGCs to unknown metabolites and the high-titer production of complex specialized metabolites of cyanobacterial origin in cyanobacterial hosts are not well advanced. From our microbiological perspective, similar obstacles are currently limiting both the genomic mining of new compound families in cyanobacteria and their high-titer production in cyanobacteria. Understanding the bottlenecks that limit growth of cyanobacteria, production of specialized metabolites as well as amenability to genetic manipulation can potentially advance both the field of natural product discovery as well as provide avenues for the production of specialized compounds or new-to-nature compounds in cyanobacteria. In the present review, we have therefore set ourselves the goal not only to compile the current state of knowledge on method development of natural product research in cyanobacteria, but also to identify gaps that should be addressed for efficient natural product exploitation.
Following the CyanoGEBA initiative that aimed to improve the sequence coverage by sequencing diverse axenic strains of the well curated Pasteur Culture Collection of Cyanobacteria (PCC),15 Gugger and coworkers carried out a phylum-wide analysis of the presence of NRPS and PKS gene clusters. This analysis revealed a burst of these genes in late branching lineages of cyanobacteria which are dominated by multicellular genera of marine, freshwater and terrestrial origin. Moreover, the study indicated that 80% or more of the NRPS and PKS BGCs were still not assigned to their products.16 A comprehensive analysis covering all types of BGCs identified by AntiSMASH (in particular NRPS, PKS, RiPPs and terpenes)22 was recently presented by Ziemert, Medema and coworkers, who created a compendium for the potential to produce specialized metabolites for bacterial genomes and metagenomes.23 This involved adapting the BiG-SLiCE software tool,24 which subdivides unique BGCs into distinguishable gene cluster families (GCFs) that were adapted to the hierarchical clustering of compounds in the NPAtlas database.25 The study revealed a significant dissimilarity of GCFs at the genus level rather than species level as well a correlation with the habitats.23 To reliably rank the biosynthetic diversity among phyla, the study applied a common phylogenetic metric, relative evolutionary divergence (RED) groups, which were roughly consistent with the genus range across phyla.26 For cyanobacteria, 536 genomes were included in the comparative part of the study and classified into 142 RED groups. For these genomes, 1.867 actual unique GCFs were predicted ranking cyanobacteria fifth behind actinobacteria, proteobacteria, firmicutes and bacteriodota.23 Since the necessary sequence depth to cover all cyanobacterial GCFs is far from being reached, the authors also estimated the diversity potential of GCFs that could be reached if the sequencing was saturated. Thereby, a diversity of approximately 5.000 unique GCFs was indicated for cyanobacteria in total. Both the actual and the estimated GCF diversity are distributed very differently among phyla. Fig. 2 zooms into the 30 richest RED groups of cyanobacteria, which are responsible for 87% of the cyanobacterial actual GCF diversity. Seventeen of the groups are assigned to the order Nostocales comprising a diversity of different genera including amongst others Nostoc, Fischerella, Calothrix, Scytonema, Nodularia and Cylindrospermopsis while in the orders Chroococcales, Oscillatoriales and Synechococcales rather single genera such as Microcystis, Moorena, Planktothrix and Leptolyngbya, stand out, respectively (Fig. 2). However, there is a bias with regard to the number of genomes representing different RED groups. Thus, at the present time, one can read a trend rather than make a ranking among phyla. Nevertheless, it is possible to draw valuable conclusions from the analysis. (1) The known prolific natural product producing genera such as Nostoc, Microcystis and Moorena are indeed among the bacterial groups with the greatest actual and estimated genomic potential. (2) The current 144 entries of characterized BGCs of cyanobacteria included in the MIBiG database (ref. 27, as of August 2023) occupy only a small fraction of the actual 1.867 GCFs identified for cyanobacteria. Although it must be assumed that the MIBiG database does not cover all characterized BGCs, there is a large untapped potential for cyanobacteria to produce natural products which can be estimated at about 90%. The order Nostocales shows by far the largest hidden potential to produce natural products. (3) For some genera or RED groups, a significant number of GCFs is matched by only a few entries in the NPAtlas database.25 This is particularly true for the genus Pleurocapsa solely representing the poorly investigated order Pleurocapsales for which there is only one entry in NPAtlas. The analysis by Ziemert, Medema and coworkers also revealed a low sequence coverage for this genus.23 There are other rich genera for which there are few entries in the NPAtlas database (<15).25 These include, for example, the genus Calothrix. This is particularly interesting because three of the five largest cyanobacterial genomes sequenced to date were assigned to the genus Calothrix.21 Further notable genera include Cylindrospermopsis and Crocosphera which are not intensively investigated yet. (4) Members of the orders Synechococcales and Spirulinales have only a small number of predicted GCFs. Although six Synechococcales RED groups made it to the list of most productive groups, they represent only a small subgroup of a much larger phylum; the majority of all Prochlorococcus and Synechococcus strains barely have predictable GCFs.23 Also, for the genus Synechocystis, to which the model strain Synechocystis sp. PCC 6803 belongs, a total of only 8 GCFs are predicted. From this analysis, one can conclude that unicellular cyanobacteria rarely have a great potential for natural product biosynthesis (except colony formers such as Microcystis or Crocosphera). The situation is similar for the filamentous order Spirulinales, for which there is also only one entry in NPAtlas.25 One can therefore also state that the strains currently used for cyanobacterial synthetic biology and biotechnology generally have a very low natural product biosynthetic potential. We will discuss this fact again later when we talk about available tools and growth of cyanobacteria.
![]() | ||
Fig. 2 Number of distinct biosynthetic gene cluster families (GCFs) in different phylogenetic groups of cyanobacteria and number of cyanobacterial entries in the Natural Product Atlas. (A) Number of GCFs in standardized relative evolutionary divergence (RED) groups as defined in Gavriilidou et al.23 RED groups correspond to single genera of cyanobacteria or may include strains of multiple genera of cyanobacteria (as indicated in parentheses). The number of genomes used for the analysis in individual RED groups is given in parentheses next to the GCF number. Data were adapted from Gavrilidou et al.23 (B) Number of cyanobacterial chemical entities in different orders and genera of cyanobacteria according to Natural Product Atlas (ref. 25 as of August 2023). The color coding corresponds to the assignment of the genera or RED groups to the orders Nostocales (orange), Chroococcales (blue) Oscillatoriales (yellow), Synechococcales (green) and Pleurocapsales (purple). |
While the integration of different databases such as the MiBIG database and NPAtlas is very useful,25,27 some of the conclusions that can be drawn when comparing the GCF analysis and the NPAtlas database must be viewed with caution. Many of the entries in NPAtlas go back to older studies. Cyanobacterial taxonomy has undergone many changes since then, and many genus names were created only in the course of more detailed 16S rRNA analysis and taxon refinement.28 It is therefore unsurprising that certain genus names with high GCF potential rarely appear in NPAtlas. These include, for example, the genera Aulosira, Trichormus or Dolichospermum, which were more commonly used recently and previously classified under Anabaena or Nostoc.28–30 Well-known examples of name changes in the field of natural products also include the change of the species name Lyngbya majuscula, first in Moorea producens and later to Moorena producens31,32 and Oscillatoria agardhii to Planktothrix agardhii.33 While taxon refinement is a general trend in modern microbiology, the situation is even more complicated in cyanobacteria. Longer than for other bacterial phyla, cyanobacterial taxonomy was largely based on morphology and reference strains were described according to the rules of the International Code of Botanical Nomenclature (ICBN) rather than the International Code of Nomenclature of Prokaryotes (ICNP).34 The problem of the morphology-based phylogeny is at least two-fold. Not all morphological features used for classification correspond to the molecular phylogenetic classifications and the morphological classifications require a specific expertise which is not always available. Therefore, one must also assume false annotations and genus names may occur several times in different polyphyletic groups.28 Even if the genus and species name chaos is gradually being resolved by careful polyphasic analyses that consider both morphology and molecular phylogeny and increasing sequencing depth, the problem is still reflected in the databases. The genus name is therefore often not a useful identifier in the cyanobacterial phylum.
The growth rates of cyanobacteria differ considerably. While some model strains achieve doubling times between 2–6 hours under ideal conditions, many cyanobacteria divide only about once a day even under optimal conditions.35 The first group mainly includes model strains of the genera Synechococcus and Synechocystis, with the strain Synechococcus elongatus UTEX 2973 being the frontrunner with a doubling time of 2 h under autotrophic conditions.40 The latter group, on the other hand, includes genera with high natural product potential such as Nostoc or Microcystis.41 Thus, all work with these non-model genera is automatically associated with a high time expenditure. The growth of cyanobacteria depends mainly on photosynthetic rates and CO2 availability.35 Photosynthetic rates, in turn, depend on light intensities, whereby cyanobacteria stand in their own way because they attenuate light through shading at higher cell densities. Higher light intensities again trigger photoinhibition, so there is limited scope for cyanobacteria to grow.35 CO2 concentrations cannot be increased arbitrarily either, because this decreases the pH in the medium. A simple way to increase the biomass production of cyanobacteria, which has proven particularly useful for natural product producers, is the so-called high-density cultivation (HDC). The technology was developed by the CellDEG company in Berlin and enables bubble-free gas exchange and turbulent mixing on a vibrating membrane. HDC minimizes photobleaching and photoinhibition leading to a high quantum yield even at high light intensities.42 For the model strain Nostoc punctiforme PCC 73102, not only much higher cell densities and biomass accumulation is achieved by HDC, but growth rates are also increased by a factor of 2–3.43 Some of the natural product-rich genera can accumulate biomass very efficiently in nature. This is especially true for bloom-forming genera such as Microcystis, Planktothrix, Nodularia or Dolichospermum,4 but also for mat-formers such as Moorena and Symploca.31,44 A considerable number of cyanobacterial natural products have been isolated directly from field material, such as from Microcystis45 or Symploca,46 which are relatively easy to harvest. However, many downstream applications and a detailed BGC characterization are more difficult without cultivation.
Looking at the success of cyanobacteria in nature, there is obviously quite some potential for optimizing growth rates even for genera like Microcystis or Nostoc. However, this also requires an understanding of the bottlenecks that currently impede rapid growth in the laboratory. The heterotrophic microbiome of bacteria seems to play a key role in this context. For Microcystis in particular, there are studies showing that the relationship with heterotrophic bacteria is predominantly growth-promoting for the cyanobacteria, although the reasons are not well understood.47 As mentioned above, heterotrophic bacteria are an obstacle to reproducible research and biotechnology. Yet, one could certainly harness the contribution of heterotrophic bacteria to cyanobacterial growth by supplementing axenic bacterial isolates with selected heterotrophic partners in synthetic communities. This is an emerging trend in the synthetic biology of cyanobacteria,48 which could also have a lasting impact on the development of model production strains in natural product research.
Not all questions to be addressed in cyanobacterial natural product research necessarily require axenic isolates. The assignment of BGCs to natural products or vice versa does not necessarily have to be carried out with pure strains. The analysis of metagenomes has now progressed to the point where assignment to cyanobacterial genomes is reliably possible.37 The assignment of biosynthetic pathways is in any case usually supported by further analyses, such as feeding studies or in vitro characterizations of enzymes. Despite the progress in purifying strains some genera of cyanobacteria are only accessible as xenic isolates. However, this does not diminish the value of axenic cultures, especially for non-model genera such as Microcystis or Nostoc. Physiological and molecular biological experiments are dependent on axenic strains. Genetic manipulations are currently limited to a few strains. This problem has changed little in recent years. However, there has been a breakthrough in the biotechnologically important genus Spirulina, which is being cultivated at large scale but was not amenable to genetic manipulation. Recently, Spirulina has been efficiently established as a chassis for high-titer expression of therapeutic proteins. It was shown that Spirulina could only be manipulated in coculture with selected heterotrophic bacteria.49 These findings may point the way forward for cyanobacterial natural product producers. Again, synthetic consortia could provide a solution. Doudna and colleagues have recently also demonstrated the possibility of species- and site-specific genome editing in complex bacterial communities via an RNA-guided CRISPR-Cas transposase (DART) system.50 Although this technique has not been applied to cyanobacteria yet, it opens up new possibilities for the analysis of cyanobacterial consortia. We will revisit the issue of genetic manipulation later when we discuss in detail the use of genetic methods in the natural products field.
In addition to an appropriate dereplication strategy for the known natural product classes, it is central to this effort to understand the reasons for the lack of assignment of compounds to orphan BGCs. In the search for causes, the continuously improved bioinformatic analysis of BGCs, especially by the AntiSMASH platform,22 is a very good basis, not only because BGCs are reliably predicted, but also because structural prediction is often possible through increasing insights into the role and specificity of enzymes.
A first step into developing a strategy for the discovery of new natural products is often transcriptional analysis.52 This analysis can reveal whether cyanobacterial BGCs are silent and whether production of the corresponding metabolites can be expected under the given cultivation conditions. These findings can be paired with metabolomic studies. For the model strain N. punctiforme PCC 73102, it was shown that the majority of cryptic BGCs are actively transcribed, but only at very low levels.53 Moreover, a transcriptional reporter analysis revealed that expression is often restricted to a few cells within the multicellular consortium.53 Low levels of transcription in turn can be assumed to lead to low metabolite production levels. Based on these findings, one can consider different methodological options and develop workflows depending on the purpose of the analysis and properties of the investigated strains (Fig. 3). (1) Can transcription of BGCs and metabolite production be stimulated in the cyanobacterial strains? In principle, different abiotic and biotic conditions can be tested randomly. This approach is commonly designated as OSMAC approach (one strain – many compounds).54 However, a more rational approach can also be taken, for example if there are regulatory factors or transport proteins that have already been characterized in other bacteria in the vicinity of BGCs and when a functional hypothesis can be deduced. (2) Is the strain amenable to genetic manipulation? Accessibility to genetic engineering offers many possibilities such as overexpression of regulatory components and characterization of biosynthetic intermediates and tailoring enzymes. (3) Is the purpose of the analysis primarily to characterize a BGC and develop a production strategy? Then, heterologous expression is an increasingly successful choice for cyanobacterial BGCs.55 It depends on the ultimate goal which production hosts are suitable, with the focus being especially on E. coli and well manipulable model strains of cyanobacteria. (4) Are biochemical reactions and precursors for the assembly of compounds well predictable? Then in vitro reconstitution offers an elegant way to characterize BGCs or even to produce bioactive compounds. All methodological approaches are associated with individual challenges. Very often, different methodological options are being combined. Combining alternative technologies is particularly important because of the named bottlenecks in cyanobacterial natural product research, their slow growth, frequent contamination with heterotrophic bacteria, and poor accessibility for genetic manipulation. Hereafter, we will dissect the progress in the different methodological fields.
In this context, a number of studies on cyanobacteria have already implemented advanced genomics-metabolomics strategies to dig into the genomes, prioritise and connect the biosynthetic machineries with their products.11,17 Genome-guided methodologies can expand chemical families by interrogating the genome in different strains and searching for variants of known BGCs or enzymes, which coupled with mass spectrometry (MS)-based strategies, allows the assignment of BGCs to products. For example, new cyanobactin linear peptides were found after genome mining of 126 cyanobacterial strains. Using BLAST searches for the signature proteases PatA and PatG, 31 putative cyanobactin BGCs were identified. A closer inspection of these sequences predicted the presence of cyanobactins containing not only the characteristic thiazoles but also methylation(s) and prenylation(s) as further posttranslational modifications in Microcystis aeruginosa PCC 9432 and Oscillatoria nigro-viridis PCC7112. The liquid chromatography-tandem mass spectrometry (LC-MS/MS) analysis identified peptides with these key features and guided the discovery of aeruginosamide B (Fig. 4, 6) and C and viridisamide A.65 The pairing between particular enzymatic domains within BGCs and MS or NMR signatures is an effective tool that can work in both ways, finding the characteristic metabolites using BGC prediction in complex extracts or deorphanizing BGCs with known distinctive compounds. For instance, halogenases can be linked with the MS isotopic patterns of halogens as exemplified in the discovery of columbamides A, B and C in Moorena bouillonii PNG. A paired genomic and metabolomic comparison of three Moorena strains correlated a novel BGC in M. bouillonii that encoded cryptic halogenase domains with two clusters of halogenated compounds found exclusively in M. bouillonii using Global Natural Products Social Molecular Networking (GNPS) analysis.66,67 Compounds of these clusters were selected for isolation leading to the structure elucidation of the columbamides and allowing assignment of the previously unknown col BGC.68
The capacity to identify the isotopic patterns can be further exploited by stable isotope-labelled supplementation experiments based on the BGC features. Applying a lipid-version of the genomisotopic approach, new fatty acids (FA)-derived compounds were discovered using supplementation experiments with deuterated FA and comparative metabolomics. After proving the absence of a beta-oxidation pathway in cyanobacteria, which supports the prevalence of FA-derived compounds, supplementation experiments were designed to detect labelled metabolites in different cyanobacterial strains containing FA-incorporating enzymes. Comparative metabolomics between supplemented and non-supplemented cultures was performed by LC-HRESIMS and combined with data processing in MZmine,69 whose modules allow filtering the peak list, for example by adducts. Feature-based molecular networking using GNPS and MS-guided isolation focusing on unknown features led to the identification of new hapalosin analogues in Fischerella sp. and the nocuolactylates A (Fig. 4, 7) and B in Nodularia sp.70 In another experiment, labelling with stable 15N was performed in Nostoc sp. UIC 10630 to correlate BGCs with their metabolites by pairing the prediction of nitrogen atoms according to the amino acid specificity of adenylation domains with the number of nitrogen atoms identified by comparative metabolomics between labelled and unlabelled experiments. This approach was able to deorphanize three of the six BGCs predicted to contain nitrogen in their products, a new anabaenopeptin analogue, a new compound named nostopyrrolidonamide and also the known aeruginosin 865 whose cluster was formerly unidentified.71
As can be seen, retro-biosynthetic strategies can be useful to connect known compounds with their BGCs provided that the order of BGC modules shows collinearity with the order of building blocks in the compounds, as is common in type I PKS, NRPS and hybrid PKS–NRPS pathways. This tactic was applied to assign the biosynthetic pathway of aeruginoguanidines (AGDs), and then the microguanidines (MGDs) that were co-assigned to the same BGC in Microcystis strains.51 According to the structural features of AGD the predicted pathway was expected to involve an NRPS with specificity for L-arginine and tailoring enzymes such as a prenyltransferase and a sulfatase/sulfotransferase. A candidate BGC fulfilling the expected features was found in the AGD-producing strain M. aeruginosa NIES-98 and screened in other public Microcystis genomes. Subsequent analysis of metabolomes using LC-MS/MS revealed the presence of AGD in eleven strains of Microcystis. Molecular networking analysis expanded the AGD compound family with new variants AGD-98A (Fig. 4, 8), AGD-98B and AGD-98D, but unexpectedly also revealed the presence of another cluster in the same strains, comprising shorter MGD variants. Isolation and structure elucidation of the MGD metabolites, revealed new variants and new intermediates, the microguanidines amides MGA-771, and MGA-787. Analysis of the presence/absence of the compounds and genes in the different strains by a phylogenetic approach uncovered the unprecedented biosynthetic versatility of the AGD/MGD/MGA pathway and showed that the production was mutually exclusive with microcystin.51
(Meta)-genomics has boosted the exploration of cyanobionts with several successful examples linking biosynthesis and chemistry, as it can be seen in the recent review of D'Agostino.11 The development of these technologies also facilitates the study of collections of microorganisms, rather than single organisms, and computational networking approaches enable the visualization and data analysis of large datasets. Following a phylogenetics-guided multi-omics study, the genome and metabolome of 24 tropical filamentous cyanobacteria were compared to evaluate the natural product potential and to prioritise strains.72 The sequence similarity networking of BGCs was created from the meta-genomic assemblies of the environmental cyanobacteria using antiSMASH,22 BIG-SCAPE73 and the MIBIG27 database to assign the GCFs. The BGC similarity network was integrated with a classical GNPS molecular networking generated using LC-MS/MS and in silico annotation tools such as Network Annotation Propagation,74 DEREPLICATOR+,75 and MolNetEnhancer76 for the chemical classification. Interestingly, the information of both networks highlighted, aside from the metabolic potential of the strains, some of the challenges in the assignment of cyanobacterial BGCs, as despite the fact that the majority of the molecular families had a spectral match, many BGCs were still orphans. Specifically, a variety of peptides were predicted in the BGC analysis, while in the metabolomics analysis, they were underrepresented compared to lipidic molecules. This pointed out the need for advanced genomic engineering approaches to enhance the production of the cryptic peptides or to develop strategies to facilitate their detection.
The power of genome and metabolome mining strategies can be complemented with further omics technologies. Peptidogenomics and de novo peptide sequencing are emerging as new approaches linking peptides with MS data through automated processes in tools like NRPquest77 for NRPS or MetaMiner78 and DeepRipp79 for RiPPs. Although proteomics has been less integrated within the workflows, it can be useful to leverage the information of the enzymes involved in the biosynthesis of the secondary metabolites. Available metadata about the samples can inspire experiments and guide the prioritisation. This can include taxonomy and phylogenetics, as we have seen, but also bioactivity, growth conditions, and phenotypes of mutant strains. As shown, the majority of studies on cyanobacteria follow the pattern-based approach, focusing on the presence/absence of metabolites and BGCs across strains. However, other methods can further explore cyanobacteria, such as the correlation-based approach, which adds metrics to score a given BGC-metabolite connection (metabologenomics),80 and the feature-based approach, supported by in silico tools and metabolite prediction, as the multi-omics study previously described.72 The above mentioned peptidogenomics approach also falls into this category. Another emerging method is the combination of feature and correlation-based approaches like applied by the NPLinker software framework.81 Through many advances in omics technologies, bioinformatics and machine learning, the opportunities for mining the untapped biosynthetic potential of cyanobacteria natural products are steadily increasing.
A group of metabolites in cyanobacteria that can be induced by changing the macro elemental composition in the growth medium are siderophores that are stimulated by iron deprivation. Until recently, very few cyanobacterial siderophores were known such as the hydroxamate siderophores schizokinen and synechobactin and the catecholate siderophore anachelin.83 A recent study that has used a genome-based approach led to the discovery of cyanochelins, a family of NRPS-derived siderophores containing β-hydroxyaspartate moieties (Fig. 5, 9).84 Further, changing the trace elemental composition can elicit production of specialized metabolites. An impressive example was recently shown for the eagle-killing toxin aetokthonotoxin (AETX).85 While the toxin was detected in environmental samples growing on the invasive water plant Hydrilla verticillata, it could not be identified in isolated laboratory strains. The addition of potassium bromide to the medium resulted in stimulation of AETX production.85 However, the reasons for the stimulating influence are different for the siderophore example and the AETX example. While the former are transcriptionally stimulated by iron deficiency, since the corresponding BGCs are typically controlled by specific transcription factors of the Fur family,83 bromide is a direct precursor of AETX, without which production of the toxin is not possible.85
![]() | ||
Fig. 5 Examples of cultivation-based discovery of cyanobacterial metabolites following BGC activation by iron limiting cultivation conditions, UV induction or high-density cultivation. |
Changing physical parameters during growth can also induce cyanobacterial specialized metabolites. Well-known examples of such inducible metabolites are the sunscreen compounds scytonemin (Fig. 5, 10) and mycosporic acids (MAAs, Fig. 5, 11).86 While MAAs are also known from other phyla such as fungi, microalgae, macroalgae and heterotrophic bacteria, scytonemin is exclusively synthesized by cyanobacteria.86 Scytonemin responds rather specifically to induction by UV-A and B,87 while MAAs are induced by various stress conditions including UV stress, salt stress and desiccation,88 MAAs are widespread among cyanobacteria and have been described in numerous variants. The inducibility of sunscreen compounds has been investigated both using laboratory strains and in field studies and is generally linked to the induction of transcription.89,90 Production of scytonemin, in particular, seems to be highly relevant for the adaptation to the extreme light conditions in deserts and is able to filter out damaging UV light portions. Scytonemin production was also linked to localized warming in soil crust communities.90 In addition to their ecological significance, scytonemin and MAAs are also among the cyanobacterial metabolites with potential for application in cosmetics and medicine.86
A combination of chemical and physical parameters again is changed by the above-mentioned HD cultivation. In this cultivation apparatus, both high light irradiation and higher amounts of the carbon source CO2 are being used. In addition, cyanobacteria grow as a dense biofilm. The use of this cultivation technology in N. punctiforme PCC 73102 and other Nostoc strains led not only to efficient growth but also reprogramming of specialized metabolism, both qualitatively and quantitatively.43,53 Transcriptional analysis of BGCs revealed that at least 50% of BGCs were strongly stimulated under these conditions and that a combination of high light conditions, high CO2 availability and medium factors are triggering both transcription and production of compounds.53 As proof of principle, HD cultivation of the strain Nostoc sp. KVJ2 led to the discovery and structural elucidation of three new anabaenopeptin variants, KVJ827 (Fig. 5, 12), KVJ841 and KVJ811 which showed allelopathic activity against other Nostoc strains from the same habitat.43
The influence of biotic interactions on the production of specialized compounds in cyanobacteria is generally poorly studied. The fact that many talented producers still have accompanying heterotrophic bacteria and that the effects of specific interactions are thus difficult to dissect certainly plays a role here. A comparison of axenic toxic and nontoxic Microcystis strains in mono- and coculture revealed an intraspecific quantitative variation of major peptide classes and led to the discovery of new cyanopeptolin analogues.91 Co-cultivation of Microcystis with a heterotrophic bacterial community again resulted in negligible effects on the intracellular specialized metabolome. Remarkably, no extracellular levels of the known peptide classes in Microcystis were detected when the cyanobacterium was accompanied by a heterotrophic consortium suggesting an ability of the community to degrade structurally different compounds or a physiological suppression of their secretion.92 Hence, while intra- and interspecific biotic interactions certainly have an impact on the intra- and extracellular metabolome of Microcystis, they did not lead to the genome-based discovery of entirely new classes of compounds. A comparison of the metabolome of the symbiotic cyanobacterium N. punctiforme in the free-living state and in physical association with the plant host Gunnera manicata using MALDI-imaging showed major differences in the metabolite profiles with nostopeptolides being downregulated and a number of novel metabolites being upregulated in planta.93 Since only limited amounts of symbiotic tissue were available the study did not lead to the description of a new metabolite family either. Yet, it gives hints that biotic interactions with eukaryotic partners may lead to a more pronounced impact on the specialized metabolome than interaction with other prokaryotes.
In summary, cultivation-induced biodiscovery is a promising approach for cyanobacteria. However, there are only incidental studies with a focus on single factors. To generalize conclusions regarding the impact of abiotic and biotic factors on the specialized metabolome high throughput approaches are needed using a highly parallelized approach. Development of such approaches requires minimization of culture volumes and ideally includes an automated control of cultivation conditions.82 One methodology that is now commonly used for other microorganisms is the microfluidics technology.82 Although this technology requires optimization for filamentous cyanobacteria, it could provide a means for semi-automated analysis of cyanobacteria in a miniaturized format. Thereby, both the impact of abiotic factors as well as biotic factors could be tested in a larger format. There are certainly additional ways to rationalize metabolite induction. The AntiSMASH platform already gives some hints on the possible specificity of transcription factors and their regulation.22 There is also much evidence that environmental cues commonly present in the habitat of the given organisms may play a particular role in the induction of specialized metabolites. Ultimately, it depends on the range of biological and metabolic properties of individual strains, which conditions could lead to an altered special metabolite profile. Since variations in cultivation conditions are easily feasible for cyanobacteria of all genera and habitats, the general potential of cultivation-based approaches is far from exhausted.
One of the few strains with great potential for natural products production that is accessible to genetic manipulation is the symbiotic strain N. punctiforme PCC 73102. For this strain, a library of transcriptional reporter mutants could be generated, in each case fusing the 5′UTR region containing the putative promoter region of BGCs of the NRPS, PKS, or RiPP types with the CFP reporting gene.53 As mentioned above, this reporter library was used to test the transcriptional response to HD cultivation in high throughput format. Because several RiPP reporting strains showed a pronounced upregulation after HD cultivation, the study was combined with a targeted metabolomic search for the hypothetical RiPP products. Thereby, the new microviridin N1–N9 variants were discovered (Fig. 6, 13).53 Microviridins had previously been described primarily in Microcystis and Planktothrix, but microviridins of N. punctiforme are distinguished by a variable chain length at the N-terminus and were overlooked prior to the reporter-guided study. While this is an example how transcriptional reporters can facilitate genomic mining of novel compounds there are certainly further ways to use the library in the future, such as BGC elicitor screening or screening of biotic interactions with either prokaryotic or eukaryotic organisms.
Strain N. punctiforme PCC 73102 (alternatively designated as N. punctiforme ATCC 29133) was also the only strain in which a cyanobacterial natural product BGC could be assigned using a forward genetic approach. The biosynthetic pathway for the sunscreen compound scytonemin was originally discovered using random transposon mutagenesis.87 Besides, several BGCs could be inactivated in non-model cyanobacteria using a directed reverse genetic approach. First, the microcystin biosynthetic pathway was knocked-out in the genus Microcystis.97 In the same genus, the BGCs for aeruginosin,98 micropeptin,99 and MAA88 were also successfully manipulated. Notably, mutagenesis has only been successful in two Microcystis strains: M. aeruginosa PCC 7806 and M. viridis S-70.99 Other cyanobacterial BGCs that could be inactivated include the anabaenopeptilide BGC in Anabaena sp.91,100 the microcystin and aeruginoside BGCs in Planktothrix NIVA-Cya126 (ref. 101 and 102) and the pks2 BGC in N. punctiforme PCC 73102.103 In all cases, mutagenesis contributed to the assignment of natural products and their BGCs. Only in the case of the cryptic pks2 BGC of N. punctiforme PCC 73102, no product could be assigned so far, probably due to the low expression level of the BGC.53 In some cases, mutagenesis has contributed essential insights into the biosynthetic mechanism. For example, mutagenesis of the aerD, E, and F genes and subsequent feeding experiments demonstrated their involvement in the biosynthesis of the characteristic Choi moiety of aeruginosins in P. agardhii NIVA-Cya 126 (Fig. 6, 14).102 Mutagenesis of the tailoring enzyme McyJ in P. agardhii NIVA-Cya126 allowed isolation of microcystins lacking the O-methylation at the ADDA moiety (Fig. 6, 15), thereby identifying McyJ as responsible O-methyl transferase.101
As mentioned earlier, a positive side aspect of biosynthetic mutants is their potential for studies of biological function. In particular, the microcystin-free ΔmcyB mutant has been intensively used for years for research on microcystin function. In this context, a close connection of microcystin with primary carbon exchange could be demonstrated. In cyanobacteria, the toxin binds to key proteins of the Calvin–Benson–Bassham cycle including the CO2-fixing enzyme RubisCO.104 MAA mutants were also used for functional analyses. It was shown that the loss of MAA did not affect growth under UV stress, but did affect the structure of the EPS layer of strain M. aeruginosa PCC 7806. This observation strengthens the hypothesis that MAA is not produced primarily for UV protection and that the sunscreen effect may be a useful side effect of the compound family.88 The pks2-mutant was used as an indirect tool to explore the role of nostopeptolide in N. punctiforme PCC 73102. Since nostopeptolide was greatly reduced in the mutant, it was shown that nostopeptolide represses or stimulates the development of motile hormogonia filaments in a concentration-dependent manner, thereby having a significant impact on the symbiotic interaction of N. punctiforme with the host Gunnera manicata.93 Loss of constitutively produced cyanopeptides often leads to increased production of other cyanopeptides, apparently because unused resources become available. For example, loss of microcystin in M. aeruginosa PCC 7806 leads to increased production of cyanopeptolin and microcyclamides,91 and loss of anabaenopeptilide in strain Anabaena 90 leads to increased production of anabaenopeptin.105 These observations may also contribute to the optimization of production strains in the future.
The strain N. punctiforme PCC 73102 is also suitable for overexpression of transcriptional regulators. There are shuttle plasmids that replicate in both E. coli and N. punctiforme. Applying a shuttle plasmid technology, it has recently been possible to overexpress an AraC type positive regulator for the previously cryptic pks1 pathway.106 Upregulation of the pks1 pathway, in turn, led to the global stimulation of the specialized metabolism in N. punctiforme, in particular under high density cultivation conditions. To identify the responsible signals, a reporter mutant was used to evaluate the response of the pks1-dependent cryptic lanthipeptide pathway ripp4. The bioactivity-targeted strategy led to the discovery and high titer production of two signal molecules: nostovalerolactone (Fig. 6, 16), representing a new family of compounds derived from a tetronate-like pathway in N. punctiforme, and nostoclide(s) N1 and N2, representing new members of the cyanobacterin family (Fig. 6, 17).106
Although only a few strains are amenable to genetic manipulation, insights gained through genetic analyses can be very helpful in making screening of cyanobacterial extracts more efficient and in developing individual strains for synthetic biology applications. However, further research and development is needed to catch up to known microbial model systems such as in Streptomyces or Aspergillus strains.94,95 For the further development of individual strains as production strains, knowledge on the regulation of secondary metabolism must be further deepened. The power of engineering strategies that exploit underlying regulatory mechanisms for the targeted overproduction of individual substances has been demonstrated, for example, for proteobacteria of the genera Xenorhabdus and Pseudomonas. Once it was known that BGC expression was primarily controlled post-transcriptionally, knocking out the RNA chaperone Hfq was able to abolish global production of specialized compounds. Against this background, individual BGCs can be specifically activated and optimized.107 Similar strategies are also conceivable for individual cyanobacterial strains in the future and can greatly advance the biotechnological potential of cyanobacteria for the production of bioactive natural products.
Unsurprisingly, early attempts at heterologous expression of cyanobacterial natural products have focused on RiPPs as their BGCs are comparably small and as such easily handled by established molecular biological techniques. Moreover, RiPPs solely rely on the presence of proteinogenic amino acids and the ribosomal translational machinery,110 their BGCs normally encode all enzymes necessary for post-translational product modification and they do not require activation by off-site factors. Consequently, RiPPs make up a large proportion of cyanobacterial natural products that were reported to be produced heterologously.111–118 In these pioneering works, E. coli was the obvious choice of expression host as the BGCs were introduced on established vectors with little or no modifications necessary to the genetic sequences or the host metabolism. This situation is drastically different when BGCs of the NRPS and/or PKS type are to be analyzed. Not only do these gene clusters tend to be much larger than those of RiPPs, which makes cloning more difficult,110 but there is also a certain degree of incompatibility between elements of transcriptional regulation, most prominently promoters, from cyanobacteria and E. coli.110,119–121 The most important impediment, however, is that the metabolism of E. coli needs to be primed for the expression of NRPS/PKS type of BGCs, primarily because they require the activity of a suitable phosphopantetheinyl transferase (PPTase) to activate their cognate carrier proteins.122,123 In connection to that, the introduction of additional factors is often needed to meet the substrate and co-factor requirements of each individual BGC as they tend to rely on the availability of rare non-proteinogenic amino acids or their precursors. In a case-dependent manner, these problems can be solved either by supplying the necessary factors with the growth medium or by genetically engineering the host metabolism to provide off-site auxiliary enzymes. Despite this, E. coli has undeniable advantages that still uphold its attractiveness as a host for BGC heterologous expression. The most convincing arguments for E. coli are the plethora of well-established genetic tools, the straightforward cultivation and the short doubling times. Accordingly, many attempts have been made to adapt E. coli to the expression of large BGCs and to rationalize workflows. Recent advances in DNA assembly, for instance, have facilitated the handling of large genetic constructs. A method based on Gibson assembly termed “direct pathway cloning” (DiPaC,124–126 and the transformation-associated recombination (TAR) approach)127,128 have successfully been employed to capture large BGCs. The necessity to provide a constitutive PPTase has also been addressed by the generation of two widely used E. coli strains, BAP1 (ref. 129) and GB05-MtaA,120 which express promiscuous Sfp-type PPTases from Bacillus subtilis and the myxobacterium Stigmatella aurantiaca, respectively. With these tools in hand, a number of NRPS/PKS derived cyanobacterial compounds have been successfully produced in E. coli. Among those are the dermatotoxin lyngbyatoxin (Fig. 7, 18 (ref. 130)), the infamous hepatotoxin microcystin-LR (1),120,131 the depsipeptide hapalosin (Fig. 7, 19 (ref. 132)) and, most recently, the natural herbicide cyanobacterin124 and the protease inhibitor microginin.125 Frequently, the products detected in E. coli did not fully reflect the compound spectrum found in the natural producers, however, these expression systems still can provide very valuable mechanistic insight (e.g. with cyanobacterin124) or can serve in establishing an accessible platform for biocombinatorial production of novel congeners (e.g. with microcystin-LR131). Therefore, E. coli will doubtlessly continue to maintain its status of a major heterologous expression system, as evidenced by the high number of current research efforts in that field.
![]() | ||
Fig. 7 Examples of cyanobacterial metabolites heterologously produced in either E. coli strains or cyanobacterial model strains. |
Despite the accessibility and adaptability of the E. coli system, there have been many examples where the production of cyanobacterial natural product proved to be very troublesome or downright impossible. The resulting demand of alternative expression systems is best illustrated with the case of lyngbyatoxin A, an NRPS-type compound and promising drug lead.133 While its heterologous production was altogether unsuccessful in Streptomyces coelicolor134 and required the exchange of the cyanobacterial native promoters in E. coli,130 it was easily achieved in the filamentous cyanobacterium Anabaena sp. strain PCC7120 (Anabaena 7120,128). This follows the widely accepted notion that heterologous expression of complex metabolites is best carried out in hosts more closely related to the original producer. As a number of cyanobacterial model strains with fairly advanced genetic tools have been available for more than two decades, scientists have started to investigate these strains for their suitability as expression hosts for natural product BGCs. The three most widely used strains of cyanobacteria are Synechocystis sp. PCC 6803 (Synechocystis), Synechococcus elongatus PCC 7942 (S. elongatus) and Anabaena 7120 and indeed, heterologous expression of cyanobacterial BGCs has been attempted in all of them (Fig. 7). Gauging the success of these approaches, however, Anabaena 7120 has emerged as the clear favorite. While there are two reported cases where Synechocystis and S. elongatus were employed for the successful production of small or partial BGCs such as the MAA shinorine (Fig. 7, 20 (ref. 135)) and a sponge-derived polybrominated diphenyl ether (Fig. 7, 21 (ref. 136)), in recent years the products of complex, full-size NRPS/PKS-type BGCs were isolated with comparably high yields from Anabaena 7120. In retrospect, this does not seem very surprising, because the genus Anabaena displays a number of traits that appear to be beneficial for the production of NRPS/PKS type compounds and that are absent from the two other strains. First and foremost, while Anabaena encodes several NRPS/PKS BGCs15 together with an active, very promiscuous Sfp-type PPTase,123 it does not seem to produce any bioactive compounds that may potentially disturb expression of foreign BGCs.128 Additionally, many cyanobacterial promoters seem to work well in Anabaena 7120 and its genetic toolbox is constantly being expanded.128,137–140 With their groundbreaking work, two groups in particular have paved the way in establishing Anabaena 7120 as a promising host for the heterologous expression of cyanobacterial natural products. Implementing their lyngbyatoxin expression platform, Philmus and coworkers were not only the first to report the successful heterologous production of a complex NRPS-type compound in Anabaena 7120, they were also able to take their system a step further, employing biocombinatorics to produce additional indolactam-V derivatives by swapping in tailoring enzymes from different genera.127,128 Most recently, the group reported the successful production of the meroterpenoid tolypodiol and some of its analogs from a 21 kb BGC.141 While in these cases the BGCs were expressed from replicative plasmids, Golden and colleagues have successfully produced cyanobacterial compounds from BGCs that were integrated into a neutral site of the chromosome of Anabaena 7120. In their first reported case, they initially attempted to have S. elongatus produce the putative anti-infective compound cryptomaldamide (Fig. 7, 22) from a 28.7 kb NRPS/PKS megasynthase BGC.142 Despite many efforts to induce biosynthesis, they were only successful once they changed the heterologous host to Anabaena 7120. More recently, the group utilized their expression system for the production of columbamides (Fig. 7, 23), a family of chlorinated acyl amides, from a 28.5 kb NRPS/PKS hybrid BGC.143 What makes this study stand out is the fact that apart from the discovery of new high potential compound analogs, the BGC in question was merely proposed, at the time, to code for the columbamide pathway and was thus experimentally confirmed by heterologous expression.
Evidently, substantial progress has been made in recent years, with both the E. coli and the cyanobacterial system. Both offer certain advantages over the other, with E. coli being better established and more accessible and cyanobacteria holding more promise with regard to enzyme fidelity and product biochemistry. Both systems suffer from a lack of predictability, as there are still no clear indicators available from which to infer host compatibility or to estimate production levels. Indeed, so far neither system has proven to be advantageous in this respect, as examples for very high and very low yields have been reported for both.55 The most convincing argument in favor of cyanobacteria as heterologous host, not only for natural product research, is the great potential for sustainable, green biotechnology. From the progress that has been made in recent years and with cyanobacterial expression systems being now in the focus of many efforts to optimize their handling and performance, the development of potentially carbon-negative production strains that can offset the advantages that E. coli still holds, seems to be only a matter of time. This could also go a long way towards fully unlocking the cyanobacterial natural product potential by finally enabling the systematic introduction of orphan BGCs into heterologous hosts and the identification of their cognate natural products.
Recent elegant examples of in vitro reconstitution of cyanobacterial pathways are the aforementioned eagle-killing toxin aetokthonotoxin (Fig. 8, 24),146 the core of the bioherbicide cyanobacterin (Fig. 8, 25),124 or the potent anticholinesterase neurotoxin guanitoxin (Fig. 8, 26).147 For the biosynthesis of aetokthonotoxin (AETX) Adak and Lukowski et al. showed that the freshwater cyanobacterium Aetokthonos hydrillicola uses an efficient, five-enzyme biosynthetic pathway to convert two molecules of tryptophan to yield this structurally unique pentabrominated biindole nitrile by assembling two functionalized indole monomers that are then linked by biaryl coupling.146 Of the five biosynthetic enzymes, two stand out for their potential as biocatalysts, namely the unique single-component, flavin-dependent tryptophan halogenase AetF, which does not require an accompanying reductase partner, and the iron-dependent nitrile synthase AetD, which builds a nitrile functional group via unprecedented rearrangement chemistry. In addition to revealing novel chemistry, the identification of the AETX BGC may help to identify related BGCs and thus help to discover new environmental toxins.
When the potent photosynthesis inhibitor cyanobacterin was isolated from Scytonema hofmanni in 1982 it was the first chlorinated natural product identified from freshwater cyanobacteria.148 Four decades later D'Agostino et al. took advantage of this property and identified the cyanobacterin (cyb) biosynthetic gene cluster by targeted bioinformatic screening for halogenase-encoding genes.124 After validation of the BGC by heterologous expression in E. coli, the authors were able to break down the biosynthesis of the furanolide core structure to four enzymes by gene knockouts and the development of a one-pot biocatalytic in vitro synthesis. At the heart of the reaction sequence the furanolide synthase CybF fuses two molecular building blocks, which are provided by CybB, CybC and CybE to form the furanolide core. Since members of the furanolide natural product family such as nostoclides106,149 or maculalactones150 are more widespread in cyanobacteria, the elucidation of furanolide biosynthesis paves the way for their targeted discovery, biosynthetic engineering and enzymatic synthesis.
Guanitoxin and cyanobacterin have in common, that their BGCs have remained elusive for decades. To identify candidate guanitoxin BGCs in Sphaerospermopsis torquesreginae ITEP-024, Lima et al. focused on genes associated with the known guanitoxin precursors (S)-4-hydroxy-L-arginine and L-enduracididine.147 This led to the identification of a candidate gnt BGC whose involvement in guanitoxin biosynthesis was confirmed by the impressive in vitro reconstitution of a nine-step metabolic pathway starting from L-arginine. The elucidation of the genetic basis of guanitoxin biosynthesis now allows environmental biosynthetic gene monitoring of this lethal neurotoxin, which shares an identical mechanism of action with the synthetic chemical warfare agent sarin and is one of the most potent cyanotoxins known today.
These three studies are prime examples of how in vitro reconstitution can help to elucidate biosynthetic pathways. However, the in vitro reconstitution of individual enzymatic steps or biosynthetic sequences also offers the possibility of exploiting these pathways chemo-enzymatically. Among the diverse compound classes found in cyanobacteria, RiPPs are particularly well suited for this approach due to the synthetic accessibility of precursor and core peptides by solid-phase peptide synthesis (SPPS), the hypervariability of the peptides, and the relaxed substrate specificity and modularity reported for many RiPP-modifying enzymes.
A recent study perfectly exemplifies the great potential of combining hypervariable precursor peptides with highly promiscuous RiPP-modifying enzymes in vitro. By incubating 120 peptide substrates from a synthetic positional scanning library with the macrocyclase PagG and the prenyltransferase PagF from the cyanobactin prenylagaramide biosynthetic pathway Sarkar et al. generated a library of more than 100 prenylagaramide-like cyclic peptides and their prenylated derivatives.151 Interestingly, the promiscuity of cyanobactin prenyltransferases towards the prenyl donor can also be exploited chemo-enzymatically. For this, a synthetic alkyl pyrophosphate analogue was used to introduce a reactive moiety into a tryptophan-containing cyclic peptide. This reaction was catalysed by the N1-tryptophan prenyltransferase AcyF from the anacyclamide A8P pathway. After the conversion, click chemistry was used to fluorescently label the enzymatically modified peptide.152
Another cyanobacterial RiPP family that has been the subject of multiple chemo-enzymatic studies are the graspetides. These compounds, which include microviridins, are defined by the presence of ester or amide side-chain linkages that are installed by ATP-grasp ligases, resulting in peptide macrocycles.153 While the ATP-grasp ligases in microviridin biosynthesis catalyse cyclisation reactions in strict order and with stringent ring size requirements, they tolerate variations in the amino acid composition of the core peptide.154 This property was exploited by Reyna-González et al. in the synthesis of a microviridin-based library of protease inhibitors.155 In one-pot chemo-enzymatic reactions synthetic microviridin K and microviridin B core peptides, varied by all 20 proteinogenic amino acids at the position responsible for protease specificity, were transformed by two ATP-grasp ligases that were constitutively activated by covalently attached leader peptides, and a GNAT-type N-acetyltransferase. This leader peptide-free approach, which was inspired by a study from lantibiotic biosynthesis154 is highly efficient, since the relatively short core peptides can be synthesized much easier and at reduced costs compared to full-length RiPP precursor peptides. Later, the chemo-enzymatic synthesis platform was used to synthesize four microviridins encoded by the cyanobacterium Cyanothece sp. PCC 7822, providing a powerful example of culture-independent genome mining.156 In addition, Scholz et al. used the platform to introduce functional tags into different microviridin variants with the help of modified core peptides, yielding biotinylated, dansylated or propargylated congeners as tool compounds to further investigate the biology of microviridins.157 As a proof of concept, dansylated microviridin B (Fig. 8, 27) was used as a diagnostic tool to selectively label elastase in protease mixtures.
The examples given in this section illustrate how in vitro enzyme reconstitution facilitates the study and exploitation of cyanobacterial metabolic pathways. However, cyanobacteria also exhibit amazing enzymology that could be exploited beyond single pathways. A remarkable example is the discovery of non-canonical protein splicing by the radical S-adenosylmethionine (rSAM) enzyme PlpX from Pleurocapsa sp. PCC 7319. In a recent study Morinaka et al. showed that PlpX, together with the helper protein PlpY, catalyzes backbone carbon–carbon bond cleavage and the net excision of tyramine, resulting in the formation of α-keto-β-amino acids (Fig. 8).158 The reaction has been used to incorporate diverse and multiple β-amino acids into various RiPP precursors and recombinant proteins in E. coli.158,159 In addition to expanding the set of basic amino acid building blocks in peptidic natural products and proteins, tyramine splicing yields keto functions that can be readily exploited as orthogonal reaction sites for chemical diversification.158,160
Another impressive example of unprecedented enzyme catalysis in the biosynthesis of secondary metabolites comes from Cylindrospermum licheniforme ATCC 29412. Nakamura et al. were able to show that in cylindrocyclophane biosynthesis, chlorination of an unactivated carbon center by the halogenase CylC sets the stage for an enzymatic dimerization reaction catalyzed by CylK.161 This reaction features sequential, stereospecific alkylations of resorcinol aromatic rings by the alkyl halide electrophiles.161,162 Notably, alkylation of aromatic rings with alkyl halides is an important transformation in organic synthesis. However, an enzymatic equivalent was unknown. Interestingly, homologs of CylC and CylK cluster together with a number of additional enzymes in other unrelated cyanobacterial gene clusters, suggesting that related C–C bond formation strategies involving cryptic chlorination may be more widespread in the biosynthesis of cyanobacterial secondary metabolites.161 This, together with the prominent role of aryl–alkyl bonds in pharmaceuticals and industrial chemicals, makes CylC and CylK promising targets for future efforts in biocatalysis and metabolic engineering.
The selected enzymatic studies underline the extraordinary biosynthetic potential of cyanobacteria. As emphasized already, in vitro reconstitution is independent of cultivation and therefore applicable to the broad mass of cyanobacterial natural product producers. If the transformation of substrates succeeds in vitro, a wide range of possibilities for diversification and functionalization of metabolites arise. However, the amounts of substrates and their conversion cannot be arbitrarily upscaled. Individual chemical reactions, for example in the chemical synthesis of peptide precursors by solid-phase peptide synthesis, are also not very environmentally friendly. Therefore, in vitro (bio)synthesis is not always the appropriate method of choice. However, its importance for the study of biosynthetic mechanisms is central. In particular, linkage with structural biology analyses of enzymes can rationalize the production of new-to-nature libraries of metabolites. Enzymes are at the core of cyanobacterial natural product research and provide a hub for further exploitation.
This discrepancy is due to the strong focus in engineering tool development on a few unicellular cyanobacterial model strains. Future cyanobacterial natural products research needs to drive method development for appropriate filamentous strains themselves. This path is already being followed for the strain Nostoc sp. PCC 7120 and N. punctiforme PCC 73102. So far, these attempts are hardly on par with heterologous expression of cyanobacterial BGCs in E. coli. However, studies on production strains in e.g. Actinobacteria and Proteobacteria have shown that there is still much room for improvement in the optimization of production titers in original producers. This effort is definitely worthwhile because it can give new impulses to the whole cyanobacterial natural product research and biotechnology. To this end, the spectrum of suitable strains should be expanded and insights into the regulation of BGCs should be further gained and explored. These efforts also depend on understanding microbial physiology and biotic interactions among strains. In this way, microbiology potentially plays a key role in future cyanobacterial natural products research.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d3np00045a |
This journal is © The Royal Society of Chemistry 2024 |