Christine E. Salomona, Nathan A. Magarveya and David H. Sherman*b
aDepartment of Microbiology, University of Minnesota, Minneapolis, MN 55455, USA. E-mail: csalomon@umn.edu; magar001@umn.edu
bDepartment of Medicinal Chemistry, University of Michigan, Ann Arbor, MI 48109, USA. E-mail: davidhs@umich.edu
First published on 15th December 2003
Covering: 1999–2003
Marine invertebrates and a growing number of marine bacteria are the sources of novel, bioactive secondary metabolites. Structurally, many of these compounds appear to be biosynthesized by polyketide synthases (PKS) and/or nonribosomal peptide synthetases (NRPS) that have also been found in terrestrial microbes. This review highlights scientific advances from 1999–2003 in the emerging field of molecular genetics of polyketide and nonribosomal peptide natural products isolated from marine organisms. The implications of this research towards the development of marine secondary metabolites as a sustainable source of new drugs are discussed.
![]() Christine E. Salomon | Christine E. Salomon, born in 1972 in Morristown, NJ, received her B.S. with honors in Marine Biology from Southampton College, Long Island University in 1995. After an undergraduate internship in the laboratory of the late Professor D. John Faulkner at Scripps Institution of Oceanography, University of California, she joined his research group to study the cellular localization of natural products in marine invertebrates and received her PhD in 2001. She is currently a MinnCResT NIH postdoctoral fellow in the laboratory of Professor David H. Sherman at the University of Minnesota. Her research interests include the molecular genetics of natural products, the location of their biosynthetic genes in complex microbial/invertebrate assemblages and their roles in the ecology and biology of marine organisms. |
![]() Nathan A. Magarvey | Nathan A. Magarvey was born in Annapolis Royal, Nova Scotia (Canada) in 1974. He studied biochemistry at Dalhousie University (Canada), working as a research assistant under Professor Leo C. Vining, studying the shikimic acid pathway derived antibiotic, chloramphenicol. Following this he moved to Pearl River, NY (USA) to work in the Natural Products Research section at Wyeth Research, studying the genetics and biochemistry of the lipoglycopeptide antibiotic, manno-peptimycin. In September 2001 he moved to the University of Minnesota (USA) for PhD studies under Professor David H. Sherman. In D. H. Sherman's laboratory, his research focuses on the chemistry and biology of marine actinomycetes and the biosynthesis of the cyanobacterial polyketide/peptide tubulin-binding agent, cryptophycin. |
![]() David H. Sherman | David H. Sherman obtained his undergraduate degree in Chemistry at the University of California, Santa Cruz, and completed his PhD in the Department of Chemistry at Columbia University with Professor Gilbert Stork. After postdoctoral studies at the Yale University School of Medicine and MIT, Dr Sherman spent three years at Biogen Research Corporation, and in 1987 moved to the John Innes Institute, Norwich, England to work with Professor Sir David Hopwood in microbial genetics. He began his academic career at the University of Minnesota in 1990, and in 2003 moved to the University of Michigan, Ann Arbor where he is the John G. Searle Jr. Professor of Medicinal Chemistry. |
This molecular perspective has focused on some of the most pharmaceutically useful and structurally interesting terrestrial microbial metabolites belonging to the biosynthetic classes of polyketide synthases (PKSs) and nonribosomal peptide synthetases (NRPSs). Many of the complex, bioactive compounds isolated from marine organisms appear to arise from similar biosynthetic pathways, and may be approached using our knowledge of analogous terrestrial enzyme systems. Classical feeding studies using isotope labeled precursors continue to be a useful tool to elucidate the mechanisms of biosynthesis of marine metabolites from cultured organisms, but there is a growing trend towards the use of molecular genetics to identify biosynthetic pathways and novel enzymes. The ability to clone and heterologously express gene clusters encoding valuable metabolites may help to alleviate the critical supply problems associated with most marine natural products and provide contributions towards the “molecular toolbox” used in the genetic engineering of new drugs.
Currently, there are only three examples of published gene clusters encoding PKS and NRPS natural products from marine organisms, and these are exclusively from cultured marine bacteria.5–7 However, recent research suggests that the adaptation of metagenomic analysis, microbial genetic and culturing techniques to the unique systems in the marine environment will allow this number to increase dramatically in the coming years.8–10
Here we review some of the molecular approaches taken to understand the biosynthesis of metabolites encoded by PKS and NRPS gene clusters in marine organisms and present strategies for the discovery of additional pathways. Particular emphasis will focus on the structural parallels of terrestrial bacterial secondary metabolites and certain classes and types of marine natural products. The molecular underpinnings of these relationships will be addressed in two major areas: 1) natural product assembly line levels (“terrestrial” versus “marine” natural product biosynthesis), and 2) phylogenetic conservation of biosynthetic pathways. Stemming from this analysis we will discuss what role marine microbiology and bacterial genetics will play in the future of marine natural product drug discovery. In particular, how the power of these approaches will aid in realizing the potential of marine systems as a rich, tractable and sustainable source for drug discovery.
Compound | Probable biosynthetic route or compound type | Source organism | Disease target | Clinical phase status |
---|---|---|---|---|
Ziconotide (Prialt™) | Ribosomally produced peptide | Conus magnus (mollusc) | Neuropathic pain | III |
Ecteinascidin 743 (Yondelis™) | NRPS | Ecteinascidia turbinata (tunicate) | Cancer | II/III |
Bryostatin 1 | Mixed PKS/NRPS | Bugula neritina (bryozoan) | Cancer | II |
Dolastatin 10 | NRPS | Dolabella auricularia (mollusc, and later found in cyanobacteria) | Cancer | II |
Dehydrodidemnin B (Aplidine™) | NRPS | Aplidium albicans (tunicate) | Cancer | II |
Kahalalide F | NRPS | Elysia rufescens/Bryopsis sp. (mollusc/green alga) | Cancer | II |
Squalamine | aminosteroid | Squalus acanthias (shark) | Cancer | II |
IPL-576092 (HRM-4011A) | steroid | Petrosia contignata (sponge) | Psoriasis | II |
IPL-512602 | steroid | Petrosia contignata (sponge) | Asthma/inflammation | II |
IPL-550260 | steroid | Petrosia contignata (sponge) | Inflammation | I |
Halichondrin B derivative (synthetic, E7389) | PKS | Lissodendoryx sp. (sponge) | Cancer | I |
Discodermolide | PKS | Discodermia dissoluta (sponge) | Cancer | I |
Hemiasterlin derivative (HTI-286) | NRPS | Cymbastella sp. (sponge) | Cancer | I |
Bengamide derivative (LAF389) | Mixed PKS/NRPS | Jaspis sp. (sponge) | Cancer | I |
Agelasphin derivative (KRN-7000) | glycosphingolipid | Agelas mauritianus (sponge) | Cancer | I |
Laulimalide | PKS | Cacospongia mycofijiensis (sponge) | Cancer | Preclinical |
Vitilevuamide | NRPS | Didemnum cucliferum and Polysyncraton lithrostrotum (tunicates) | Cancer | Preclinical |
Diazonamide | NRPS | Diazona angulata (tunicate) | Cancer | Preclinical |
Eleutherobin | diterpene glycoside | Eleutherobia sp. and Erythropodium caribaeorum (soft corals) | Cancer | Preclinical |
Sarcodictyin | terpene | Sarcodictyon roseum (sponge) | Cancer | Preclinical |
Peloruside A | PKS | Mycale hentscheli (sponge) | Cancer | Preclinical |
Salicylhalimides A and B | PKS | Haliclona sp. (sponge) | Cancer | Preclinical |
ES-285 (Spisulosine) | alkyl amino alcohol | Spisula polynyma (mollusc) | Cancer | Preclinical |
Thiocoraline | NRPS | Micromonospora marina (bacteria) | Cancer | Preclinical |
The supply of marine metabolites being tested in the clinic is currently provided by several means: open aquaculture of the invertebrates (ET-743 and bryostatin), total synthesis (ziconotide, discodermolide, dolastatin 10, dehydrodidemnin, hemiasterlins), semi-synthesis (halichondrin B derivative, ET-743), and fermentation of producing microbes (thiocoraline).17 In general, fermentation has been the most viable method for production of natural products (penicillin, clavulanic acid, erythromycin, etc.), especially if the compounds proceed through clinical trials and are needed on a commercial scale (> kg).18 Analysis of the structural types present in Table 1 suggests that more than half are polyketides and/or peptide metabolites, with predicted origins stemming from modular PKSs and NRPSs. Our rapidly growing knowledge of modular biosynthetic systems may allow these gene clusters to be cloned and overexpressed in more amenable bacterial hosts. This approach could potentially provide a virtually unlimited supply of compounds and alleviate the need for culturing of the source macroorganisms.
Although many marine natural products appear to arise from multi-functional enzymes that are also present in terrestrial systems, many also possess a myriad of functional groups not previously described from terrestrial metabolites. These unique structural differences are inevitably reflected in novel biosynthetic pathways with genes encoding completely new enzymatic activities.6,7,19 Despite these differences, the presence of conserved regions of “terrestrial” biosynthetic genes can be used for probing analogous pathways in marine systems both in vivo through molecular cloning and in silico by analyzing sequenced genes and genomes for secondary metabolic enzymes and pathways.
The actions of the essential “core” domains present in types I and II PKS systems are presented schematically in Fig. 1. The initiation module begins with the actions of the acyl transferase domain (AT) selection of an activated acyl-CoA monomer (usually malonyl or methyl malonyl CoA). Following this selection step, the AT domain transfers the acyl-CoA to the adjacent “swinging arm” of the post-translationally (phosphopantethiene) modified acyl carrier protein (ACP). The chain is transferred from the ACP to the upstream ketosynthase domain (KS) required for catalyzing the decarboxylation of the carboxylic acid and subsequent Claisen condensation between the growing chain (or starter unit) and the downstream ACP tethered extender unit.
![]() | ||
Fig. 1 Modular biosynthetic pathway systems. (a) Polyketide synthase (PKS) organization and mechanisms. (b) Nonribosomal peptide synthetase (NRPS) organization and mechanisms. (KS: ketosynthase, AT: acyl transferase, ACP: acyl carrier protein, DH: dehydratase, KR: ketoreductase, ER: enoyl reductase, TE: thioesterase, MT: methyl transferase, C: condensation, A: adenylation, PCP: peptidyl carrier protein, E: epimerase, Cy: cyclization, Ox: oxidoreductase) (Figure adapted from Schwarzer et al., 2003).23 |
In type I PKSs, structural diversity is introduced in part by optional reductive domains acting upon the newly formed β-carbonyl after each extension reaction. The “reductive loop” can consist of one or more of the following activities: ketoreductase (KR) reduces the carbonyl to a hydroxyl group, a dehydratase (DH) dehydrates the alcohol to form a double bond and an enoyl reductase (ER) is responsible for the reduction of the double bond to a fully saturated methylene. The incorporation of unusual starter molecules (amino acids, fatty acid derivatives, etc.) and extension units also plays an important part in generating immense diversity in polyketide structures.28 Additional “auxiliary” domains consist of methyl transferases (MT) or other embedded activities.29 The complete elongated and functionalized chain is often transferred to a final thioesterase domain (TE) capable of catalyzing the hydrolytic release of a linear compound or a hydrolytic release coupled with cyclization to generate a macrolactone structure.
Like the Type I PKS systems, NRPSs are large, multifunctional enzyme complexes that build growing chains from individually selected building blocks. The NRPS is also organized into modules, each of which is responsible for one cycle of elongation by the incorporation of a single amino acid into the growing peptide chain. Each elongation module contains three essential domains: adenylation, thiolation and condensation. The adenylation domain (A) selects a specific amino acid and activates it as an amino acyl adenylate. The activated amino acid is then transferred to the phosphopantethiene group (“swing arm”) of the post-translationally modified peptidyl carrier protein (PCP) or thiolation domain (T). The condensation domain (C) catalyzes the peptide bond formation between amino acids in adjacent modules. The chain is elongated successively and released at the end of the “assembly-line” by the action of an integrated thioesterase (TE) domain or a separate TE generating either a linear or cyclic peptide. Additional structural diversity is introduced by modification of the growing chain catalyzed by a variety of embedded auxiliary domains such as epimerization (E), N-methylation (MT), cyclization (Cy), oxidoreductase (Ox), N-formylation (F) and reductase (R) domains.33 These domains provide further methods to deviate from the nonribosomal code, providing the incorporation of diverse amino acid functionality such as thiazoles, oxazoles, thiazolidines, oxazolidones, as well as a full array of N-methylated and D-amino acids not found in any other system throughout nature.33
The three complete gene clusters (PKS and NRPS) that have been reported from marine organisms were isolated from cultured actinomycetes and cyanobacteria, two microbial groups that are especially well known for their ability to produce diverse natural products. All of the clusters were identified from cosmid libraries by developing probes designed from conserved regions of biosynthetic domains from terrestrial microbial pathways. Although the quantity of published marine gene clusters is relatively small, we expect that there will be more examples in the near future. Work in our lab, in collaboration with William Gerwick's lab (Oregon State University), is currently involved with the identification of several new marine clusters for the microbial metabolites curacin A, carmabin and barbamide. In addition to complete clusters, there is an increasing number of examples of individual biosynthetic genes and putative sections of clusters that have been identified in several organisms, including dinoflagellates and a potential microbial symbiont of an invertebrate.9,10
The following marine bacterial gene clusters demonstrate that while PKS and NRPS biosynthetic routes are conserved between terrestrial and marine systems, in some cases there are additional, novel catalytic enzymes responsible for the unique functional groups found solely in marine natural products.6,7,39
The enc cluster consists of 20 ORFs spanning 21.3 kb and was overexpressed in Streptomyceslividans. The core structure of enterocin is synthesized by an uncommon benzoate starter unit and extended by 7 malonate molecules (Fig. 2). At least five ORFs present in the enc cluster are involved in the synthesis (EncP,H,I,J) and incorporation (EncL) of the benzoic acid unit which is derived from L-phenylalanine via a plant-like oxidative pathway.46,47 The biosynthesis of the unusual benzoyl-CoA starter unit is reviewed by Moore and Hertweck in an earlier issue of this journal and illustrated in Fig. 2.28 An expected dehydrogenase for oxidation of the β-keto group on β-ketophenylpropionyl-CoA was not found and cyclases which are typical of all other type II PKS systems were also absent.
![]() | ||
Fig. 2 Proposed biosynthetic pathway for enterocin and the benzoyl-CoA starter unit. EncP: phenylalanine ammonia-lyase, EncH: cinnamate-CoA ligase, EncI: cinnamoyl-CoA hydratase, EncJ: 3-keto-3-phenylpropionyl-CoA thiolase, EncABCD: minimal PKS, EncL: aromatic acyl transferase, EncM: oxygenase, EncK: methyl transferase, EncQ: ferredoxin, EncR: cytochrome P-450 hydroxylase. |
Earlier precursor feeding studies by Seto et al. suggested that enterocin is derived from a linear precursor which undergoes an unusual Favorskii-like carbon rearrangement.48 This rare carbon rearrangement reaction is proposed to be involved in the biosynthesis of several natural products, including the toxin okadaic acid isolated from dinoflagellates.49 It is presumed that such a rearrangement prevents the successive aldol condensations of reactive intermediates that typically lead to an aromatic structure in all other type II PKS systems. The identity of the putative “favorskiiase”, EncM, was shown by gene knock out studies and the successive isolation of wailupemycin D 5 (previously isolated from a natural strain) and novel non-rearranged analogues wailupemycins E–G 6–8.42 The encoded methyltransferase, EncK, was proposed to be responsible for O-methylation of the α-pyrone ring and is possibly linked to the EncM mediated rearrangement leading eventually to enterocin, 5-deoxyenterocin 9 and 3-epi-5-deoxyenterocin 10.19 The enterocin cluster is the first example of a “derailed” aromatic polyketide synthase which encodes natural chemical diversity by the versatility of associated tailoring enzymes and pathway branch points.
The 34 kb grh type II PKS cluster was localized to a single cosmid and contained 33 ORFs. The loading molecule is an acetate which is elongated by 12 malonyl-CoA molecules via the core PKS genes (grhABC) (Fig. 3). The ACP gene grhC is unusual due to its location 26 kb upstream of the ketosynthase domains grhA and grhB. The extended chain is then putatively cyclized by products of two different aromatase genes, grhE and grhQ, and the associated genes grhSTL to yield a pradimicin-like precursor. An unprecedented number of genes encoding a variety of oxidoreductases (11) were identified as responsible for the many oxidative steps required to produce the epoxyspiroketal and naphthoquinone ring systems. The authors hypothesize that one possible oxidative degradation pathway may involve a retro aldol reaction before two Baeyer–Villiger oxygen insertions leading to the excision of a C2 unit and subsequent cyclization to form the spiroketal. A putative P450 monooxygenase is suggested to catalyze the epoxidation of the C19 double bond. The identification of the cluster was confirmed by overexpression of the cosmid clone in S. lividans and production of griseorhodin A 11.
![]() | ||
Fig. 3 Proposed biosynthetic pathway for griseorhodin A. |
In order to gain understanding of the origin of the unusual trichloro-amino acid and biosynthesis of this group of compounds, Sitachitta and coworkers conducted several biosynthetic studies with a Curaçao strain of L. majuscula that produced the lipopeptide barbamide 14 in culture.61 Stable isotope labeled precursor feeding experiments established that the trichloromethyl moiety was derived from the pro-R methyl group of L-leucine.61 Similar experiments with labeled [2-13C] 5,5,5-trichloroleucine demonstrated high incorporation rates, suggesting its role as a direct intermediate. Other feeding studies showed the additional incorporation of L-cysteine, L-phenylalanine, acetate and S-adenosylmethionine, signifying a mixed PKS/NRPS biosynthetic origin.61
An interest in the unusual biosynthesis and bioactivity of barbamide led Sherman and coworkers to identify and clone the barbamide gene cluster.7 One of the challenges associated with the cluster was the variety of compounds that were produced by the same strain and were presumably also the products of mixed PKS/NRPS systems including curacin A, carmabin, antillatoxin and malyngamide. The number of clones positive to both the PKS and NRPS probes was narrowed down by focusing on adenylation sequences specific for cysteine, presumed to be involved in thiazole ring formation.
The bar gene cluster was found to span 40 kb and was isolated as two cosmid clones. Twelve ORFs, designated barA–barH, were identified as the putative core genes involved in barbamide production (Fig. 4). Leucine is presumably selected and activated by the BarD adenylation domain and loaded onto the BarA PCP. The leucine is then possibly chlorinated by the actions of BarB1, BarB2 and BarC and transferred to the NRPS/PKS hybrid loading module, BarE. While the trichloroleucine is tethered to the PCP of BarE, BarJ may oxidatively deaminate the amino acid. A rare one-carbon truncation is postulated to then occur by decarboxylation of the α-keto intermediate. The resulting trichloroisovaleryl group may be extended by a malonyl unit forming a diketide. An O-methylation domain in BarF could subsequently catalyze formation of the enol methyl ether to stabilize the enol form of the β-carbonyl. The role of BarF in the formation of the energetically unfavorable C4–C5 E double bond is not yet understood. The chain is further extended by phenylalanine which is presumably N-methylated by the actions of the NRPS module BarG and then condensed with a final cysteine residue which is cyclized to form the thiazoline ring. According to this putative biosynthetic plan, the molecule must undergo a final decarboxylation to yield barbamide, which may be catalyzed by a potential oxidative domain (BarJ). In the absence of gene disruption or heterologous expression, the identification of the cluster was supported by expression of the NRPS adenylation domains and subsequent assay for ATP-[32P]PPi exchange activity. The amino acid specificity of each A domain (leucine, trichloroleucine and cysteine) correlated with the putative biosynthetic assembly of barbamide.
![]() | ||
Fig. 4 Proposed biosynthetic pathway for the cyanobacterial lipopeptide barbamide. Reprinted from Chang et al., 2002 with permission from Elsevier.7 |
The gene clusters shown above clearly reveal that at least some marine bacteria utilize the same modular biosynthetic pathways that are used by terrestrial bacteria. This realization demonstrates that the growing number of novel, cultured marine bacteria may provide a completely new, tractable resource for molecular genetic studies. Additionally, there is tantalizing evidence that many compounds from marine invertebrates may also be produced by bacteria (see below): such pathways could potentially be obtained using similar molecular techniques to harness an even greater wealth of chemical and enzyme diversity.
The “taxonomic chemistry tree” in Fig. 5 illustrates the resemblance between marine invertebrate and microbial compounds. The tree, organized phylogenetically by source, shows a variety of natural products that are presumably produced by PKS and NRPS pathways. The tree illustrates natural products from a variety of marine organisms, highlighting compounds isolated from marine macroorganisms versus terrestrial analogues produced entirely by microorganisms. In some cases the compounds are identical, such as westiellamide6717 and cycloxazoline6818 found in a terrestrial cyanobacterium and marine ascidian, respectively. More commonly, there are specific functional groups or skeletal systems that are similar, for example: the enediyne “warhead” moiety in calicheamicin6919 and namenamicin7020, the isoquinoline ring system of saframycin B7121 and ecteinascidin-74372,7322, and the overall structural analogies between nodularin7423 and motuporin7524.
![]() | ||
Fig. 5 Marine natural products and structural analogues from terrestrial microbes arranged in a “chemo-taxonomic tree”. Compounds isolated from prokaryotes (bacteria) are shown in the lower half and metabolites from sponges and ascidians (multicellular eukaryotes) are shown in the upper half of the diagram. |
Unson and coworkers isolated the poly-brominated biphenyl ether 25 in the cyanobacterial symbiont Oscillatoria spongeliae separated from the sponge Dysidea herbacea.82 The same group also localized the chlorinated amino acid derivative 13-demethylisodysidolide 12 from O. spongeliae separated from another specimen of D. herbacea.58 The cyclic peptide theopalauamide 26 was isolated from filamentous δ-proteobacteria separated from the sponge Theonella swinhoei while the cytotoxic polyketide swinholide A 27 was isolated from the fraction containing a mixture of unicellular bacteria from the same sponge.63,81 The polyketide latrunculin B 28 was localized in situ within sponge cells of Negombata magnifica using gold-labeled anti-latrunculin antibodies.76
While the above examples suggest that the compounds are associated with specific bacteria or invertebrate cells, the possibility that the compounds were synthesized in another cell type and translocated to those storage cells cannot be ruled out. Theoretically, if a sponge or ascidian has evolved to harbor a microbial symbiont that produces a useful metabolite, it could be advantageous for the invertebrate to adsorb the compound onto its cell surfaces to prevent fouling or uptake it into specific cells for storage and protection from predation, for example.83 Virtually nothing is known about the transport of natural products or their precursors between invertebrate cells and bacteria. Terrestrial bacteria are especially well known for their ability to excrete natural products into the environment as exemplified by self resistance export mechanisms and the extra-cellular occurrence of many antibiotics produced in fermentation such as penicillin, streptomycin and mitomycin C.84,85
It is absolutely relevant to study where and how secondary metabolites are stored to understand their roles in the ecology and biology of the invertebrates and/or their associated microbes. However, the only ways to determine the true source of secondary metabolites are to culture the symbiont and demonstrate compound production or utilize the molecular genetic tools that are now available to identify and locate the biosynthetic clusters.
The major steps involved in developing an integrated biosynthetic study to obtain the gene cluster and source information are shown in Fig. 6. The first step in a search for any specific gene cluster is the prediction of the biosynthetic pathway based on our growing knowledge of other modular systems. This analysis is critical to identify specific, key enzymes that may be used as molecular probes to screen genomic libraries.
![]() | ||
Fig. 6 Overview of a molecular genetic/cloning strategy for marine compounds. |
Terrestrial compound | Microbial source(s) | Biosynthetic pathway | Related marine compound | Microbial/invertebrate sources |
---|---|---|---|---|
staurosporine8631 | Streptomyces staurosporeus | Shikimic acid87 | staurosporine8831 11-hydroxystaurosporine8932 | marine actinomycete ascidian (Eudistoma sp.) |
saframycin Mx190,9130 saframycin B9221 | Myxococcus xanthus Streptomyceslavendulae | NRPS93 | renieramycin E9429 ET-74372,7322 | sponge Reniera sp. ascidian Ecteinascidia turbinata |
pederin9533 | Endosymbiont of the Paederus blister beetle, most similar to Pseudomonas aeruginosa | Mixed type I PKS/NRPS96 | mycalamide A9734 onnamide A9835 theopederin9936 | sponge (Mycale sp.) sponge (Theonella sp.) sponge (Theonella sp.) |
calicheamicin6919 | Micromonospora echinospora | Iterative type I PKS100,101 | namenamicin7020 shishijimicin B10237 | ascidian (Polysyncraton lithostrotum) ascidian (Didemnum proliferum) |
nodularin7423 | Microcystis aeruginosa Planktothrix agardhii | Mixed type I PKS/NRPS103,104 | motuporin7524 | sponge Theonella swinhoei |
One example that deserves further discussion is the potent anti-tumor alkaloid ET-743 22. The ecteinascidins (ETs) are a group of bioactive tetrahydroisoquinoline metabolites that were isolated from the Caribbean ascidian Ecteinascidia turbinata in low yields.72,73 ET-743 is the most bioactive analogue and is one of the most clinically advanced marine natural products. The complex core structure bears remarkable similarity to several sponge compounds (i.e. renieramycin 29) and terrestrial microbial metabolites such as saframycin B 21 and saframycin Mx1 30 isolated from Streptomyceslavendulae92 and Myxococcus xanthus, respectively.90,91 Currently the compound is supplied by a relatively efficient semi-synthetic strategy starting with the microbially produced cyanosafracin B obtained from Pseudomonas fluorescens.108
Despite the synthetic accessibility of ET-743, there is interest in identifying and isolating a presumed microbial symbiont from the ascidian that produces the compound in culture. A study by Moss et al. utilized 16S rDNA libraries, in situ hybridization and microscopy to demonstrate the consistent association of the ascidian larvae with a γ-proteobacterium denoted ‘Candidatus Endoecteinascidia frumentensis’.109 Additional circumstantial support for microbial production of ET-743 in the ascidian is preliminary work showing the consistent presence of both the ecteinascidin compounds and a single major strain of bacteria in the larval tissue.110
Biosynthetic feeding studies of the saframycins/ecteinascidin compounds have shown that the core structure is formed by the condensation of two tyrosine derivatives via a diketopiperazine intermediate and the incorporation of glycine and alanine.111,112 Additionally, the methyl groups are known to be derived from S-adenosyl methionine. Cysteine was also found to be incorporated into the ecteinascidins.112 The saframycin Mx1 gene cluster is in good agreement with the precursor studies, encoding two NRPS proteins that incorporate the successive condensation of AlaGlyTyrTyr as specified by the 4 adenylation domains. An O-methyl transferase was also found to be necessary for saframycin Mx1 production. Although the ecteinascidins possess additional functional groups (exocyclic isoquinoline and sulfur bridge), the genetic structure of the core ET pathway provides the sequence information necessary to probe the ascidian metagenome to potentially identify the biosynthetic source of the ecteinascidin compounds.93
Marine compound | Microbial/invertebrate sources | Proposed biosynthetic pathway | Terrestrial compound | Microbial source(s) |
---|---|---|---|---|
arenastatin11338 | sponge (Dysidea arenaria) | Mixed type I PKS/NRPS | cryptophycin-2411439 | Nostoc sp. |
alteramide11540 discodermide11641 | bacterium (Alteromonas sp.) sponge (Discodermia dissoluta) | Mixed type I PKS/NRPS | ikarugamycin11742 | Streptomyces sp. |
jaspamide11843 | sponge (Jaspis splendins) | Mixed type I PKS/NRPS | chondramide11944 | Chondromyces crocatus |
sphinxolide12045 | sponge (Neosiphonia superstes) | PKS | scytophycin12146 | Scytonema pseudohofmanni |
sphinxolide12045 | sponge (Neosiphonia superstes) | PKS | rhizopodin12247 | Myxococcus stipitatus |
One model system in this category includes the Dysidea sponge compound arenastatin11338, identical to cryptophycin-2411439 produced as a minor compound by terrestrial Nostoc cyanobacteria. Interestingly, one of the strains that produces the cryptophycins, Nostoc sp. 53789, was isolated as the symbiont of a lichen.123 The Nostoc cyanobiont in this case is much more tractable because it is available in pure culture, can be easily grown in the laboratory for unlimited sources of DNA, and may be amenable to genetic manipulations. The sponge, however, must be obtained by scuba, cannot be grown in a lab, and contains diverse populations of bacteria.124 Identification of the cryptophycin gene cluster first could provide specific probes to localize the presumably similar cluster in the sponge. Additionally, the similarity in structures suggests that the sponge symbiont responsible for arenastatin production is probably also a cyanobacterium and this information is currently directing efforts in our lab to culture the producing organism.125
The proposed PKS gene cluster for the biosynthesis of discodermolide 50 and analogues 52–56 is shown in Fig. 7. The predicted starter unit is acetyl-CoA which is presumably elongated by 4 malonate and 7 methyl malonate subunits. A unique feature of the biosynthesis is that it includes three trans–cis isomerizations that may occur during the reductive loop of the β-carbonyl through the stereochemically specific reduction by the KR and subsequent dehydration or possibly by a separate isomerase. Another unprecedented component of the proposed biosynthetic pathway is the predicted formation of the 8,9 cis double bond by dehydration in the direction of the growing chain, rather than towards the “starting” end (indicated by * in Fig. 7). Presumably the final linear chain undergoes an intramolecular cyclization to form the terminal 6-membered lactone ring and is simultaneously cleaved from the protein complex by a terminal thioesterase (TE) domain. Tailoring enzymes, including a C-methyltransferase and carbamoyl transferase may then elaborate the core structure to add a methyl at C-3 and carbamate moiety to the hydroxyl at C-19, respectively. A minor component, 9(13)-cyclodiscodermolide 56, may be formed by an intramolecular C–C bond formation by a cation–π cyclization (typical in terpene cyclases) to form the new 5-membered ring.
![]() | ||
Fig. 7 Predicted biosynthetic pathway for discodermolide and derivatives. (The cluster probably contains several genes but the organisation/number of modules cannot be predicted and it is therefore depicted as a single gene for simplicity) |
One of the technical challenges associated with a search for this putative cluster is that the sponge is presumably associated with many different microorganisms, both dietary and symbiotic, and may require an intractable number of cosmid clones to adequately cover all of the possible genomes present (including that of the eukaryotic sponge, estimated to be 1.7 Gb in some species based on cellular DNA content).132 One approach to reduce the size of a potentially unmanageable genomic library for suspected sponge symbionts is to separate the major populations of microbes from the sponge, shown to be feasible in the previously discussed cellular localization studies. Bacterial enrichment can be accomplished with dissociated cells by fluorescence activated cell sorting (FACS, for photosynthetic organisms),58,82 centrifugation and/or filtration,81 and density gradients.133 An alternative metagenomic approach was recently used successfully by Piel to isolate the PKS gene cluster encoding the toxin pederin 33 from a mixture of terrestrial beetle tissue and one major species of symbiotic bacteria.96 A large cosmid library was constructed (80,000 clones) and a rapid PCR based screen using a “pooling” method was used to quickly isolate the positive clones.96 Although the beetle was previously shown to possess only a single symbiont, far fewer than any marine sponge, such a metagenomic strategy may be successful if probes are carefully designed to be specific for the pathway of interest. In this example for discodermolide, a combination of ketosynthase domains, C-methyl transferase genes and the carbamoyl transferase could be used to target the cluster. The carbamoyl transferase would be a particularly good handle because it produces a rare functional group, present only in a few microbial natural products such as novobiocin,134a bleomycin,134b cephamycin C134c and mitomycin C.134d
The proposed NRPS gene cluster for the biosynthesis of didemnin B/dehydrodidemnin B is shown in Fig. 8. This retro-biosynthetic analysis takes into consideration known progenitors, which point to the suggestion that N-methyl leucine serves as the starter unit for the NRPS assembly-line. Didemnin A, found in the highest yield from the Caribbean tunicate, contains only the core peptolide (no side-chain), suggesting that this side-chain is not integral to the main didemnin B assembly line. From this start point, a series of extensions of proteinogenic (L-leucine, L-proline and threonine), nonproteinogenic amino acids (isostatine, N-methyl tyrosine, N-methyl leucine) and a fatty acid component (hydroxylvaleryl isopropionate) are predicted to occur. Following the build-up of the assembly line a thioesterase enzyme is predicted to catalyze the release of the enzyme thioester-linked heptadepsipeptide coupled with a macrocyclization reaction. This reaction would generate the didemnin A structure that may serve as a substrate for post core-assembly enzymes with varying specificities for side-chain groups. From this proposed molecular assembly sequence, the known diversity of the didemnin family of marine natural products could conceivably arise from one core biosynthetic system (Fig. 8).
![]() | ||
Fig. 8 Predicted biosynthetic pathway for didemnin B and derivatives. (NMe-D-Leu: N-methyl-D-Leucine, Ist: Isostatine, Hip: Hydroxylvaleryl isopropionyl, O-Me-L-Tyr: O-methyl-L-Tyrosine) |
The recent advances in nonribosomal peptide biosynthesis and methodologies to generate modifications of these pathways make biologically active marine depsipeptides like the didemnins particularly attractive targets. A guided search for the didemnin B genes could involve utilizing the inherent co-linearity rules for nonribosomal peptide biosynthesis coupled with analysis of the nonribosomal code.137,138 An example of a feasible approach would follow a random genetic amplification and in silico study of A (adenylation) domains obtained using metagenomic template DNA from the tunicate. Such a strategy would afford a rapid snap-shot of all possible NRPS “gatekeeper” domains (A domains) from the tunicate. More importantly, this approach would allow for honing in on particular chemical functional groups present in the natural didemnins. One particular chemical handle is the N-methyl leucine and its relative position in these marine natural products. From the pool of A domains, an in silico candidate would be selected on the basis of having a nonribosomal code specific for leucine (Aleu). Following this, libraries containing material enriched for bacterial DNA would be screened using this specific domain. Once positive clones were identified using the predicted Aleu sequences, a genetic walk (DNA sequencing) immediately upstream and downstream could provide additional cluster information. If the “downstream” sequences revealed an embedded N-methyltransferase domain, it is plausible that the cloned DNA relates to didemnin B biosynthesis. Information gathered from peptide antibiotic clusters has established that well known N-methylations of amide bonds in peptide antibiotics occur by such embedded domains.137 To complete the cluster, the entire cosmid DNA would be sequenced and the resulting sequences checked against a pathway prediction such as the one shown in Fig. 8.
The paradigm of biodiversity translating to chemodiversity is also clearly evident from explorations of natural products from organisms in the marine environment.35 Soft bodied invertebrates, in particular, have yielded a wide variety of biologically active structures. From an ecological standpoint, such organisms may have evolved the ability to harbor or produce chemicals (such as defensive or allelopathic compounds) that promote their overall fitness. The similarity of many “invertebrate” compounds to those isolated from terrestrial microbes (Fig. 5) led to the hypothesis that associated microorganisms might be responsible for their production. Sessile invertebrates such as sponges and ascidians are especially known to associate with diverse populations of bacteria. In cases where there are clear parallels between marine and terrestrial compounds (i.e. arenastatin and cryptophycin) it is logical to infer that the marine metabolite is produced by an associated symbiont. In the majority of other peptide and polyketide compounds with no known terrestrial analogues, it is possible to invoke a feasible modular assembly system. These complex biosynthetic enzymes (PKS and NRPS) are only known to be encoded by the genomes of microorganisms.
Studies have shown that marine invertebrates house extremely high levels of microbial diversity and are a logical source for culturing and molecular genetic studies.36,115,143–146 Current research suggests that unique microorganisms are also abundant in all areas of the ocean, and two new actinomycete genera have been recently suggested.37,125 More importantly, these new organisms are being investigated for their production of new compounds, and at least one published study of compounds isolated from the newly proposed genus Salinospora has led to the elucidation of novel bioactive metabolites.147 The number of unique natural products from cultured marine microbes with promising biological activity is growing and may translate to a new resource for drug discovery.40
We expect that the combined knowledge of microbial biodiversity and the ability to target specific biosynthetic genes will provide the necessary tools to identify and culture microbes that produce novel, potent pharmaceuticals on an economically viable scale. The decreasing costs associated with whole genome sequencing will also allow for a much greater number of bacterial and possibly invertebrate genomes to be sequenced, providing a genetic basis to identify and clone new pathways. Thus, the synergistic efforts of both biological and chemical approaches will provide insight into the biosynthesis and eventual production of some of the most structurally complex, bioactive marine natural products.
One thing that we have learned from terrestrial systems is that the search for specific genes in an organism or complex assemblage not only sets the stage for exploring natural diversity but provides a framework for understanding biosynthesis at its most fundamental level. Natural products discovery has greatly benefited from the investigations of the marine environment and the understanding of the roles that secondary metabolites may play in the ecology of marine organisms. Moving forward and tapping into the diversity and metabolic talents of the vastly under-explored world of marine microorganisms will undoubtedly provide a barrage of new natural products and should lead to high quality bioactive drug leads. In a greater sense, this type of investigative work could revitalize drug discovery from nature and provide us with the genetic storehouse of PKS and NRPS enzymes for understanding the roles and true potential of secondary metabolites from marine ecosystems.
This journal is © The Royal Society of Chemistry 2004 |