Open Access Article
Teresa M.
Dreckmann
a,
Lisa
Fritz
b,
Christian F.
Kaiser
a,
Sarah M.
Bouhired
a,
Daniel A.
Wirtz
a,
Marvin
Rausch
b,
Anna
Müller
b,
Tanja
Schneider
bc,
Gabriele M.
König
a and
Max
Crüsemann
*a
aInstitute of Pharmaceutical Biology, University of Bonn, Nussallee 6, 53115 Bonn, Germany. E-mail: cruesemann@uni-bonn.de
bInstitute for Pharmaceutical Microbiology, University Hospital Bonn, University of Bonn, Meckenheimer Allee 168, 53115 Bonn, Germany
cGerman Center for Infection Research (DZIF), Partner Site Bonn-Cologne, Bonn, Germany
First published on 16th August 2024
Corallorazines are cyclic lipodipeptide natural products produced by the myxobacterium Corallococcus coralloides B035. To decipher the basis of corallorazine biosynthesis, the corallorazine nonribosomal peptide synthetase (NRPS) biosynthetic gene cluster crz was identified and analyzed in detail. Here, we present a model of corallorazine biosynthesis, supported by bioinformatic analyses and in vitro investigations on the bimodular NRPS synthesizing the corallorazine core. Corallorazine biosynthesis shows several distinct features, such as the presence of a dehydrating condensation domain, and a unique split adenylation domain on two open reading frames. Using an alternative fatty acyl starter unit, the first steps of corallorazine biosynthesis were characterized in vitro, supporting our biosynthetic model. The dehydrating condensation domain was bioinformatically analyzed in detail and compared to other modifying C domains, revealing unreported specific sequence motives for this domain subfamily. Using global bioinformatics analyses, we show that the crz gene cluster family is widespread among bacteria and encodes notable chemical diversity. Corallorazine A displays moderate antimicrobial activity against selected Gram-positive and Gram-negative bacteria. Mode of action studies comprising whole cell analysis and in vitro test systems revealed that corallorazine A inhibits bacterial transcription by targeting the DNA-dependent RNA polymerase.
The myxobacterial strain Corallococcus coralloides B035, isolated from a Belgian soil sample, is known to synthesize the polyketide/nonribosomal peptide hybrid molecule corallopyronin A,4 which is a promising antibiotic in the fight against filariasis.5,6 During investigations on the corallopyronins, a novel group of secondary metabolites named corallorazines was isolated from C. coralloides B035 (Fig. 1).7 The structure of the main metabolite corallorazine A was identified as a dipeptide core, composed of N-methylglycine and dehydroalanine (Dha) cyclized via a hemiaminal forming a piperazine ring. The Dha residue is additionally acylated with the rare branched fatty acid (2E,4Z)-iso-octa-2,4-dienoic acid. The less complex derivatives corallorazine B and C were identified as probable intermediates of corallorazine biosynthesis, as both are lacking the glycine residue and the latter compound also the N-methyl group. A first investigation into the biosynthetic origin of corallorazine A was carried out by employing feeding experiments with 1-13C labeled glycine, leading to a significant enhancement of the respective glycine carbon signal as well as that of the Dha residue in the 13C NMR spectrum. The authors thus postulated the initial incorporation of serine, which is linked to glycine biosynthesis by the serine-hydroxymethyl transferase, and a subsequent dehydration event to form the final Dha building block. More recently, we fully sequenced C. coralloides B035, allowing us to mine its genome for corallorazine biosynthesis genes.8
![]() | ||
| Fig. 1 (A) The crz BGC. (B) Model of corallorazine A–C biosynthesis. The double bond resulting from dehydration of L-serine is marked in red, the N-methyl group resulting from SAM-dependent methylation of glycine is marked in blue. Abbreviations are explained in the box. A more detailed model of corallorazine biosynthesis is depicted in Fig. S1 (ESI†). | ||
Here, we report the identification and detailed analysis of the corallorazine BGC. We provide a biosynthetic model and investigate corallorazine biosynthesis by in vitro experiments on purified non-ribosomal peptide synthetase (NRPS) multidomain proteins. We perform detailed bioinformatic analyses of the dehydrating C domain Crz1C2 and compare it with other modifying C domains. We analyze the distribution of the corallorazine gene cluster family and its encoded chemical diversity. Furthermore, we re-evaluate the antibacterial activity of corallorazine A, revealing activity towards Staphylococcus aureus strains and identify RNA polymerase as antibiotic target.
The supercluster containing the candidate NRPS genes comprises 38 open reading frames (ORFs), spanning over 87,792 kb. We assigned eight of these, namely crz1–crz8, to be putatively involved in the biosynthesis of corallorazine A (Fig. 1(A) and Table S1, ESI†). The core biosynthetic genes crz1 and crz2 encode for a bimodular NRPS system (Fig. 1(B)). The encoded megaenzyme Crz1 has the domain sequence C1-A1-T1-C2-A2-nMT, suggesting that the second module is split across two ORFs, as its thiolation (T) domain is absent. A preliminary condensation (C) domain analysis with representative sequences retrieved from NaPDoS10 revealed that C1 clades with CStarter domains, catalyzing N-acylation of the adjacent T domain-bound amino acids, whereas C2 is classified as a CmodAA domain. Members of this domain subfamily have been long postulated to catalyze dehydration of carrier protein-bound serine and threonine to yield Dha and dehydrobutyrine, respectively.10 This hypothesis was recently experimentally confirmed for the biosyntheses of albopeptide11 and the antibiotic 2-amino-4-methoxy-trans-3-butenoic acid,12 in the latter case followed by structural characterization of the dehydrating domain. The proposed substrate specificities of the Crz A domains A1 and A2 were L-Ser and Gly. The A2 domain additionally harbours a type I SAM-dependent N-methyltransferase (nMT) domain.13 The much smaller ORF crz2 only encodes the T domain of the second module, as well as a terminating thioreductase (TD) domain, which is known to catalyse the reductive release and cyclisation of nonribosomal peptides at the expense of NAD(P)H.14 Taken together, all these features align well with formation of the dipeptidic Dha-N-methyl-Gly core of corallorazine A.
The genes crz3, crz4, crz5, and crz6 encode enzymes that are likely involved in the formation of the unusual acyl residue (2E,4Z)-iso-octa-2,4-dienoic acid. This residue shows significant similarity to branched, dehydrated fatty acid residues found in several lipopeptide antibiotics, such as cadaside, ramoplanin, friulimicin, laspartomycin, and malacidin.15,16 The substrates for its biosynthesis likely originate from fatty acid metabolism through the condensation of isovaleryl-CoA with two malonyl-CoA units. The genes crz4 and crz5 encode acyl-CoA dehydrogenases, predicted to introduce double bonds in the acyl precursor. Crz4 is predicted to use isovaleryl-CoA as a substrate, while no specific substrate could be predicted for Crz5. In the next step, the acyl-CoA ligase Crz3 is expected to transfer the CoA-bound acyl chain to the acyl carrier protein Crz6, which would ultimately serve as a substrate for Crz1 C1 (Fig. 1(B) and Fig. S1, ESI†). crz7 and crz8 encode a cytochrome P450 monooxygenase and the corresponding flavine mononucleotide reductase. P450 monooxygenases are known to catalyse different reactions in NRP biosynthesis, such as β-hydroxylation17 and C–C or C–O–C cyclization.18 In this case, however, no distinct function could be assigned for Crz7, as the predicted functions for Crz1–6 are sufficient to pose a comprehensive scheme for corallorazine biosynthesis, which is fully depicted in Fig. S1 (ESI†).
:
1 molar ratio and then assessed for adenylating activity with glycine as substrate. Strikingly, in this setup, we observed an almost quantitative substrate turnover in the γ18O4-ATP exchange assay. To our knowledge, this is the first experimental report of a split, yet functional A domain, encoded on two different genes. This unusual architecture might be the consequence of an integration of the nMT domain or an A subdomain recombination event during evolution, which both were shown to predominantly occur between the motifs A8 and A9.20
Next, we conducted a CStarter domain assay, that had previously been established in our laboratory,21 to test the activity of Crz1-C1. Due to the unavailability of its proposed substrate, (2E,4Z)-iso-octa-2,4-dienoic acid, which is predicted to be bound to acyl carrier protein Crz6, we used octanoyl-CoA as alternative substrate together with L-Ser, and ATP for Crz1.1. Indeed, the activated fatty acid was accepted and we could subsequently detect the formation of octanoyl-serine in the assay (Fig. 2(B)), confirming the proposed function of the Crz-C1 starter domain and offering the possibility for the targeted generation of corallorazine analogues with altered fatty acid tails.
CmodAA domains can further be distinguished by the final modification of their substrate amino acid in addition to simple dehydration. In a previous study, 27 domains from the CmodAA subtype were identified and analysed with regard to their type of modification.12 We aimed to get a more comprehensive picture and expanded this set with 23 additional sequences of potential CmodAA domains, including Crz1-C2, that were identified by a combination of literature and MIBiG23 analyses (Table S2, ESI†). The resulting phylogenetic tree (Fig. 3) shows that domains catalyzing conjugate addition instead of peptide bond formation and pyrimidine formation each formed their own clades. The pyrrolizidine formation group, which was now extended to six domains, also formed its own clade, while Crz1-C2 was repeatedly grouped with AmbE C1.
![]() | ||
| Fig. 3 Phylogenetic analysis of CmodAA domains. A maximum likelihood tree of 50 CmodAA domains was generated using one randomly selected dual E/C domain as outgroup. A total of 100 bootstrap replicates were performed and bootstrap values above 75 are given on corresponding branches. Domains were assigned colours according to the taxonomic classification of the source organism at phylum level, and by final modification of the amino acid they act on. Conserved residues detected in this study are depicted next to each domain. Visualisation was done with iTol v6.24 | ||
In order to identify conserved amino acid residues and motifs within CmodAA domains, potentially offering clues to their mode of action, the alignment (Fig. S5 and S6, ESI†) was further investigated and also compared to DCL and dual E/C domain sequences, related subclasses within the D-clade (Fig. S7, ESI†). Previously, several well conserved residues located near the active centre or the donor side tunnel of AmbE-C1 were identified.12 For example, mutagenesis of active site residues R1605 and H1632 significantly reduced the activity of AmbE-C2, indicating their importance for the function of CmodAA domains. In our alignment, R1605 (K1325 in Crz1-C2), that was predicted to take part in substrate positioning,12 shows functional conservation in all analysed CmodAA domains (78% K, 22% R). Several other residues were found to be highly to moderately conserved among all CmodAA domains in the alignment (Fig. 3). Noticeably, some of these were located in inter-motif regions, and partly arranged in motif-like sections that were not present in this form in any of the other representatives of the D-clade (Fig. S7, ESI†). Particularly striking was a section directly located N-terminal of the C5 motif, which extends over approx. 8 amino acids and, in a structural model generated with AlphaFold, is part of a helix on the donor side of the substrate tunnel (Fig. S8 and S9, ESI†).
This helix contains the highly conserved K/R mentioned above, but also a noteworthy histidine (H1332), which is present in 48 of the 50 analyzed CmodAA domains. In our structural model, the side chain of H1332 is oriented in the same direction as that of K1325 (Fig. S9, ESI†). Together, K1325 and H1332 seem to delineate the beginning and end of a motif-like region characterized by the consensus sequence [K/R]SVLLAAH, which is, in the model structure, in close proximity to conserved T (T1347, 82%), E (E1393, 88%), and Y (Y1489, 90%) side chains.
The HHxxxDG motif in the active site is crucial for the activity of canonical C domains.25 Our alignment does show strict conservation of the DG motif in CmodAA domains, but not of both histidines. In seven cases, the first His, which is thought be crucial for structural integrity, is missing, while in three other cases the second His is mutated. In FclJ-C2 and Zmn17-C2, both catalyzing side chain conjugation, the first His is exchanged by a Cys. Given that Cys can also form hydrogen bonds, it might substitute for His in maintaining structural integrity under certain conditions. Interestingly, TnaC-C3, the third domain in this group, retains both histidines but features a cysteine directly downstream of the second His. Domains involved in conjugate addition (BlmVI-C2, TlmVI-C2, and ZbmVI-C2) lack both histidines, and feature Phe and Gly or Ala instead. Here too, the presence of Phe could induce an active site conformation which promotes addition to the side chain. The absence of the second His, which is attributed a central role in peptide bond formation, aligns with the altered functions of these domains.
Further specific residues for CmodAA domains were located in close proximity to the donor substrate tunnel: these include an R (96%) at the beginning of the C6 motif, followed by a less conserved R (78%), corresponding to R1400/R1401 in Crz1, and R1679/R1680 in AmbE, respectively. The model shows both arginine side chains to be located near the substrate tunnel entrance and are facing the side chain of a conserved E (E1365, 88%) located directly opposite. Mutating R1679 or R1680 to Ala had no significant effect on its activity.12 However, it is feasible that one Arg might substitute for the other. In fact, each of the CmodAA domains contains either both or at least one of the arginines, which supports their predicted importance for CmodAA domain functionality.
![]() | ||
| Fig. 4 Cytoscape network visualization of homologous corallorazine BGCs. Nodes correspond to BGCs containing at least one gene that is homologous to a gene in the corallorazine BGC (A1, red circle outline). Information on host strain and presence of crz1–8 homologues is displayed in Table S3 (ESI†). Classification in clade I, II or III is indicated by colored areas. Examples for characteristic clusters from these clades are shown in this figure and in Fig. S11 (ESI†). Length of the connections between nodes correlates with the degree of relationship according to BiG-SCAPE analysis. | ||
Clade I consists of BGCs harbouring homologues of all eight crz genes, with crz7 duplicated in most Streptomyces species. The most intriguing additional feature is an additional monooxygenase domain (MOx) within A2, detected in 38 pathways, notably all from members of the Gammaproteobacteria genera Serratia and Pseudomonas. This MOx domain was previously reported from myxothiazol and melithiazol biosynthesis, where it is also encoded within glycine-recruiting A domains.27 In the respective pathways, the MOx domain hydroxylates Cα, leading to the spontaneous dissimilation of an amine due to the unstable N–Cα bond. The remaining T domain-bound glyoxylate is subsequently released by the terminating TE domain. The presence of additional MOx domains in crz-like BGCs suggests the formation of derivatives with truncated, linear peptide cores by these pathways, similar to the shunt products corallorazine B/C.
Clade II of the network analysis contains BGCs that encode homologues of crz1 and crz2, but mostly lack genes corresponding to crz3–6, suggesting production of molecules with a cyclic dipeptide core, but lacking the acyl side chain. Interestingly, genes encoding for a P450 monooxygenase and the associated FMN reductase, corresponding to crz7 and crz8, respectively, are present in almost all BGCs from clade II. In five cases, e.g. cluster 6 from Streptomyces buecherae (Fig. 4), the P450 is even fused to the NRPS. This constellation implies an important function of the P450 for biosynthesis of this compound class. However, our current biosynthetic model does not involve steps catalysed by Crz7 and Crz8. Instead, these enzymes might also play a role in side chain biosynthesis or self-resistance.28 The BGCs of clade III harbour genes corresponding to crz3–6 that are located in close proximity to genes encoding various NRPSs- and/or polyketide synthases (PKS). This indicates a conserved interplay of the encoded enzymes and the collective horizontal transfer of the corresponding genes, yielding structurally diverse compounds with branched, unsaturated fatty acid side chains, such as the calcium-dependent antibiotics of cadasides, malacidins, and taromycins.16,29
| Strain | MIC [μg mL−1] |
|---|---|
| Staphylococcus aureus SG511 (MSSA) | 16 |
| Staphylococcus aureus RN4220 (MSSA) | 32 |
| Staphylococcus aureus HG003 (MSSA) | 32 |
| Staphylococcus aureus Mu50 (VISA, rifampinR) | 8 |
| Staphylococcus aureus COL (MRSA) | 4–8 |
| Enterococcus faecalis BM4223 (VRE) | 16 |
| Bacillus subtilis 168 | 128 |
| Escherichia coli I-112768 | >128 |
| Escherichia coli MB5746 | 16 |
To determine the target pathway, pathway-selective β-galactosidase bioreporter strains were treated with corallorazine A. Expression of LacZ was specifically induced in B. subtilis PyvgS-lacZ indicating interference with RNA biosynthesis while DNA, protein and cell wall biosyntheses remained unaffected (Fig. 5(A)). In a next step, we used conditional antisense RNA expression to reduce expression of putative target proteins involved in transcription and translation to confirm RNA biosynthesis pathway as a target for corallorazine A (Fig. 5(B)). Reduction of target protein results in hypersensitivity towards the inhibitor. While the reduction of expression levels of RNA polymerase Sigma factor SigA (sigA) and translation initiation factor IF-1 (infA) had no effect, increased susceptibility towards corallorazine A was observed when the RNA polymerase beta subunit (rpoC) and the 30S ribosomal protein S5 (rpsE) were down-regulated, indicating the latter being molecular targets. The direct effect of corallorazine A on the RNA polymerase from E. coli was examined in an in vitro transcription assay (Fig. 5(C)). The results show that corallorazine A inhibits transcription in a concentration-dependent manner, suggesting direct interaction with RNA polymerase.
The minor, linear derivatives corallorazine B and C appear to be biosynthetic shunt products of the assembly line, possibly caused by this unusual split, as both are lacking the glycine residue. In addition, our detailed bioinformatic analyses of the dehydrating Crz1-C2 and related CmodAA domains revealed previously not recognized conserved sequence features of this group of modifying C domains, including modifications of the highly conserved active site HHxxxDG motif for certain domains.
Our network analysis unveiled the crz BGC and closely related homologues thereof to be surprisingly widely distributed among bacteria from different phyla and allowed a comparative global analysis of the corallorazine-like GCF. In several cases, the presence of an additional MOx domain within the glycine-incorporating A domain indicates the production of oxidized or smaller, linear corallorazine derivatives. The widespread occurrence of BGCs for the assembly of corallorazine-like compounds suggests specialized bioactivities and ecological relevance of the encoded molecules, which may also be further investigated in view of their pharmaceutical potential. Finally, unexpected frequent co-occurrence of genes encoding a P450 monooxygenase and its corresponding FMN reductase along with the corallorazine NRPS genes in homologous BGCs give reason for further investigation of their potential impact in corallorazine biosynthesis.
mL of LB medium, supplemented with 50
ng mL−1 kanamycin for plasmid selection and incubated at 37 °C, 220
rpm overnight. This densely-grown preculture was used to inoculate 250
mL TB in a baffled flask, supplemented with kanamycin. After roughly 2
h incubation (37 °C, 220
rpm), OD600 reached 0.7 and the culture was chilled on ice. Protein expression was induced by addition of 0.4
mM isopropyl-β-D-thiogalactopyranoside (IPTG). The culture was then incubated for additional 16
h (16 °C, 200 rpm). For extraction of the proteins, cells were harvested via centrifugation (10
000 × g, 4 °C, 2 min). The cells were resuspended in 5
mL lysis buffer (50
mM NaH2PO4, 10
mM imidazole, 300
mM NaCl, pH 8) per g pellet. The cells were lysed via sonification and the lysate centrifuged (12
000 × g, 4 °C), until lysate was clear. The supernatant, containing the hexahistidine-tagged protein was incubated on ice with 1
mL of Ni-NTA-agarose suspension (Qiagen) for 1
h at 100
rpm. The suspension was finally filtered with a propylene column (Qiagen), washed with 4
mL wash buffer (50
mM NaH2PO4, 20
mM imidazole, 300
mM NaCl, pH 8) and eluted in 2.2
mL elution buffer (50
mM NaH2PO4, 300
mM imidazole, 300
mM NaCl, pH 8). The buffer system of the elution fraction was exchanged with PD-10 columns (Cytiva), following the manufacturers protocol. The desired concentration as set by centrifugation with Amicon membrane filter (Merck KGaA) with a MWCO of 30
kDa or 10
kDa.
mM tris pH 7.5, 5% glycerol) was mixed with 2 μL solution 1 (3
mM amino acid, 15 mM pyrophosphate, 20
mM tris pH 7.5) and 2 μL solution 2 (3 mM γ18O4-ATP, 15
mM MgCl2, 20 mM tris pH 7.5) and incubated at 22 °C for 1
h. The reaction was quenched with 6 μL 9-aminoacridine in acetone (5 mg mL−1). Precipitated proteins were removed via centrifugation. For sample analysis, 1 μL of the sample were spotted on a carrier plate of ground steel and analyzed with a Bruker AutoFlex III MALDI-TOF-MS in negative mode. Absolute substrate conversion in [%] was calculated by dividing the peak area at m/z 506 through the combined peak areas at m/z 508, 510, 512, and 514, divided by 83.33 for the molar ratio of labelled against unlabeled pyrophosphate in the assay.19
mM MgCl2) using PD-10 columns, following the manufacturers protocol. The assay was performed in a 500 μL one-pot-reaction: 12.5 μM Crz1.1 were incubated with 1 mM ATP, 1
mM L-Ser, and 0.5 mM octanoyl-CoA (CoALA) for 3
h at 22 °C. For the negative control, the enzyme was inactivated beforehand at 80 °C for 10
min. After incubation, proteins were precipitated with 10% trichloroacetic acid (final concentration) and incubated for 30
min on ice. Subsequently, the acid was removed by 3 washing steps with ddH2O. The precipitated proteins were then resuspended in 0.1
M KOH and incubated at 70 °C for 20
min to cleave the thioester-T domain bonds upon alkaline hydrolysis. Samples were dehydrated via lyophilization and dissolved in a minimal volume of MeOH for LC–MS analysis using a Waters Alliance e2695 Separation Module, equipped with a RP-18 column (Waters XBridge BEH Shield RP18 Column; 3.5 μm, 2.1 × 100 mm). A binary solvent mixture of A: 90% acetonitrile + 10% ammonium acetate (5 mM) pH 7.4 and B: 10% acetonitrile + 90% ammonium acetate (5 mM) pH 7.4 was used, with a flow rate of 0.3 mL min−1, starting at minute 0–5 with isocratic elution of 10% B, a gradient 10% B → 100% at minute 5–25, and again isocratic conditions of 100% B at minute 25–30. Injection volume of MeOH sample was 5 μL. Signals were detected in negative ion mode for total ion current (TIC) and for selected ion recording (SIR) at m/z = 230.1 for octanoyl-serine.
Footnote |
| † Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4cb00157e |
| This journal is © The Royal Society of Chemistry 2024 |