Open Access Article
Jing
Shi‡
a,
Xiang
Xu‡
a,
Pei Yi
Liu
a,
Yi Ling
Hu
a,
Bo
Zhang
a,
Rui Hua
Jiao
a,
Ghader
Bashiri
b,
Ren Xiang
Tan
*a and
Hui Ming
Ge
*a
aState Key Laboratory of Pharmaceutical Biotechnology, School of Life Sciences, Chemistry and Biomedicine Innovation Center (ChemBIC), Institute of Artificial Intelligence Biomedicine, Nanjing University, Nanjing 210023, China. E-mail: rxtan@nju.edu.cn; hmge@nju.edu.cn
bLaboratory of Structural Biology, Maurice Wilkins Centre for Molecular Biodiscovery, School of Biological Sciences, University of Auckland, Auckland 1010, New Zealand
First published on 5th January 2021
Nonribosomal peptide synthetases (NRPSs) are modular enzymes that use a thiotemplate mechanism to assemble the peptide backbones of structurally diverse and biologically active natural products in bacteria and fungi. Unlike these canonical multi-modular NRPSs, single-module NRPS-like enzymes, which lack the key condensation (C) domain, are rare in bacteria, and have been largely unexplored to date. Here, we report the discovery of a gene cluster (gup) encoding a NRPS-like megasynthetase through genome mining. Heterologous expression of the gup cluster led to the production of two unprecedented alkaloids, guanipiperazines A and B. The NRPS-like enzyme activates two L-tyrosine molecules, reduces them to the corresponding amino aldehydes, and forms an unstable imine product. The subsequent enzymatic reduction affords piperazine, which can be morphed by a P450 monooxygenase into a highly strained compound through C–O bond formation. Further intermolecular oxidative coupling forming the C–C or C–O bond is catalyzed by another P450 enzyme. This work reveals the huge potential of NRPS-like biosynthetic gene clusters in the discovery of novel natural products.
The carboxylic acid reductases (CARs) belong to the single-module NRPS-like enzymes, and comprise an N-terminal adenylation domain, which is fused to a C-terminal reductase (R) domain via a T domain.11 Mechanistically, the T domain delivers the acyl thioester to the R domain for reduction by NADPH to give an aldehyde product. For example, Lys2 can reduce α-aminoadipate at C-6 to the semialdehyde in the biosynthesis of lysine in Saccharomyces cerevisiae;12 CmlP is involved in the biosynthesis of the antibiotic chloramphenicol (Scheme 1) in Streptomyces;13 PvfC is responsible for the synthesis of a family of pyrazine N-oxides in Pseudomonas virulence;14,15 and HamD is related to valdiazen biosynthesis in Burkholderia cenocepacia (Scheme 1).16 CARs have also recently attracted interest as green biocatalysts due to their abilities to convert a broad range of carboxylic acid substrates to the corresponding aldehydes.17,18 CARs alone or combined with other enzymes have been applied to generate pharmaceuticals, biofuels, and other valuable products.17,19,20
Recent advances of microbial genomics have led to the rapid accumulation of uncharacterized biosynthetic gene clusters (BGCs) in genome database. The diverse functions of CARs and their biocatalytic potential encouraged us to identify new members through genome mining. Here, we report the discovery of a unique four-gene operon encoding an F420H2-dependent reductase, a CAR, and two P450 proteins in Streptomyces. We reconstituted four enzymes that catalyse the production of two unprecedented alkaloids (1 and 2). We demonstrated the detailed biosynthetic steps towards the formation of 1 and 2.
000 bacterial BGCs. When the R domain was used as the search query, 1230 NRPS BGCs containing a terminal R domain were retrieved. After manually removing the multi-modular NRPSs, we were left with 608 single-module NRPS-like BGCs.
From these 608 hits, we generated a sequence similarity network (SSN) based on their corresponding R domain sequences.22 We showed that CARs from different organisms may evolve independently as many of them are related to their taxonomies (Scheme 1).23 Nonetheless, the known CAR enzymes with similar functions are clustered together. The largest clade of proteins contains homologs of NiCARs and MmCARs,24 suggesting this clade is most likely involved in the reduction of aromatic or aliphatic carboxylic acids to their corresponding aldehydes. We also excluded homologous enzymes of PvfC, HamD, and CmlP from further investigation (Scheme 1).
We hypothesize that other tailoring enzymes in BGCs are important for the structural complexity of the corresponding natural products, as exemplified by the chloramphenicol BGC where a series of reductase, halogenase, amidase, and oxygenase encoding genes are present.13 Thus, we further submitted the CAR-gene containing loci to antiSMASH for gene cluster analysis.21 We identified an orphan BGC (gup) in S. chrestomyceticus consisting of a four-gene operon encoding a putative F420H2-dependent reductase, a CAR, and two P450s (Fig. 1). A subsequent BLAST homology search in the NCBI and JGI genome databases revealed that many other Streptomyces strains harbour this homologous, yet cryptic, gene cluster (Fig. S1†).
![]() | ||
| Fig. 1 Biosynthesis of guanipiperazines A and B (1 and 2). (A) Biosynthetic gene cluster of gup; (B) the proposed biosynthetic pathway of 1 and 2. | ||
![]() | ||
| Fig. 2 HPLC analysis (monitored at 280 nm) of the heterologous expression of the gup biosynthetic gene cluster in Streptomyces lividans TK24. | ||
GupA is annotated as an F420H2-dependent reductase. Co-expression of the gupB and gupA genes yielded a piperazine-type compound 4 (Fig. 1, Table S7†). Feeding 4 to the S. lividans TK24/gupACD strain restored the production of 1 and 2 (Fig. S2†), indicating 4 is an on-pathway intermediate. Hence, the CAR GupB enzyme may successively activate two tyrosine molecules and reduce them to the corresponding amino aldehydes. Condensation between two amino aldehydes followed by dehydration led to the formation of a proposed intermediate 5. In the absence of GupA, 5 is spontaneously overoxidized by air to give 3; alternatively, 5 is reduced to form intermediate 4 in the presence of GupA.
GupA is a small protein (162 amino acids) that belongs to the RV1155-F420 superfamily, and contains a conserved F420-binding domain. However, the functions of members of this protein superfamily are unclear. We tested whether GupA can reduce the proposed intermediate 5. We purified the cofactor F420 from a genetically-engineered strain of Mycobacterium smegmatis.29 F420-dependent glucose-6-phosphate dehydrogenase (FGD) from M. tuberculosis was used for in situ regeneration of the reduced form of F420 (F420H2) (Fig. S5†).30 We reconstituted the holo-GupB by incubating apo-GupB, CoA, and Sfp (a phosphopantetheinyl transferase in Bacillus subtilis).31 Further supplementation of ATP and L-tyrosine in the above reaction system led to the production of 3 (Fig. 4), indicating the reconstituted GupB protein is functional. When we incubated GupA, F420, and the F420H2 regeneration system into the abovementioned enzymatic reaction, production of 4 was observed.
As cofactor F420 is structurally similar to the flavins (FMN or FAD), we evaluated if GupA could utilize FMNH2/FADH2 as reducing agents. FMNH2/FADH2 can be regenerated by NADPH, FMN/FAD, and flavin reductase (Fre) from E. coli.32 However, the production of 4 was not detected upon the introduction of FMNH2/FADH2 (Fig. S6†). These data reveal that GupB activates and reduces two L-tyrosine molecules and forms the condensed intermediate 5, while GupA is required for the formation of 4. To the best of our knowledge, GupA is the first member of the RV1155-F420 superfamily to be functionally characterized.
The gupC and gupD genes were deleted individually from the gup BGC through in-frame deletion. LC-MS analysis showed that S. lividans TK24/gupABD did not produce 1 and 2, but accumulated 4 as the dominant product (Fig. 2). Meanwhile, the S. lividans TK24/gupABC strain generated a new compound 6 with m/z 297.2 [M + H]+ (Fig. 2), which is 2 Da less than that of 4. The structure of 6 is the expected strained piperazine compound (Fig. 1, Table S8†). Our in vivo heterologous expression data indicated that GupC catalyzes the intramolecular C–O bond formation, while GupD couples 6 and a guanine to form 1 and 2 through C–C and C–O linkage.
To verify functions of the two P450s in vitro, GupC and GupD were overexpressed in E. coli BL21(DE3) cells and purified to near homogeneity as (His)6-fusion proteins. In addition, P450 enzyme activity requires the supply of electrons to the iron center via the electron transport chain. The bacterial P450 enzyme utilizes a two-component system consisting of ferredoxin (Fdx) and ferredoxin reductase (Fdr). Thus, we overexpressed and purified the native Fdx and Fdr enzymes from S. chrestomyceticus (Fig. S3†). Incubation of GupC and 4 in the presence of the electron transport system and NADPH led to the consumption of 4 with the concomitant appearance of 6. These findings indicate GupC catalyzes the intramolecular C–O bond formation.
When GupD was incubated with 6 and GTP in the presence of the electron transport system, we observed the formation of 1 and 2 (Fig. 5). This indicated that GupD acts as a guaninyl transferase that catalyzes the attachment of a guanine moiety to the piperazine core via a C–C or C–O bond. This function of GupD is reminiscent of the P450 enzyme PcmD (with 37% identity to GupD),36 which transfers a guaninyl moiety to a cyclodipeptide in the biosynthesis of purincyclamide. PcmD uses a guanine as the substrate instead of GTP, whereas GupD only accepts GTP as the substrate, and is inactive to ATP, UTP, and guanine (Fig. S7†).
Footnotes |
| † Electronic supplementary information (ESI) available. See DOI: 10.1039/d0sc06135b |
| ‡ Contributed equally to this work. |
| This journal is © The Royal Society of Chemistry 2021 |