Open Access Article
This Open Access Article is licensed under a
Creative Commons Attribution 3.0 Unported Licence

Rapid discovery of terpene tailoring enzymes for total biosynthesis

Yunlong Sun a, Jennifer Gerke a, Kevin Becker a, Eric Kuhnert a, Bart Verwaaijen b, Daniel Wibberg b, Jörn Kalinowski b, Marc Stadler c and Russell J. Cox *a
aInstitute for Organic Chemistry, Leibniz Universität Hannover, Germany. E-mail: russell.cox@oci.uni-hannover.de
bCeBiTec, University of Bielefeld, Germany
cHelmholtz-Zentrum für Infektionsforschung, Braunschweig, Germany

Received 10th August 2023 , Accepted 2nd November 2023

First published on 8th November 2023


Abstract

Twenty oxygenated aristolochene congeners were rapidly synthesised by combining genes from four different fungal pathways in the fungal host organism Aspergillus oryzae. Compounds produced in a single step include the natural product hypoxylan A and an epimer of guignaderemophilane C. A new fungal aromatase was discovered that produces phenols by oxidative demethylation.


Advances in heterologous expression methodology in plants, fungi and bacteria mean that biosynthetic gene clusters (BGCs) encoding the biosynthesis of specialised metabolites can be rapidly assembled and expressed.1 In turn, many natural products can be produced by total biosynthesis. For example the modern antibacterial pleuromutilin has been produced in a single process by expression in the fungal host Aspergillus oryzae.2 Such processes compete very favourably with total chemical synthesis where the best route to pleuromutilin currently involves a minimum of 16 chemical steps and many reagents and solvents.3 However, total chemical synthesis is much more flexible than total biosynthesis because it can be used for the creation of many different structural variations. Natural biosynthetic processes normally involve the production of a specific skeleton, followed by tailoring processes. Current total biosynthesis strategies, such as that used during the total biosynthesis of pleuromutilin, follow the natural pathway exactly and rational structural variation is currently difficult.

Many biosynthetic pathways start from the same core carbon skeleton and then diverge by the function of different tailoring enzymes. It is likely that biosynthetic pathways evolve by the gain and loss of genes that encode these enzymes.4 A clear example of this comes from the four compounds sporogen AO-1 1a (A. oryzae5 and A. flavus6), hypoxylan A 2 (Hypoxylon rickii),7 eremoxylarin D 3 (Xylaria hypoxylon),8 and PR-toxin 4 (Penicillium roquefortii)9 that are all built from the aristolochene skeleton via different tailoring reactions. Sporogen AO-2 1b has been isolated from a plant source, but may be fungal in origin.10 Aristolochene 5 is produced in a single step by the terpene cyclase aristolochene synthase from farnesyl-diphosphate (FPP).11 Remarkably, even among these four aristolochene-derived compounds, twelve out of the fifteen carbons derived from FPP are oxidatively modified.

We hypothesised that it should be possible to mix-and-match tailoring enzymes from these related biosynthetic pathways to rapidly access differently oxidised aristolochenes. For this to be possible it would require some flexibility in substrate selectivity of the tailoring enzymes. Specifically, we hypothesised that pathways may be more web-like than exclusively linear (Scheme 1B). We rationalised that this could be likely since natural processes involving evolution by gain or loss of biosynthetic genes between related pathways also require flexibility in substrate selectivity. For example, we have shown that web-like routes are responsible for later oxidative tailoring steps during cytochalasan biosynthesis.12–14


image file: d3sc04172g-s1.tif
Scheme 1 (A) Biosynthesis of aristolochene terpenoids in fungi; (B) comparison of linear and web-like biosynthetic routes.

We therefore set out to explore the catalytic potential of the oxidative tailoring enzymes of these metabolites using heterologous expression as our synthetic tool.

Results

The fully-sequenced genomes of A. oryzae and P. roquefortii that produce sporogen AO-1 1a and PR-toxin 4 respectively are publicly available. We fully sequenced the genomes of Hypoxylon rickii and Xylaria hypoxylon that produce 2 and 3 respectively using combined Illumina and Oxford nanopore technologies.

Assembled genomes were automatically annotated, and where necessary, manual annotation was performed to remove ambiguities using a previously described informatic pipeline.15,16 The biosynthetic gene cluster (BGC) encoding the biosynthesis of PR toxin 4 in P. roquefortii (prL1prL9) is already known.17 Using this BGC we searched the genomes of the other three organisms using Cblaster.18 This revealed a single BGC in each organism likely to be responsible for the biosynthesis of each of the compounds. BGC-comparison was performed using CLinker (Fig. 1).15 This revealed that each BGC encodes the expected terpene cyclase, plus differing numbers of redox tailoring enzymes. General gene occurrence patterns supported our hypothesis that the BGCs do, in fact, encode the biosynthesis of each compound. For example, sporogen AO-1 1a is the simplest compound and its putative BGC (aoTcaoL4) consists of the fewest genes. In the case of the PR-toxin BGC (prR1prL9), the presence of an acetyl transferase corresponds with the presence of the 3-O-acetyl group. The putative eremoxylarin BGC (xhR3xhPKS) encodes a polyketide synthase (PKS), most likely responsible for the range of polyketides esterified at O-1, while the proposed hypoxylan A 2 BGC (hrTchrL8) encodes an additional SDR (HrL5) that could be associated with its more reduced sidechain. All BGCs also encode transcription factors and transporters that show low sequence homologies, likely reflecting their different hosts and biological contexts.


image file: d3sc04172g-f1.tif
Fig. 1 Clinker15 analysis of the four BGCs responsible for the biosynthesis of modified aristolochenes. PKS, polyketide synthase; AT, acyl transferase.

All compounds 1–4 contain a ketone (or phenolic oxygen) at C-8. This could be introduced via initial oxygenation to form an allylic alcohol, followed by oxidation by a short-chain dehydrogenase-reductase (SDR). Consistent with this idea, all four BGCs encode a common P450 monooxygenase (AoL4, HrL7, PrL9 and XhL7) and a common SDR (AoL1, HrL8, PrL2 and XhL6) that may catalyse these steps.

A. oryzae, our standard expression host, is a domesticated derivative of A. flavus in which most specialized metabolite BGCs are inactive.19 This is convenient for heterologous expression experiments because newly introduced biosynthetic pathways do not have to compete for precursors with endogenous pathways, and newly created compounds are easier to detect against a clean metabolic background. Under our standard fermentation conditions Aspergillus oryzae NSAR1 does not produce sporogen AO-1 1a. This may be because the terpene cyclase is damaged or not expressed. However RT-PCR analysis showed that aoL4 (P450) and aoL1 (SDR) are transcriptionally active. We therefore expressed the H. rickii terpene cyclase gene (hrTc) under the control of the inducible A. oryzae amyB promoter (PamyB) in A. oryzae NSAR1.

LCMS analysis of transformants grown under inducing conditions (Fig. 2) showed the production of two new compounds that were purified, fully characterised by NMR, and shown to be 2,8-dihydroxy aristolochene 6 (5.0 mg L−1)20 and 2-hydroxy, 8-oxo-aristolochene 7 (7.2 mg L−1, Scheme 2). In all cases the absolute stereochemistry was assumed to match 5 itself, and relative stereochemistry was determined via J and nOe correlations.


image file: d3sc04172g-f2.tif
Fig. 2 ELSD chromatograms obtained from extracts of A. oryzae NSAR1 expressing the indicated proteins. * = unrelated compound. Colour coding as Scheme 2. Grey proteins appear inactive (e.g. HrL5) or redundant (e.g. P450 PrL9 = P450 AoL4 that is already expressed in the host).

image file: d3sc04172g-s2.tif
Scheme 2 Compounds isolated and fully characterized after biosynthetic gene expression in A. oryzae showing expressed proteins and putative shunt steps.

PR-toxin 4 from Penicillium roquefortii, (pr) is oxidised at C-1 and C-3. Coexpression of hrtc and prL4 resulted in production of the 1-hydroxylated congeners 8 (4.2 mg L−1), 9 (6.0 mg L−1) and 10 (4.7 mg L−1). Further addition of prL7 to this system provided the 3-hydroxylated congeners 11 (6.0 mg L−1) and 12 (3.5 mg L−1). In contrast, hypoxylan A 2 from Hypoxylon rickii (hr) is not oxidised at positions 1-3, but features a phenolic ring and loss of C-15. In order to investigate the reactions leading to this skeleton, we coexpressed all of the biosynthetic genes from the hr BGC. This provided 2 itself (8.5 mg L−1), plus the corresponding carboxylic acid 13 (15.0 mg L−1). Omission of the FAD-dependent dehydrogenase encoded by hrL4 led to the diol 14 (4.0 mg L−1) and no more highly oxidised analogs. Compound 15 (5.0 mg L−1), in which the phenol is present, was formed by coexpression of hrTc and hrL1 alone. This was accompanied by C1-alcohol epimers 16 and 17 (3.0 mg L−1 combined). Coexpression of hrTc with hrL3 and hrL4 created the carboxylic acid 18 (5.6 mg L−1).

Finally, coexpression of xhR1 with hrTc provided six new aristolochene congeners 19 (4.0 mg L−1), 20 + 21 (3.0 mg L−1), 22 (4.0 mg L−1) and 23 + 24 (10.0 mg L−1) respectively. These are predominantly oxidised at C-13, but unoxidised at C-8, suggesting that early oxidation at C-13 may prevent the A. oryzae C-8 oxidases from recognising these as substrates.

Discussion

The strategy of mining four BGCs that all process the same terpene skeleton for functional tools for future total biosynthesis projects proved highly effective. The host organism A. oryzae itself apparently already expresses catalysts (AoL1 and AoL4) that form the 8-hydroxyl and 8-oxo functionalities.

This is in common with the other three BGCs that must all also form the 8-ketone 25 in their respective biosynthetic pathways, as early common steps. Compound 25 (Scheme 2) has already been shown by Dickschat and coworkers to be an intermediate in the pathway to PR-toxin 4.14 However, this compound is reported to be highly unstable, and we did not directly observe 25 in our experiments.

Oxidative enzymes from the PR-toxin (pr) BGC can hydroxylate at the 1- and 3-positions, while enzymes from the H. rickii (hr) hypoxylan A 2 pathway catalyse redox processes on the isopropenyl side-chain carbons 11-13.

Remarkably, cytochrome P450 monooxygenase HrL1 behaves as an aromatase (Scheme 3), catalysing the oxidative elimination of the C-15 methyl to create the distinctive phenolic nucleus of hypoxylan A 2. Aromatase is an important enzyme involved in the biosynthesis of estrogens in vertebrates, but it is not known in other organisms, and to the best of our knowledge this is the first fungal example.


image file: d3sc04172g-s3.tif
Scheme 3 Combination of A. oryzae sporogen and shunt enzymes and H. rickii aromatase activity to reach 15.

These experiments show that achieving total synthesis using the easily accessible genetic resources of fungi is an attractive prospect. Here we report the first total synthesis of hypoxylan A 2 in a single process. More complex compounds were also produced. For example, triol 12 is a C-1 epimer of the known fungal natural product guignaderemophilane C 26 (ref. 21) that has been accessed by total chemical synthesis in eleven steps.22 In this case, combination of genes from H. rickii, P. roquefortii and the host organism A. oryzae allowed the synthesis of the epimer 12 in a single process.

A. oryzae is a widely-used host organism for pathway expression and engineering.23 Normally it is regarded as a clean host that does not interfere with engineered pathways. However in this project A. oryzae becomes a rather active host. It is clear that AoL4 (P450) and AoL1 (SDR) are active. This is beneficial since the 8-oxo functionality is required by all pathways to 1–4. However, observation of 2-hydroxylation was not anticipated and many of the observed products feature this substitution. Other shunts are exemplified by the putative route to 15 that involves unplanned hydroxylation at C-2 and C-13 in addition to the aromatase chemistry catalysed by HrL1.

Although it is currently impossible to predict the substrate selectivity or mode of reaction of any given tailoring enzyme from its sequence alone, such predictability will be required if total biosynthesis is to approach the flexibility of total chemical synthesis. However, in the absence of such predictions, our results show that harvesting and screening tailoring genes from pathways that utilise the same carbon skeleton is an effective way of finding catalysts for use in total biosynthesis. Twenty compounds were derived from manipulation of only 6 genes. It is likely that many other compounds could be synthesised by the same methodology, since only a fraction of available genes and combinations were coexpressed. Given that aristolochene-type terpenes are among the most common in fungi24 it is also likely that very many other BGCs could be used to expand the diversity yet further. The same strategy should also be effective for other classes of specialized metabolites, and as more BGCs are discovered and linked to particular compounds it will become increasingly easy to design and build effective total biosynthetic routes to target compounds. It is also likely that tailoring enzymes with inherently flexible substrate selectivities obtained from biosynthetic webs could be engineered by directed evolution25 more easily than enzymes with high selectivies from exclusively linear pathways. Such engineered enzymes already find important roles in shortening total chemical synthesis routes to natural products.26 Finally, these results lend significant support to ideas that gain and loss of biosynthetic genes contributes to fungal specialized metabolite diversity and evolution.

Data availability

Annotated genome sequence files are available at: https://doi.org/10.6084/m9.figshare.23683644.v1 (Xylaria hypoxylon); and https://doi.org/10.6084/m9.figshare.23652825.v1 (Hypoxylon rickii).

Author contributions

YS and KB did all experimental work. JG, EK, BV, DW and JK sequenced and annotated genomes and provided data analysis. MS provided organisms and RJC, EK and MS obtained funds. RJC drafted the MS and led the project. All authors were involved in polishing the draft MS.

Conflicts of interest

There are no conflicts to declare.

Acknowledgements

YS thanks the China Scholarship Council for a scholarship (201908360165). This work was funded by the Deutsche Forschungsgemeinschaft priority program “Taxon-Omics: New Approaches for Discovering and Naming Biodiversity” (SPP 1991), specifically CO 1328/4-2 and CO 1328/4-1. The bioinformatics support of the BMBF-funded project ‘Bielefeld-Gießen Center for Microbial Bioinformatics; BiGi (Grant number 031A533)’ within the German Network for Bioinformatics Infrastructure (de.NBI) is gratefully acknowledged.

References

  1. X. Meng, Y. Fang, M. Ding, Y. Zhang, K. Jia, Z. Li, J. Collemare and W. Liu, Biotechnol. Adv., 2021, 54, 107866 CrossRef PubMed.
  2. A. M. Bailey, F. Alberti, S. Kilaru, C. M. Collins, K. de Mattos-Shipley, A. J. Hartley, P. Hayes, A. Griffin, C. M. Lazarus, R. J. Cox, C. L. Willis, K. O'Dwyer, D. W. Spence and G. D. Foster, Sci. Rep., 2016, 6, 25202 CrossRef CAS PubMed.
  3. N. J. Foy and S. V. Pronin, J. Am. Chem. Soc., 2022, 144, 10174 CrossRef CAS PubMed.
  4. A. B. Chase, D. Sweeney, M. N. Muskat, D. G. Guillén-Matus and P. R. Jensen, mBio, 2021, 12, e02700–e02721 CrossRef CAS PubMed.
  5. S. Tanaka, K. Wada, S. Marumo and H. Hattori, Tetrahedron Lett., 1984, 25, 5907–5910 CrossRef CAS.
  6. Z. Liu, J.-Y. Zhao, S.-F. Sun, Y. Li, J. Qu, H.-T. Liu and Y. Liu, J. Nat. Prod., 2019, 82, 1063–1071 CrossRef CAS PubMed.
  7. E. Kuhnert, F. Surup, V. Wiebach, S. Bernecker and M. Stadler, Phytochemistry, 2015, 117, 116–122 CrossRef CAS PubMed.
  8. A. Miral, S. Ferron, I. Rouaud, D. Slyambayev, L. Bousarghin, C. Camuzet, S. Belouzard, K. Séron, P. L. Pogam, S. Tranchimand and S. Tomasi, J. Nat. Prod., 2023, 86, 730 CrossRef CAS PubMed.
  9. A. A. Chalmers, A. E. de Jesus, C. P. Gorst-Allman and P. S. Steyn, J. Chem. Soc., Perkin Trans., 1981, 1, 2899–2903 RSC.
  10. Y.-B. Zeng, S.-S. Li, W.-L. Mei, W.-H. Dong, K.-M. Li and H.-F. Dai, J. Asian Nat. Prod. Res., 2015, 17, 280–284 CrossRef CAS PubMed.
  11. B. Felicetti and D. E. Cane, J. Am. Chem. Soc., 2004, 126, 7212–7221 CrossRef CAS PubMed.
  12. C. Wang, V. Hantke, R. J. Cox and E. Skellam, Org. Lett., 2019, 21, 4163–4167 CrossRef CAS PubMed.
  13. C. Wang, K. Becker, S. Pfütze, E. Kuhnert, M. Stadler, R. J. Cox and E. Skellam, Org. Lett., 2019, 21, 8756–8760 CrossRef CAS PubMed.
  14. C. Wang, C. Lambert, M. Hauser, A. Deuschmann, C. Zeilinger, K. Rottner, T. E. B. Stradal, M. Stadler, E. J. Skellam and R. J. Cox, Chem. – Eur. J., 2020, 26, 13578–13583 CrossRef CAS PubMed.
  15. D. Wibberg, M. Stadler, C. Lambert, B. Bunk, C. Spröer, C. Rückert, J. Kalinowski, R. J. Cox and E. Kuhnert, Fungal Diversity, 2021, 106, 7–28 CrossRef CAS.
  16. E. Kuhnert, J. C. Navarro-Muñoz, K. Becker, M. Stadler, J. Collemare and R. J. Cox, Stud. Mycol., 2021, 99, 100118 CrossRef CAS PubMed.
  17. R. Riclea and J. S. Dickschat, Angew. Chem., Int. Ed., 2015, 54, 12167–12170 CrossRef CAS PubMed.
  18. M. van den Belt, C. Gilchrist, T. J. Booth, Y.-H. Chooi, M. H. Medema and M. Alanjary, BMC Bioinf., 2023, 24, 181 CrossRef CAS PubMed.
  19. G. A. Payne, W. C. Nierman, J. R. Wortman, B. L. Pritchard, D. Brown, R. A. Dean, D. Bhatnagar, T. E. Cleveland, M. Machida and J. Yu, Med. Mycol., 2006, 44, 9–11 CrossRef PubMed.
  20. Unoptimised isolated and purified yields. See ESI for all characterisation data.
  21. Y. Liu, Y. Li, J. Qu, S. Ma, C. Zang, Y. Zhang and S. Yu, J. Nat. Prod., 2015, 78, 2149–2154 CrossRef CAS PubMed.
  22. A. Ilazi, R. Liffert and K. Gademann, Helv. Chim. Acta, 2018, 101, e1800011 CrossRef.
  23. L. Kahlert, C. Schotte and R. J. Cox, Synthesis, 2021, 53, 2381–2394 CrossRef CAS.
  24. K. T. Yuyama, D. Fortkamp and W.-R. Abraham, Biol. Chem., 2017, 399, 13–28 Search PubMed.
  25. F. Li, H. Deng and H. Renata, J. Am. Chem. Soc., 2022, 144, 7616–7621 CrossRef CAS PubMed.
  26. C. R. Zwick and H. Renata, Nat. Prod. Rep., 2020, 37, 1065–1079 RSC.

Footnote

Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d3sc04172g

This journal is © The Royal Society of Chemistry 2023