Chenglong
Zhang
ab,
Jia
Wang
ab,
Yi
Shi
ab,
Nan
Wu
ab,
Xia
Li
ab,
Ying
Wang
ab,
Bingzhi
Li
ab,
Wenhai
Xiao
*abcd,
Mingdong
Yao
*ab and
Yingjin
Yuan
ab
aFrontier Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China. E-mail: mingdong.yao@tju.edu.cn
bFrontier Research Institute for Synthetic Biology, Tianjin University, China
cSchool of Life Sciences, Faculty of Medicine, Tianjin University, China
dGeorgia Tech Shenzhen Institute, Tianjin University, Shenzhen 518071, China
First published on 13th September 2024
Taxadiene is an important precursor of paclitaxel. However, its low yield in eukaryotic systems limits its biosynthesis. This study found that the low yield of taxadiene is likely due to the degradation of expressed taxadiene synthase (TS) in Saccharomyces cerevisiae. The TS expression degradation problem was improved by the knockout of protease PRB1, and the yield of taxadiene was increased by 97% from 19.8 mg L−1 to 39.2 mg L−1. Furthermore, multi-copy integration of the TS gene and enhancement of the geranylgeranyl diphosphate (GGPP) precursor pathway increased the taxadiene titer to 282.4 mg L−1 in a shake flask. Interestingly, enhancing TS expression also decreased the competitive synthesis of geranylgeraniol (GGOH). Finally, the taxadiene titer reached 878.5 mg L−1 after optimising the fed-batch fermentation, which is the highest taxadiene titer reported for eukaryotic microbes. This study alleviated the problems associated with the expression and degradation of heterologous proteins and provided an efficient and green strategy to produce complex natural compounds.
In recent years, the utilisation of microbes to produce high-value plant compounds has become a promising alternative approach owing to rapid advances in synthetic biology and metabolic engineering.7,8 Furthermore, compared to the production processes of natural product chemical synthesis, microbial synthesis utilizes renewable biomass to produce the target compounds via green and sustainable means.9 Taxadiene is the key intermediate in the paclitaxel biosynthetic pathway and high-yield taxadiene production is an essential prerequisite for efficient paclitaxel biosynthesis.10 Taxadiene is formed by the cyclisation of geranylgeranyl diphosphate (GGPP) by taxadiene synthase (TS). Ajikumar et al. used a multivariate modular pathway engineering approach for dividing taxadiene synthesis into an upstream native IPP synthesis module and downstream GGPP and taxadiene synthesis modules. By combining these two modules with different promoters and gene copy numbers, they identified an optimal combination that increased taxadiene production in a 2 L bioreactor by 15000-fold to 1 g L−1.11 In the paclitaxel biosynthetic pathway, the intermediate taxadiene undergoes a series of complex reactions, including hydroxylation, acetylation, and epoxidation.12 Compared to prokaryotic systems, the eukaryotic system Saccharomyces cerevisiae (S. cerevisiae) possesses complete membrane systems and post-translational protein modification machinery, giving it an advantage for expressing P450 enzymes.13 Therefore, S. cerevisiae have tremendous potential as a platform for the green bio-synthesis of paclitaxel.
Although taxadiene production in Escherichia coli (E. coli) has made significant progress, its yield in S. cerevisiae remains very low. Compared to other terpene synthases, TS not only has lower expression levels but also relatively lower activity, with a turnover rate 70-fold lower than that of plant diterpene synthases with higher sequence homology.14 Engels et al. introduced GGPP synthase (SaGGPPS) from Sulfolobus acidocaldarius and TS from Taxus into S. cerevisiae and overexpressed tHMG1. Through TS codon optimisation, the final taxadiene titer reached 8.7 mg L−1.15 According to previous studies, the expression of soluble TS in S. cerevisiae is poor. Nowrouzi et al. increased the expression of soluble TS by utilising solubility enhancement tags, thereby increasing the taxadiene titer in S. cerevisiae to 129 mg L−1.16 TS protein expression was enhanced by the fusion of GGPPS and TS in our previous study.17 Although previous studies have proposed various engineering strategies to improve TS protein expression and activity, including N-terminal truncation, solubility-enhancing tags, subcellular relocalization, and copy number amplification, taxadiene production has not yet reached the expected level in S. cerevisiae. Progress has been made regarding the downstream modification pathways of taxadiene to paclitaxel and the latest studies have revealed the key missing enzymes in the biological pathway of paclitaxel, clarified a new mechanism of oxidative rearrangement in paclitaxel oxetane formation, and discovered multiple related genes in the heterogeneous synthesis pathway of paclitaxel.18 The exploration of taxadiene and its downstream pathways in eukaryotic systems has garnered considerable attention.19 Therefore, finding new strategies and tools to improve the expression in S. cerevisiae and increase the taxadiene yield is worthy of further investigation.
Many studies have addressed the problem of low heterologous protein expression in S. cerevisiae. To increase the yield of haemoglobin in S. cerevisiae, Ishchuk et al. knocked out the VPS1 gene encoding the vacuolar protein sorting receptor and the PEP4 gene encoding vacuolar protease A to reduce the degradation of haemoglobin.20 Moreover, disrupting the PAH1 encoding phospholipid phosphatase by CRISPR/Cas9 caused endoplasmic reticulum stress (ERS), thereby stimulating the production of recombinant triterpene biosynthetic enzymes and ultimately promoting the accumulation of triterpene compounds and triterpene saponins.21 Yap et al. knocked out PRB1 and PEP4 genes in S. cerevisiae BJ5464, improved the expression of sesquiterpene synthetase derived from tiger milk mushroom Lignosus rhinocerotis, and increased the yield of α-cadinol.22 Tang et al. enhanced the expression of monacolin-j acid (MJA) biosynthesis genes and increased the MJA titer to 75 mg L−1 by deleting PYC2, PRB1, and PEP4 proteases.23 The productivity of heterologous proteins in S. cerevisiae can be effectively improved by inhibiting the activity of related proteases to prevent their degradation.24
In previous studies, we had successfully constructed a high-producing GGPP strain by screening metabolic pathway genes and controlling key gene promoters, thereby providing an excellent basis for taxadiene biosynthesis.25 In this study, we aimed to verify the abnormal protein expression of TS and the phenomenon of TS protein cleavage in the expression system of S. cerevisiae. Subsequently, we knocked out the endogenous protease PRB1 of S. cerevisiae and improved the expression level of the intact TS protein, which resulted in a significant increase in the production of taxadiene. Then, we further increased the titer of taxadiene to 282.4 mg L−1 in a shaking bottle by enhancing the GGPP precursor pathway and integrating the multi-copy TS gene into the TY1 site. Finally, the titer of taxadiene was increased to 878.5 mg L−1 in a 5 L bioreactor under optimised fed-batch fermentation conditions, which is the highest taxadiene yield reported in eukaryotes (Fig. 1).
Strains | Host strain | Description | Source |
---|---|---|---|
CEN.PK2-1C | MAT a; ura3-52, trp1-289, leu2-3,112, his3Δ1, MAL2-8C, SUC2 | This lab | |
yZCL078 | CEN.PK2-1C | MAT a; ura3-52, trp1-289, leu2-3,112, his3Δ1, MAL2-8C, SUC2; Delta22::CYC1t-TDH1p-BST1-ERG20-TEF1t-PYC1p-tHMG1-HXT7t; ΔYL1062W; Δgal80 | This study |
yZCL080 | yZCL078 | pZCL106 | This study |
yZCL081 | yZCL078 | ΔCYM1::LEU2 | This study |
yZCL082 | yZCL078 | ΔPEP4::LEU2 | This study |
yZCL083 | yZCL078 | ΔPRB1::LEU2 | This study |
yZCL084 | yZCL078 | ΔYAP3::LEU2 | This study |
yZCL089 | yZCL078 | pZCL096 | This study |
yZCL090 | yZCL081 | pZCL096 | This study |
yZCL091 | yZCL082 | pZCL096 | This study |
yZCL092 | yZCL083 | pZCL096 | This study |
yZCL093 | yZCL084 | pZCL096 | This study |
yZCL099 | yZCL081 | pZCL106 | This study |
yZCL100 | yZCL082 | pZCL106 | This study |
yZCL101 | yZCL083 | pZCL106 | This study |
yZCL102 | yZCL084 | pZCL106 | This study |
yZCL107 | yZCL078 | pZCL116 | This study |
yZCL108 | yZCL081 | pZCL116 | This study |
yZCL109 | yZCL082 | pZCL116 | This study |
yZCL110 | yZCL083 | pZCL116 | This study |
yZCL111 | yZCL084 | pZCL116 | This study |
yZCL115 | yZCL083 | ΔPRB1::LEU2; G418-100 mg; TY1::PKK1t-GAL1p-2× His-TS-GPM1t-PYC1p-KanMX-TEF1p | This study |
yZCL117 | yZCL083 | ΔPRB1::LEU2; G418-200 mg; TY1::PKK1t-GAL1p-2× His-TS-GPM1t-PYC1p-KanMX-TEF1p | This study |
yZCL118 | yZCL083 | ΔPRB1::LEU2; G418-400 mg; TY1::PKK1t-GAL1p-2× His-TS-GPM1t-PYC1p-KanMX-TEF1p | This study |
yZCL119 | yZCL083 | ΔPRB1::LEU2; G418-400 mg; TY1::PKK1t-GAL1p-2× His-TS-GPM1t-PYC1p-KanMX-TEF1p | This study |
yZCL124 | yZCL083 | ΔPRB1::LEU2; G418-800 mg; TY1::PKK1t-GAL1p-2× His-TS-GPM1t-PYC1p-KanMX-TEF1p | This study |
yZCL126 | yZCL083 | ΔPRB1::LEU2; G418-1600 mg; TY1::PKK1t-GAL1p-2× His-TS-GPM1t-PYC1p-KanMX-TEF1p | This study |
yZCL136 | yZCL124 | ΔPRB1::LEU2; G418-800 mg; Delta22::CYC1t-TDH1p-BST1-ERG20-TEF1t-PYC1p-tHMG1-HXT7t; HO::CYC1t-TDH1p-BST1-ERG20-TEF1t-PYC1p-tHMG1-HXT7t; | This study |
yZCL137 | yZCL136 | ΔPRB1::LEU2; G418-800 mg; Delta22::CYC1t-TDH1p-BST1-ERG20-TEF1t-PYC1p-tHMG1-HXT7t; HO::CYC1t-TDH1p-BST1-ERG20-TEF1t-PYC1p-tHMG1-HXT7t; TKL2::CYC1t-TDH1p-BST1-ERG20-TEF1t-PYC1p-tHMG1-HXT7t | This study |
The E. coli top 10 strain was cultured in LB medium (0.5% yeast extract, 1% tryptone, and 1% NaCl) supplemented with either 50 μg mL−1 kanamycin or 100 μg mL−1 ampicillin at a temperature of 37 °C for the propagation of recombinant plasmids. S. cerevisiae strains devoid of plasmids were cultured in yeast extract-peptone-dextrose (YPD) medium. The selected plasmids carrying nutritional screening tags were introduced into S. cerevisiae cells and cultured in a synthetic complete (SC) medium lacking the corresponding amino acids. All medium formulations were the same as those used in a previous study.17
After diluting the total protein concentrations to the same level across different experimental groups, 40 μL of protein sample was taken and mixed thoroughly with 10 μL of 5× SDS-PAGE protein sample buffer. The protein sample was heated in a boiling water bath for 10 minutes to denature the protein completely, then the sample was placed on ice to cool to room temperature and centrifuged for 5 minutes. Equal volumes of each sample containing the same amount of total protein were loaded into 15% SurePAGE gel. Protein electrophoresis was divided into two stages. In the first step, a constant voltage of 80 V was applied for 20 min and in the second step, a constant voltage of 120 V was applied for 60–70 minutes. For transfer (blotting), the conditions were: a constant voltage of 110 V for approximately 70 minutes. After sealing the PVDF membrane with 5% BSA solution at room temperature for 4 h, a mixture of 20 μL of the primary antibody (anti-His mouse monoclonal antibody for the target protein and anti-GAPDH mouse monoclonal antibody for the reference gene GAPDH) at a ratio of 1:
2000 was added into the same 5% BSA solution and incubated overnight at 4 °C. The PVDF membrane was washed three times with 40 mL of pre-cooled TBST buffer. Subsequently, the PVDF membrane was incubated with goat anti-mouse IgG(H + L) and HRP conjugate (diluted at 1
:
3000) in TBST buffer at room temperature for 3 hours. After incubation with the secondary antibody, the membrane was washed three times with buffer C. The PVDF membranes were washed three times with pre-cooled TBST buffer (Solarbio, China) and incubated for 50 s using the EasySee western-blotting kit (TransGen, China). Then protein expression was detected using an Azure Biosystems C280 chemiluminescent blot imaging system (USA). The gray values of protein bands were quantitatively analyzed using ImageJ.
All samples were sent to Novozymes (Tianjin, China) for data-independent acquisition (DIA) quantitative proteomics analysis. Significantly differentially expressed proteins were screened according to the criteria of an expression fold change of more than 1.2 times (up-regulated more than 1.2 times or down-regulated less than 0.83 times) and P-value <0.05. Proteomic DIA sequencing data were obtained from three biological replicates. Raw data files were searched against the Baker's yeast (S. cerevisiae ATCC 204508/S288c) reference proteome database downloaded from UniProt (UP000002311, 6060 sequences, 20211018) and supplemented with QJX58297.1(NCBI), which is a frequently observed contaminant. At least two peptides were required for protein identification, of which at least one peptide was required to be unique in the database. Identified proteins were quantified with MaxQuant's LFQ algorithm. Proteomic samples were tested using Novo, and the experimental results were processed and analysed using Novogene (Novogene, Tianjin, China).
The feeding batch fermentation conditions were the same as those used for shaking bed fermentation, with a seed inoculation amount of 10% (v/v) in the fermentation tank. Subsequently, 200 mL of the seed culture was transferred to 1.8 L of YPD medium containing 2% (w/v) glucose, 1% (w/v) yeast extract, and 2% (w/v) peptone and inoculated at an OD600 of 1. The initial culture conditions were set at 300 rpm with a stirring-dissolved oxygen linkage mode to maintain the dissolved oxygen at 30%. The temperature was maintained at 30 °C. Oxygen was supplied to the fermentation tank by supplying 2 vvm of gas, and the pH was adjusted to 6.0 using 5 M NaOH. In addition, adding nitrogen sources to batches could increase the fermentation biomass and late accumulation of taxadiene. Glucose was used as the carbon source, and forty-four hours before fermentation, 500 g L−1 glucose was added to the fermentation tank to maintain its concentration below 5.0 g L−1. A nitrogen source containing 1% (w/v) yeast extract was added every 8 h. After 44 hours of fermentation, the temperature was lowered to 20 °C and 20% (v/v) n-dodecane was added for two-phase extraction fermentation, switching the carbon source from glucose to ethanol. Cell growth and glucose and ethanol concentrations were continuously monitored throughout fermentation. The organic layer was selected for taxadiene analysis. After centrifugation at 12000 rpm for 10 min, an appropriate amount of anhydrous sodium sulphate was added to remove the water. After filtering through an organic 0.22 μm filter to remove impurities, the sample could be directly used for GC-MS analysis.
The split ratio for the GC-TOF/MS analysis of taxadiene and GGOH was set to 50:
1. The column temperature was initially held constant at 70 °C for 1 min, followed by a ramping rate of 30 °C min−1 to reach 180 °C, which was maintained for an additional minute. Subsequently, the temperature was increased to 265 °C at a rate of 10 °C min−1 and held steady for a duration of 4 minutes. The total run time encompassed a period of precisely 19.95 minutes. Taxadiene identification relied on mass fragments with m/z values of 109 and 122, with peak elution occurring at approximately 12.24 min. In contrast, GGOH detection involved mass fragments with m/z values of 69, 93, and 119 (Fig. S1†). The peak retention time corresponded to approximately 13.46 min.25 The standard curve of GGOH (Solarbio) was constructed as shown in Fig. S2.†
After fermentation, the organic phase containing taxadiene products was collected by centrifugation. The taxadiene sample in the organic phase was purified using modified silica gel flash chromatography, followed by concentration under reduced pressure at room temperature to obtain taxadiene. After 1H NMR and 13C NMR testing, comparison with literature carbon spectra confirmed the isolated compound as taxadiene (Fig. S3, S4 and Table S4†). And the purity of the prepared taxadiene was up to 98.45%, which was confirmed by gas chromatography according to the peak area normalization (Fig. S5†). Finally, the purified taxadiene was regarded as the standard to construct a standard curve to determine the taxadiene yield (Fig. S6†).
Meanwhile, the collected yeast cells of strain yZCL080 were washed, cleaved, purified by nickel affinity, and identified by SDS-PAGE. The control group included the expression and purification of the TS gene in E. coli BL21(DE3), and the SDS-PAGE results showed an obvious intact band of TS, which was consistent with the theoretical molecular weight (MW) of TS (MW: 93KD) (Fig. 2D). While in S. cerevisiae, SDS-PAGE results for TS protein showed three distinct cleaved bands (MW: 53KD, 35KD and 15KD) besides the intact bands of TS (Fig. 2D). Therefore, it is speculated that the TS protein expressed in S. cerevisiae (TS-S.C) may be degraded. The four protein bands of the TS-S.C sample were analyzed by protein mass spectrometry, and the residue sequence of each protein band sample matched the continuous specific sequence of the target protein TS within a certain range, which suggested that these protein samples should contain fragments of TS sequences. However, the low coverage scores of the residue sequence of the four protein band samples of TS-S.C were obtained by matching the full-length sequence of the TS protein, and the coverage scores were only 39.4%, 15.5%, 7.5% and 8.7% respectively (Fig. S7†). This result may be due to the unsatisfactory purity of SDS-PAGE protein bands for the TS-S.C sample. Therefore, further validation by western blotting (WB) is required, and the results of WB confirmed the phenomenon of TS protein cleavage in the expression system of S. cerevisiae. In Fig. 2E, four distinct protein bands are observed, including an intact TS band and three cleaved bands, and the proportion of the intact TS band is only 13.7% (intact TS/TS-S.C). In our opinion, the abnormal expression of TS will limit the synthesis of taxadiene in S. cerevisiae which may be the key to the low production of taxadiene in S. cerevisiae.
Compared to prokaryotic expression systems (e.g. E. coli), S. cerevisiae has more abundant post-protein modification and degradation systems.31 Moreover, TS is a heterologous complex protein with fused αβγ multi-domain structures and a class I terpene cyclase catalytic motif (DXDD), which is even more unfavourable for its correct folding and expression in S. cerevisiae.32,33 For the above reasons, the heterozyme TS introduced in S. cerevisiae may be easily degraded by proteases.
To investigate the impact of the four proteases (CYM1, PEP4, PRB1, and YAP3) on the expression of taxadiene synthase and taxadiene synthesis, single-deletion strains were generated by disrupting the endogenous proteases CYM1, PEP4, PRB1, and YAP3 individually using homologous fragments containing a LEU2 nutritional marker (Fig. 3B). As shown in Fig. 3C, the taxadiene yield was increased to 39.2 mg L−1 in the PRB1 protease knockout (ΔPRB1) strain yZCL101, with approximately 97% higher than that of the control strain (yZCL080). Subsequently, we also inserted the t60TS gene into the multi-copy plasmid PRS426, then introduced it into the PRB1 protease knockout chassis and constructed the ΔPRB1 strain yZCL110. As shown in Fig. 3D, the taxadiene yield was further increased to 78.4 mg L−1 in the ΔPRB1 strain yZCL110, which was approximately 74% higher than that of the control strain (yZCL107) with the multi-copy plasmid of the t60TS gene. In addition, we also evaluated the expression levels of the TS protein in the ΔPRB1 strain yZCL110 and the control strain yZCL107 by western blotting assay (Fig. 3E). The expression level of the intact TS protein was significantly improved by knocking out the endogenous protease PRB1 of S. cerevisiae, even though the cleavage of TS was not completely eliminated (Fig. 3E). And the relative intensity analysis showed that the expression of the intact TS protein increased by 371% in PRB1 protease single knockout chassis, compared with that in the control strains (Fig. 3F), which resulted in a significant increase in the production of taxadiene. Moreover, the quantification of the expression of TS by RFP is consistent with the above results (Fig. S8†). Therefore, PRB1 protease knockout will promote the stable expression of taxadiene synthase and increase the yield of taxadiene in S. cerevisiae.
To further characterise the effects of PRB1 protease knockout on taxadiene biosynthesis, we performed proteomic analysis of the expression of key enzymes in the MVA pathway and taxadiene synthase in the ΔPRB1 strain yZCL110 and the control strain yZCL107. We first conducted differential analysis of proteins closely related to product synthesis pathways in S. cerevisiae (Fig. S9†). The introduction of a heterologous terpenoid synthesis pathway may lead to acetyl-CoA deficiency. Moreover, acetyl-CoA may be involved in cofactor balance and redox balance in S. cerevisiae.37 GO enrichment analysis (Fig. 4A) revealed significant changes in important biological processes. Enhanced carbohydrate metabolism generates energy and provides intermediates for biosynthesis. Additionally, enhanced redox processes are likely related to an increased acetyl-CoA supply, achieving a balance between anabolism and catabolism (Fig. S10†). The heatmap (Fig. 4B) shows that enzymes related to glycolysis were upregulated in the PRB1 protease-knockout strain, consistent with acetyl-CoA being mainly derived from glycolysis and taxadiene biosynthesis, potentially draining metabolic flux from glycolysis. Compared to the control, tHMG1, ERG20 and BST1 were endogenous proteins of S. cerevisiae, and their expression was relatively stable without significant improvement. The abnormal expression of the TS protein may be more closely related to the PRB1 protease. TS expression was enhanced in the PRB1 knockout strain, alleviating the key problem of low TS expression in S. cerevisiae (Fig. 4C).
Although PRB1 protease knockout can enhance the expression of genes related to the taxadiene synthesis pathway and taxadiene synthase, the excess taxadiene precursor GGPP still accumulates, producing large amounts of the by-product GGOH. Therefore, increasing the TS protein expression in the PRB1 knockout strain to reduce the accumulation of GGOH may have a positive effect on the taxadiene yield. Endogenous proteases are essential for the maintenance of normal cell growth and viability. Environmental and intracellular factors can cause DNA damage. Proteases play a crucial role in DNA repair.38 The ubiquitin–proteasome system plays an important role in the DNA repair pathway in S. cerevisiae.39 Abnormal folding of proteins occurs because of folding failure and mutations in coding genes during protein translation. These abnormally folded proteins are ubiquitinated and eventually degraded by the proteasome, thereby ensuring the stability of S. cerevisiae.40 Therefore, several protease-knockout modifications could not be performed. Subsequently, the expression of the TS protein should be further improved.
N-terminally truncated HMG1 (tHMG1) effectively relieves feedback inhibition and increases the production of monoterpenes, sesquiterpenes, and triterpenes in yeast.43 Ohto et al. promoted GGPP synthesis by fusing the FPP synthase ERG20 with BTS1 while reducing the loss of the ERG20 catalytic product FPP to the squalene synthesis pathway.44 After TS multi-copy integration, we constructed a precursor enhancement expression module, GGPP (Fig. 5C), and gradually integrated the GGPP precursor synthesis module into the HO and TKL2 sites of the high-producing PRB1 strain yZCL124, successfully constructing the GGPP module-enhanced strains yZCL136 and yZCL137. The results showed that with an increase in the copy number of the GGPP precursor module, the taxadiene yield increased significantly. The taxadiene yield was increased by 113.9% from 131.97 mg L−1 to 282.4 mg L−1 gradually (Fig. 5D). The highest taxadiene titer in S. cerevisiae was obtained. Simultaneously, with the enhanced GGPP synthesis pathway, the yield of GGOH increased from 76.9 mg L−1 to 110.4 mg L−1.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4gc03079f |
This journal is © The Royal Society of Chemistry 2024 |