 Open Access Article
 Open Access Article
Xinya Hemu a, 
Xiaohong Zhanga, 
Giang K. T. Nguyenb, 
Janet Toa, 
Aida Serracd, 
Shining Looa, 
Siu Kwan Szea, 
Chuan-Fa Liu
a, 
Xiaohong Zhanga, 
Giang K. T. Nguyenb, 
Janet Toa, 
Aida Serracd, 
Shining Looa, 
Siu Kwan Szea, 
Chuan-Fa Liu a and 
James P. Tam
a and 
James P. Tam *a
*a
aSchool of Biological Sciences, Nanyang Technological University, 637551, Singapore. E-mail: jptam@ntu.edu.sg
bWIL@NUS Corporate Lab, MD6 Centre for Translational Medicine, Wilmar International Limited, National University of Singapore, 117599, Singapore
cIMDEA Food Research Institute, +Pec Proteomics, Campus of International Excellence UAM+CSIC, Old Cantoblanco Hospital, 8 Crta. Cantoblanco, Madrid 28049, Spain
dIproteored – Instituto de Salud Carlos III (ISCIII), Campus UAM, Cantoblanco, Madrid 28049, Spain
First published on 30th June 2021
Butelase-1, an asparaginyl endopeptidase or legumain, is the prototypical and fastest known Asn/Asp-specific peptide ligase. It is highly useful for engineering and macrocyclization of peptides and proteins. However, certain biochemical properties and applications of naturally occurring and recombinant butelase-1 remain unexplored. Here we report methods to increase the yield of natural and bacterial expressed recombinant butelase-1 and how they can be used to improve the stability and activity of two important industrial enzymes, lipase and phytase, by end-to-end circularization. First, the yield of natural butelase-1 was increased 3-fold to 15 mg kg−1 by determining its highest distribution which is found in young tissues, such as shoots. The yield of recombinantly-produced soluble butelase-1 was improved by promoting cytoplasmic disulfide folding, codon changes, and truncation of the N-terminal pro-domain. Natural and recombinant butelase-1 displayed similar ligase activity, physical stability, and salt tolerance. Furthermore, the processing and glycosylation sites of natural and recombinant butelase-1 were determined by proteomic analysis. Storage conditions for both forms of butelase-1, frozen or lyophilized, were also optimized. Cyclization of lipase and phytase mediated by either soluble or immobilized butelase-1 was highly efficient and simple, and resulted in increased thermal stability and enhanced enzymatic activity. Overall, improved production of butelase-1 can be exploited to improve the biocatalytic efficacy of lipase and phytase by end-to-end cyclization. In turn, ligase-improved enzymes could be a general and environmentally friendly strategy for producing more stable and efficient industrial enzymes.
Our laboratory has a long-standing interest in chemoenzymatic ligation to form peptide bonds through a proximity-driven O/S–N acyl migration mechanism, which is shared by both chemical and enzymatic peptide ligation.14–21 We recently discovered butelase-1, a prototypic peptide asparaginyl ligase (PAL).22 PALs belong to the C13 subfamily of cysteine proteases (EC 3.4.22.34), which are well-represented by legumains, also known as asparaginyl endopeptidases (AEPs).23 AEPs cleave the peptide bond after an Asn/Asp (Asx) residue. In contrast, PALs catalyse peptide bond formation after an Asx. PALs recognize the simple tripeptide motif Asx–Xaa–Yaa and form a new Asx–Zaa peptide bond either intramolecularly to give a cyclized product or intermolecularly to give a linear ligated product.
End-to-end or N-to-C cyclization of the peptidyl backbone is an accepted approach for improving protein stability which minimizes proteolytic degradation by exopeptidases and reduces the flexibility of less-structured N- and C-terminal ends.24 However, the molecular size and complexity of enzymes make cyclization by chemical methods challenging. On the other hand, cyclizing a proteinaceous enzyme could be efficiently accomplished by a peptide bond forming-ligase, such as butelase-1 because of its exquisitely high site-specificity.22,25,26 Currently, only a few examples of enzyme-mediated cyclization of industrial enzymes have been reported. They include N-to-C cyclization of phytase by a self-splicing intein27 and by a SpyLigase forming an isopeptide bond.28 In both examples, intein and SpyLigase are co-expressed with the target enzyme as a fusion protein, rendering them non-reusable, in contrast to free-standing and reusable butelase-1.
Here we report the preparation, characterization and use of the natural and recombinant butelase-1 to improve the thermostability and activity of two industrial enzymes by N-to-C cyclization. This work aims to provide a simple, efficient and environmentally friendly ligase-mediated approach to meet the growing needs for improving industrial enzymes. In addition, we show the high level of natural butelase-1 in young plant tissues and the optimized storage condition of butelase-1.
The amount of butelase-1 in various crude tissue extracts was estimated to range from 20 to >60 mg kg−1 by comparing the ligase activity of tissue extracts with purified natural butelase-1 (nBu1) in the cyclization of kB1-HV. Fig. 1A shows that young tissues, such as shoots, displayed the highest level of butelase-1 activity, followed by flowers and 1-week-old pods; aged tissues, such as the 3-week-old pods, produced the lowest ligase activity. It is worth noting that most tissues produced a low level of Asn-specific protease activity, which yielded <5% linear kB1 at pH 6.0. The only exception was old leaves, which produced an equal amount of cyclic and linear kB1. An increasing amount of hydrolysed product (linear kB1) in old leaves suggested high expression level of AEPs, which may be associated with protein degradation and cell death in old tissues.29,30
Since shoot extracts contained the highest ligase activity, these tissues were used to isolate nBu1 instead of pods. The previous isolation protocol31 was simplified to a 3-step chromatography process that includes anion exchange flash chromatography, anion-exchange fast protein liquid chromatography (FPLC), and size exclusion FPLC (Fig. S2†). In addition, the ammonium sulfate precipitation step that leads to butelase-1 degradation was omitted and polyvinylpolypyrrolidone was added to prevent enzyme oxidation by phenolic compounds in plant tissues.32 Fig. 1B shows the purity of the isolated butelase-1 by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE). These accumulative improvements resulted in an isolation yield of 15 mg kg−1 of nBu1 from young shoots, a 3-fold increase compared to the previous protocol.31
![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 38 to facilitate formation of three conserved disulfide bonds, which significantly reduced the amount of insoluble form produced. Second, the natural cDNA sequence of butelase-1 was used to construct rBu1(ΔSP) with its signal peptide substituted by a His6 tag instead of the previous codon optimization strategy which gave mainly misfolded insoluble products. It is likely that codon-optimized RNA fold into unexpected structures that affect translation efficiency39 although another group has reported bacterial expression of butelase-1 using a codon-optimized sequence.35 In the present case, rBu1(ΔSP) afforded >10 times higher expression level than the codon-optimized sequence Opt-rBu1(ΔSP-Ubi) which contains a His6-ubiquitin tag to replace its signal peptide (Fig. 2B and S3†). Third, the first five residues (IRDDF) were deleted from the pro-domain to derive the truncated construct rBu1(Δ1–25) which further improved expression levels (Fig. 2B). Fourth, acid-induced auto-activation was required to activate rBu1 proenzyme by removing both the cap domain which served as an inhibitor of the active core domain and parts of the N-terminal pro-domain. Using the optimized auto-activation condition of pH 4.0, at 37 °C for 2 h, a single band corresponding to the active form was obtained and the cap domain was completely degraded (Fig. 3A and S4†).
38 to facilitate formation of three conserved disulfide bonds, which significantly reduced the amount of insoluble form produced. Second, the natural cDNA sequence of butelase-1 was used to construct rBu1(ΔSP) with its signal peptide substituted by a His6 tag instead of the previous codon optimization strategy which gave mainly misfolded insoluble products. It is likely that codon-optimized RNA fold into unexpected structures that affect translation efficiency39 although another group has reported bacterial expression of butelase-1 using a codon-optimized sequence.35 In the present case, rBu1(ΔSP) afforded >10 times higher expression level than the codon-optimized sequence Opt-rBu1(ΔSP-Ubi) which contains a His6-ubiquitin tag to replace its signal peptide (Fig. 2B and S3†). Third, the first five residues (IRDDF) were deleted from the pro-domain to derive the truncated construct rBu1(Δ1–25) which further improved expression levels (Fig. 2B). Fourth, acid-induced auto-activation was required to activate rBu1 proenzyme by removing both the cap domain which served as an inhibitor of the active core domain and parts of the N-terminal pro-domain. Using the optimized auto-activation condition of pH 4.0, at 37 °C for 2 h, a single band corresponding to the active form was obtained and the cap domain was completely degraded (Fig. 3A and S4†).
|  | ||
| Fig. 3 Proteomic characterization of nBu1 and activated rBu1. (A) Determination of auto-activation sites and N-glycosylation site. N403*, estimated cleavage site based on intermediate band size and LC-MS/MS sequencing data. (B) Coomassie blue-stained SDS-PAGE gel of nBu1 and the activated rBu1. nBu1, calculated molecular weight (MW) = 32 kDa, observed MW ≈ 38 kDa. rBu1, calculated MW = 31 kDa, observed MW ≈ 37 kDa. (C) Detection of Asp164 in the soluble butelase-1 by LC-MS/MS sequencing in contrast to the Snn164 in the crystal structure of rBu1 superimposed with other AEPs and PALs. Left panel shows one example of MS/MS sequencing of tryptic digested butelase-1 fragments (image obtained by PEAK Studio after DB search and PTM analysis). Right panel includes structures with PDB codes: butelase-1 (6DHI), butelase-2 (6L4V), human legumain–cystatin E/M complex (4N6O), AtLEGγ (5OBT), HaAEP1 (6AZT), and VyPAL2 (6IDV). | ||
Fig. 3A summarizes the processing sites of both nBu1 and rBu1. The major processing sites of both are at Gln37 in the pro-domain and Asn331 in the linker region (Fig. S5†). In 38 kDa-nBu1, additional cleavage also occurred at Asp40 or Asn41 in the N-terminus and Asn322 or Asp323 in the C-terminus. Sequencing of the 43 kDa-rBu1 intermediate suggested that the processing of the rBu1-proenzyme involve an additional step in the cap domain, most likely occurring at Asn403 located after the α8 helix. These data agreed with previous reports that the processing of butelase-1 is similar to other PALs and AEPs, which involve multiple steps and sites on both ends.33,37,41,42
LC-MS/MS analysis revealed the major difference between nBu1 and rBu1, which is the presence of post-translational N-linked glycosylation at Asn94 of nBu1 (Fig. 3A). The N-linked glycans on Asn94 of nBu1 included a complex glycan core (1216 Da) and several truncated forms including Hex3HexNac2 (892 Da) and a single hex-N-acylation (203 Da) (Fig. S6†). Consequently, a 1–2 kDa difference was observed by SDS-PAGE between ∼38 kDa-nBu1 and ∼37 kDa-rBu1, although they share the same amino acid sequence. As such, the active forms of butelase-1, either natural or recombinant, migrated slower in SDS-PAGE gel due to the negative net charge (isoelectric points <5.0),43 which produces a 6–7 kDa mass increase in the gel (Fig. 3B). These results agree with our previous study showing that nBu1 are glycosylated and can be non-covalently immobilized on concanavalin A beads and the immobilized nBu1 can be reused for >100 times without diminishing its catalytic activity.22,44
Thus far, 31 crystal structures belonging to 12 different AEPs and PALs have been determined (Protein Data Bank, Table S1†), 21 of which show a succinimide (Snn) at the conserved Asp164 (butelase-1 numbering) preceding the catalytic His165. Snn, also known as aspartimide, is a dehydrated Asp derivative which forms a 5-membered ring with its neighbouring amino acid, and is particularly facile in Asp–Gly and Asp–His sequences.45 The presence of Snn164–His165 in the catalytic site of AEPs and PALS found in crystal structures of different mammalian and plant proenzymes and active forms has been proposed to play a catalytic role in the religation of cap and core domains at basic pH to re-form an AEP zymogen.46,47 However, our proteomic analysis show that no Asp164-to-Snn conversion (−18 Da by dehydration) was detected in either nBu1 or rBu1 (Fig. 3C and S7†). No mass of iso-Asp (c ion +57 Da or z ion −57 Da) was determined in the MS/MS spectra of any Asp164-containing fragments as demonstrated by DeGraan-Weber et al.48 Since the Snn164 is not found in both nBu1 and rBu1 soluble enzymes but found in crystal structure of rBu1, we speculated that the dehydration of Asp164 commonly observed in the crystal structures of AEPs and PALs may occur under specific conditions, such as exposing to a high-energy source in the crystallization–determination procedure, rather than a post-translational modification.
![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) :
:![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 1000 in a series of conditions was used, including eight different pH (4.5 to 8.0), and three temperatures (25, 37, and 42 °C). Reactions were quenched after 5 min, and the product profiles analysed using MALDI-TOF MS (Fig. S8†) showed that rBu1 is equally as potent as nBu1. Fig. 4A shows that both forms display optimal catalytic activity at pH 6–6.5 and 37–42 °C to give 75–80% cyclized cGN14 product within 5 min. At 25 °C, pH 7 was found to be optimal for yielding 40% cGN14. No hydrolytic product GN14 (1532 Da) was observed at pH 5.5–8.0. However, at a pH lower than 5.5, trace amounts (<3%) of the hydrolytic product GN14 were observed after 10 min, consistent with the auto-proteolytic ability of butelase-1 at an acidic pH of 4 to 4.5.
1000 in a series of conditions was used, including eight different pH (4.5 to 8.0), and three temperatures (25, 37, and 42 °C). Reactions were quenched after 5 min, and the product profiles analysed using MALDI-TOF MS (Fig. S8†) showed that rBu1 is equally as potent as nBu1. Fig. 4A shows that both forms display optimal catalytic activity at pH 6–6.5 and 37–42 °C to give 75–80% cyclized cGN14 product within 5 min. At 25 °C, pH 7 was found to be optimal for yielding 40% cGN14. No hydrolytic product GN14 (1532 Da) was observed at pH 5.5–8.0. However, at a pH lower than 5.5, trace amounts (<3%) of the hydrolytic product GN14 were observed after 10 min, consistent with the auto-proteolytic ability of butelase-1 at an acidic pH of 4 to 4.5.
To examine the thermal stability of nBu1, rBu1, and the rBu1-proenzyme, thermal shift assays were performed at pH 5–7.5 using enzyme solutions prepared with six different reaction buffers. Fig. 4B shows that the thermal stability of rBu1 and nBu1 is similar and pH-dependent. The highest melting temperature (Tm) was at 52 °C and pH 6.0, while the lowest Tm was at 38–40 °C and pH 7.5. The pH-dependent thermal stability of butelase-1 correlates with its pH-dependent catalytic activity, and provides an explanation for the shift of optimal condition from pH 7.0 at 25 °C to pH 6.0 at 42 °C (Fig. 3). Similar to other PALs,44 the rBu1-proenzyme with Tm at 55–57 °C is more stable and less pH-sensitive than the active forms. The Tm of nBu1 in salt-free reaction buffers was about 1–5 °C higher than the previously reported Tm obtained using buffers containing 0.1 M NaCl,44 suggesting that butelase-1 may be destabilized by salt. The effect of ionic strength was then evaluated by comparing the enzymatic activity in 20 mM sodium phosphate buffers containing varying amounts of NaCl from 0 to 1.5 M. The addition of salt exerted an inhibitory effect: butelase-1 activity decreased with an increasing concentration of NaCl. Only 20% butelase-1 catalytic activity remained in the presence of 1.5 M NaCl, but enzymatic activity was restored by removing NaCl (Fig. 4C). Overall, the absence of surface glycan in E. coli-produced rBu1 does not affect its stability or function.
We also explored conditions for butelase-1 lyophilization. Freeze-drying of proteins involved two main denaturing stresses, freezing stress due to the formation of dendritic ice crystals, increased ionic strength, changed pH, or phase separation and drying stress due to the removal of the protein hydration shell.49 Since the freeze–thaw cycle of butelase-1 caused only about 5% reduction in activity without protectants, the drying phase during lyophilization was expected to be the main cause of activity loss. 16 different conditions include common lyoprotectants disaccharides (sucrose or trehalose), detergents (Tween-20), and bulking reagents (glycine) were examined (Table 1).50 In the absence of additives, <5% activity was recovered upon reconstitution. In contrast, the presence of additives generally improved recovered ligase activities, with a combination of 12% sucrose and 0.01% Tween-20 being the best condition, recovered 93% activity.
| No. | Sucrose (%) | Trehalose (%) | Tween-20 (%) | Glycine (M) | Activity (%) | 
|---|---|---|---|---|---|
| 1 | 0 | 0 | 0 | 0 | <5 | 
| 2 | 0 | 0 | 0.01 | 0 | 8 | 
| 3 | 6 | 0 | 0 | 0 | 72 | 
| 4 | 12 | 0 | 0 | 0 | 81 | 
| 5 | 12 | 0 | 0.1 | 0 | 45 | 
| 6 | 12 | 0 | 0.05 | 0 | 75 | 
| 7 | 12 | 0 | 0.01 | 0 | 93 | 
| 8 | 24 | 0 | 0.1 | 0 | 60 | 
| 9 | 24 | 0 | 0.05 | 0 | 75 | 
| 10 | 24 | 0 | 0.01 | 0 | 49 | 
| 11 | 0 | 10 | 0.1 | 0 | 78 | 
| 12 | 0 | 10 | 0.05 | 0 | 82 | 
| 13 | 0 | 20 | 0.1 | 0 | 65 | 
| 14 | 0 | 20 | 0.05 | 0 | 79 | 
| 15 | 12 | 0 | 0.01 | 0.1 | 54 | 
| 16 | 12 | 0 | 0.05 | 0.2 | 33 | 
![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) :
:![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 500 Bu1-to-lipase ratio at pH 6.5 at room temperature.
500 Bu1-to-lipase ratio at pH 6.5 at room temperature.
To circularize phytase, Bacillus phytase PhyC, a well-characterized phytase was selected.53,54 The crystal structure of PhyC reveals a 29 Å gap between the N- and C-termini.55 To produce a cyclic phytase, nine residues were added to the N-terminus, starting with a ligation-promoting dipeptide (Met–Leu), containing a hydrophobic Leu at the P2′′ position to facilitate butelase-1 recognition,22 followed by a His6-tag to facilitate purification (Fig. S11†). The butelase-1 tripeptide recognition signal NHV was added to the C-terminus. Fig. 5B shows that the bacteria-expressed 370-residue linear precursor ML-His6-phytase-NHV was cyclized by nBu1 in 90% yield within 30 min with a Bu1-to-phytase molar ratio of 1![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) :
:![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 100 at pH 6.0 at 37 °C in the presence of 2 mM CaCl2. The success in cyclization between Asn and Met supported the earlier observation that butelase-1 has broad substrate tolerance to the P1′′ residue which could be any amino acids except Pro.22
100 at pH 6.0 at 37 °C in the presence of 2 mM CaCl2. The success in cyclization between Asn and Met supported the earlier observation that butelase-1 has broad substrate tolerance to the P1′′ residue which could be any amino acids except Pro.22
The thermal stability of linear and circular enzymes were examined by a thermal shift assay using SYPRO Orange. Lipase T1.2 is thermophilic with a high melting temperature of 65 °C. Since the N- and C-termini are close to each other and flexible, the effect of cyclization on thermal stability is mild that resulted in 1 °C increase in melting temperature (Fig. S12A†). Nevertheless, the esterase activity of lipase could be significantly improved by cyclization. In a colorimetric assay measuring the lipase-mediated hydrolysis of p-nitrophenyl palmitate (pNPP),56 circular T1.2 produced 35% more hydrolytic product p-nitrophenol (λmax = 405, yellow colour) than the linear T1.2 at the optimal reaction condition (70 °C, pH 9) (Fig. 5C).
The melting temperature of circular PhyC increased 6 °C in the salt-free buffer and 10 °C in the presence of 0.2 M NaCl (Fig. S12B†). This improved thermostability of circular PhyC resulted in an improved tolerance against heat treatment, which is a common disinfection practice in feed and food industries. We treated both linear and circular PhyC at 50–95 °C for 5 min followed by renaturation at room temperature for 1 h. A colorimetric phytase activity assay was performed by determining the amount of inorganic phosphate released from phytate via measuring the formation of phosphomolybdate at 700 nm.57 In the post-treatment activity test performed at 50 °C, circular PhyC retained at least 75% activity. In contrast, linear PhyC lost more than half of its activity after heating at 75–90 °C (Fig. 5D).
The major fundamental significance is the discovery that butelase-1 is present in every plant tissue parts tested, and particularly high in young tissues. Since butelase-like ligases are responsible for the maturation of host defence cyclotides, their abundant occurrence in young tissues is in agreement with their defence role in young and vulnerable tissues.58–61 Moreover, using shoots as the source to increase the isolation yield to 15 mg kg−1 is a viable approach to obtaining already activated peptide ligases. Glycosylation of nBu1 gives an additional advantage for immobilization on lectin columns.
The major practical significance is that this study provides the first example of a reusable ligase-mediated cyclization of industrial enzymes, lipase and phytase. Furthermore, the results show that either free-standing or immobilized butelase-1 can be used. The N-to-C circularized enzymes gain increased stability, heat tolerance, and activity, which are needed for both production and pollution management. This butelase-mediated cyclization represents a simple, efficient, and environmentally friendly enzyme-improving-enzymes approach that is applicable to other important industrial proteins.
| Footnote | 
| † Electronic supplementary information (ESI) available: Materials and methods, supplementary tables and figures. See DOI: 10.1039/d1ra03763c | 
| This journal is © The Royal Society of Chemistry 2021 |