Weixin
Tang‡
a,
Shi-Hui
Dong‡
b,
Lindsay M.
Repka
a,
Chang
He
a,
Satish K.
Nair
*bc and
Wilfred A.
van der Donk
*ab
aDepartment of Chemistry and Howard Hughes Medical Institute, University of Illinois at Urbana-Champaign, 600 S. Mathews Ave., Urbana, IL 61801, USA. E-mail: vddonk@illinois.edu; Fax: +1 217 244 8533; Tel: +1 217 244 5360
bDepartment of Biochemistry, University of Illinois at Urbana-Champaign, 600 S. Mathews Ave., Urbana, IL 61801, USA
cCenter for Biophysics and Computational Biology, University of Illinois at Urbana-Champaign, 600 S. Mathews Ave., Urbana, IL 61801, USA. E-mail: s-nair@life.illinois.edu; Fax: +1 217 244 5858; Tel: +1 217 333 0641
First published on 2nd September 2015
The final step of lanthipeptide biosynthesis involves the removal of leader peptides by dedicated proteases. In vitro characterization of LicP, a class II LanP protease involved in the biosynthesis of the lantibiotic lichenicidin, revealed a self-cleavage step that removes 100 amino acids from the N-terminus. The 2.35 Å resolution crystal structure provides insights into the active site geometry and substrate specificity, and unveiled an unusual calcium-independent maturation mechanism of a subtilisin family member. LicP processes LicA2 peptides with or without post-translational modifications, but dehydrated and cyclized LicA2 is favored. Investigation of its substrate specificity demonstrated that LicP can serve as an efficient sequence-specific traceless protease and may have great utility in basic research and biotechnology. Encouraged by these findings for LicP, we identified 13 other class II LanPs, ten of which were previously unknown, and suggest that these proteins may serve as a pool of proteases with diverse recognition sequences for general traceless tag removal applications, expanding the current toolbox of proteases.
Fig. 1 Biosynthesis of class II lanthipeptides. (A) Biosynthetic pathway of class II lanthipeptides with lichenicidin β (Licβ) shown as an example. The precursor peptide LicA2 is dehydrated and cyclized by the class II lanthipeptide synthetase LicM2. Then, the leader peptide is removed in a two-step process by LicT and LicP (see Fig. S1†). Also shown is the structure of the α component that together with Licβ makes up the two-component lantibiotic lichenicidin. Obu, 2-oxobutyryl group resulting from hydrolysis of an N-terminal Dhb; Abu, α-aminobutyric acid. (B) Dehydration and cyclization reactions that take place during the lanthipeptide post-translational modification process. For the sequence of the leader peptide, see Fig. S1.† |
Lanthipeptides are currently classified according to the synthetases that install the thioether cross-linked amino acids (Fig. S1 in the ESI†).12 Compared to the well-characterized lanthionine synthetases, the proteases responsible for leader peptide removal are much less studied.14–22 Two types of proteases have been reported for the maturation of class I and II lanthipeptides (Fig. S1†) – the subtilisin-like serine protease LanP employed by both classes and the papain-like cysteine protease domain of the LanT transporter protein involved exclusively in the biosynthesis of class II lanthipeptides.14–20 The cysteine protease domain is located at the N-terminus of LanT proteins and typically cleaves its substrate after a double Gly-type motif (GG/GA/GS).17,18 In contrast to LanT protease domains, the sequence specificity of subtilisin-like LanP proteins remains mostly elusive. Thus far, two class I LanP proteases (NisP and ElxP) have been heterologously expressed and reconstituted in vitro with their native substrates,23–25 and the substrate specificity has been characterized in vitro for ElxP. The substrate specificity of class II LanPs, which fall into a different phylogenetic clade,23 has not been explored.
The biosynthesis of only a few class II lanthipeptides involves LanP proteases.12 These enzymes remove a short N-terminal oligopeptide after a LanT protein detaches the majority of the leader peptide at a double Gly-type cleavage site. For example, LicP is an extracellularly located serine protease expressed by some strains of Bacillus licheniformis and is required for the production of the two-component lantibiotic lichenicidin (Fig. 1A).20 After installation of the thioether rings in the precursor peptides LicA1 and LicA2, LicT removes the leader peptide of modified LicA1 to generate Licα. LicT also cleaves off the majority of the leader peptide from modified LicA2 to generate NDVNPE-Licβ (hereafter LicA2′) (Fig. S1†).20 The maturation of Licβ requires one more cleavage step outside the cell, where LicP trims off the six remaining amino acids at the N-terminus of LicA2′ (Fig. S1†).
In this study, we describe the in vitro characterization of LicP, demonstrate that the protein self-cleaves, and show that it recognizes a specific cleavage sequence but otherwise is very tolerant of the peptide sequence. In addition, prompted by the lack of structural information for class II lanthipeptide leader proteases, we solved a 2.35 Å resolution structure of the enzyme, revealing an unanticipated intramolecular activation strategy that stabilizes the mature form of the enzyme in a calcium-independent manner. LicP has potential utility for sequence-specific proteolysis because it does not leave a scar.3,26 Encouraged by these observations, we identified ten new class II LanP proteins with diverse recognition sequences, which may expand the current protease toolbox for specific removal of expression tags in protein chemistry or leader peptides in RiPP biosynthesis.
Such proteolytic processing has been reported for several extracellular class I LanP proteases and was suggested to be autocatalytic14,15 like other subtilisins.28 To test whether this mechanism applied to the class II enzyme LicP, we substituted the predicted catalytic Ser376 with Ala. Unfortunately, His6-LicP-25-433-S376A was expressed almost exclusively in the insoluble fraction. We were able to obtain a very small amount of soluble His6-LicP-25-433-S376A, demonstrating that indeed the proteolytic cleavage after Glu100 was abolished (Fig. S2†). The observation that an inactivated LicP was expressed as the full length protein indicates that the cleavage event is catalyzed by LicP rather than E. coli proteases. Our findings mirror those of a very recent report on a protease that has sequence homology with the lanthipeptide protease EpiP. This homolog employs an autocatalytic mechanism of cleavage between Lys98 and Thr99.29 Using His6-LicP-25-433 and its S376A mutant, we showed that the proteolysis can take place intermolecularly, albeit slowly (Fig. S3†). To obtain a form of LicP with the pro-sequence still covalently attached, we substituted Glu100 at the native cleavage site with Ala. Surprisingly, His6-LicP-25-433-E100A was again expressed and purified as two fragments (Fig. S4†). MS analysis showed that the resulting fragments corresponded to a shifted cleavage site from residue Ala100 to Glu102 (Fig. S4†). Further mutation of Glu102 to Ala abolished the production of soluble protein.
We further investigated whether the enzyme displays a preference for modified or linear LicA2. Liquid chromatography-based kinetic analysis of the concentration dependence of the cleavage reactions was hampered by the poor solubility of the LicA2 and Licβ peptides. Instead, we monitored the proteolysis of linear LicA2 and dehydrated and cyclized LicA2 at one concentration by HPLC. The time dependence of the disappearance of the substrates and appearance of the leader peptide clearly shows that modified LicA2 is a much better substrate than linear LicA2 (Fig. S7†). Modified LicA2 (100 μM) was consumed by 0.4 μM LicP in 5 min, corresponding to a turnover rate of at least 50 min−1, whereas it took much longer for consumption of 100 μM linear LicA2 (Fig. S7†). We also employed a competitive MALDI-TOF MS assay at lower substrate concentration, in which LicP was supplied to an equimolar mixture of modified and linear LicA2 and the production of leader peptides was monitored over time. In order to differentiate the otherwise identical leader peptides after proteolysis, we introduced a Pro to Gly mutation between the hexa-histidine tag and the precursor peptide in linear LicA2 (G-LicA2). The leader peptides obtained by complete proteolysis of equimolar amounts of modified and linear LicA2 exhibited comparable signal intensities when monitored by MALDI-TOF MS, confirming that the Pro to Gly mutation does not alter the ionization efficiency significantly (Fig. 3B). LicP was incubated with an 800-fold excess of modified and linear LicA2 (i.e. enzyme:combined substrates = 1:1600), and MALDI-TOF MS analysis illustrated complete consumption of modified LicA2 peptide within 10 min, corresponding to a rate of at least 80 min−1, whereas the cleavage of linear LicA2 only started after the modified LicA2 had been consumed and required more enzyme to be completed (Fig. 3B and S7†). Collectively, our observations indicate that although both are substrates for the enzyme, LicP prefers modified LicA2.
The experiments with linear and modified LicA2 as well as NDVNPE-ProcA1.7, NDVNPE-NisA and MBP-BamL demonstrated that LicP tolerates Dhb, Thr, Ser and Ile in the P1′ position. To further evaluate its tolerance, we altered the P1′ position in NDVNPE-NisA from Ile to eight other amino acids (Gly, Cys, Thr, Leu, Phe, Trp, Glu and Lys; Fig. S10†). All these mutants were accepted by LicP as in all cases removal of the NisA leader peptide was observed when substrate peptides were supplied in 30 to 300 fold excess over the enzyme (Fig. S11†). The highest activity was observed when the P1′ position was occupied by Thr or Cys. The latter is noteworthy because traceless removal of tags in front of Cys is highly valuable for cysteine-based native chemical ligation-type chemistry.35,36 The removal of the NisA leader peptide in front of a Gly or Ile residue was slightly less efficient, but complete consumption of precursor peptides was still observed. NDVNPE-NisA analogs with Trp, Leu, and Lys at the P1′ position were also accepted by LicP, although some substrate still remained after 30 hours of incubation. Peptides with Glu or Phe in the P1′ position turned out to be poor substrates. Collectively, these results show that LicP serves as a sequence-specific protease for non-native substrates and that its activity is highly portable with respect to the P1′ position (Table 1).
Substrate sequences | P and P′ positions | [LicP] | [Substrate] | Incubation time at RT | Complete reaction |
---|---|---|---|---|---|
a Performed at 4 °C. RT: room temperature. | |||||
LicP-95-105 | NTAVNE|TESVI | — | — | — | Y |
LicP-E100A-97-107 | AVNATE|SVISG | — | — | — | Y |
Modified LicA2 | NDVNPE|DhbDhbPADhb | 0.4 μM | 100 μM | 5 min | Y |
Linear LicA2 | NDVNPE|TTPAT | 0.4 μM | 100 μM | 2 h | Y |
NDVNPE-ProcA1.7 | NDVNPE|TIGGT | 0.2 μM | 180 μM | 4 h | Y |
NDVNPE-NisA | NDVNPE|ITSIS | 2 μM | 65 μM | 30 h | Y |
MBP-BamL | NDVNPE|SGSEN | 0.5 μM | 50 μM | 12a h | Y |
NDVNPE-NisA-I1T | NDVNPE|TTSIS | 0.2 μM | 65 μM | 20 h | Y |
NDVNPE-NisA-I1C | NDVNPE|CTSIS | 0.2 μM | 65 μM | 20 h | Y |
NDVNPE-NisA-I1G | NDVNPE|GTSIS | 2 μM | 65 μM | 30 h | Y |
NDVNPE-NisA-I1W | NDVNPE|WTSIS | 2 μM | 65 μM | 30 h | N |
NDVNPE-NisA-I1L | NDVNPE|LTSIS | 2 μM | 65 μM | 30 h | N |
NDVNPE-NisA-I1K | NDVNPE|KTSIS | 2 μM | 65 μM | 30 h | N |
NDVNPE-NisA-I1F | NDVNPE|FTSIS | 2 μM | 65 μM | 30 h | N |
NDVNPE-NisA-I1E | NDVNPE|ETSIS | 2 μM | 65 μM | 30 h | N |
Fig. 4 Crystal structure of LicP. (A) Cartoon representation of the overall structure of LicP with the prodomain colored in green and the catalytic domain colored in purple and light blue (PDB ID: 4ZOQ). See also Fig. S12.† (B and C) Comparison of the structures of LicP (C) and subtilisin BPN′ (B) in the vicinity of the calcium-binding region, which in LicP is replaced by a hydrophobic pocket into which Trp111 from the linker region binds. In panel B, the residues numbering refers to the subtilisin (SBT) catalytic domain, whereas in panel C and elsewhere LicP numbering is based on the full length protein. (D) Omit electron density map superimposed on a model of LicP bound to a peptide substrate. The density corresponds to residual features of the linker into which the substrate has been modeled. |
The prodomain folds into a single compact domain consisting of four antiparallel β-strands interconnected with two helices. The length of the helices is larger than those found in subtilisin BPN′, and consequently, the LicP prodomain has only limited structural similarity to the prodomain of subtilisin BPN′ (Z-score = 5.7; RMSD of 2.4 Å over 57 aligned Cα atoms). The LicP prodomain has an overall fold that is more reminiscent of the PII proteins that facilitate signal transduction in response to build-up of primary metabolites (PDB code = 3DFE; Z-score = 8.8; RMSD = 1.6 Å over 60 aligned Cα atoms).39 In the subtilisin BPN′ structure, two long α-helices from the catalytic domain were proposed to stabilize the prodomain during the folding process.40 A similar interaction between the prodomain and the catalytic domain is preserved in LicP (Fig. 4A).
A second central feature in subtilisin maturation is that once the prodomain has been cleaved, the N-terminus of the catalytic domain is displaced to a high-affinity calcium-binding loop, where it provides additional residues for coordinating the metal (Fig. 4B).41 In the LicP crystal structure, this calcium-binding site is not conserved, and the long loop that comprises this site in subtilisin BPN′ is shortened. Additionally, the helix that precedes this loop is lengthened in LicP, and these rearrangements result in the formation of a hydrophobic pocket. In the mature form of LicP observed in the crystal structure, the excised N-terminus of the catalytic domain (Val104 through Leu117) is situated in the same place as the calcium-binding site, and Trp111 from this linker inserts into this hydrophobic pocket (Fig. 4C). Several hydrophobic residues, including Leu123, Phe125, Leu161, Met193, Thr349, and Tyr369, define the contours of this pocket (Fig. 4C), and insertion of Trp111 into this cavity creates favorable van der Waals interactions that may be a necessary step in LicP maturation. Perhaps the observed difficulty of obtaining soluble mutants of LicP that prevent proteolytic processing can be attributed to the inability of Trp111 to access this hydrophobic pocket.
All eight crystallographically independent copies of LicP in the asymmetry unit contain spurious but persistent electron density in the vicinity of the active site (Fig. 4D), which may be due to partial occupancy of the cleaved linker within the active site. Because of the paucity of well-defined features, no attempts were made to model residues from the cleaved linker region into this density. Nonetheless, we utilized the contours of this density to derive an informed model of how LicP may interact with a NDVNPE peptide substrate (Fig. 4D). According to this model, the specificity for the P1 Glu of the substrate peptide is dictated by interactions with the side chains of Ser272 and His291 of LicP, as well as weak interactions with the hydroxyl groups of Thr238 and Tyr241. Likewise, a pocket that consists of Leu209, Ile219, and Thr238 appears to dictate the P4 specificity for a small hydrophobic residue such as Val. Lastly, the Asp at the P5 position of the substrate is positioned to make ionic interactions with Lys212 of LicP. The paucity of additional interactions with the enzyme is consistent with the tolerance of LicP for the linker that connects the prodomain and the catalytic domain (sequence NTAVNE) and its substrate peptide (NDVNPE).
The substrate specificity of LicP was further evaluated by an SDS-PAGE-based assay monitoring the time-dependent cleavage of mutants of linear LicA2. The assay was first validated by analyzing samples by both HPLC and SDS-PAGE (Fig. S7†), which showed that the gel-based assay provides a good read-out of relative activity. The presence of a Glu at the P1 position was critical for LicP activity as LicA2-E–1A was not a substrate under the assay conditions and even substitution with structurally related amino acids Gln and Asp was not tolerated (Fig. S13 and S14†). The importance of the P5 position was tested by mutating Asp to Lys, Ala or Val. We found that all three P5 mutants were processed much more slowly, suggesting that the P5 position of the substrate is also important for LicP's activity (Fig. S14–S16†). The P4 position of LicA2 was substituted with three hydrophobic amino acids of varying size, Ala, Leu and Phe, and with a polar amino acid, Asn. LicA2-V–4A, LicA2-V–4L and LicA2-V–4N were still accepted by LicP with slightly reduced cleavage efficiency (Fig. S16 and S17†). However, LicA2-V–4F was not processed (Fig. S15†), indicating that only relatively small amino acids are tolerated at the P4 position of the substrate. The P2 and P3 positions (Pro and Asn, respectively) are not critical for LicP recognition as alanine substitution at both sites did not alter the processing efficiency of LicP significantly (Fig. S18†). The observations that P1 plays a critical role and that D–5V and V–4N mutants are still processed by LicP, albeit at a slower rate, are consistent with the shift in proteolysis site for LicP(-E100A) resulting in cleavage after AVNATE (Table 1). Collectively, our biochemical data are in good agreement with the model based on the LicP crystal structure with respect to the substrate specificity.
Fig. 5 Representative class II lanthipeptide biosynthetic gene clusters containing LanP genes. Genes with unknown functions are indicated with X. For substrate peptides and their proposed recognition sequences, please see Fig. S19.† Clusters are annotated using the standard lanthipeptide biosynthesis nomenclature: LanM proteins catalyze the dehydration and cyclization reactions, LanA peptides are the lanthipeptide precursors, LanT proteins are transporters with an N-terminal Cys protease domain, LanEFGHI proteins are immunity conferring proteins, LanJ are dehydrogenases, and LanHR are regulatory proteins. The cytolysin cluster is annotated differently: the substrates are CylLL and CylLS, CylB is a transporter with a protease domain, and CylA is the class II LanP. |
Comparing the two class I LanP proteases for which a structure has been reported (NisP and an EpiP homolog)24,29 with the class II LicP protein in the current study shows that there are similarities and differences. Both the EpiP homolog and LicP purified with the prodomain non-covalently bound to the catalytic domain. Furthermore, neither the prodomain nor the catalytic domain of the EpiP homolog could be obtained individually,29 which we also observed for LicP, suggesting that prodomain binding and/or the folding process induced by prodomain cleavage is critical for obtaining a stable protein. Conversely, NisP was obtained as an active protein in the absence of the prodomain, which is also not present in its reported structure.24 Although mechanistic details for protease maturation have not been described for either NisP or the EpiP homolog, structural considerations provide a plausible framework. The hydrophobic cavity stabilized by Trp111 in the LicP structure is enclosed by a long loop in the catalytic domains of NisP and the EpiP homolog, which is absent in LicP. This loop superimposes upon the calcium-binding site of subtilisin BPN′, which is missing in both the NisP and EpiP homolog structures. Notably, amino-terminal residues from the N-terminus of the catalytic domain that is generated following proteolysis buttress this loop, suggesting that stabilization of the loop may be involved in class I LanP protease maturation.
In addition to its physiological role of trimming a 6-residue oligopeptide from LicA2′, LicP removed the entire leader peptides from modified and linear LicA2. An estimated substrate consumption rate of 50–80 min−1 was observed for LicA2 with the post-translational modifications installed, indicating LicP is a highly efficient protease. NisP, the class I LanP responsible for the maturation of the lantibiotic nisin, in vivo accepts modified NisA as its substrate but not the linear version.44,45 Our observations suggest that LicP also favors modified substrate in vitro. Such substrate selectivity by LanP proteins is beneficial in the lanthipeptide producing organism as it prevents undesired consumption of unmodified precursor peptides.
Although LicP favors modified LicA2 over linear LicA2, which indicates that post-translational modifications in the core peptide contribute to LicP's substrate recognition in addition to the NDVNPE sequence, our observations with substrate analogs demonstrate its application as a sequence-specific protease for traceless removal of leader peptides and an MBP expression tag. The substrate specificity of LicP was identified using both structural information and biochemical characterizations. The P5, P4 and P1 residues of LicA2 were found to be important for LicP recognition. These three sites were also suggested as the origin of specificity for a class I LanP, ElxP, as determined by kinetic analysis based on LC quantification.23 The similarity in the important positions may suggest a general substrate recognition mechanism by the entire subtilisin-like LanP family.
The thermostability of subtilisin BPN′ and related proteases is enhanced significantly in the presence of calcium ions, which are necessary for maturation, and subsequent stabilization of a large loop in the catalytic domain.46 However, the calcium dependence constitutes a drawback for industrial utility, e.g. in detergents containing metal chelators. Therefore, much effort has been spent on engineering thermostable mutants of subtilisin that function in a calcium-independent manner.47,48 The structural and biochemical analysis of LicP reveals an elegant solution to this limitation, as maturation and subsequent stabilization of the enzyme is facilitated not by metal ions, but rather by the insertion of Trp111, liberated by cleavage of the linker between the prodomain and the catalytic domain, into a hydrophobic pocket located in the same vicinity as the calcium-binding site in subtilisin BPN′. The recent structure of the class I lanthipeptide protease NisP also demonstrated loss of a calcium binding site, although unlike the structure of LicP, the prodomain was not present in the NisP structure and the alternative mechanism of stabilization in the absence of a calcium binding site was not discussed.24 Based on structural comparisons with LicP, we speculate that stabilization of the region in the catalytic domain corresponding to the calcium-binding site of subtilisin BPN′ by the new N-terminus after proteolytic processing may prove to be a general mechanism for LanP maturation.
Over the past several decades, the toolbox of useful proteases has been significantly enlarged. Several proteases with strict recognition sequences have been commercialized for biochemical or industrial applications, including Factor Xa, enterokinase, and TEV protease. Factor Xa and enterokinase exhibit P1 preferences similar to trypsin and cleave after an Arg or Lys. TEV protease recognizes a larger motif and exhibits better reliability in terms of specificity, but TEV protease requires either a Gly or Ser at the P1′ position for efficient cleavage. LicP is complementary in that it specifically cleaves after a Glu in the NDVNPE sequence, and is quite tolerant of various residues in the P1′ position. LicP accepted a range of residues at the P1′ site (Table 1) such as glycine, small polar residues (Ser, Thr, Cys) and large aliphatic residues (Ile). LicP also processed peptides with Leu, aromatic (Phe, Trp) and charged (Lys, Glu) residues at the P1′ position albeit with reduced efficiency. Also noteworthy is the ability to cleave adjacent to post-translationally modified residues such as Dhb. In previous work, commercial proteases have shown considerable reluctance to cleave next to such a post-translationally modified amino acid.31–33,49 Additional favorable properties include the enzyme's stability demonstrated by the persistent activity of LicP after 12 weeks at 4 °C, and no obvious decrease in activity after multiple rounds of freeze–thaw procedures.
This study also identified ten new class II lanthipeptide gene clusters containing lanP genes, suggesting they are more widely distributed than previously expected.12 Although the putative recognition sequences of these newly identified LanPs show a certain level of homology, they also exhibit considerable diversity. Similar to other proteases, most of these LanPs are predicted to cleave after charged residues such as arginine, glutamate or aspartate, but a few appear to cleave after unusual P1 residues such as histidine or alanine that are rarely the site of cleavage for other proteases (Fig. S19†). We confirmed the predicted sites for two examples that have a His and an Arg in the P1 position (Fig. S20†). Hence, this naturally occurring protease family may serve as a basis to construct a general protease pool for traceless tag removal purposes.5,50,51
Footnotes |
† Electronic supplementary information (ESI) available: Molecular biology procedures, protein expression and purifications, procedures for enzyme assays, supplementary figures and tables. See DOI: 10.1039/c5sc02329g |
‡ These authors contributed equally to this work. |
This journal is © The Royal Society of Chemistry 2015 |