Tyler R.
Lambeth
a,
Zhefu
Dai
b,
Yong
Zhang
b and
Ryan R.
Julian
*a
aDepartment of Chemistry, University of California, Riverside, California, 92521, USA
bDepartment of Pharmacology and Pharmaceutical Sciences, School of Pharmacy, Department of Chemistry, Dornsife College of Letters, Arts and Sciences, Norris Comprehensive Cancer Center, and Research Center for Liver Diseases, University of Southern California, Los Angeles, California, 90089, USA. E-mail: ryan.julian@ucr.edu
First published on 17th February 2021
Cathepsin B is an important protease within the lysosome, where it helps recycle proteins to maintain proteostasis. It is also known to degrade proteins elsewhere but has no other known functionality. However, by carefully monitoring peptide digestion with liquid chromatography and mass spectrometry, we observed the synthesis of novel peptides during cathepsin B incubations. This ligation activity was explored further with a variety of peptide substrates to establish mechanistic details and was found to operate through a two-step mechanism with proteolysis and ligation occurring separately. Further explorations using varied sequences indicated increased affinity for some substrates, though all were found to ligate to some extent. Finally, experiments with a proteolytically inactive form of the enzyme yielded no ligation, indicating that the ligation reaction occurs in the same active site but in the reverse direction of proteolysis. These results clearly establish that in its native form cathepsin B can act as both a protease and ligase, although protease action eventually dominates over longer periods of time.
Fig. 1 The active site of catB shown with a bound substrate inhibitor in ribbon representation (left) and molecular surface representation (right). The occluding loop and catalytic cysteine are shown in white, and the substrate is shown in blue. His110 and His111 from the occluding loop bind to the C-terminus of a substrate to favor carboxypeptidase activity. Images constructed from PDB ID:1CSB. |
In addition to the lysosome, catB is found in a variety of cellular locations where it is involved in proteolysis. One secondary role is the activation of enzymes such as trypsin, β-Galactosidase, and renin after synthesis of the respective proenzymes.4 In antigen presenting cells, catB is thought to be involved in pruning sequences along with other cathepsins in endosomes before selection by the major histocompatibility complex (MHC).5 CatB has also been implicated in apoptosis, where its release into the cytosol after lysosomal membrane permeabilization cleaves the apoptotic protein Bid into its active form.6 Another role catB plays outside the lysosome is degradation of the extracellular matrix, often to promote tumor growth in cancer.7 Like other lysosomal proteases, catB is most active at acidic pH, however it can function at near-neutral pH in the cytosol.8 Due to the loss of salt-bridges required for stabilization of the occluding loop, catB operates primarily as an endopeptidase at higher pH. These secondary roles for catB demonstrate its versatility but do not reveal any known function outside proteolysis.
While peptide bonds are facile to cleave using proteases or chemical methods such as acid hydrolysis,9 synthesis of new bonds in aqueous solution is a more difficult challenge. Due to their high stability, carboxylic acids are not naturally reactive enough for amines to attack and spontaneously form a peptide bond. Instead targets must be modified into a more reactive form first. Biologically, new peptides are formed in the ribosome using an active ester as a target for a nucleophilic amine attack, binding the N-terminal amine from an amino acid to the carbonyl from the C-terminus of the growing peptide chain.10In vitro methods commonly exploit this same type of ester–amine reaction, most famously in the solid-phase peptide synthesis and NHS-ester reactions.11 Ester–amine reactions are non-specific to the termini however, and unlike ribosomal synthesis which controls the reaction sterically, any free amine such as a lysine can form an incorrect linkage. For this reason, these reactions require careful protection and subsequent de-protection of the side-chains to increase the reaction efficiency. Native chemical ligation is an alternative to standard ester-based reactions, utilizing a thioester and a cysteine instead.12 It has the advantage of operating orthogonally to canonical amino acids, so no protection of peptide side-chains is required. However, this method still requires chemical modification to produce a reactive C-terminal thioester group which is not found naturally in peptides, and the reaction is also limited to sites with a cysteine residue.
In biology, ligation of existing peptides is performed by enzymes known as ligases, which catalyze connections between the N-terminus and C-terminus of peptide fragments.13,14 These ligases are rare however, and typically highly specific in their native function, as in the case for Butelase 1 which is involved in antigen preparation within the immune system.15 Some proteolytic enzymes with known ligase activity have been engineered to increase the favorability of the ligation reaction.16 For example, mutagenesis of several residues in the active site of subtilisin lead to a significant increase in ligation efficiency when used in combination with activated esters.17,18 A trypsin variat has also been recently engineered into an efficient ligase known as Trypsiligase, although it is limited to a specific sequence motif for coupling.19 In both of these enzymes, modified substrates containing esters or thioesters are used to favor a higher yield of ligation products.20 These ligases have been used for site-specific reactions such as the labelling of free N-termini in complex samples because ligases do not produce side-products at lysine residues as typically occur in chemical additions.21,22
Herein, we examine incubations of catB with synthetic peptide sequences via liquid chromatography and mass spectrometry (LC–MS) to evaluate its ability to perform previously unreported ligation reactions between peptide substrates. Ligated products were detected in multiple incubations, and we explored the sequence specificity for the reaction by substituting amino acids with different functional groups. Additional experiments using a mutated and non-proteolytic form of the enzyme revealed the role of the active site residues and provide further information about the ligation mechanism.
While this behavior has previously gone unnoticed in catB, dual protease/ligase ability has been observed in other enzymes such as Subtiligase, Butelase 1, and the enzyme Legumain.23 Legumain participates in the MHC antigen cleaving pathway but can also ligate cyclic peptides that are involved in plant defense mechanisms. Unlike other dual-activity enzymes however, Legumain ligation operates through a reactive succinimide not directly associated with the proteolytic residue. Instead, the succinimide lies directly adjacent to the histidine of the catalytic triad, which allows it to share the same active site as the proteolytic machinery. For catB, ligase activity appears to target dipeptides such as MR, which may suggest involvement of the same occluding loop and catalytic site responsible for exopeptidase action. Ligase behavior would simply constitute reversal of the protease action as illustrated in Scheme 1.
To elucidate whether the internal removal and ligase activity occurs in one step or multiple steps, two synthetic sequences, APSWFDTGLSEMR and GPSWFDTGLSEGR were incubated with catB as shown in Fig. 3a. While a one-step process would yield the original sequences minus the internal fragment, a two-step process could lead to scrambling of the C-terminal appendages. The color-coded sequences shown in Fig. 3 confirm that scrambling occurred, suggesting that the two C-terminal amino acids are first removed and released, followed by subsequent reattachment following removal of LSE. This illustrates not only that the process occurs in two steps, but also that it is not sequence-specific to the MR portion of the original APSWFDTGLSEMR peptide. Quantitation of the homogeneous ligated products which are comprised of only the original sequence versus the heterogeneous products comprised of mixed sequences is shown in Fig. 3b. The amount of homogeneous ligation is significantly higher, perhaps due to the nearby localization of both necessary pieces after proteolysis.
To further explore the sequence specificity, modifications were made to the C-terminal residue to determine if it operates as a key factor in the ligase activity (see Fig. 4, derived from data in Fig. S2, ESI†). The sequences APSWFDTGLSEMR and GPSWFDTGLSEGR were incubated with the synthetic dipeptides GK and GA, to evaluate whether arginine or any basic residue is required. In this incubation, ligation was detected for both the lysine and alanine substitutions, indicating that neither arginine nor a basic residue is a necessary element in ligation. With this knowledge, an incubation of both dipeptides with a new sequence, VFFAEDVGSNK, was performed and is shown in Fig. 5. In this incubation, the full sequence with appended dipeptide GA was detected, as well as the proteolytic fragment VFF with ligated dipeptides GA and GK in separate peaks. These fragments, especially the appended full sequence, demonstrate that the previous internal remove of LSE before ligation is a coincidental action resulting from competing protease activity, and that proteolysis is not required for ligase activity. The low abundance of these fragments also shows that while the ligase activity ostensibly works for a variety of sequences, it appears to be affected by substrate affinity.
Fig. 4 Incubation of APSWFDTGLSEMR and GPSWFDTGLSEGR with dipeptides GA and GK results in a series of heterogeneous and homogeneous ligation products. Raw LC chromatogram shown in ESI,† Fig. S1. |
Fig. 5 Extracted ion chromatogram for the incubation of VFFAEDVGSNK with CatB. Ligation is detected on proteolytic fragments as well as the full intact sequence. |
This illustrates why ligase activity may have gone unnoticed previously, as the initial incubation substrate APSWFDTG appears to have a high affinity for ligase activity, allowing for easy detection while other ligation products are harder to detect. The competing protease activity also serves to break down any ligated products, and by the end of typical incubations lasting 12 hours or more, these sequences have been fully digested, erasing the evidence. No fragments larger than two residues were attached to the C-terminus of a peptide in any incubation, further implying that the occluding loop may play a role in ligation.
Experiments with isomeric substitutions were conducted to further probe the limits of catB ligation. An incubation of catB with the L-isoAsp isomer of APSWFDTGLSEMR resulted in the ligated fragment FDTGMR, as shown in Fig. 6a. The identity of the fragment was confirmed with CID fragmentation as shown in Fig. S3 (ESI†). This ligation product is also obtained by removal of the internal LSE sequence, however this product does not appear in the incubation with the canonical sequence shown in Fig. 2, which does not show any abundant cleavage at the W–F bond. W–F cleavage results seemingly from the recognition of the isoAsp side-chain as a C-terminus. The altered connectivity of L-isoAsp reduces the side-chain length by one carbon while similarly lengthening the backbone connection. This new orientation produces a structural analogue of a normal C-terminal carboxylate group, as illustrated in Fig. 6b. When this C-terminal mimic is bound by catB, the F residues from APSWFTGLSEMR sit in the pocket normally reserved for the last two residues of a sequence, and cleavage occurs at the W–F bond as it would to remove a dipeptide. This unintended behavior has not been previously reported for catB but has been noted for other carboxypeptidases.24
The ability of catB to recognize L-isoAsp as a C-terminal residue suggests that it could also serve as a site for ligation. In order to explore this possibility, two sequences were incubated together with catB, ADAG and FTG, modelled after the ligated fragment from Fig. 6a. Indeed, after a period of incubation followed by LC–MS analysis, the ligated fragment ADAGFTG was identified in a small quantity as shown in Fig. 6c. This peptide is the first ligation product that does not correspond to attachment of a simple dipeptide. In this case, the first two residues of FTG take on the role of a dipeptide and are ligated together with ADAG. These results reveal that a peptide with an L-isoAsp in the second position can be attached to another peptide regardless of the size limitation normally imposed by the occluding loop. As L-isoAsp can be easily repaired to L-Asp by the repair enzyme Protein L-isoaspartyl methyltransferase,25 this method of utilizing L-isoAsp to ligate longer sequences could be used to produce longer canonical peptide sequences. While Butelase 1 has been previously shown to work on peptides with epimerized residues,26 this is the first report of a ligase recognizing an isoAsp residue. These specificities suggest that catB could be optimized as a complement to N-terminal ligases, since only dipeptides and isoAsp-modified peptides will be added to the C-terminus.
The implied participation of the occluding loop by preferential ligation of dipeptides (or dipeptide mimics) suggests that the same active site may be used for both proteolysis and ligation in catB. To probe whether the same active site is required for ligation, incubations were performed with a mutated form of catB. The C29A mutant lacks the catalytic cysteine required for proteolysis, as demonstrated previously.27 Incubation of catB-C29A with APSWFDTG and the dipeptide MR is shown in Fig. 7a. This incubation did not yield any ligated APSWFDTGMR products when investigated by LC–MS analysis. Within the same timeframe, wild type catB was able to produce the necessary fragments (APSWFDTG and MR) via proteolysis of APSWFDTGLSEMR, and then ligate them together again into the target peptide APSWFDTGMR shown in Fig. 7b. The results in Fig. 7b also represent a replicate of those shown in Fig. 2a, with the exception of a completely different source of catB. Overall, the results indicate that the proteolytic cysteine is necessary for ligation and provide evidence that ligation operates by reversal of the proteolytic mechanism.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/d0cb00224k |
This journal is © The Royal Society of Chemistry 2021 |