Gabriele Antonio
Zingale
*a,
Irene
Pandino
a,
Daniele
Trivellato
b,
Dario
Cavaterra
c,
Francesca
Munari
b,
Giuseppe
Grasso
d,
Peter A.
Bell
e,
Francesco
Oddone
a,
Alessio
Bocedi
c,
Massimiliano
Coletta
a,
Michael
Assfalg
b,
Mariapina
D'Onofrio
b,
Grazia Raffaella
Tundo
af and
Diego
Sbardella
*a
aIRCCS Fondazione Bietti, Rome, Italy. E-mail: gabriele.zingale@fondazionebietti.it; diego.sbardella@fondazionebietti.it
bDepartment of Biotechnology, University of Verona, Strada Le Grazie 15, 37134 Verona, Italy
cDepartment of Chemical Sciences and Technologies, University of Rome Tor Vergata, 00133 Rome, Italy
dChemical Sciences Department, University of Catania, Viale Andrea Doria 6, 95125, Catania, Italy
eDepartment of Oral Biological and Medical Sciences, University of British Columbia, Vancouver, Canada
fDepartment of Clinical Sciences and Translational Medicine, University of Rome Tor Vergata, 00133 Rome, Italy
First published on 20th August 2025
The ubiquitin proteasome system is a critical regulator of proteostasis and shows altered activity and composition in neurodegenerative diseases affecting both the brain (e.g., Alzheimer's disease) and the retina/optic nerve (e.g., age-related macular degeneration, glaucoma). A common feature of neurodegeneration is the progressive accumulation of amyloidogenic proteins such as beta-amyloid and tau protein (MAPT gene). There is compelling evidence that the aggregation propensity of tau protein is regulated by post-synthetic modifications including phosphorylation and ubiquitylation. These alterations are gaining increasing pathological relevance not only for brain tauopathies but also for the retinal/optic nerve degenerative diseases. In this regard, site-specific mono-ubiquitylated (Ub) tau proteoforms, have been recently identified in neurodegenerative brains. In this work, the cleavage patterns of the uncapped 20S proteasome acting on mono-Ub regio-isomers of tauK18, which covers the 4RD domain, have been unveiled by using SpectraSage, a novel proteomics software conceived for the MS1 identification of complex branched peptide and here introduced for the first time. Ub position was found to affect regio-isomers susceptibility to proteolysis and unexpectedly long Ub-tauK18 branched peptides have been identified, proving distinct catalytic preferences. These findings show that the 20S digests mono-Ub proteins through specific enzymatic mechanisms and the implications of the latter on neurodegeneration are discussed.
Within the PN framework, degradation and turnover of proteins is primarily regulated by a major intracellular proteolytic pathway called Ubiquitin Proteasome System (UPS).4–8
The UPS operates through two consecutive steps: (a) ubiquitin (Ub) conjugation to a substrate protein via an E1–E2–E3 enzymatic cascade; (b) degradation of the ubiquitylated substrate by the 26S proteasome.9,10
Ubiquitylation is a major post-translational modification (PTM) that shows a diversified heterogeneity. Proteins can be decorated with mono-ubiquitylation (one Ub conjugated to a lysine [Lys, K] residue) and poly-ubiquitylation (a chain of two or more Ub molecules). Moreover, poly-Ub chains show diverse topologies (i.e., M1, K6, K11, K27, K29, K33, K48, K63) determined by the amino acid residue of the first Ub involved in the isopeptide bond with the next Ub along the chain.9,11
The Ub code dictates protein fate, with each distinct Ub configuration conferring unique biological outcomes.11,12 In this regard, efficient proteasomal degradation typically requires at least four K48-linked Ub moieties.12–14
The 26S holoenzyme comprises a 20S catalytic core particle and a 19S regulatory particle which recognizes the poly-Ub substrate and performs its ATP-dependent unfolding and translocation into the 20S for degradation.13–16 Indeed, the 20S is composed of four stacked rings (α7β7β7α7) surrounding a central narrow catalytic channel housing three proteolytic specificities: chymotrypsin-like (β5), trypsin-like (β2) and caspase-like (β1).17–19
Several studies have suggested that the 20S exists as a free uncapped particle and that it may have specific catalytic activities and biological roles.20–23 The 20S has been shown to degrade in vitro damaged and oxidized proteins as well as intrinsically disordered proteins (IDP).24,25 Moreover, the 20S was shown to degrade amyloidogenic proteins, including full-length tau in vitro and in cell,26,27 suggesting it plays crucial role in protein quality control regardless of ubiquitylation.20,24,25
A recent study has introduced the perspective that the human 20S cleaves a model mono-Ub substrate (i.e., cyclin B1) in vitro, generating branched fragments stretching across the iso-peptide bond and covering up to four C-terminal residues of Ub (i.e., -GG, -GGR, -GGRL).22
Since this catalytic property may introduce novel biological insights into the mechanisms of targeted proteolysis, herein we explored further this mechanism taking advantage of synthetic and semi-synthetic mono-Ub regio-isomers of the four-repeat domain of tau protein (MAPT gene), also known as tauK18. Altered turnover, dysregulated pattern of post-synthetic modifications and accumulation/aggregation propensities of tau protein show a prominent pathological role in different neurodegenerative conditions. They include primary and secondary tauopathies (e.g., Pick's disease, AD among the others), but also neurodegenerative processes of the retina and the optic nerve, such as AMD and glaucoma where retinal deposits of Ub-tau were found.2,26–28 The Ub-tau species (referred to as proteoforms throughout the manuscript) here investigated were tracked in post-mortem brain biopsies of subjects with different tauopathies and were reported to shape the conformational diversity of proteotoxic aggregates through a disease-specific pattern.28 Accordingly, the mono-Ub regio-isomer were described to modulate their assembly into filamentous aggregates and liquid condensates clarifying the impact of site-specific mono-Ub on structural and aggregating properties of tau.29–32
Herein, we performed biochemical and tailored proteomic studies which allowed us to discover that the uncapped 20S degrades Ub along with the substrate, providing mechanistic insights on this process, namely: (i) Ub is required for tauK18 catalysis and its position affects the susceptibility of tauK18 substrates to the h20S digestion; (ii) digestion of mono-Ub substrates generates Ub remnants on branched peptide products (hereafter referred to as Ub-branched fragments) much longer than initially thought and unidentifiable by current MS/MS approaches; (iii) the h20S has aminoacidic preferences different for tauK18 and Ub.
SpectraSage, a proteomic software that works at MS1 level and here introduced for the first time, was central to the realization of this study. It applies to enzymological studies (enzyme:substrate) in vitro, overcoming the limitations for the identification of complex, non-tryptic cross-linked peptides by conventional MS/MS analysis.
Moreover, we validated SpectraSage performance using a middle-down MS/MS workflow based on Tandem Mass Tag (TMT) proZero labelling of neo N-termini generated by the h20S digestion of Ub-tauK18 proteoforms.
Therefore, a time-course assay was carried out over 0.5 h, 1 h, 2 h and 3 h of incubation. As internal control, the same proteins were left in the absence of h20S (under the same experimental conditions) or fed to a h20S pre-incubated with epoxomicin, an irreversible inhibitor of chymotrypsin-like activity (IC50 ≈ 4 nM).33 Proteins at each time point were resolved by SDS-PAGE (CBB staining) (Fig. 1). The SDS-PAGE pattern indicated that h20S did not digest tauK18, since the band intensity (∼15 kDa) over time was comparable between the presence and the absence of proteasome (Fig. 1A).
Conversely, the electrophoretic pattern of Ub-tauK18 provided a different outcome (Fig. 1B).
In the presence of h20S, a progressive drop over time of the intensity of the band corresponding to Ub-tauK18 (∼25 kDa) was documented. This decrease was slowed down by epox (data shown in uncropped gel, see SI Fig. S4), suggesting that, unlike tauK18, Ub-tauK18 was a h20S substrate (Fig. 1C). A closer inspection of the SDS-PAGE pattern highlighted the absence of h20S digestion fragment. The lack of intermediate species with MW > 8.5 kDa (Ub MW) indeed suggested that Ub was proteolyzed along with tauK18.
Based on this evidence, we performed a pilot MS search of Ub fragments generated by the h20S. Effectively, along with several tauK18 fragments, we detected three distinct Ub proteolytic fragments covering the middle portion of Ub sequence (i.e.2QIFVKTL8, 11KTITLEVEPSD21, 22TIENVKAKIQD32 with XCorr = 0.98, 0.99 and 1.41, respectively) that were identified also by Sahu and co-workers.22
As observed for tauK18, the SDS-PAGE study indicated that tauΔC did not behave as a h20S substrate (SI Fig. S4), suggesting that Cys removal had no effects on tauK18 behaviour in solution in accordance with previous physico-chemical studies.29–31,34 Conversely, SDS-PAGE data reported in Fig. 2 highlighted a significant influence of Ub position along the tauK18 sequence on the rate of digestion of the individual proteoforms by the h20S.
Ub-tau254 turned out to be the h20S preferential substrate, exhibiting, compared to time 0 and the absence of h20S, a remarkable decrease of band intensity after 3 hours of digestion (60% of protein digested on average).
Ub-tau353 displayed a less pronounced but significant reduction over time in the presence of h20S (around 25% digested after 3 hours). By contrast, Ub-tau311 showed a low rate of degradation in the presence of h20S, suggesting that this proteoform was characterized by a greater resistance to h20S-mediated proteolysis, at least under the experimental conditions tested (Fig. 2C). All the regio-isomers proved stability in solution in the absence of h20S, whereas epox slowed down digestion (this can be appreciated for Ub-tau254, and Ub-tau353), envisaging that, like for Ub-tauK18, catalytic subunits others than the chymotrypsin-like were engaged in proteolysis (Fig. 2B). In accordance with SDS-PAGE studies on Ub-tauK18 no accumulation of proteolytic fragments was detectable in the low MW range of the gel (Fig. 2B).
A full description of the software principles is provided in the SI appendix, along with its validation using a calibrating solution of cytochrome c digest (supplied by Thermofisher Scientific). A summary schematic is provided in Fig. 3.
It is important to underline that the approach here proposed is tailored for digestion assays in vitro and universally applicable to any combination of protein/enzyme of interest. It is worth pointing out that a high degree of purity of both preparations is required to maximize the confidence of the output.
Briefly, SpectraSage operates by generating in silico and matching a customized protein-specific list of theoretical proteolytic fragments with the list of potential peptides effectively identified during the LC-MS run and deconvoluted by a standard algorithm (i.e., Xtract). The list of theoretical proteolytic fragments generated in silico includes all the possible combinations of peptides, either cross-linked or not, originating from two peptides/proteins joined through a covalent bond. Thereafter, the list of identified proteolytic fragments across a spectrum is matched to the theoretical ones. Coupled with high resolution instruments, SpectraSage allows for discrimination between peptides, either branched or not, with ambiguous masses. Whilst peptides with ambiguous masses are shown in the software output, greatest confidence should be given to unambiguous peptides.
Herein the following results are based only on peptides with unambiguous masses. Moreover, regardless of ambiguity, mass spectral intensities were normalized using the median intensities across the spectrum.35
In our case, the following results were previously filtered for peptides with no mass ambiguity at resolution 180000.
To this aim, .raw Orbitrap files were deconvoluted and analysed setting the following filtering criteria for identification of h20S digestion peptides, namely: (a) an increase of over 2-fold in intensity from time 0 to 3 hours of incubation in the presence of h20S (notably, h20S was added before harvesting the aliquot of sample at time 0 and immediately quenched); (b) the intensity of the proteolytic fragment in the presence of epox was less than 80% of the intensity of the same proteolytic fragment in the absence of the inhibitor; (c) identification of the peptides in at least two out of the three replicates.
SpectraSage analysis, reported in Fig. 3 and SI Fig. S2 and S3, highlighted notable different aspects along the pattern of digestion of Ub-tau254, Ub-tau311, Ub-tau353. Distinct proteolytic fragment size distributions were observed among the three regio-isomers. Specifically, Ub-tau311 evidenced the presence of several proteolytic fragments meeting the criteria (a–c) introduced above (n = 85). However, they all had quite low intensities and failed to densely populate the three-dimensional space (SI Fig. S2) defined by peptide molecular mass, tauK18 and Ub peptide lengths. Conversely, digestion of Ub-tau353 shared a similar quantity of proteolytic fragments (n = 106) but exhibited a considerably broader dispersion in the 3D space (SI Fig. S3), indicating an enhanced heterogeneity in the resulting proteolytic fragments.
Particularly striking is the scenario observed in the case of Ub-tau254 (Fig. 3). Despite presenting fewer proteolytic fragments (n = 48), compared to the other proteoforms, the identified peptides showed higher intensities and a well-distributed presence across the 3D space (Fig. 3B). This observation was interpreted as a confirmation of SDS-PAGE data about the h20S preference for Ub-tau254. Regardless of the effective rate of digestion, the following discussion is based on the top 15 proteolytic fragments, identified by their normalized intensity and reported in Fig. 3C, and SI Fig. S2c and S3c. Most notably, in accordance with one out of the main goals of SpectraSage development, the unprecedented identification of branched fragments of variable length was here achieved. Inspection of the results retrieved cleavage sites within the (branched) Ub predominantly located near residues 74 to 72 and 60 to 47 of Ub.
Fig. 4 provides compelling evidence for the proteolytic cleavage of Ub-tau254 by the h20S. The XICs clearly show a significant increase in the number of proteolytic fragments upon enzyme treatment, suggesting extensive processing of the protein with extensive release of branched peptide fragments. Although the overall rate of digestion was lower, branched fragments were documented also for Ub-tau311 and Ub-tau353 (SI Fig. S2 and S3, respectively). In particular, in the case of Ub-tau353, branched fragments were found to span from the -GG motif (residues76–75) to amino acids Asn60 and up to Lys33 of Ub. In the case of Ub-tau311, very few branched fragments were observed in the Top 15, with Ub cleaved in proximity of the 75th residue, and leaving a -GG remnant, or, else at Glu51.
While all the investigated regio-isomers released proteolytic fragments of tauK18 and Ub alone, as well as branched peptides, their identities were almost regio-isomer specific (Fig. 5B and C).
However, cleavage site analysis, which included fragments of tauK18 and Ub alone together with branched fragments, identified conserved h20S preferences for residues surrounding the scissile bond, typically denoted as P1, P1′, P2, P2′, and so forth, based on the nomenclature established by Schechter and Berger,36 which indicates the amino acids preceding (P1) and following (P1′) the cleavage site (Fig. 5A).
In fact, in the case of tauK18, amino acid residues including Val, Ser, Lys, and Gly showed the greatest frequency at positions P1 and P1′. Conversely, in the case of Ub a distinct preference for Leu residues, followed by Arg and Gly, was observed over the same catalytic positions.
It is important to recall, based on previous data, that h20S proteolysis is expected to release oligopeptides of tauK18 alone (tauK18-only), Ub alone (Ub-only), and Ub-tauK18 with branches of variable length (see SpectraSage results). With these premises, if the long Ub-tauK18 branched peptides were truly released by the h20S, the following conditions should be met:
(a) In the absence of trypsin digestion, largest identification of TMT labelled Ub proteolytic fragments covering the N-terminal portion of Ub rather than the C-terminal one. This is based on the assumption that, in branched peptides, the C-terminal portion of Ub is still anchored to the protein (tauK18 in this case) after h20S cleavage.
(b) Analogously, there must be an increased detection of TMT labelled Ub peptides covering the C-terminal portion of the protein in the presence (vs. absence) of trypsin. These originally branched peptides can be released by trypsin cut at the Arg residues on the C-terminal portion of Ub.
(c) Ideally, one or more peptides should be identified, showing the features discussed in the previous point and covering one out of the first two Arg cleavage sites of trypsin at the very C-terminus of Ub (Arg residues in 74th and 72nd position), otherwise not identifiable in the absence of trypsin.
In accordance with these considerations, we set up this assay for Ub-tau-254, based on its higher susceptibility to h20S digestion discussed above. Regarding the informatic pipeline, .raw files of samples not exposed to trypsin digestion were analysed by setting “no-enzyme” with unspecific digestion in PD panel, whereas .raw files of samples digested with trypsin were analysed by setting “trypsin” with “semi-tryptic” digestion search in the same panel. All additional features were left constant between methods and, to circumstantiate the findings, data reported below were filtered for: (i) peptides PSMs validated by the Fixed Validator node in PDv2.5, enriched for high and medium-confidence identifications, reflecting that over 75% of peptides had XCorr > 1.5; (ii) missed identification in the absence of h20S and identification with lower intensity (with respect to the presence of h20S) in the presence of epox.
Mass spectrometry analysis results are presented in Fig. 6 (comprehensive data is provided in the SI Data 1 and 2). The outcome of the analysis met the assumption discussed above (see points a–c) for which a point-by-point reply is provided:
(a) In the absence of trypsin different peptides covering the full length of Ub were documented; despite high sequence coverage, the -RLRGG motif remained undetectable, supporting the hypothesis that this remnant, or a shortened form, persists anchored to the tau sequence (Fig. 6A).
(b) In the presence of trypsin, a net increase in peptides covering the C-terminal portion of Ub was observed.
(c) In the presence of trypsin, we identified a proteotypic peptide (see Fig. 6A) (55TLSDYNIQKESTLHLVLR72) with a double TMT label at the N-terminus and at the ε-NH2 group of Lys, that was undetectable in the absence of trypsin.
As evident from Fig. 6B and the comprehensive peptide list in the SI (Data 1 and 2), numerous tauK18 peptides were identified (a complete list is provided in the SI Data 1 and 2). Notably, the presence of a tauK18 peptide decorated with a -GG remnant was detected exclusively in the presence of the h20S both in the presence and absence of trypsin digestion (XCorr = 1.86 and 2.13, respectively).
To confirm that branched peptides, including the peptide discussed above, are released also in the presence of a native isopeptide linkage, we set up and recapitulated the analytical workflow introduced for Ub-tau254 above, using Ub-tauK18 (enzymatic preparation).
Remarkably, in this case, the analysis (SI Fig. S6) retrieved an outcome very similar to that obtained for Ub-tau254 highlighting that:
(i) Based on TMT labelling, Ub and branched peptides were documentable also in the presence of a native isopeptide linkage.
(ii) The (55TLSDYNIQKESTLHLVLR76) peptide was identified in the presence of trypsin but it was undetectable in the absence of this further digestion step (a complete list is provided in the SI Data 3 and 4). Nonetheless, a manual inspection of SpectraSage outcome confirmed detection of this branched fragment in the presence of h20S which is expected to cover the tauK18 sequence covering residues Ser2-Asn47 (see SI Data 5).
(iii) Notably, the only -GG remnant detected in the absence of trypsin was in proximity of Lys254, indirectly confirming that, among the different mono-Ub regio-isomers of the enzymatic preparation, Ub-tau254 was the preferential substrate.
Importantly, a frequency heatmap of tau digestion peptides of Ub-tau254 (SI Fig. S7) identified by MS/MS, confirmed the catalytic preferences already highlighted by SpectraSage. Also, in the case of Ub, for which, compared to SpectraSage, a lower number of peptides was retrieved to compute an heatmap, Leu turned out to be robustly represented across P1 and P1′ positions (this outcome can be deduced by inspection of Fig. 6A).
Specifically, we report that, under the experimental conditions adopted:
(a) The h20S cleaves synthetic enzymatic (native isopeptide linkage) and (Ub conjugated by a disulfide bridge) proteoforms of Ub-tauK18, including three distinct regio-isomers (Ub-tau254, −311, −353) that were found to shape tau intracellular aggregates through a disease-specific pattern;28
(b) Unlike full-length tau, whose four-repeat domain was reported to be preferentially digested by t20S,26 we found that Ub is necessary for tauK18 cleavage; therefore, in the absence of ubiquitylation, it is likely that the four-repeat domain needs to be embedded into a longer sequence to become a h20S substrate. Alternatively, it is worth pointing out that we monitored digestion under reasonable steady-state conditions (i.e., large excess of substrate), using very low h20S concentration (10–50 nM) and over a limited time of incubation (3 h) to explore the mechanism under conditions of limited proteolysis and to maximize the patho-physiological translation of our findings;
(c) The position of Ub along the tauK18 sequence affects the catalytic preference. Our data support the preference of h20S for tauK18 with Ub tag placed at the N-terminus (Ub-tau254), followed by the C-terminus (Ub-tau353) and the middle portion (Ub-tau311). Importantly, this preference was seemingly confirmed also by digestion of the enzymatic Ub-tauK18 and by previous data on full-length tau,26 whose digestion fragments in most cases did fall within the R3 domain and were retrieved also by us. Indeed, in the case of Ub-tauK18, among peptides released by the h20S, a diGLY remnant was identified with a reasonable confidence only in correspondence of Lys254. Regarding the debated h20S preference for the N- over C-terminus of unstructured proteins, it is likely that a certain degree of protein specificity exists, and this study was not intended to clarify this feature;
(d) Ub is cleaved alongside the substrate generating branched fragments that extend beyond the C-terminal -GGRL motif described in the previous study by Sahu and co-workers.22 In this regard, digestion of all Ub-tauK18 regio-isomers released diverse Ub remnants on branched peptide products extending beyond the canonical four C-terminal residues and up to the 17th amino acid. The main implication of this finding is that the h20S is likely endowed with a pulling force sufficient to completely unfold the tightly packed Ub monomer;
(e) Digestion of Ub shows catalytic preferences different from that of tauK18. Specifically, in the majority of identified and quantified peptides a Leu residue in P1, P2 or P1′, P2′ position was required for Ub cleavage, whereas Gly, Ser and Lys residues were preferred in P1 or P1′ position for tauK18. The hypothesis regarding the catalytic preference is strengthened by an additional SpectraSage search carried out by removing the filter concerning the identification of peptides with lower intensity in the presence of epox. Thus, by removing this filter, a robust increase of peptides with Leu in P1 and P1′ position was documented also for tauK18. Since, at the concentration tested, epox is a weak inhibitor of trypsin-like (β2 subunit) and caspase-like (β1 subunit) activities, this observation is consistent with a role of these catalytic subunits, over the canonical chymotrypsin-like, in cleaving Ub. Interestingly, although the identity of tauK18 and Ub fragments (without branches) was in most cases regio-isomer-specific, the catalytic preferences were conserved.
Therefore, we suppose that distinct catalytic subunits drive the enzymatic processing of ubiquitylated substrates, revealing a mechanism where Ub and substrate enzymatic cleavage may be spatially and/or chronologically separated. As a matter of fact, h20S cleavage site preferences, particularly at P1 and P1′, remain controversial in literature. Studies have shown that catalytic subunits (β1, β2, β5) exhibit preferences for small, uncharged residues (e.g., Ala, Leu, Gly, Ser), while bulky or charged residues (e.g., Lys, Arg) are generally disfavoured due to interference with the catalytic mechanism.37–39 In particular, Leu is favoured due to its compatibility with hydrophobic pockets within the proteasome active sites, enhancing substrate binding and cleavage.10,40 While Lys and Arg are typically less preferred, exceptions exist where they contribute to substrate specificity through interactions with the active sites.41 Notably, neighbouring residues and substrate structural features significantly influence proteasomal cleavage specificity, as demonstrated by Nussbaum et al.42 and Groll et al.17
It is important to underline that SpectraSage has been fundamental to realize this study. SpectraSage is a proprietary software specifically designed to identify and quantify peptides separated by nanoUHPLC and analysed at MS1 level. This software was developed to advance in vitro enzymology studies (enzyme:substrate interactions) using MS and, remarkably, to overcome, at least for in vitro studies, the computational challenge associated with the identification of complex cross-linked peptides of non-tryptic origin, such as the branched Ub-peptides here identified. Moreover, for middle-down (or top-down) studies, SpectraSage shows key advantages including a less complex sample preparation and a rapid and automated bioinformatic analysis. In this regard, SpectraSage offers faster and automated data processing, eliminating the need for manual inspection of fragment tables and reconstruction of sequence coverage. Additionally, it automatizes the counting of amino acid occurrences at specific positions relative to putative cleavage sites.
Nonetheless, although MS1 analysis does not allow PSM validation, employing MS1-based precursor ion current quantification is a validated method, which is commonly used for middle-down experiments by virtue of its reliability and accuracy especially for these specific scientific aims.43
However, given the biological relevance that Ub digestion and release of long branched peptides may have in biology, we here adopted a TMT labelling workflow coupled with limited trypsin digestion (SI Fig. S6) to validate the SpectraSage ability and to identify, though indirectly, Ub proteolytic fragments and Ub-tau branched peptides. This workflow is based on labelling of free N-termini, such as those of peptides generated by h20S activity, before trypsin digestion. Moreover, TMT labelling of the ε-amino group of the Lys side chain introduces a missed cleaved site,44 thereby restricting the search to trypsin cut at Arg residues. This protocol turns out very useful for Ub conjugates, since two Arg residues lie in the very proximity of the diGLY (-GG) motif anchored to the substrate.
Remarkably, the TMT workflow has played a key role in validating the outcomes of SpectraSage analysis and identifies a prototypical branched digestion fragment (see SI Fig. S6). Based on the comparison with SpectraSage output, the proteolytic fragment may be coherent with a branched product encompassing Ser241-Asn286 on Ub-tau254 and Gly76-Tyr55 of Ub. Remarkably, this proteolytic fragment was identified either in the case of h20S digestion of both (synthetic) Ub-tau254 and (enzymatic) Ub-tauK18, revealing that the catalytic process occurs regardless of the presence of a disulfide bridge or a native isopeptide linkage.
The biological role of the uncapped 20S is matter of intense debate across the scientific community. Although its “existence” has been validated by cryo-EM studies on living neurons, the possibility that it plays proteolytic activities, distinguishable from those of the 26S, has never been addressed.23 However, recent research indicated that free 20S possesses distinct biological functions and is subject to unique regulatory mechanisms (e.g., DJ-1 and Insulin Degrading Enzyme).21,45–51
First, the catalytic preference of the free 20S for oxidized and unfolded proteins was proposed.21,45 More recently, the possibility that the h20S is able to degrade mono-Ub proteins has been further advanced, an hypothesis strengthened by targeted proteomics in living cells.22
In this framework, the enzymatic and semi-synthetic Ub-tau species enrolled in this study, offered the possibility to investigate how the position of Ub within the domain of a protein affects its structural properties and the susceptibility to proteasome digestion and to deeply explore the catalytic mechanism.29,34 In this context, previous studies demonstrated that enzymatic ubiquitylation of tauK18 inhibits its oligomerization and aggregation propensity.29
While a similar effect was observed for tauΔC, site-specific mono-ubiquitylation at different positions imparted distinct structural properties. Thioflavin-T fluorescence and transmission electron microscopy revealed that, in the presence of heparin induction, both Ub-tau254 and Ub-tau353 retained the ability to form cross-β structures, albeit with significantly slower kinetics compared to tauK18. In contrast, Ub-tau311 did not exhibit this behaviour.29 Whether these structural and aggregating-prone properties have a patho-physiological link with the susceptibility to h20S digestion cannot be addressed at this stage. However, it is unlikely that protein conformation, at least at the concentration tested (5 μM), has prominent roles, since aggregation was not induced by heparin in our system.
In conclusion, the findings here reported can push forward research, and the debate around the biological role of h20S towards natural substrates, mono-ubiquitylated or not, which is worth being explored for its biological and pathological implications. This is particularly relevant for the comprehension of the processes underscoring the onset and progression of degenerative diseases of the brain (primary and secondary tauopathies), retina (AMD) and optic nerve (glaucoma) for which altered metabolism of tau protein has been widely documented. Nonetheless, a detailed mechanistic understanding of these processes is key to envision and foster new therapeutic interventions.
Furthermore, this study introduces SpectraSage a software specifically conceptualized to assist the scientific community in exploring the repertoire of proteolytic fragments generated from simple reaction mixture, allowing to uncover more and new additional mechanistic insights into the enzymology of proteolytic systems and their substrates.
(a) Construct of tau spanning residues Q244-E372 plus initial Met (hereafter tauK18);
(b) The variants: tauK18(C291A/C322A) (here named tauΔC), tauK18(C291A/C322A/K254C), tauK18(C291A/C322A/K311C), and tauK18(C291A/C322A/K353C);
(c) Ub-tauK18 that is tau-K18 mono-ubiquitylated at Lys residues by an enzymatic reaction (see below);
(d) Ub-S-S-tau proteoforms, namely Ub-tau254, Ub-tau311, Ub-tau353, that are proteoforms with Ub installed at specific positions (see below).
All tau proteins were expressed and purified (>95% purity) as previously described.29 Typical chromatograms of Ub-S-S-tau proteoforms are shown in SI (Fig. S1–S3). Ub was expressed and purified following a previously described protocol both in wild type and N-terminal (histidine)6-tagged forms.29,52 Ub-tauK18 was obtained applying an enzyme-based method employing recombinant enzymes E1, UBE2N, and CHIP E3, as previously described.29 Ub-tauK18 is a mixture of mono-ubiquitylated tauK18 species, where Ub is linked to different Lys residues. The relative amount of the different mono-Ub-proteoforms cannot be precisely assessed, but they were proven to be all present to a significant amount through a proteomic approach based on detection of a diGLY remnant released by trypsin digestion.29,34 Ub-S-S-Tau proteoforms were obtained using a semi-synthetic approach based on a disulfide-directed reaction. TauΔC was mutated to introduce a single cysteine in place of Lys254, Lys311 or Lys353. Ub bearing an aminoethanethiol C-terminal group (hereafter Ub-SH) for the disulfide-coupling reaction, was produced and purified as described previously.29 Ub-SH was activated with 5,5 dithio-bis-(2-nitrobenzoic acid) and then incubated with tauΔC variants bearing lysine to cysteine mutations at selected sites. The Ub-S-S-Tau proteoforms were then purified using ion exchange chromatography.29 The site-specificity of ubiquitin conjugation achieved by the semi-synthetic approach has been confirmed in prior studies using site-resolved NMR spectroscopy: 1H–15N HSQC spectra revealed localized chemical shift changes, which were corroborated using 15N/13C isotope labelling and multidimensional heteronuclear NMR experiments.30,53 TauK18 in the construct Ub-tau254, contains additional amino acids (GSHM) at the N-terminus deriving from the cleavage of a tag used for protein purification.
Gel lanes were acquired in a gel documentation system (Azure Gel Imager C280, Azure Biosystem, Sierra CT Suites AB, Dublin, USA, and iBright Imager, Thermo Fisher Scientific, Waltham, Massachusetts, USA). Band intensity was calculated by ImageJ software.
Gels shown in figures have been slightly edited or cropped to increase band visibility or resolution. Raw uncropped gels are provided in SI Fig. S4.
Samples were then injected into an Orbitrap Exploris 240 mass spectrometer (Thermo Fisher Scientific, MA, USA) online with an Ultra High-Performance Liquid Chromatography (UHPLC) Ultimate 3000 system (Thermo Fisher Scientific, MA, USA) with a C18 prefiltration column installed.
In the case of Ub-tauK18, full length protein (∼23 kDa) was filtered out by flowing the sample through 10 kDa-cutoff filter (Merck Millipore, MA, USA) at the end of degradation assay. Thereafter, eluted peptides were cleaned by Pierce™ C18 Tips (Thermo Fisher Scientific, Waltham, Massachusetts, USA), dried in a speed Vacuum system (Labconco, Kansas City, MO, USA) and resuspended in loading solvent (see below) before injection.
Regarding the UHPLC-MS parameters, chromatographic separation was performed with a 2 μm particle size column (ES904, Thermo Fisher Scientific, MA, USA) for peptides generated from the proteasome digestion of Ub-tauK18 and a 3 μm particle size column (ES900, Thermo Fisher Scientific, MA, USA) for the digestion of the semi-synthetic tau proteins.
Both columns had a 100 Å pore size, 75 μm internal diameter, and 15 cm length. Loading solvent: 2% ACN 0.05% TFA; solvent A: 100% H2O 0.1% FA; solvent B: 80% ACN 0.1% FA. UHPLC gradient (88 min) used for Ub-TauK18: 0–62 min 6–31% B, 62–67 min 31–50% B, 67–72 min 50–99% B, 72–80 min 99% B, 80–82 min 99–6% B. UHPLC gradient (75 min) used for semi-synthetic tau proteins: 0–9 min 4.5% B, 9–62 min 4.5–50.6% B, 62–65 min 50.6–99% B, 65–69 min 99% B, 69–70 min 99–4.5% B.
The mass spectrometry parameters were set as follows: ionization mode: nano-electrospray ionization (nano-ESI); ionization voltage: 1500 V; capillary temperature: 30 °C; Scan range (m/z): 375–1650; resolution: 120000 for Ub-tauK18 and 180
000 for semi-synthetic tau proteins; AGC target: 3 × 106; microscans: 3. For Data-Dependent Acquisition (DDA) analysis, the following filters were employed: Precursor ion intensity: 5 × 103, included charge states: 2–8; dynamic exclusion duration: 45 s; exclusion mass width: 10 ppm. MS/MS parameters: fragmentation method: HCD; isolation window (m/z): 2; normalized HCD collision energy: 30%; resolution: 15
000; AGC target: 5 × 104.
Raw files of tauK18, tauΔC, and Ub-S-S-tau species were processed by SpectraSage Software, whose development and properties are described in detail in the Appendix section of the SI. Briefly, .raw files underwent initial processing including spectra deconvolution, followed by peak extraction and deisotoping. Concurrently, a theoretical fragments table was generated based on the expected linear/branched peptide/protein structures. These processed experimental data were then used for matching against the theoretical fragments, leading to identification and subsequent visual representation of the results. While a mass error tolerance of 20 ppm was applied for initial peptide identification to accommodate potential unmodeled modifications, it is noteworthy that approximately 30% of the key Ub-tau branched peptides discussed in this study exhibit a mass error tolerance below 1 ppm, demonstrating high confidence in their identification. Although we explored the use of retention time prediction based on peptide hydrophobicity (using the Sequence Specific Retention Calculator (SSRC) method54) as an orthogonal validation for peptide identification, we found its applicability limited for the unusually long and complex branched peptides discussed in this study, likely due to structural effects not fully captured by these models.
Aliquots were collected and all generated proteolytic fragments were separated from the full-length (undigested) protein (∼23 kDa) using 10 kDa cutoff filters (Merck Millipore, MA, USA).
Eluted peptides were then labelled by Tandem Mass Tag (TMT) pro Zero label reagent (Fisher scientific, MA, USA) addition (10:
1 TMT
:
protein ratio) for 1 h at room temperature (r.t.). The workflow ensured a labelling efficiency ∼95%, which was preferred to rule out eventual over labelling of samples. TMT was then quenched with an excess hydroxylamine, for 20 min, r.t.
Finally, aliquots were divided into two equal ones: the first one was addressed to tryptic digestion (1:
10 trypsin
:
protein ratio), for 1 h at 37 °C, the second one was left undigested (by trypsin) but treated under the same experimental conditions (1 h, at 37 °C). At the end of the incubation time, the reaction was stopped by adding 0.1% TFA (final concentration) and all peptide aliquots were cleaned up by Pierce™ C18 Tips before injection into the UHPLC-MS system.
In this case, chromatographic separation was performed with a 2 μm particle size column (ES904, Thermo Fisher Scientific, MA, USA) 100 Å pore size, 75 μm internal diameter, and 15 cm length. Loading solvent: 2% ACN 0.05% TFA; solvent A: 100% H2O 0.1% FA; solvent B: 80% ACN 0.1% FA. UHPLC gradient (88 min): 0–62 min 6–31% B, 62–67 min 31–50% B, 67–72 min 50–99% B, 72–80 min 99% B, 80–82 min 99–6% B.
The mass spectrometry parameters were set as follows: ionization mode: nano-electrospray ionization (nano-ESI); ionization voltage: 1500 V; capillary temperature: 40 °C; scan range (m/z): 375–1200; resolution: 120000; AGC target: 3 × 106. For Data-Dependent Acquisition (DDA) analysis, the following filters were employed: precursor ion intensity: 1 × 104, included charge states: 2–7; dynamic exclusion duration: 45 s; exclusion mass width: 10 ppm. MS/MS parameters: fragmentation method: HCD; isolation window (m/z): 2; normalized HCD collision energy: 30%; resolution: 15
000; AGC target: 5 × 104.
This journal is © The Royal Society of Chemistry 2025 |