Dana
Shkolnik‡
a,
Subhasis
Dey‡
a,
Mahdi
Hasan
a,
Michael J.
Matunis
b and
Ashraf
Brik
*a
aSchulich Faculty of Chemistry, Technion Israel Institute of Technology, Haifa, 3200008, Israel. E-mail: abrik@technion.ac.il
bDepartment of Biochemistry and Molecular Biology, Johns Hopkins University, Bloomberg School of Public Health, Baltimore, Maryland, USA
First published on 15th November 2024
The Small Ubiquitin-like Modifier (SUMO) is a crucial post-translational modifier of proteins, playing a key role in various cellular functions. All SUMOs are synthesized as precursor proteins that must be proteolytically processed. However, the maturation process of cleaving the extending C-terminal tail, preceding SUMOylation of substrates, remains poorly understood, especially within cellular environments. Chemical protein synthesis coupled with cell delivery offers great opportunities to prepare SUMO analogues to investigate this process in vitro and in live cells. Applying this unique combination we show that SUMO2 analogues containing the native tail undergo rapid cleavage and nuclear localisation, while a Gly93Ala mutation impairs cleavage and alters localisation. Tail mutations (Val94Glu, Tyr95Ala) affected cleavage rates, highlighting roles in SUMO–SENP protease interactions. In cells, SUMO2 analogues containing tail mutations underwent cleavage and subsequently incorporated into promyelocytic leukemia nuclear bodies (PML-NBs). These findings advance our understanding of SUMO2 maturation and provide a foundation for future studies of this process for different SUMO paralogues in various cell lines and tissues.
SUMO proteins are highly conserved across species and are initially expressed as immature precursors with C-terminal extensions of 2–11 amino acids following the conserved Gly–Gly (GG) motif, known as tail. The single gene of SUMO expressed in yeast, Smt3, contains the ATY tail13 and the five mammalian SUMO paralogues contain different tails, altered both in size and amino acid compositions. Interestingly, SUMO tails also vary across orthologues and paralogues, for instance, in most species, SUMO2 proteins feature the tail sequence ‘Val–Tyr’, whereas in some species (e.g., snakes) other amino acids, such as the ‘Met–Asn’, are present (ESI, Section 12†). Mammalian precursors are processed by the same SENP proteases that remove SUMO from modified substrates. SENPs cleave the C-terminal extensions of SUMOs to expose the GG motif, a prerequisite step for SUMOylation. Various studies indicate that SENP family members exhibit distinct subcellular localisation and specificity. The specificity of SENP enzymes for SUMO paralogues in the maturation process is achieved through several structural and biochemical mechanisms.14–16
While there is ample experimental support on the SUMOylation pathway and the modifications by other ubiquitin-like (Ubl) modifiers in general, only a few studies have focused on the maturation process of SUMO precursors. For example, a previous study described the residue specificity of the S. cerevisiae SUMO protease Ulp1 against the (Gly–Gly↓Ala) positions of Smt3 and demonstrated that the conserved GG motif is not strictly required for its maturation.17 In another study, it was shown that replacing the C-terminal tail residues following ‘GG’ in SUMO1 and -2 with that of SUMO3 affects the maturation efficiency.18 Additionally, this study highlighted that the two amino acids immediately after the GG motif are crucial for paralogue specificity. Furthermore, structural studies demonstrated that selectivity in maturation is determined by electrostatic complementarity between the interacting regions in SENP1 and SENP2 with the SUMO paralogue tails.16 A hydrophobic region in SENP2 is complementary to the SUMO2 tail, while a strongly acidic region in SENP1 interacts with the positively charged tail of SUMO1.
Nevertheless, various questions on the impact of mutations of the GG motif and tail region in SUMO precursors remain unexplored. In addition, we still lack in vivo data on the localisation of SUMO precursors compared to the mature forms, the dynamics of the maturation process, as well as the specific SENPs that are involved and how they are affected by cellular stress. Detailed studies on these aspects and others could provide valuable insights into the rationale behind the C-terminal tail and for targeting the SUMO machinery in drug development.
Total chemical synthesis and semi-synthesis of proteins enable selective modifications of both natural and unnatural components providing a significant opportunity to generate custom proteins for functional studies.19 Yet, the studies of these synthetic proteins remain to be conducted in an in vitro setting excluding knowledge of their cellular behaviours. On the other hand, delivering these proteins with minimal cellular stress offers unique opportunities to study them in their native environment. Recently, we began to explore these combinations to shed light on unknown aspects of Ub, and Ub-like modifiers in live cells with efforts to overcome many challenges in these complex studies, in the delivery step, the monitoring process as well as in the data analysis.20–24
Studying the maturation process of endogenous SUMO proteins is challenging because it is highly dynamic and difficult to monitor within live cells. In this study, we harnessed the power of protein synthesis and protein cell delivery19 to assess the SUMO maturation process in vitro and in live cells. We designed and synthesized ten different SUMO2 analogues to study the maturation process focusing on new aspects in a cellular context. Our results revealed, for the first time, a rapid tail cleavage of SUMO2 precursors in live cells. Our studies also investigated the impact of processing on the nuclear localisation of mature SUMO2 compared to the precursor, and the critical role of Gly93 in this process.
SUMO proteins and their conjugated forms were prepared by different groups using either linear SPPS or ligation approaches.21,27–32 In this study the set of SUMO2 protein analogues was synthesized using linear Fmoc-SPPS on 2-chlorotrityl chloride (2-CTC) resin, producing after cleavage the native carboxylic acid at the C-terminus. During the synthesis, we introduced three pseudoproline dipeptides and Gly dimethoxybenzyl (DMB) at two positions to achieve better synthesis with improved yields (Fig. 1a).28 In the SUMO2 sequence, Met residues were substituted with Nle, a widely used mutation in chemical protein synthesis reported across various studies to avoid Met oxidation which could complicate purification.33–35 Our previous study on SUMO2 demonstrated that this mutation did not alter the protein function or localisation in the cells.21,29 Following the complete synthesis of the protein sequence, two PEG molecules were attached to the N-terminus to serve as a linker between the protein sequence and the relative organic fluorophore (Fig. 1b and c). Initially, we synthesized SUMO2 without its tail (1) and with the native tail, VY, having fluorescein (FITC) (2) or 5-carboxytetramethylrhodamine (TAMRA) (3) to follow conjugation and cellular localisation studies. The use of different organic dyes for proteins 1 and 2 permitted their co-delivery to the same cells via MBL and allowed comparison. Purification and characterization of the analogues were performed through RP-HPLC, mass spectrometry, fluorescence gel analysis using the TAMRA channel, and western blot (WB) using SUMO2 antibody (Fig. 2a–h).
With these proteins in hand, we first investigated the localisation of SUMO2 analogues, 1 and 2, at three time points (15 minutes, one hour, and three hours) following protein delivery. The analogues were delivered into U2OS cells using the MBL approach in a final concentration of 2 μM for each protein. Subsequently, cells were incubated at 37 °C for 15 minutes, one hour, and three hours, then were imaged using laser scanning confocal microscopy (LSCM) in the TAMRA and FITC channels. At the initial time point (15 minutes), proteins 1 and 2 exhibited nearly identical localisation, predominantly accumulating in the nucleus (Fig. 3a). This localisation remained consistent throughout all the measured time points. The observed localisation of both analogues suggests rapid tail cleavage, in which 1 and 2 become identical proteins that share the same intracellular behaviour following delivery.
Based on the observed localisation, we examined the conjugation pattern of analogues 1 and 3 to test our assumption that the tail was rapidly cleaved to allow conjugation. Analogues 1 and 3 were delivered into U2OS cells at 5 μM final concentration and incubated for the same time points as the localisation experiments (15 minutes, 1 hour, and 3 hours). Subsequently, cells were lysed using 2× Laemmli sample buffer and analysed by SDS-PAGE. At the earliest 15 minute time point, and at later time points, analogues 1 and 3 exhibited similar high molecular weight bands, consistent with rapid and efficient tail processing and conjugation to protein substrates (Fig. 3b). These results showed a rapid tail cleavage for the first time in live cells.
To further validate these results, we expressed and purified the SENP2 active domain, which cleaves the SUMO2 tail in vitro,14,27 and utilized it to investigate the maturation process. We performed an in vitro assay in which the SENP2 active domain (40 nM) was incubated at 37 °C with 5 μM of SUMO2 analogue 3. Our result indicated that the tail of the SUMO analogue was cleaved (>90%) within 15 minutes (Fig. 3c).
Motivated by these findings, we set out to design a SUMO2 analogue 4 with an uncleavable tail. Relying on previous reports that substitution of Gly to any larger amino acid may affect SUMO–SENP interactions due to the constricted hydrophobic tunnel of the SENP catalytic site,15 we substituted Gly93 of SUMO2 to Ala to produce analogue 4. This analogue was prepared using the procedure mentioned in the general synthetic scheme (Fig. 1). In addition, a SUMO2 analogue with deletion of Gly93 and the C-terminal tail, SUMO2ΔG93 (5), was synthesized to serve as a control for an unconjugatable SUMO2.36
Before initiating cellular experiments with analogue 4, in vitro reactions were conducted to test the hypothesis that the Gly93 to Ala substitution inhibits processing by SENP2. Our results clearly showed that the tail in analogue 4 was completely stable after three hours of incubation with SENP2 (Fig. S10a†). With these results in hand, we checked the localisation of analogue 4, which revealed localisation in the nucleus and the cytoplasm that differs from analogues 1 and 2 which were restricted to the nucleus. The uncleavable analogue 4 showed very similar localisation as the unconjugatable SUMO2 analogue 5 (Fig. 4a). These findings highlight the effect of tail processing and subsequent covalent protein modification on accumulation of SUMO2 in the nucleus. The connection between localisation and protein modification was supported by fluorescence gel analysis, in which high molecular weight protein conjugates of analogues 4 and 5 were not detected (Fig. 4b). Together, these findings demonstrate that mutating the conserved Gly93 to Ala abolishes the cleavage process and results in a unique localisation pattern of immature SUMO2. This effect on localisation suggests that the covalent conjugation of SUMO2 to proteins in the nucleus affects its nuclear–cytoplasmic distribution. The significant impact of mutating Gly93 to Ala on the cleavage process may stem from the narrow tunnel through which the SUMO2 C-terminus must enter for cleavage.15 Such a step can be impeded by any structural changes or the mispositioning of the scissile bond of SUMO2 towards the active site Cys.
To further expand our understanding of the inhibition of the cleavage process when mutating Gly93 to Ala, we synthesized analogue 6 in which this Gly was mutated to D-Ala. Such a mutation could change the dihedral angle of the scissile peptide bond and possibly allow the cleavage to proceed since D-Ala has been proposed to be a mimic of Gly.37 Analysis of the in vitro reaction between SENP2 and analogue 6 revealed that the cleavage also did not proceed with this analogue even after three hours of incubation (Fig. S10b†), suggesting the important role of the narrow tunnel in the preference of the Gly–Gly motif. Previous structural study of the SENP2 catalytic domain in complex with SUMO2 precursor showed that the tail (VY) kinks at Gly92, positioned the GG motif over Trp410 of SENP2.38 Mutation of Trp410 to Ala reduced activity in processing, underscoring its role in positioning the SUMO diglycine motif and scissile bond at the active site. Therefore modifying Gly93 with any larger amino acid could affect the positioning of the scissile bond due to steric clashes with Trp410 and restrict its entry to the catalytic site, which is reflected in our results regarding the inhibition of processing analogues 4 and 6.
After demonstrating the impact of altering the position preceding the C-terminal tail on the cleavage process, we further examined whether changes in the tail itself would affect the cleavage reaction. Previous studies have primarily focused on generating SUMO chimeras, such as SUMO2 fused with the tail of SUMO3 or vice versa.14,15,18 These studies showed that the maturation efficiencies of SUMO2 containing the SUMO3 tail are comparable to those of SUMO3.15,18 The first two amino acids following the GG motif are critical in regulating maturation efficiency.15 Another study demonstrated that the tail region is essential for defining the endopeptidase specificity of SENPs, allowing for the discrimination between pro-SUMO2 and pro-SUMO3.14 Our study extends this research by exploring alternative tail compositions beyond those amino acids that are present in the native SUMO paralogues. We initially prepared SUMO2-AY (7) (SUMO2-Ala94–Tyr95). The reason for mutating Val94 to Ala in the tail was based on the significant impact observed upon replacing Gly93 with Ala. Our gel analysis results after cell delivery and lysis showed high molecular conjugates for analogue 7 at both the 15 minutes and 1 hour time points (Fig. 4b). Despite challenges in the quantification of conjugation levels due to variations in protein analogue delivery to cells by bead loading, our results suggest that the Val94 to Ala substitution did not significantly affect cleavage and subsequent conjugation.
To further investigate flexibility in changing the tail composition, we synthesized, SUMO2-EA (8) (SUMO2-Glu94–Ala95). This analogue features completely different C-terminal tail residues in terms of polarity and hydrophobicity to those in SUMO2 and in SUMO1 and SUMO3. The in vitro processing rate of analogue 8 with SENP2 was slower compared to analogues 3 or 7 (Fig. 3c, S10c and d†). Despite its slower processing in vitro, analogue 8 showed conjugation to proteins in cells within 15 minutes (Fig. 4b). Our findings align with a previous study16 which showed that the region within SENP2 interacting with the SUMO C-terminus is hydrophobic and has a preference for hydrophobic amino acids such as valine over glutamic acid. Based on our findings, it is evident that substitutions in Gly93 preceding the tail (e.g. analogues 4 and 6) inhibit the cleavage process, whereas mutations within the tail (analogues 7 and 8) do not abolish the process but rather affect its rate. These findings are consistent with the previously resolved SENP2–SUMO2 precursor crystal structure,38 wherein the tail is oriented toward the SENP2 surface comprised of residues Trp410, Val477, Gly545, and Met497, which exhibits a preference for hydrophobic amino acids. These interactions likely underlie the observed differences in the in vitro processing rates between the VY, AY, and EA tails. The Val94Glu mutation disrupts these hydrophobic interactions, leading to the reduced observed rate.
Understanding the interaction between SUMO proteins and the enzymes responsible for tail cleavage is crucial for elucidating the SUMO processing step. While previous in vitro studies have examined the distinct processing activities of different SENPs on SUMO paralogs,15,18,38 demonstrating that electrostatic interactions account for the selectivity between the C-terminal tail and the specific SENP,15 information is still lacking in live cells. Of particular importance, the exact SENPs responsible for processing SUMOs in vivo remain unknown. To address this gap, we aimed to synthesize a SUMO2 activity-based probe having dehydroalanine (Dha), as an electrophile to capture the cellular-specific maturation enzymes.
Inspired by our previous study for trapping the catalytic Cys residue of a specific deubiquitinating enzyme (DUB) in the ubiquitination process, we positioned the Dha moiety near the scissile amide bond39–41 to generate the SUMO2-Gly93Dha-tail probe. To construct this probe, we mutated Gly93 to Cys during SPPS of SUMO2 and replaced the internal Cys48 with Ser. Finally, to convert the Cys to Dha, this SUMO derivative was treated with α,α′-dibromo-adipyl(bis)amide to give the desired SUMO2-activity-based probe analogue 9 in 22% isolated yield (Fig. S9†).
Delivery of analogue 9 to live cells showed low reactivity towards endogenous enzymes. Unfortunately, this probe failed to capture any enzyme and did not exhibit any conjugation, suggesting that the tail remained intact (Fig. 5a and S10e†). Two main factors may account for the failure to capture any SENP using this probe. One possibility could be attributed to the steric hindrance caused by the planar double bond group within the narrow tunnel of SENPs, like the Ala case in analogue 4. Another explanation could relate to the positioning of the Dha, whereby the electrophilic group may be located far from the catalytic Cys of SENPs. Indeed, a previous study reported by Reverter et al. also failed to capture SENP2 and SENP7 with a similar Dha probe in vitro.42
This prompted us to alter the position of the Dha from Gly93 to Val94, hoping to change the proximity of this reactive moiety to the catalytic Cys. We generated SUMO2 activity-based probe 10 and subjected it to in vitro processing assays with SENP2. The cleavage reaction and the gel fluorescence assay revealed that this probe underwent cleavage at the expected site, leading to the removal of the Dha as part of the tail, rather than reacting with the enzyme (Fig. 5a and S10f†).
Although alternative reactive groups are available in the literature43 and can be introduced in the C-terminus of the immature SUMO, they may not capture solely the maturation enzymes and are prone to be cleaved with the tail. Additionally, the short length of the tail limits the options available for modification. Nevertheless, these results obtained from probes 9 and 10 again highlight the sensitivity of the changes in Gly93 and the flexibility of the tail composition.
Next, we wanted to evaluate the effect of cleavage of the SUMO2 tail on its incorporation into PML-NBs, which are dependent on SUMOylation for their formation and function.44–46 These NBs are localised inside the nucleus where SUMO2 exists in its conjugated form. Detection of PML-NBs would indicate that SUMO2 has been processed and conjugated to PML and associated PML-NB substrates. Our initial analysis with analogues 1 and 2 suggested a localisation of conjugatable SUMO2 to PML-NBs (Fig. 3a). Here, we analysed cells at higher resolution and included additional analogues. We loaded U2OS cells with our synthesized SUMO2 analogues 1, 2, and 4–8, followed by incubation for an hour. Cells were subsequently fixed and examined by LSCM. The images obtained by LSCM clearly showed the incorporation of analogues 1, 2, 7, and 8 into nuclear foci with the expected spherical shape of PML-NBs47 (Fig. 5b and S11†). On the other hand, SUMO2 analogues 4–6 were not observed in nuclear foci, demonstrating that processing and conjugation are required for PML-NB localisation (Fig. S11†).
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4sc06254j |
‡ The authors have contributed equally. |
This journal is © The Royal Society of Chemistry 2025 |