Open Access Article
Shijun Lina,
Yakun Yi
b,
Yiheng Liua,
Minglun Li
c,
Fan Xia
*a,
Hai-Chen Wu
*b and
Xiaoding Lou
*a
aState Key Laboratory of Geomicrobiology and Environmental Changes, Faculty of Materials Science and Chemistry, China University of Geosciences, Wuhan 430074, China. E-mail: louxiaoding@cug.edu.cn; xiafan@cug.edu.cn
bBeijing National Laboratory for Molecular Sciences, Key Laboratory of Analytical Chemistry for Living Biosystems, Institute of Chemistry, Chinese Academy of Sciences, Beijing 100190, China. E-mail: haichenwu@iccas.ac.cn
cState Key Laboratory of Polymer Science and Technology, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun 130022, China
First published on 13th May 2026
Deciphering the complex chemical code of post-translational modifications (PTMs) is fundamental to proteomics but remains challenging due to the low abundance of modified proteins and the difficulty in resolving isobaric positional isomers. Conventional mass spectrometry and affinity-based methods often lack the sensitivity or specificity required to capture this full chemical diversity. Here, we present a general strategy for site-specific PTM recognition based on MspA nanopores engineered with polar charged residues. By systematically tuning the constriction site (position N91) with polar charged residues, we introduced specific electrostatic, electroosmotic flow and steric interactions that dramatically enhance molecular recognition capabilities. This engineered interface enabled the label-free discrimination of 10 distinct PTM types across 26 peptides, including phosphorylation, glycosylation, and—reported here for the first time via nanopore sensing—lysine crotonylation and succinylation. Furthermore, the nanopore is capable of distinguishing subtle positional isomers. Coupled with a machine learning algorithm that classifies single-molecule events with >98% accuracy, our findings establish a generalizable principle of electrostatic gating for PTM profiling, offering a versatile chemical tool for next-generation single-molecule proteomics.
Nanopore sensing, which directly detects single molecules as they translocate through a nanoscale pore,20–22 offers unique advantages for PTM analysis.23–26 Nanopore sensing can operate label-free and amplification-free, allowing real-time detection of individual peptides with sub-nanometer resolution.27–29 However, designing nanopores capable of distinguishing a broad spectrum of PTMs with high specificity remains a challenge. Previous studies have relied on engineering approaches, with limited scope across PTM types. To address this gap, we sought to develop a general strategy for PTM detection by systematically tuning nanopore–analyte interactions. Since most PTMs occur on polar amino acid residues (e.g., lysine, arginine, serine), and often involve changes in charge, polarity, and steric bulk,30–32 we hypothesized that introducing polar charged residues at the nanopore constriction would enhance sensitivity and broaden detection capabilities.
MspA, a robust octameric nanopore from Mycobacterium smegmatis, has previously demonstrated utility in DNA sequencing and peptide analysis.33–35 Its constriction site, centered around residue N91, was selected for engineering due to its critical role in analyte sensing. In this study, we executed a systematic engineering of the MspA constriction site, generating nine variants with diverse polarities to precisely modulate nanopore–analyte interactions. Our investigation revealed that variants incorporating polar charged residues—specifically lysine and glutamate—dramatically enhanced sensing capabilities. These engineered nanopores enabled the label-free identification of 10 distinct PTMs at single modification sites, including tyrosine, threonine, and serine phosphorylation; lysine methylation and acetylation; serine acetylation and glycosylation; threonine glycosylation; and notably, lysine crotonylation and succinylation, both reported here for the first time using nanopore sensing. Multiplexed discrimination of up to nine PTM types was achieved in a single measurement. Furthermore, the system distinguished positional isomers of phosphorylated peptides based on unique ion current features. To facilitate accurate event classification, a machine learning model was developed, achieving over 98% validation accuracy. These results highlight the essential role of polar charged residues at the nanopore constriction for enhancing PTM recognition. Our work presents a generalizable and scalable strategy for single-molecule PTM detection, providing a versatile platform for advancing nanopore-based proteomic technologies.
![]() | ||
| Scheme 1 Single-point mutants of MspA were constructed and enabled the highly specific detection of target post-translational modifications (PTMs). | ||
To systematically evaluate the recognition capabilities of MspA mutant nanopores toward diverse PTMs, we constructed a peptide library in which all peptides shared the same sequence but carried one of eight modification groups: phosphorylation, methylation, acetylation, glycosylation, crotonylation, succinylation, nitration, or sulfonation (Fig. 2a). The corresponding chemical structures are shown in Fig. 2b. Nanopore measurements using engineered MspA mutants revealed that five PTMs—phosphorylation, acetylation, glycosylation, crotonylation, and succinylation—produced distinct current signatures, whereas methylation, nitration, and sulfonation could not be clearly resolved under the tested conditions (Fig. S8–S15). Also, to establish a biologically relevant PTM detection strategy, we engineered KT (Tau175-190) peptide by substituting lysine with glutamine/glutamate (Q/E), mitigating charge interference in MspA nanopores (Fig. S16–S28).
To quantify discrimination performance, we analyzed the relative residual current (ΔI/I0) values of the PTM peptides across different MspA variants. Mutants bearing polar charged residues, particularly MspA-K and MspA-E, consistently enabled detection across the widest range of PTM peptides, highlighting the essential role of electrostatic interactions and electroosmotic flow (EOF) in PTM detection (Fig. 2c and S29). The electroneutral pores showed no detectable events, indicating that electrophoretic force alone is insufficient to drive peptide translocation under these conditions. However, this does not exclude the possibility of capture under higher voltages or higher sampling rates. In contrast, the high charge density at the constriction sites of MspA-K and MspA-E provides strong ion selectivity and, consequently, generates EOF. This electroosmotic flow offers a significant driving force that actively pulls peptides through the nanopore, enabling analyte capture and detection.
To elucidate the underlying mechanism, we focused on MspA-K and carried out molecular docking simulations with three representative peptides: a phosphorylated peptide (PD-p), a crotonylated peptide (PD-cr), and the corresponding unmodified peptide (PD). Docking models showed that the docking interface revealed that PD-p formed an extensive hydrogen-bonding network—24 hydrogen bonds in total, including nine with the K91 residue—whereas PD-cr and PD formed only 10 and 9 hydrogen bonds, respectively. The denser hydrogen-bonding and electrostatic complementarity in the PD-p–MspA-K complex likely enhance conformational stability, prolong translocation dwell times, and amplify current differences, thereby improving PTM discrimination (Fig. S30–S32).
In practice, MspA-K was able to clearly distinguish all the nine PTM peptide groups with distinct signal distributions (Fig. 2d). To streamline classification and minimize manual bias, we developed a machine learning pipeline. Each signal event was described by four features: relative residual current (ΔI/I0), dwell time, standard deviation, and instantaneous frequency. We evaluated 30 different models, including decision trees, support vector machines (SVMs), k-nearest neighbors (KNN), discriminant analysis, ensemble methods, and neural networks (Table S4). The Bagged Trees ensemble model exhibited the best performance, achieving 98.2% validation accuracy and 98.3% test accuracy (Fig. 2e and S33), enabling robust and automated identification of PTM types from nanopore data.
In this study, we found that polar charged mutants of MspA can recognize a wide variety of PTMs on peptides, underscoring the importance of charge–charge interactions in PTM detection. This recognition is further facilitated by EOF, which enhances the capture and translocation of peptides through the nanopore, thereby contributing to the sensitivity and discriminative capability of the mutants (Fig. S34–S37). To further evaluate their potential, we used MspA-K to discriminate phosphorylated peptides (Fig. 3a). We designed peptide substrates containing none (HY), one (HY-1p), two (HY-2p), or three (HY-3p) phosphoryl groups. With increasing numbers of phosphoryl groups, the current blockade events displayed distinct graded features corresponding to first-, second-, and third-level blockades (Fig. 3b–h). Notably, such phosphorylation-dependent graded signal characteristics have not been previously reported for any nanopore sensor. We also observed a positive correlation between event frequency and peptide concentration (Fig. S38). Together, these findings demonstrate that the MspA-K nanopore can reliably detect phosphorylation modifications through characteristic current blockades, highlighting its promise as a powerful tool for post-translational modification analysis.
To elucidate the mechanism underlying phosphorylated peptide recognition by MspA-K, we performed all-atom molecular dynamics (MD) simulations. In the setup, the phosphorylation sites within the peptide are positioned within the constriction position of MspA-K nanopore, with the Y2, Y8 and Y10 sites. Notably, for our simulations, the peptide was modeled with double phosphorylation at both Y8 and Y10 sites (referred to as Y8@Y10). The Y2 and Y8@Y10 positions of the peptides were positioned adjacent to MspA-K (Fig. 4a and S39–S40). The combinations of phosphorylation groups at these sites correspond to HY, HY-1p, HY-2p, or HY-3p, allowing quantitative analysis of the binding free energy (ΔGbinding, kcal mol−1) between peptides carrying different numbers of phosphoryl groups and the N91K residue in the constriction zone, along with the ionic currents generated during simulated translocation. The HY peptide was included in simulations as a structural reference to explore potential peptide–pore interactions; while no obvious translocation events were detected for HY experimentally, these simulations offer mechanistic insights into molecular–scale interactions that may occur during peptide–pore encounters. The HY peptide was included in simulations as a structural reference to explore potential peptide–pore interactions; while no obvious translocation events were detected for HY experimentally, these simulations offer mechanistic insights into molecular scale interactions that may occur during peptide–pore encounters. In these simulations, HY, HY-1p, HY-2p, and HY-3p all translocated through the MspA-K nanopore within comparable simulation times. The calculated ΔGbinding exhibited a phosphorylation-dependent enhancement (Fig. 4b), with HY-3p showing the strongest affinity for the nanopore.
Comparison of binding free energies at the two positions further indicated that HY-1p, HY-2p, and HY-3p generate one, two, and three distinct energy barriers, respectively, during translocation (Fig. 4c). These barriers likely account for the experimentally observed graded blockade distributions. The experimentally measured blockade currents (Fig. 4d) showed excellent agreement with the simulated values (Fig. 4e and S41). For the multi-level experimental signals observed in HY-2p and HY-3p, the representative experimental current values plotted in Fig. 4d were determined by extracting the amplitudes of all blockade events and applying a gaussian fit to the resulting distribution. Thus, the plotted experimental values represent the statistically most probable states (weighted average), which are then compared with the simulated current of the predominant binding conformation. Since ionic current is strongly correlated with the effective volume of the nanopore constriction, the markedly reduced residual currents of phosphorylated peptides compared to their unmodified counterparts suggest that the steric bulk of phosphate groups effectively restricts ion flux through the MspA-K channel.
Radial distribution function (RDF) analysis of K91–peptide interaction distances revealed a pronounced probability peak at ∼0.9 nm, with the intensity of this peak decreasing progressively with fewer phosphoryl groups (Fig. 4f). To provide a more intuitive comparison of the differences in radial distribution among different peptides, Fig. 4g shows a magnified view of the region presented in Fig. 4f. From this enlarged view, it can be seen more clearly that the difference in the peak radial distribution distance between HY and HY-3p is approximately 0.1 nm. The distribution probability of the center-of-mass displacement (ΔLz) shown in Fig. 4h (Y2 + MspA-K), further demonstrated that HY exhibits minimal displacement due to its low net charge, which limits effective translocation. In contrast, HY-1p and HY-2p showed larger displacements, corresponding to shorter blockade times. Interestingly, HY-3p exhibited markedly reduced displacement—a seemingly counterintuitive outcome that can be attributed to its high binding free energy with K91, which substantially retards translocation and extends blockade duration. While our current simulations focus primarily on the ionic current blockade caused by a single peptide stably trapped at the constriction zone, we acknowledge that the actual nanopore translocation is a highly complex dynamic process. Mesoscopic phenomena such as voltage-driven polymer stretching,42 local ion concentration polarization induced by multi-phosphate groups,24 and occasional multiple peptide trapping likely play roles in the comprehensive experimental signals. A systematic and multiscale investigation of these dynamic conformational changes and electrokinetic effects will be the subject of our future research.
This analysis revealed that MspA mutants carrying polar charged residues (e.g., MspA-E, MspA-D, MspA-H, and MspA-K) exhibited significantly improved PTM recognition capabilities compared with other variants (Fig. S42–S49). Notably, initial experiments were performed in a buffer solution mimicking physiological conditions (pH 7.4), but the translocation frequency of the target peptide was very low. Therefore, buffers with adjusted pH values were subsequently used according to the isoelectric points of the peptides. Guided by these results, we next applied trypsin digestion to both MUC1 and eukaryotically expressed glycosylated MUC1, generating complex mixtures of peptide fragments (Fig. 5g). Based on the radar plot analysis, MspA-E and MspA-D were selected for detection of these digests. Representative current traces obtained using MspA-E are shown in Fig. 5h, with the corresponding scatter plot and histogram of current blockage ratios (ΔI/I0) in Fig. 5i and S50.
The data revealed five distinct clusters. To interpret these clusters, we first characterized the expected digestion products of the MUC1 construct. Trypsin cleavage is restricted to three specific sites (Table S9) due to proline residues in KP/RP sequences, yielding three major peptide fragments. Each fragment contains multiple serine residues that can be O-glycosylated. Because the protein was expressed in mammalian cells, these serines may carry various mucin-type glycans, including αGalNAc, αNeu5Ac-(2,6)-αGalNAc, and βGal-(1,3)-αGalNAc.43 Thus, the glycopeptides are highly heterogeneous. The five clusters do not directly represent five specific glycan structures; instead, they represent five populations of events resolvable under our conditions. To confirm their glycosylation origin, we analyzed unglycosylated MUC1 (prokaryotic expression) under identical conditions, which produced only one or two clusters (bare peptides). By comparison, the additional clusters in the mammalian sample were attributed to distinct glycosylation states/types. Therefore, the five clusters reflect the heterogeneity of O-glycosylation that MspA-E can discriminate. In contrast, MspA-D nanopores resolved only three distinct clusters under the same conditions (Fig. 5j–k and S51). We further extended this strategy to Tau and phosphorylated Tau proteins (Figure S52). Remarkably, MspA-E resolved three additional current clusters specifically associated with phosphorylated Tau samples. These results not only highlight the pivotal role of polar charged residues in PTM recognition within nanopores, but also provide strong experimental support for the development of multiplex PTM detection platforms using MspA nanopores.
Our work establishes a robust framework for the functional design and optimization of biological nanopores. Incorporating polar and charged residues into the constriction region, while also considering the resultant electroosmotic and polarization effects, significantly broadens the spectrum of detectable PTMs and expands the utility of nanopore sensing for complex proteomic analysis. This strategy offers new technical pathways for high-throughput, multiplexed, and label-free PTM profiling, paving the way for real-time monitoring in systems biology, disease biomarker discovery, and clinical diagnostics. With continued advances in nanopore engineering, we anticipate that such tailored nanopores will become very useful tools for proteomic research and translational applications.
| This journal is © The Royal Society of Chemistry 2026 |