Afsaneh
Orouji‡
a,
Mahdi
Ghamsari‡
a,
Samira
Abbasi-Moayed
b,
Mahmood
Akbari
c,
Malik
Maaza
c and
Mohammad Reza
Hormozi-Nezhad
*a
aDepartment of Chemistry, Sharif University of Technology, Tehran, 111559516, Iran. E-mail: hormozi@sharif.edu
bDepartment of Analytical Chemistry, Faculty of Chemistry, Kharazmi University, Tehran, 15719-14911, Iran
cUNESCO-UNISA-iTALBS Africa Chair in Nanoscience & Nanotechnology (U2ACN2), College of Graduate Studies, University of South Africa (UNISA), Pretoria, South Africa
First published on 11th March 2025
The rapid and precise quantification and identification of proteins as key diagnostic biomarkers hold significant promise in allergy testing, disease diagnosis, clinical treatment, and proteomics. This is crucial because alterations in disease-associated genetic information during pathogenesis often result in changes in protein types and levels. Therefore, the design of portable, fast, user-friendly, and affordable sensing platforms rather than a single-sensor-per-analyte strategy for multiplex protein detection is quite consequential. In the present research, a robust multicolorimetric probe based on the inhibited etching of gold nanorods (AuNRs) allowing unambiguous high-performance visual and spectral quantification and identification of proteins in human urine samples was designed. Most recently, we discovered that N-bromosuccinimide (NBS) can quickly etch AuNRs with a distinct color change, allowing convenient and accurate visual recognition of all amino acids. Herein, further explorations revealed that the presence of proteins, as amino acids’ polymers, reduces the effective concentration of NBS to different amounts and in turn prevents the etching of AuNRs to various degrees, thereby allowing precise quantification and identification of various proteins ranging from phosphatase (ACP), pepsin (Pep), hemoglobin (Hem), and transferrin (TRF) to immunoglobulin G (IgG), lysozyme (Lys), fibrinogen (Fib), and human serum albumin (HSA). The acquired dataset was statistically analyzed using linear discriminant analysis (LDA), partial least-squares regression (PLSR), and hierarchical cluster analysis (HCA) to accurately classify and identify individual proteins and their combinations at various levels. The multivariate regression models indicated that the colorimetric responses were linearly dependent on protein concentrations with low detection limits of around 1 ppm. Most importantly, the proposed multidimensional colorimetric probe was successfully utilized for protein discrimination in real urine samples. The diverse rainbow responses exhibited by the AuNRs in the proposed probe greatly enhance the accuracy of visual detection, making it a practical tool for straightforward protein monitoring in real samples.
In recent decades, several analytical techniques have been developed for the detection of proteins, notably mass spectrometry (MS) with electrospray ionization (ESI) or matrix-assisted laser desorption ionization (MALDI) and enzyme-linked immunosorbent assay (ELISA).18–20 The exceptional precision of MS is generally acknowledged in analyzing and characterizing macro-biomolecules, namely proteins.21,22 Nevertheless, the capacity of the method to identify a diverse array of chemicals is limited by the tedious process, complex technology, and costly equipment. On the other hand, the great specificity and sensitivity of ELISA—owing to the exact binding between antibodies and their target proteins—make it the most used approach for clinical protein quantification. Despite these advantages, ELISA is constrained by the high cost, unstable antibodies, lengthy procedures, large sample volume needs, and the inability to detect multiple analytes simultaneously. In addition to MS and ELISA, high-performance liquid chromatography (HPLC) and other chromatographic techniques are widely employed in clinical settings for protein analysis due to their high resolution, reproducibility, and ability to separate complex mixtures.23,24 However, these methods often require extensive sample preparation, expensive instrumentation, and lengthy analysis times, which limit their practicality for rapid diagnostics. Similarly, spectrofluorimetric techniques provide high sensitivity for protein detection, particularly in monitoring specific biomarkers through fluorescence labeling.25 Despite their accuracy, these methods are constrained by the necessity of fluorescent tags, potential interference from biological matrices, and the requirement for sophisticated instrumentation.
Therefore, it is evident that developing portable, fast, user-friendly, and affordable sensing platforms, rather than a single-sensor-per-analyte approach, is required for detecting and differentiating various proteins and their complex mixtures. The prospective applications of probes in minimizing size while maximizing sensing performance have garnered significant attention in recent years.26–28 The integration of a multidimensional sensing probe with machine learning algorithms offers significant benefits by extracting multiple signals from a single sensing element, effectively creating a virtual sensor array capable of multiplex detection and discrimination of various analytes. In the field of sensing, machine learning provides numerous advantages. It simplifies the processing of complex responses from multidimensional sources and can handle noisy, lower-resolution, or conflicting data, making it a particularly valuable analytical tool. Moreover, advanced multivariate analysis enables machine learning-powered sensors to precisely predict the concentration or identify a target by uncovering correlations between signals and variables.29–31 Therefore, combining machine learning techniques with the exceptional advantages of colorimetric sensors—such as portability and visual detection—presents a promising approach for developing an advanced platform for protein analysis.
Recently, plasmonic nanoparticles (NPs) have attracted considerable interest in the design of diverse colorimetric probes for accurate sensing of a series of analytes due to their visual signals, swiftness, low cost, and simplicity.29,32–35 Over the past decade, despite the existence of a variety of plasmonic NPs with different shapes and compositions, gold nanorods (AuNRs) have emerged as outstanding signal generators, and color labels in multicolorimetric probes have inspired researchers to utilize them in a wide range of applications.36,37 AuNRs have a rod-like, almost one-dimensional structure that creates two distinct localized surface plasmon resonance (LSPR) bands associated with longitudinal and transversal surface plasmon oscillations.36,38 The primary reason that AuNRs have such vivid and highly contrasted rainbow color variations is that the band locations depend highly on the aspect ratio of the nanorods; with a minor increase in the aspect ratio, the transversal band blue-shifts slightly and the longitudinal peak red-shifts significantly.39–41 Correlating this color variation with the identity and quantity of the analyte in the design of visual sensors is quite challenging. In this regard, etching of AuNRs with a suitable oxidizing agent that reacts with different targets can be a potential solution.41,42
Most recently, the impressive capabilities of two cost-effective mild oxidizing agents, N-bromosuccinimide (NBS) and N-chlorosuccinimide (NCS), have been highlighted for their ability to rapidly control the etching of gold nanorods (AuNRs) at ambient temperature.43 Hence, we aim to explore another potentiality of this strategy by developing a robust multicolorimetric probe that inhibits the etching of AuNRs for high-performance visual and spectral discrimination and quantification of proteins. In this strategy, NBS undergoes hydrolysis to form HBrO, which can quickly and softly oxidize AuNRs, resulting in distinct color changes that facilitate qualitative and semi-quantitative detection visible to the naked eye. The presence of various proteins, such as ACP, Pep, Hem, TRF, IgG, Lys, Fib, and HSA, alters the effective concentration of NBS, thus preventing the etching of AuNRs to varying extents and generating unique colorimetric responses for each protein (Scheme 1). The colorimetric responses of the probe were investigated using linear discriminant analysis (LDA) and partial least-squares regression (PLSR) to discriminate proteins and establish the correlation between the concentration matrix and the independent variable matrix. Furthermore, hierarchical cluster analysis (HCA) was employed to accurately cluster individual proteins and their combinations at various levels. Obviously, the multicolorimetric probe functions as a self-contained sensing unit that requires no internal or external adjustments, making it a practical platform for multiplex protein detection. The etching-based colorimetric probe exhibits vibrant rainbow color patterns and interesting spectral variations, enabling effective detection of proteins at low concentrations while maintaining accuracy even in complicated environments.
![]() | ||
Scheme 1 Schematic illustration representing the principle of the proposed multicolorimetric probe for protein discrimination. |
To achieve this, UV-Vis spectroscopy and TEM were used to characterize the synthesized AuNRs, which had an average aspect ratio of 3.7, demonstrated excellent monodispersity, and exhibited two characteristic LSPR peaks at 514 nm (transversal) and 760 nm (longitudinal) (Fig. S2a and S2b†). Upon etching the AuNRs, a shift towards shorter wavelengths in the longitudinal LSPR peak was observed, accompanied by a color change from brown to pink (Fig. S2†). This process also resulted in the appearance of an absorption peak at 525 nm, indicating the formation of gold nanospheres.43 Although the study focused on AuNRs with an aspect ratio of 3.7, the methodology can be adapted to AuNRs with different aspect ratios, provided that experimental conditions remain consistent within each set of experiments. To explore this adaptability, the spectral variations of three AuNRs with different longitudinal LSPR wavelengths during the etching process were recorded (Fig. S3†). It was found that larger aspect ratios produced more pronounced plasmonic shifts during etching, which is crucial for enhancing color tonality and extending the detection range. For example, AuNRs with shorter LSPR wavelengths (e.g., 725 nm) exhibited subtler color variations and blue shifts, leading to a narrower response range. In summary, the use of a single large batch of AuNRs with a fixed aspect ratio of 3.7 minimized variability and improved the reliability and robustness of the detection system. While the methodology is adaptable to AuNRs with different aspect ratios, larger AuNRs offer a broader response range and enhanced color tonality.
The TEM images depicted in Fig. S2c† demonstrate the formation of nanospheres during the etching process, resulting in vibrant multicolor variations. When proteins were incubated with NBS, distinct multicolor variations were observed, along with a reduction in the blue shift in the spectrum of the AuNRs (Fig. 1). At a concentration of 10.0 ppm, the degree of protein oxidation using NBS varied based on the specific amino acid residues in the target proteins, leading to different etching patterns in the AuNRs (Fig. 1). Additionally, aggregation of the AuNRs was observed only upon exposure to Pep, as indicated by the red shift and broadening of the AuNRs’ longitudinal peak (Fig. 1). As shown in Table S2,† this aggregation likely occurs due to the different isoelectric point pH values of Pep (i.e., 1.0) compared to those of the other proteins, as it carries a negative charge at pH levels above 1.0, leading to significant aggregation of the AuNRs rather than etching due to the positive surface charge of the AuNRs. Consequently, the distinct spectral patterns of Pep facilitated its differentiation from the other proteins, with each protein exhibiting a unique absorption pattern, enabling effective discrimination.
![]() | ||
Fig. 1 Absorption spectra and the corresponding images of the etching of the AuNR probe as a blank and in the presence of protein samples (10.0 ppm). |
Based on our previous report, the ideal pH for fast and regulated etching is 7, while providing a complete range of colors to be observed in the development of a visual multicolor probe.43 This process is initiated with the etching of brown-colored AuNRs and ends with the formation of red-colored Au nanospheres and colorless Au(I) during the process. Hence, during the optimization process, the pH was adjusted to 7, and all subsequent analyses were performed accordingly.
First of all, Fig. S4† illustrates that increasing the concentration of NBS significantly enhanced the etching of AuNRs, resulting in a greater blue shift of the longitudinal LSPR peak, shifting it from ∼760 to ∼525 nm. In fact, 75 μmol L−1 of NBS completely etched the AuNRs into red-colored Au nanospheres, exhibiting a single LSPR peak at ∼525 nm. Accordingly, 75.0 μmol L−1 was determined to be the optimized concentration for NBS.
Incubation time is another important factor that significantly impacts the performance of the proposed probe. Preliminary experiments showed that the etching process is inhibited to varying degrees when NBS is incubated with proteins. Specifically, aqueous solutions of each protein were incubated with NBS for a duration ranging from 0 to 20 minutes, after which the resulting mixture was introduced to AuNRs at pH 7. In the absence of proteins, the etching process remains unhindered, forming red-colored Au nanospheres. However, in the presence of proteins, the etching of AuNRs is inhibited to varying degrees, depending on the incubation time, until equilibrium is reached (Fig. S5 and S6†). A fixed time of 10 minutes was selected to ensure the repeatability of the method, as most proteins reach equilibrium within this timeframe, and distinct behaviors between the proteins were observed during this period (Fig. S7†).
![]() | ||
Fig. 2 Color variation patterns and spectral variation responses of the colorimetric multidimensional probe in the presence of eight proteins. |
The main challenge in performing LDA is that the number of samples (rows) in the dataset matrix must be equal to or greater than the number of variables (columns).56 In this investigation, absorbance values recorded at various wavelengths were used as independent variables, while proteins at different concentrations served as samples, resulting in a smaller number of samples than variables. Therefore, PCA was initially performed to reduce the dimensionality of the training sets for each case. The LDA models were then trained using the first three PCs, which captured the most significant variance within each dataset. The LDA models showed remarkable discrimination of the eight protein clusters, achieving 100% accuracy across all 20 concentration levels (Fig. S10 and S11†). Furthermore, as shown in Fig. 3c and h, the 2D LDA plots demonstrated the excellent discriminatory power of the multidimensional colorimetric probe for the eight proteins at two concentration levels (12.5 and 15.0 ppm as samples). The exceptional performance of the models was further confirmed by the corresponding jackknife tables (Tables S3 and S4†), which indicated that both sensitivity and selectivity for the eight proteins at these two concentration levels (12.5 and 15.0 ppm as samples) were 100.0%, with no misclassifications. Next, eight distinct two-dimensional (2D) LDAs were conducted to evaluate the sensitivity of the probe to specific proteins at varying concentrations, producing well-clustered 2D score plots for each protein with no misclassification (Fig. 4).
![]() | ||
Fig. 4 2D score plots for (a) ACP, (b) Pep, (c) Hem, (d) TRF, (e) IgG, (f) Lys, (g) Fib, and (h) HSA at different concentrations. |
Additionally, CIELAB has recently demonstrated promising potential in representing the features in the spectra of AuNRs for both qualitative and quantitative modeling.39 To assess the applicability of this approach in our proposed sensor, CIELAB parameters were extracted from the color images presented in Fig. 2. The resulting dataset was then used to train an LDA model for classification. As shown in Fig. S12 and S13,† the LDA models based on CIELAB parameters achieved 100% accuracy in discriminating proteins only at specific concentrations: 7.5, 10.0, 12.5, 15.0, 17.5, 20.0, 25.0, 32.5, 35.0, and 45.0 ppm. At other concentrations, the accuracy dropped significantly. In contrast, as shown in Fig. S10 and S11,† LDA score plots based on the absorption spectra accurately identified proteins at all 20 concentration levels.
To further evaluate the potential of the probe in discriminating the eight proteins, variations in PC-1 were utilized to perform hierarchical cluster analysis (HCA), a commonly used chemometric technique associated with unsupervised learning. HCA is typically conducted on the sample space to identify clusters within the data. As shown in Fig. S14 and S15,† the HCA dendrograms for discrimination of all eight proteins at each concentration level clearly demonstrate that the multidimensional colorimetric probe accurately clustered all three replicates of the protein samples without any misclassification. Furthermore, as illustrated in Fig. 3d and i, the diverse protein samples at two concentration levels (12.5 and 15.0 ppm) were successfully clustered by HCA without any misclassification.
Another approach for representing the obtained response profiles is through radar plots, a practical graphical method for displaying multivariate data in 2D, where three or more quantitative variables are plotted on axes originating from the same point. Radar plots can also be used to represent the sample space, highlighting the variable with the maximum variance between samples. The radar plots shown in Fig. S16 and S17† were obtained by using the variations in PC-1 for all proteins at different concentration levels. These unique patterns effectively represent complex numerical data, illustrating the unique behavior of each protein in comparison with others. As demonstrated in Fig. 3e and j, the radar plots revealed distinct patterns for the eight proteins at two concentration levels (as two sample concentrations).
The potential of the probe for quantitative analysis was confirmed by the high correlation observed between the predicted concentrations and the actual measurements (Fig. 5). The multivariate regression models indicated that the colorimetric spectral responses from the probe were linearly dependent on protein concentrations within the ranges of 0.8–40.0, 1.4–32.5, 1.2–22.5, 1.5–22.5, 1.0–27.5, 0.8–30.0, 1.4–45.0, and 1.0–40.0 ppm, with detection limits of 0.3, 0.5, 0.4, 0.5, 0.3, 0.3, 0.5, and 0.3 ppm for ACP, Pep, Hem, TRF, IgG, Lys, Fib, and HSA, respectively (Table 1). A series of analytical figures of merit, including accuracy, precision, sensitivity, and response range, were calculated and are presented in Table 1. The robustness of the regression models was further demonstrated by the high R-squared values (R2 > 0.99) and the low root-mean-square error (RMSE) values.
Sample | Opt. LVs | RMSEP | REP% | R 2 | SEN | Anal. SEN | LOD (ppm) | LOQ (ppm) | Linear range (ppm) |
---|---|---|---|---|---|---|---|---|---|
ACP | 4 | 0.4 | 1.8 | 0.9995 | 0.108 | 14.21 | 0.3 | 0.8 | 0.8–40.0 |
Pep | 5 | 0.8 | 4.1 | 0.9981 | 0.044 | 16.21 | 0.5 | 1.4 | 1.4–32.5 |
Hem | 6 | 0.6 | 5.2 | 0.9964 | 0.054 | 21.45 | 0.4 | 1.2 | 1.2–22.5 |
TRF | 5 | 0.8 | 5.7 | 0.9948 | 0.123 | 60.23 | 0.5 | 1.5 | 1.5–27.5 |
IgG | 4 | 0.6 | 4.3 | 0.9975 | 0.268 | 42.28 | 0.3 | 1.0 | 1.0–27.5 |
Lys | 4 | 0.4 | 2.8 | 0.9990 | 0.181 | 30.42 | 0.3 | 0.8 | 0.8–30.0 |
Fib | 6 | 1.0 | 4.7 | 0.9971 | 0.027 | 21.06 | 0.5 | 1.4 | 1.4–45.0 |
HSA | 6 | 0.6 | 3.1 | 0.9988 | 0.058 | 22.06 | 0.3 | 1.0 | 1.0–40.0 |
Additionally, the CIELAB dataset obtained in the previous section was used to train PLSR models for each protein. A comparison between the PLSR models derived from the absorption spectra and those based on the CIELAB dataset revealed that the latter exhibited significantly narrower linear ranges. Specifically, the linear ranges for ACP, Pep, Hem, TRF, IgG, Lys, Fib, and HSA were limited to 10.0–25.0, 2.5–25.0, 1.0–10.0, 2.5–15.0, 5.0–20.0, 5.0–20.0, 5.0–20.0, and 7.5–20.0 ppm, respectively (Fig. S18†). These findings confirm that absorption signals provide a more reliable and robust approach for quantitative analysis, enabling a broader and more comprehensive evaluation of protein interactions across a wider dynamic range.
Most importantly, the proposed strategy is particularly effective for determining the HSA percentage in mixture samples containing HSA/Lys and HSA/TRF binary mixtures, as well as the HSA/Lys/TRF ternary mixtures with varying ratios. Changing the ratio in mixture samples from 9:
1 to 1
:
9 for binary mixtures and from 90
:
5
:
5 to 10
:
45
:
45 for ternary mixtures results in a strong correlation with the inhibition of AuNR etching (Fig. S21†). Consequently, the dataset matrix comprising binary and ternary protein mixtures was used to train three separate PLS-1 models based on the HSA concentration percentage. Three robust regression models were developed and are documented in Table 2, demonstrating exceptional performance in terms of sensitivity, accuracy, precision, and response range. The models exhibited high accuracy, as indicated by large R-squared (R2 > 0.99) and low root-mean square error (RMSE) values. Ultimately, the results clearly indicate that the proposed multidimensional colorimetric probe is highly suitable for accurately analyzing proteins in a quantitative manner.
Sample | Opt. LVs | RMSEP | REP% | R 2 | SEN | Anal. SEN | LOD (%) | LOQ (%) | Linear range (%) |
---|---|---|---|---|---|---|---|---|---|
HSA/Lys | 4 | 2.3 | 4.6 | 0.9959 | 0.008 | 9.90 | 1.5 | 4.6 | 4.6–90 |
HSA/TRF | 6 | 2.5 | 4.6 | 0.9940 | 0.004 | 7.18 | 1.8 | 5.4 | 5.4–90 |
HSA/Lys/TRF | 6 | 1.2 | 2.4 | 0.9989 | 0.005 | 7.87 | 0.9 | 2.7 | 2.7–90 |
Sample | Spiked (ppm) | Found (ppm) | Recovery (%) | RSD (n = 3, %) |
---|---|---|---|---|
ACP-real | 15.0 | 16.2 | 108.2 | 1.1 |
Pep-real | 15.0 | 15.5 | 103.6 | 0.1 |
TRF-real | 15.0 | 16.1 | 107.3 | 2.8 |
Lys-real | 15.0 | 15.1 | 100.8 | 1.3 |
Footnotes |
† Electronic supplementary information (ESI) available: Chemicals and materials; instrumentation and characterization; synthesis of AuNRs; preparation of the Britton–Robinson buffer; statistical analysis; absorbance spectra and TEM images of the synthesized AuNRs and the AuNRs after etching with NBS; UV-Vis absorption spectral variations of different-sized AuNRs; effect of NBS concentration on the AuNRs; effect of incubation time on the probe; variation in the absorption spectra of the probe at different protein concentration levels; 2D LDA score plots, HCA dendrograms, radar plots, and the jackknifed classification matrix; multivariate calibration using PLS regression. See DOI: https://doi.org/10.1039/d4nr04797d |
‡ These authors contributed equally. |
This journal is © The Royal Society of Chemistry 2025 |