Open Access Article
Mohammad
Mohammadi
a,
Sima
Najafzadehkhoei
b,
George G.
Vega Yon
bc and
Yunshan
Wang
*a
aDepartment of Chemical Engineering, University of Utah, Salt Lake City, UT, USA. E-mail: yunshan.wang@utah.edu
bDepartment of Population Health Sciences, University of Utah, Salt Lake City, UT, USA
cDepartment of Internal Medicine, University of Utah, Salt Lake City, UT, USA
First published on 14th October 2025
This study introduces a hybrid approach integrating advanced plasmonic nanomaterials and machine learning (ML) for high-precision biomolecule detection. We leverage aluminum concave nanocubes (AlCNCs) as an innovative plasmonic substrate to enhance the native fluorescence of neurotransmitters, including dopamine (DA), norepinephrine (NE), and 3,4-dihydroxyphenylacetic acid (DOPAC). AlCNCs amplify weak fluorescence signals, enabling probe-free, label-free detection and differentiation of these molecules with great sensitivity and specificity. To further improve classification accuracy, we employ ML algorithms, with Long Short-Term Memory (LSTM) networks playing a central role in analyzing time-dependent fluorescence data. Comparative evaluations with k-nearest neighbors (KNN) and Random Forest (RF) demonstrate the superior performance of LSTM in distinguishing neurotransmitters. The results reveal that AlCNC substrates provide up to a 12-fold enhancement in fluorescence intensity for DA, 9-fold for NE, and 7-fold for DOPAC compared to silicon substrates. At the same time, ML algorithms achieve classification accuracy exceeding 89%. This interdisciplinary methodology bridges the gap between nanotechnology and ML, showcasing the synergistic potential of AlCNC-enhanced native fluorescence and ML in biosensing. The framework paves the way for probe-free, label-free biomolecule profiling, offering transformative implications for biomedical diagnostics and neuroscience research.
MANTs have an aromatic ring structure that emits autofluorescence (AF) when excited by ultraviolet (UV) light. The AF absorption cross-sections are orders of magnitude higher than that of Raman or infrared absorption. Therefore, AF spectroscopy is a promising technique for sensitive, label-free, and probe-free quantification and identification of MANTs. However, the classification of similar MANTs based on their AF profile is challenging due to their overlapping spectrum.18 We have shown in our prior work that the AF of MANTs drop cast on a solid substrate (e.g., a silicon wafer or a plasmonic nano hole array, etc.) decays exponentially over time when continuously exposed to UV light.19–22 The decay rate constants were found to be distinct among similarly structured MANTs, and their differences were enlarged by a UV plasmonic nano hole-array.20 Our prior work focused solely on the decay rate constants and has not realized the full potential of using UV plasmonic-engineered Auto Fluorescence Time Decay Series (AFTDS) in classifying MANTs. In this paper, we demonstrated excellent classification accuracy by combining artificial intelligence with the AFTDS of MANTs deposited on aluminum plasmonic concave nanocubes (AlCNC). The assembly of concave cubes was reproducibly obtained by drop casting a droplet of nanoparticle solution containing AlCNCs in ambient conditions, offering a cost-effective and nanofabrication-free way to form a large area of plasmonic substrates.23,24
Machine learning has increasingly become a powerful tool in nanoscale science, surface chemistry, and biosensors, enabling advances in fabrication, characterization, and property prediction by integrating experimental data with physics-informed models. While ML has been applied to Raman and fluorescence spectroscopy for biochemical sensing, this paper is the first application of ML for the classification of neurotransmitters based on AFTDs. We performed comparative analysis using three ML techniques – Long Short-Term Memory (LSTM), k-nearest neighbors (KNN), and Random Forest (RF).25–27 LSTM on AFTDS collected on AlCNCs achieved the highest classification accuracy, followed by a slightly poorer performance by KNN and RF on AFTDS. KNN and RF on AF collected in the solution phase without plasmonic nanoparticles achieved poorer performances. Our results emphasize the importance of plasmonic-engineered AFTDS in achieving high classification accuracy among similarly structured MANTs. In addition, the superiority of LSTM over KNN and RF in analyzing time-dependent AF data was demonstrated.
Fluorescence measurements of the neurotransmitters were conducted on two substrates: AlCNC and plain silicon wafers as a reference. The AlCNC structure was selected based on our hypothesis that its concave geometry, featuring sharp corners and edges similar to those in aluminum bowtie nano antennas,28,29 enables it to more effectively trap and concentrate light, leading to stronger and more consistent fluorescence enhancement compared to smoother geometries or spherical shapes, such as aluminum hole arrays, and aluminum nanotriangles.21,30 A standard 2-inch silicon wafer was cut into four pieces and treated with a plasma cleaner for 90 seconds at a base pressure of 0.4 torr to render the surface hydrophilic. Subsequently, 5 μL of the AlCNC nanoparticle (NP) solution was drop-cast onto the silicon substrate and allowed to dry in ambient conditions;23 offering a simple and time-efficient fabrication route without requiring complex methods or equipment such as e-beam lithography or sputtering.21 After drying, a multi-layer pattern of the nanoparticles remained on the substrate surfaces. The structural and compositional analysis of AlCNCs was performed using scanning electron microscopy (SEM) and scanning transmission electron microscopy (STEM). Following substrate preparation, solutions of three MANTs in deionized water at varying concentrations were prepared, and 1 μL of each solution was drop-cast24 onto two different substrates. The outer edge of the coffee ring pattern of the AlCNC where high concentration of AlCNCs were found and a Si wafer to measure the fluorescence spectrum. The dried MANTs were then exposed to a 266 nm CW solid state UV laser CryLas FQCW266-10 (incident angle of 60°) with an output power of ∼5 mW and a circular spot size of ∼100 μm in diameter, and AFTDS were collected using a Horiba IHR 550 spectrometer coupled with a CCD camera.19–22 The AFTDS were taken over 2–3 minutes and an exposure time of 0.5 seconds was used for each spectrum within the AFTDS.
We performed a comparative analysis using three ML techniques: first, we used LSTM; a type of recurrent neural network well-suited for sequence prediction, which can capture temporal dependencies in the fluorescence data, potentially improving the accuracy for distinguishing closely related biomolecules. While LSTM has been applied for capturing sequential patterns and forecasting time-series tasks such as molecular generation,31 property prediction,32,33 and dynamic process modeling,34,35 this paper presents the first application of LSTM in classifying molecules based on AFTDS. Secondly, we used KNN; a simple, instance-based learning algorithm that classifies data points based on the closest training examples in the feature space.36 Third, we used RF, an ensemble learning method based on decision trees, to classify the spectral data and assess its performance compared to other algorithms.27 The AFTDS data was recorded as a time-dependent spectrum spanning 280–360 nm, Each fluorescence time series originally contained up to 240–360 spectra (0.5 s interval over 2 to 3 minutes). However, because the number of time-dependent spectra for each illumination spot (sequence length) varied across experiments (ranging from 0–17 to 240–360 spectra after preprocessing), we standardized all AFTDS inputs to the shortest complete sequence length of 17. Longer sequences were truncated to 17 regularly spaced sequences, and shorter ones were padded if necessary. This variability was due to the adjustment of sample position to find a high fluorescence signal spot that leads to delayed collection of fluorescent data after shutter opens at time 0. This length adjustment ensured consistent input dimensions for the LSTM while preserving the overall decay dynamics.
Adjustment details can be found in Fig. S1, Tables S1 and S2 of the SI. These were converted into structured input tensors of shape (time_steps × wavelength_bins) for sequence-based models (LSTM) or flattened into feature vectors for non-sequential models (KNN, RF). Processing the AFTDS with ML was accomplished in different steps; the fluorescence data gathered from multiple experiments was used to train the ML model. Each input sample was labeled with the corresponding neurotransmitter class (DA, DOPAC, NE) to train supervised classifiers. The training process involved pre-processing the fluorescence data, feature extraction, and applying data augmentation techniques to improve the model's accuracy and generalizability.
Distinguishing neurotransmitters with similar structures solely based on their native fluorescence is challenging due to their overlapping fluorescence spectra. To address this challenge, we employed ML classification models and developed a reliable classification framework capable of differentiating neurotransmitters based on their unique fluorescence signatures. This approach allows for more precise detection and identification of neurotransmitters. The integration of ML not only improves classification accuracy but also provides a scalable method for analyzing large experimental datasets 6, 10.
We scaled all fluorescence data with MinMaxScaler and evaluated model performance with 4-fold cross-validation (75% train, 25% test) as Fig. 1 shows the workflow. The dataset was first stratified by neurotransmitter class to maintain balanced representation in each fold. For every split, three folds (75%) were used exclusively for training, and the remaining fold (25%) was held out for validation. This process was repeated four times so that each fold served once as the validation set, and the reported metrics correspond to the mean across folds. To prevent data leakage, preprocessing steps such as MinMax scaling fit only on the training partition within each fold and then applied to the validation data. Importantly, spectra from the same experimental replicate were kept within a single fold, ensuring that no replicate contributed to both training and validation sets. For the in-solution emission spectra, each sample was a static snapshot of paired wavelength–intensity values fed directly to KNN and a 150-tree RF. For the AFTDS, we supplied KNN and RF with flattened, right-aligned intensity vectors, while the LSTM received the raw 17-step sequence, letting its three stacked LSTM layers learn temporal dependencies before batch normalization, a dense layer, and soft-max output. This unified preprocessing lets each model exploit either static spectral structure or time dynamics as appropriate, yielding consistent classification performance across modalities. Further details and the code used to process the data, fit the model, and generate predictions can be found in the data availability section.
The AFTDS collected on AlCNC substrates contain only seventeen regularly spaced intensity values per trace, so the data are sequential and short. A LSTM network suits this setting because its gated recurrent structure keeps information from both the fast initial quench and the slower tail of the decay curve, the two regions that together separate DA, DOPAC, and NE even though they occur at very different times.
The in-solution spectra, by contrast, are single wavelength–intensity snapshots. For these static measurements, we wanted algorithms that run quickly while still capturing subtle spectral patterns. An 11-neighbor KNN classifier provides a simple non-parametric baseline that classifies by proximity in wavelength–intensity space, while a RF with 150 trees captures non-linear interactions among neighboring wavelengths and offers built-in feature importance scores that aid interpretation.
Simpler alternatives such as logistic regression and single decision trees were rejected before testing. Linear decision boundaries cannot recover the class separation that appears only after part of the fluorescence has decayed, and a single tree is sensitive to noise in individual wavelengths. Either option would likely underfit or overfit without reducing inference time, since RF and KNN already respond well under a millisecond on the intended devices. Therefore, we concentrated our experiments on the LSTM model for the sequential traces and on KNN together with RF as fast, interpretable baselines for the static spectra.
All analyses were performed in Python 3.10 using TensorFlow 2.18.0.37 The LSTM achieved 89% accuracy using 17-point standardized sequences derived from the original variable-length fluorescence time series, under 4-fold cross-validation with replicate-level partitioning to avoid data leakage. The training data was partitioned into four equal folds, with three folds (75%) used for training and one-fold (25%) reserved for validation in each iteration. This process was repeated four times, allowing each fold to serve as the validation set once. By exposing the model to diverse training and validation distributions, cross-validation effectively addressed class imbalance, enhanced generalization, and minimized bias. Real-time inference was benchmarked on Google Colab Pro with an NVIDIA A100 GPU and Intel Xeon CPU (2.20 GHz); the trained LSTM processed a single 17-point AFTDS trace in under 1 ms, confirming the framework's suitability for real-time neurotransmitter classification.
Fig. 2 presents the training and validation curves of the LSTM model over 50 epochs, showing model accuracy (blue) and model loss (red). Both training and validation accuracy increase rapidly during the initial epochs and stabilize around 0.89, indicating good generalization and minimal overfitting. The k-fold cross-validation results further support the model's consistency, with accuracy ranging from 88.2% to 90.9% and loss values stabilizing around 0.3 across folds.
![]() | ||
| Fig. 2 The plot presents the training and validation curves of the LSTM model, illustrating model accuracy (blue) and model loss (red) over 50 training epochs. | ||
The dataset used for model training and evaluation consisted of 146
432 samples for models trained on AlCNC and 4113 samples for in-solution models. A 75–25% split was applied, allocating approximately 109
824 samples for training and 36
608 samples for testing in the AlCNC dataset, while the in-solution dataset contained 2817 training samples and 939 test samples the dataset captures both spectral and temporal fluorescence dynamics, providing a rich input for classification.
AF intensity is influenced by concentration, absorption efficiency (ε), excitation light intensity, and experimental conditions.38 While a molecule with a higher quantum yield generally exhibits greater fluorescence intensity,20 the molecule with a higher extinction coefficient can absorb more light, leading to stronger fluorescence despite a lower quantum yield. For example, although NE has a higher quantum yield than DA, the higher extinction coefficient of DA (2110 L mol−1 cm−1 compared to NE's 1070 L mol−1 cm−1) results in higher fluorescence intensity from DA. In contrast, although DOPAC exhibits an extinction coefficient comparable to DA, its much lower quantum yield (1.7%) leads to significantly reduced fluorescence intensity.39,40 The extinction coefficients of DA, NE, and DOPAC were determined from the slope of linear fit data of absorption vs. concentration plots in Fig. 4, using five data points employed in our previous work for quantum yield calculations. The calculated extinction coefficients were 2110, 1070, and 2210 L mol−1 cm−1 for DA, NE, and DOPAC, respectively, as presented in Table 1. The fluorescent data for DA, NE, and DOPAC in deionized water (DI) at 5 different concentrations are shown in Fig. S3 and the corresponding absorption data are shown in Fig. S4.
We employed AlCNCs that were drop cast and dried on a bare Si wafer (as reference) as plasmonic substrates to differentiate and analyze three similar neurotransmitters: DA, DOPAC, and NE. Fig. 5(A–C) presents AFTDS signals, decreasing over time for DA, DOPAC, and NE, and the topmost curve (highest intensity) corresponds to the first spectrum. The AF spectra, shown in Fig. 5D and E, compare the intensities of DA, DOPAC, and NE acquired at 0–0.5 seconds with an acquisition time of 0.5 seconds (I0, the highest intensity spectrum in the AFTDS) on a silicon wafer (Fig. 5D) and AlCNC substrates (Fig. 5E). DA exhibited the highest fluorescence intensity on both substrates, followed by NE and DOPAC. Notably, fluorescence intensities on the AlCNC substrate were significantly higher compared to a silicon substrate, highlighting the fluorescence-enhancing properties of AlCNCs. This enhancement is specifically attributed to the localized surface plasmon resonance (LSPR) effects of AlCNCs in the UV range, which amplifes both the excitation and fluorescent emission (quantum yield) near the nanocube surface, unlike the non-plasmonic silicon substrate.39 This enhancement suggests strong plasmonic amplification or more efficient excitation/emission conditions on the AlCNC substrate while maintaining consistent spectral profiles. Additionally, fluorescence spectra data acquired using a fluorometer for the neurotransmitters dissolved in water are presented in Fig. 5F.
Fig. 6 compares the average integrated fluorescence intensities and net enhancement factors for DA, NE, and DOPAC; each droplet contains 1 μL of 500 μM of MANTs that drop cast and dried on a silicon wafer and AlCNC substrates. The integrated fluorescence intensity and net enhancement factors were calculated using the methods published previously.19,20 To quantify the variability in enhancement measurements, the error was calculated as the standard deviation (STD) across multiple experimental replicates.40 For each neurotransmitter (DA, DOPAC, and NE), fluorescence spectra were collected at multiple distinct spots (n = 5–7) on both AlCNC and silicon (Si) substrates. The integrated fluorescence intensity for each measurement was obtained by subtracting the baseline signal (dark spectrum) and computing the area under the spectrum using the trapezoidal rule. Enhancement factors were calculated by normalizing the integrated intensity on AlCNC to the corresponding average intensity on Si. The reported enhancement values represent the mean of these normalized measurements, and the associated errors reflect the standard deviation across the set of replicates, thereby accounting for experimental variation due to sample heterogeneity, measurement noise, and substrate uniformity. As shown in Fig. 6A, fluorescence intensities on the AlCNC substrate were significantly higher than those on the silicon wafer, with DA exhibiting the strongest signal, followed by NE and DOPAC. Fig. 6B highlights the net enhancement factors achieved on AlCNC, with DA, NE, and DOPAC showing enhancements of 12, 9, and 7 respectively, compared to silicon wafers.
![]() | ||
| Fig. 6 (A) Integrated fluorescence intensity and (B) net enhancement factors from three neurotransmitters: DA, DOPAC, and NE, comparing signal levels between AF data on silicon wafers and AlCNCs. | ||
These results demonstrate that AlCNCs drop cast and dried on a substrate are effective in enhancing the AF signal of MANTs. Although other plasmonic nanostructures have achieved higher fluorescence enhancement factors, the presented method achieves a plasmonic substrate without complicated, lengthy and expensive nanofabrication techniques.
The AFTDS data on different concentrations for model training and evaluation are shown in Fig. S5. The dataset contained three classes—DA, DOPAC, and NE—with slightly varying support: 54
784, 51
712, and 39
936 samples, respectively, in the AlCNC dataset, and 1105, 1,326, and 1326 samples in the in-solution dataset and more details about samples in each class can be found in Fig. S6. Each sample corresponds to a unique fluorescence data point defined as a pair of fluorescence intensity and wavelength values, represented as a function FL(Wv, I). As shown in Fig. S5, each fluorescence time series was acquired over about 2 to 3 minutes measurement timeframe with a 0.5-second acquisition interval, yielding approximately 240–360 spectra per measurement. Each of these spectra spans a 60 nm range (280–360 nm), representing the AFTDS for a specific molecule at a defined concentration. The weighted and macro averages in performance metrics were used to ensure a balanced evaluation across all classes.
The classification performance metrics, precision, recall, and F1 score, for LSTM, KNN, and RF models across three molecular classes: DA, DOPAC, and NE, are presented in Table 2. These metrics, derived from confusion matrices, provide insight into each model's classification, accuracy, and efficiency. We included macro and weighted averages to summarize model performance across all classes. Macro average treats all classes equally, useful for balanced datasets, while the weighted average reflects class distribution, favoring the majority classes. These metrics offer complementary views but can mislead if used alone in imbalanced settings. For better clarity, bar charts are provided in Fig. S7.
The results demonstrate that the LSTM model consistently outperforms the KNN model when applied to AFTDS data collected from MANTs on AlCNCs. The LSTM model achieves a weighted and macro average F1 score of 0.89, indicating strong generalization and superior classification performance across all classes. In contrast, the KNN model yield a slightly lower weighted and macro average F1 score of 0.86, reflecting its comparatively reduced ability to capture dynamic spectral features.
For AF spectra data collected from MANTs in solution (without plasmonic nanostructures), the KNN model exhibits reduced classification performance compared to its performance on AFTDS data. It achieves a weighted and macro average F1 score of 0.68. Notably, KNN performs best on DOPAC but struggles with NE and exhibits the weakest performance on DA. This discrepancy may be attributed to differences in molecular intensity within solution-based measurements. Conversely, classification accuracy using AFTDS data remains relatively consistent across different molecules, underscoring its robustness in molecular classification.
In addition to KNN and LSTM, a RF model was evaluated under the same conditions. On AFTDS data, the RF model achieved a weighted and macro average F1 score of 0.84, slightly lower than the KNN and LSTM models but still demonstrating strong performance. When applied to AF data collected in solution, the RF model outperformed KNN, with a weighted and macro average F1 score of 0.71. This indicates that while RF does not match LSTM performance on AFTDS data, it provides more reliable classification than KNN in solution-based environments, likely due to its ensemble-based structure and resilience to data noise.
The sample size for data collected on AlCNC is much larger than that of solution-based spectra. Since ML benefits from large data sets, our methods of collecting time variant spectra under continuous light illumination are advantageous compared with solution-based spectra collection. For instance, using AlCNC and acquiring AFTDS data with an acquisition time of 0.5 seconds, we obtained 300 spectra from a single molecular concentration in 2 minutes. In contrast, obtaining the same number of spectra with solution-based UV-Vis would require 300 different concentrations and separate measurements, which would be an impractical, time-consuming, and costly approach.
Overall, these findings highlight the LSTM model's advantage in leveraging the time-dependent characteristics of AFTDS signals, allowing for more accurate molecular differentiation. In contrast, the KNN model, which relies on static data, demonstrates lower classification performance. The confusion matrix results further reinforce the LSTM model's effectiveness in distinguishing structurally similar MANTs based on AFTDS data while also showing the RF model as a better alternative than KNN when temporal data is limited or unavailable.
Integrating ML models, particularly LSTM networks, proved critical in analyzing and classifying complex fluorescence data. With a classification accuracy of 89%, the ML models effectively captured subtle variations in spectral data, enabling the reliable identification of neurotransmitters. The study also highlighted the comparative advantage of LSTM over KNN and RF in handling time-dependent fluorescence data (AFTDS). In addition, RF models demonstrated competitive performance, particularly in environments where temporal patterns are less pronounced. While not as effective as LSTM on AFTDS data, the RF model surpassed KNN in classifying solution-based spectra, emphasizing its robustness and versatility as a static-data classifier and suggesting RF as a better approach in scenarios where dynamic signal acquisition may be limited.
These findings underscore the potential of combining nanotechnology and ML to create smart biosensing systems that are both sensitive and selective. To the best of our knowledge, the proposed methodology may pave the way for developing non-invasive, real-time diagnostic tools for complex biological environments, offering new avenues for applications in biomedical diagnostics, neuroscience, and beyond.
| This journal is © The Royal Society of Chemistry 2025 |