Benchmarking Deep Learning Models for Raman Spectroscopy Across Open-Source Datasets

Abstract

Deep learning classifiers for Raman spectroscopy are increasingly reported to outperform classical chemometric approaches. However, their evaluations are often conducted in isolation or compared against traditional machine learning methods or trivially adapted vision-based architectures that were not originally proposed for Raman spectroscopy. As a result, direct comparisons between existing deep learning models developed specifically for Raman spectral analysis on shared open-source datasets remain scarce. In this work, we focus on supervised Raman spectra classification where each spectrum is assigned to a predefined material, bacterial/yeast isolate, drug treatment or pharmaceutical compound. To the best of our knowledge, this study presents one of the first benchmarks comparing three or more published Raman-specific deep learning classifiers across multiple open-source Raman datasets. We evaluate five representative Deep Learning (DL) architectures along with two conventional Machine Learning (ML) methods under a unified training and hyperparameter tuning protocol across three open-source Raman datasets selected to support standard evaluation, fine-tuning, and explicit distribution-shift testing. In this comparative study, we primarily focus on classification because the selected open-source datasets provide classification annotations, while annotations for complete structure elucidation are not available. We report classification accuracies and macro-averaged F1 scores to provide a fair and reproducible comparison of the supervised ML and DL models for Raman spectra based classification.

Supplementary files

Article information

Article type
Paper
Submitted
27 Jan 2026
Accepted
17 Jun 2026
First published
23 Jun 2026
This article is Open Access
Creative Commons BY license

Digital Discovery, 2026, Accepted Manuscript

Benchmarking Deep Learning Models for Raman Spectroscopy Across Open-Source Datasets

A. Sineesh and A. R. Kamsali, Digital Discovery, 2026, Accepted Manuscript , DOI: 10.1039/D6DD00044D

This article is licensed under a Creative Commons Attribution 3.0 Unported Licence. You can use material from this article in other publications without requesting further permissions from the RSC, provided that the correct acknowledgement is given.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements