Rajendhar
Junjuri
ab,
Matteo
Calvarese
a,
MohammadSadegh
Vafaeinezhad
abc,
Federico
Vernuccio
d,
Marco
Ventura
de,
Tobias
Meyer-Zedler
ab,
Benedetta
Gavazzoni
d,
Dario
Polli
de,
Renzo
Vanna
e,
Italia
Bongarzone
f,
Silvia
Ghislanzoni
f,
Matteo
Negro
g,
Juergen
Popp
abc and
Thomas
Bocklitz
*ab
aLeibniz Institute of Photonic Technology, Member of Leibniz Health Technologies, Member of the Leibniz Centre for Photonics in Infection Research (LPI), Albert-Einstein-Strasse 9, 07745 Jena, Germany. E-mail: Thomas.bocklitz@uni-jena.de
bInstitute of Physical Chemistry (IPC) and Abbe Center of Photonics (ACP), Friedrich Schiller University Jena, Member of the Leibniz Centre for Photonics in Infection Research (LPI), Helmholtzweg 4, 07743 Jena, Germany
cMax Planck School of Photonics, Jena, Germany
dDepartment of Physics – Politecnico di Milano, P.za L. da Vinci 32, 20133 Milano, Italy
eIstituto di Fotonica e Nanotecnologie – CNR, P.za L. da Vinci 32, 20133 Milano, Italy
fDepartment of Diagnostic Innovation, Fondazione IRCCS Istituto Nazionale dei Tumori, Via Giacomo Venezian 1, 20133, Milano, Italy
gCambridge Raman Imaging Ltd, Cambridge, UK
First published on 15th July 2024
Broadband Coherent Anti-Stokes Raman Scattering (BCARS) is a valuable spectroscopic imaging tool for visualizing cellular structures and lipid distributions in biomedical applications. However, the inevitable biological changes in the samples (cells/tissues/lipids) introduce spectral variations in BCARS data and make analysis challenging. In this work, we conducted a systematic study to estimate the biological variance in BCARS data of two commonly used cell lines (HEK293 and HepG2) in biomedical research. The BCARS data were acquired from two different experimental setups (Leibniz Institute of Photonics Technology (IPHT) in Jena and Politecnico di Milano (POLIMI) in Milano) to evaluate the reproducibility of results. Also, spontaneous Raman data were independently acquired at POLIMI to validate those results. First, Kramers–Kronig (KK) algorithm was utilized to retrieve Raman-like signals from the BCARS data, and a pre-processing pipeline was subsequently used to standardize the data. Principal component analysis – Linear discriminant analysis (PCA-LDA) was performed using two cross-validation (CV) methods: batch-out CV and 10-fold CV. Additionally, the analysis was repeated, considering different spectral regions of the data as input to the PCA-LDA. Finally, the classification accuracies of the two BCARS datasets were compared with the results of spontaneous Raman data. The results demonstrated that the CH band region (2770–3070 cm−1) and spectral data in the 1500–1800 cm−1 region have significantly contributed to the classification. A maximum of 100% balanced accuracies were obtained for the 10-fold CV for both BCARS setups. However, in the case of batch-out CV, it is 92.4% for the IPHT dataset and 98.8% for the POLIMI dataset. This study offers a comprehensive overview for estimating biological variance in biomedical applications. The insights gained from this analysis hold promise for improving the reliability of BCARS measurements in biomedical applications, paving the way for more accurate and meaningful spectroscopic analyses in the study of biological systems.
Coherent Anti-Stokes Raman Scattering (CARS) technique, on the other hand, provides much more intense signals without the influence from fluorescence.7,8 In CARS, two laser beams, namely pump and Stokes, at frequency ωp and ωS respectively, are focused on the target to capture the vibrational mode of the sample at frequency Ω = ωp − ωS. The nonlinear interaction of the electric field of a third beam, called probe, at ωpr frequency, produces a coherent anti-Stokes signal, at frequency ωaS = ωp − ωS + ωpr, with several orders of magnitude enhancement in signal intensity compared to the spontaneous Raman signal. In its simplest configuration, CARS employs degenerate pump and probe beams, so that ωp = ωpr, therefore the anti-Stokes frequency is simply equal to ωaS = 2ωp − ωS.9,10 In the case of Broadband CARS (BCARS), a large-bandwidth Stokes laser pulse is utilized to simultaneously excite multiple molecular vibrations and provide more comprehensive chemical information.11 BCARS has been proven as a rapid spectroscopic imaging tool in biomedical applications such as biological tissue mapping,12–15 lipid droplet dynamics,16,17 bacterial spores detection,18 live cell imaging,19 and other fields.1,5,6
Despite its numerous advantages, the application of BCARS is influenced by the dynamic nature of biological systems. Biological variance, stemming from inevitable changes in cellular components, tissues, and lipids, introduces extra complexity to the acquired BCARS spectra. Such variance manifests itself as subtle yet crucial alterations in the measured BCARS data, hindering the reliability of cross-sample studies and comparisons. These variations, whether due to differences in cell lines or inherent biological fluctuations, underscore the need for sophisticated analytical approaches. Consequently, understanding and quantifying the biological variance in BCARS data are pivotal for extracting meaningful information from spectroscopic analyses.
This study delves into the intricacies of biological variance in BCARS data, aiming to ensure the reliability of BCARS measurements for their use as a spectroscopic tool in biomedical research. For this task, we choose HEK293 and HepG2 cell lines. Both cells have been invaluable tools in scientific research, contributing to numerous discoveries and advancements in various fields, including cancer biology, pharmacology, and molecular biology.20,21 First, BCARS data from these two cell lines were acquired using two different experimental setups, available at the Leibniz Institute of Photonics Technology (IPHT) in Jena and Politecnico di Milano (POLIMI) in Milano. Then, the corresponding Raman signal was extracted using the Kramers–Kronig (KK) algorithm.22,23 The spontaneous Raman data were also independently acquired at POLIMI to validate the BCARS data. A comprehensive preprocessing step was implemented to standardize the data as an initial step. Subsequently, PCA-LDA analysis was performed on preprocessed data using two cross-validation (CV) methods: batch-out CV and 10-fold CV. Additionally, the analysis considered different spectral regions of the data as input to the PCA-LDA, with the goal of identifying the most relevant vibrational regions, bringing the highest informational content for best chemical selectivity and species identification. Finally, the classification accuracies of the two BCARS datasets were compared with the results obtained from spontaneous Raman data.
For BCARS measurements in Jena, slightly different cultivation conditions were used. The HepG2 cell line was cultured in Dulbecco's Modified Eagle Medium/Nutrient Mixture F-12 (DMEM: F12) (Biochrom, Berlin, Germany) containing 10% fetal calf serum (FCS) (Thermo Fisher Scientific, Langenselbold, Germany), 1% penicillin, and 1% streptomycin (Merck Millipore, Biochrom, Berlin, Germany). HEK293 cells were obtained from the Centre for Applied Microbiology and Research (CAMR; Porton Down, Salisbury, UK) and cultured in DMEM-F12 (Thermo Fisher Scientific, Waltham, Massachusetts, USA) supplemented with 10% fetal bovine serum (FBS). The cells were maintained in a humidified incubator at 37 °C and 5% CO2. This experimental design aimed to reveal the influence of variance originating from biological dynamics and measurement devices; this will facilitate data comparison from different experimental setups and cell lines.
After locating the cells with the bright-field configuration, a snake scan of a given field of view (FOV) was performed by moving the stage. A schematic of the scanning pattern is shown in Fig. 1. The BCARS spectral images consist of 100 × 100 pixels in a FOV, generally between 50 × 50 μm2 and 100 × 100 μm2. The pixel dwell time was typically set to 50 ms (44 ms exposure time + 6 ms readout time). Only for a few cell regions, the dwell time was increased to 100 ms (94 ms exposure time + 6 ms readout time) due to the signal strength in those specific FOVs. The pump and Stokes powers in the sample plane were 9.5 ± 1.5 mW and 3.35 ± 0.35 mW, respectively, and the repetition rate was set to 1 MHz. After acquiring the image data, spectra were retrieved by following a procedure mentioned in section 2.5.
The samples are positioned onto an XY motorized translation stage (U-780.DNS, Physik Instrumente), while its z position is controlled with a second motorized XYZ stage (P-545.3R8S, Physik Instrumente), mounted on top of the scanning stage. Sample illumination and collection are performed with a pair of 0.85-NA air objectives (LCPLN100XIR, Olympus). After the microscope, a short-pass filter (FESH1000, Thorlabs) selects the anti-Stokes component that is coupled into a spectrometer, consisting of a monochromator (Acton SP2150, Princeton Instruments) equipped with a 600 grooves per mm grating and a back-illuminated CCD camera (BLAZE 100HR, Princeton Instruments). For all the experiments, the average power of the pump and Stokes beams at the sample plane was 25 mW and 10 mW, respectively. A sandwich sample configuration has been adopted for the measurements. First, the cells were plated on 170 μm thick quartz coverslips at the density of 320.000 cells per mL for HepG2 cells and 180.000 cells per mL for HEK293 cells. A drop of phosphate-buffered saline (PBS) buffer was added to them. Later, a second 170 μm thick quartz coverslip was positioned on the top and fixed with enamel glue. BCARS images of cells were acquired with 400 × 400 μm2 field of view (FOV), 1 μm pixel size, and 1.8 ms pixel dwell time (1 ms exposure time + 0.8 ms readout time), as shown in Fig. 2. The BCARS spectra retrieval procedure is presented in section 2.5.
For each selected region of the image, a binary mask is generated, featuring values of 1 in correspondence with the selected area and 0 elsewhere. Then, the binary mask was multiplied by the CARS hyperspectral image, resulting in a 3D matrix that gives the average CARS spectrum of the selected region. In this way, 112 BCARS spectra were acquired from the two batches of samples for HepG2 cell lines, and 136 spectra were obtained from the HEK293 cell lines. A similar procedure was performed for IPHT BCARS images, and 41 spectra were acquired for HepG2 and 42 for the HEK293 cell line. The averaging operation provided the BCARS and reference spectra with a signal-to-noise ratio higher than the raw data. Hence, no further spectral denoising (e.g., singular value decomposition) was performed.26 In the case of spontaneous Raman data, 30 Raman spectra were acquired for each cell line.
Fig. 3(a and b) represents the mean CARS spectra retrieved from the BCARS images acquired at POLIMI and IPHT, respectively. The solid lines represent the mean spectra calculated over the entire two batches of the data for each cell line, and the shaded fill area corresponds to their standard deviation. An offset (black dotted line) is added in both figures for better visualization. Despite being measured at the same setup, the HepG2 spectra have shown a higher standard deviation compared to HEK293 spectra for both setups, as shown in Fig. 3. The average relative standard deviation of the HepG2 cell line is found to be ∼1.3 times the HEK293 for the POLIMI data and more than 2 times for the IPHT data. This could suggest the inherent variance present across different cell lines. Further, the spectra from the same cell line look distinct between the two setups due to the differences in the NRB signal, which, in turn, is linked to the different spectral densities of the supercontinuum in the Stokes and also due to the different BCARS processes involved (only 2-color CARS for IPHT and 2-colors and 3-colors for POLIMI).
For better correction, we extracted a reference spectrum for each image by selecting a cell-free area within each FOV. This process has been repeated for all measured cells, mainly those with easily distinguishable edges, as shown in Fig. 2. The average of the retrieved Raman spectra was calculated over the entire two batches for each cell line and each setup; the results are presented in Fig. 4. In the spectra obtained from cells measured by the POLIMI BCARS setup, a broad peak beyond 3000 cm−1 is observed, originating from the OH-stretching of water.
Despite using a reference spectrum from the cell-free region containing PBS medium, the remaining peak in the reconstructed spectra associates with a higher concentration of water from within the cells to their surroundings. Conversely, in the case of the Jena BCARS setup, the signal beyond 3000 cm−1 is notably weak, and the reconstruction through KK does not retrieve this signal. Also, the wavenumber and intensity ranges differ between both setups. Further, it is worth considering that, even though the parameters of the reconstruction are chosen the same, the different ratio between the signal and NRB of different samples can lead to differences in the reconstructed spectra, as studied in our recent work.28 Therefore, to ensure consistency and comparability across datasets from different experimental setups, we standardized the spectral data through preprocessing in the following section.
![]() | ||
Fig. 5 Mean preprocessed Raman spectra: (a) POLIMI data and (b) IPHT data. An offset value of 0.25 is added to the HepG2 spectra for better visualization. |
It is observed that the spectral variations (shaded area) for each cell line reduced after preprocessing.
In the case of spontaneous Raman data, preprocessing was performed using RamApp.32 The same preprocessing steps were applied, and the final mean processed spontaneous Raman spectra are shown in Fig. 6. It is found that variations are very low in spontaneous Raman spectra compared to the Raman spectra retrieved from the BCARS spectra. An inset is added for each cell line to visualize these small variations, as shown in Fig. 6. Overall, this preprocessing step demonstrated the minimization of spectral variations in the Raman data.
All the major vibrational spectral bands in the retrieved Raman spectra matched the spectral bands of the spontaneous Raman data. It confirms that the KK algorithms correctly retrieved all the spectral bands. However, minor variations in peak intensities are observed. For example, the intensity of the spectral band at 1451 cm−1 is higher than the peak at 1661 cm−1 for the spontaneous Raman data. Conversely, an inverse trend is observed for the Raman spectra retrieved from IPHT BACRS data. Meanwhile, both peaks demonstrate nearly identical intensities in the Raman spectra retrieved from the POLIMI BCARS data. Overall, it is noticed that the spectral variations from the spontaneous Raman measurement are lower, with a higher signal-to-noise ratio compared to the remaining two data sets.
Further, the Pearson correlation coefficient (PCC) is calculated to numerically quantify the similarity between the spectra,33,34 and the results are depicted in Fig. 7b. The retrieved PCC value of more than 0.96 indicates a substantial resemblance of the retrieved spectra to the spontaneous Raman spectra. It also ascertains the performance of the KK algorithm. Furthermore, it is evident that the two retrieved datasets exhibit a higher correlation of 0.98 with each other compared to them with the spontaneous Raman spectrum. In the next section, PCA-LDA analysis is performed to quantify the biological variance in the data.
Further, batch-out CV and 10-fold CV is considered for the analysis. In a 10-fold CV, the dataset is divided into ten folds, and the LDA model is trained on nine of these folds and tested on the remaining fold. This process is repeated ten times, with each fold serving as the test set exactly once, and finally, the average accuracy is estimated. In the case of the Batch-out CV, given the two batches of data available for each cell line, one batch is utilized for training and another for testing. Each batch is used once as testing data, and the average accuracy of the cross-validation is estimated. Initially, PCA is applied to each preprocessed dataset, and the corresponding PCs are obtained.
Fig. 8a visualizes the loadings of the first 3 PCs obtained on the preprocessed Raman data retrieved from the POLIMI CARS data. Then, these PCs, serving as features, are systematically chosen as input to construct a linear LDA classifier with a maximum limit of 50 PCs. The model's accuracy is then assessed iteratively for different numbers of PCs, as depicted in Fig. 8b. The optimal number of PCs required for the final classification model was determined using balanced accuracy as the primary metric. It is calculated as the arithmetic mean of the per-class accuracies, where the accuracy of each class is defined as the ratio of correctly predicted instances to the total number of instances in that class.
Fig. 8b illustrates the balanced accuracy estimated as a function of the number of PCs input to the LDA classifier. The dots on the graph represent balanced accuracy values, while the solid line represents a curve fitted to those values using a 3-piece linear function.29 This function utilizes a piecewise linear model with three segments to capture different trends in the data and identifies a breakpoint, as shown in Fig. 8b with a black circle. The number of PCs at this point can be considered optimal, and the corresponding accuracy is utilized for further analysis. For instance, 16 PCs are optimal for a 10-fold CV where the estimated balanced accuracy is 100%, which decreases with a further increase in PCs. In the case of batch-out CV, three PCs are found to be optimal, and the corresponding accuracy is 83.3%. This approach not only effectively balances the model complexity and performance but also mitigates the risk of overfitting, ensuring the efficiency of the LDA-based classification model.
The same procedure is applied to the remaining two datasets (1. Raman data retrieved from the IPHT CARS data and 2. Spontaneous Raman data). The corresponding loadings and balanced accuracy curves are presented in ESI Fig. 3 and 4,† respectively. The Raman data retrieved from the IPHT CARS spectra has given the accuracy of 92.7% for batch-out CV and 100% for 10-fold CV. It was also noticed that 7 PCs were found to be optimal for both CV methods. On the other hand, the accuracy of spontaneous Raman data was found to be 100% for both validation methods with optimal PCs of 12.
Further, the loadings represent the coefficients of the original variables in the PCs. Higher loading values indicate a stronger influence of the corresponding variable on the PC.
For example, both fingerprint and CH band regions exhibit higher loading values in all three PCs, as illustrated in Fig. 8a. The same scenario is observed for the remaining two datasets (1. Raman data retrieved from the IPHT CARS data and 2. Spontaneous Raman data), as shown in ESI Fig. 3a and 4a.† These loading plot visualizations underscore the significance of the fingerprint region and, particularly, the CH band region in classifying cell lines for all three datasets. This observation suggests that a PCA-LDA model based on selected spectral regions could be more effective, helping to mitigate the influence of noisy regions within the spectrum. Hence, the PCA-LDA analysis is done by considering different regions of the spectrum as an input to the model, as illustrated in Fig. 9.
Initially, the entire spectrum was divided into fingerprint (F) and CH band (CH) regions guided by the outcomes of the loading plot visualizations. The exclusion of the silent region (1800–2770 cm−1) from the analysis was imperative as it predominantly contains noise. Then, the fingerprint region (600–1800 cm−1) was subdivided into sub-spectra (S1, S2, S3, and S4), each highlighting specific peaks and having a bandwidth of 300 cm−1.
For example, the S1 region mainly contains the characteristic peak of nucleic acids at 785 cm−1 associated with the symmetric phosphodiester stretch.37 The S2 region displays three major peaks at 934, 1004, and 1094 cm−1, which are characteristic of nucleic acids and lipids.3 The last two frequencies specifically denote the ring breathing modes and phosphate backbone vibrations, respectively. The S3 region contains vibrational features of both proteins and lipids. The peak at 1240 cm−1 belongs to the Amide-III band, associated with protein content. Additionally, the peaks at 1338 and 1450 cm−1 represent the CH2 bending modes, indicating lipid content. Finally, the last region, S4, contains an Amide-I band at 1654 cm−1 with higher intensity, primarily associated with CO stretching vibrations of peptide bonds in proteins.38 Further, the CH band, in conjunction with the other regions, was also considered, resulting in F-CH, S1-CH, S2-CH, S3-CH, and S4-CH combinations In total, 11 specified regions (S1, S2, S3, S4, CH, F, F-CH, S1-CH, S2-CH, S3-CH, and S4-CH) were used as input for PCA-LDA each time, and the balanced accuracies were estimated. Finally, the results were compared with the accuracy obtained using the total data as input.
Fig. 10 illustrates the batch-out CV analysis results where the spontaneous Raman data has demonstrated superior performance compared to IPHT and POLIMI data for any spectral combination. Notably, a 100% classification accuracy was achieved for all inputs of spontaneous Raman data, excluding specific sub-spectrum regions. Even within these sub-spectrum regions, the minimum accuracy was found to be more than 96.5%. In contrast, a different scenario was observed for the two BCARS datasets, where the accuracy significantly varied based on the input data. Additionally, it was observed that the IPHT BCARS data consistently showed higher accuracy than the POLIMI data for all combinations, except in the S3 and S4 regions.
Further, the fingerprint region exhibited lower accuracy compared to the CH band. Also, any combination involving the CH band as an input demonstrated increased accuracy. For example, POLIMI data accuracy in the S1 region improved from 59.8% to 86.2% when combined with the CH band. Similarly, for IPHT data, accuracy increased from 62.5% to 97.6%, and spontaneous Raman data increased from 96.7% to 100%. This trend was consistently observed for all remaining combinations. Particularly, the S4-CH combination exhibited the highest accuracy, surpassing that achieved with the total data as an input to the model. For instance, POLIMI total data accuracy was only 83.3% and increased to 92.4% for the S4-CH combination. It is worth considering that the same scenario was noticed for the remaining two datasets (1. Raman data retrieved from the IPHT CARS data and 2. Spontaneous Raman data).
The same analysis is repeated for the 10-fold CV, and the results are presented in Fig. 11. An overall increase in accuracy is observed compared to the results obtained through batch-out CV. However, the key distinction lies in the intricacies of the 10-fold CV procedure. Due to the random partitioning of the entire dataset (two batches of data), there exists a higher likelihood of data overlap or leakage from one batch to another within the individual folds. This contrasts with the batch-out CV method, explicitly designed to mitigate biases by segregating distinct batches for training and testing, providing an unbiased and stringent estimate of the model's performance. Hence, the lower accuracy in batch-out CV is attributed to batch variance present in each cell for all three data sets. Also, similar observations were noticed in our previous works.39
In the case of spontaneous Raman data, 100% classification accuracy was obtained for all inputs except the S4 region. Even the S4 region has demonstrated an accuracy of 98.3%. Additionally, both the total data and the S4-CH combination have demonstrated 100% accuracy across all three datasets. Further, the remaining combinations involving CH also achieved 100% accuracy for Spontaneous Raman and IPHT data, with only a negligible difference observed for the POLIMI data, where the accuracy differed by approximately 1%. Overall, accuracies varied between 78–90% for the sub-spectrum regions of the POLIMI and IPHT data, with the minimum observed for the S1 region and the maximum for the S4 region. The insights derived from both CV methods demonstrate that adopting a targeted region approach, e.g., focusing solely on the CH band or the combined S4-CH region, can minimize the data acquisition times and is particularly helpful for improving the efficiency of imaging applications.
Even though the present study has supported the understanding and quantification of biological variance in BCARS data of HepG2 and HEK293 cell lines, there are avenues for further research to enhance the applicability of BCARS in biomedical investigations. First, exploring additional cell lines with more batches of the data and experimental conditions could provide a more comprehensive understanding of the generalizability of our findings. Investigating the influence of specific cellular components on spectral variations would deepen insights into the molecular basis of biological variance.
Further, it is worth considering that the variance can be introduced from the biological sample, preprocessing (including phase retrieval), and the CARS measurement devices. The present study is primarily focused on the biological aspect (which is probably higher than the other two, at least for spontaneous Raman spectra). The second aspect is also addressed by applying the proper preprocessing steps. Finally, the last part requires a ring trial like we did for Raman spectra to check the influence of the CARS device and its data processing on the spectral data, leading to an understanding of the compatibility.40,41 Also, in the future, deep learning (DL) models can be explored for efficient removal of NRB and retrieval of resonant Raman information.17,42,43 Pursuing these future directions holds the potential to refine our understanding of BCARS spectroscopy and its application in biomedical research, ultimately advancing our ability to harness spectroscopic imaging for meaningful insights into biological systems.
Footnote |
† Electronic supplementary information (ESI) available: For the Supplementary Fig. 1–4. See DOI: https://doi.org/10.1039/d4an00648h |
This journal is © The Royal Society of Chemistry 2024 |