Data augmentation method based on the Gaussian kernel density for glioma diagnosis with Raman spectroscopy

Qingbo Li; Jianwen Wang; Yan Zhou

doi:10.1039/D3AY00188A

Data augmentation method based on the Gaussian kernel density for glioma diagnosis with Raman spectroscopy

Qingbo Li,

*^a Jianwen Wang

^a and Yan Zhou*^b

Author affiliations

* Corresponding authors

^a School of Instrumentation and Optoelectronic Engineering, Precision Opto-Mechatronics Technology Key Laboratory of Education Ministry, Beihang University, Beijing 100191, China
E-mail: qbleebuaa@buaa.edu.cn

^b Department of Neurosurgery, PLA Air Force Medical Center, Beijing 100142, China
E-mail: zhouyandr@126.com

Abstract

Glioma is an intracranial malignant brain tumor with high infiltration. It is difficult to identify the glioma boundary. Raman spectroscopy can potentially detect this boundary accurately in situ and in vivo during surgery. However, when building a classification model for an in vitro experiment, fresh normal tissue is difficult to obtain. The number of normal tissues is far less than that of glioma tissues, which leads to a classification bias toward the majority class. In this study, a data augmentation algorithm GKIM based on the Gaussian kernel density is proposed for the data augmentation of normal tissue spectra. A weight coefficient calculation formula is proposed based on the Gaussian density instead of a fixed coefficient to synthesize new spectra, which increases sample diversity and improves the robustness of modeling. Additionally, the fuzzy nearest neighbor distance replaces the general fixed neighbor number K to select the original spectra for synthesis. It automatically determines the nearest spectra and adaptively synthesizes new spectra according to the characteristics of the input spectra. It effectively overcomes the problem of the newly generated sample distribution being too concentrated in specific spaces for the common data augmentation method. In this study, 769 Raman spectra of glioma and 136 Raman spectra of normal brain tissue corresponding to 205 and 37 cases, respectively, were collected. The Raman spectra of the normal tissue were extended to 600. The accuracy, sensitivity, and specificity were 91.67%, 91.67%, and 91.67%. The proposed method achieved better predictive performance than traditional algorithms for class imbalance.

This article is part of the themed collection: Analytical Methods HOT Articles 2023

Article information

https://doi.org/10.1039/D3AY00188A

Article type

Paper

Submitted

06 Feb 2023

Accepted

17 Mar 2023

First published

03 Apr 2023

Download Citation

Anal. Methods, 2023,15, 1861-1869

Permissions

Request permissions

Data augmentation method based on the Gaussian kernel density for glioma diagnosis with Raman spectroscopy

Q. Li, J. Wang and Y. Zhou, Anal. Methods, 2023, 15, 1861 DOI: 10.1039/D3AY00188A

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Analytical Methods

Data augmentation method based on the Gaussian kernel density for glioma diagnosis with Raman spectroscopy

Abstract

Article information

Download Citation

Permissions

Data augmentation method based on the Gaussian kernel density for glioma diagnosis with Raman spectroscopy

Social activity

Search articles by author

Spotlight

Advertisements