FTIR-based machine learning for prediction of malignant transformation in oral epithelial dysplasia

Rong Wang *a, Roya Sabzian b, Tanya M. Gibson c and Yong Wang a
aDepartment of Oral and Craniofacial Sciences, School of Dentistry, University of Missouri Kansas City, Kansas City, MO, USA. E-mail: wangrong@umkc.edu; wangyo@umkc.edu; Tel: +816-235-2176
bDepartment of Restorative Dentistry, Rutgers School of Dental Medicine, Newark, NJ, USA. E-mail: Rs2430@rsdm.rutgers.edu
cDepartment of Oral Pathology, Radiology & Medicine, School of Dentistry, University of Missouri Kansas City, Kansas City, MO, USA. E-mail: gibsontm@umkc.edu

Received 30th January 2025 , Accepted 8th May 2025

First published on 14th May 2025


Abstract

Oral squamous cell carcinoma (OSCC) is an aggressive cancer with a poor prognosis. Oral epithelial dysplasia (OED) is a precancerous lesion associated with an increased risk of malignant transformation (MT) into OSCC. However, current histopathological methods for diagnosing OED are subjective and ineffective in assessing MT risk. FTIR provides a comprehensive biochemical profile of tissues, known as “biomolecular fingerprinting”. Previously we developed an FTIR-based OSCC-Benign classifier that accurately distinguishes OSCC from benign tissue. In this study, we evaluated whether this classifier could also predict MT risk in OED. Thirty OED patient biopsies with documented MT outcomes were analyzed, including 12 with and 18 without MT. FTIR images were acquired from six regions of interest (ROIs) per tissue section, yielding an average epithelial spectrum for each ROI, and a total of 180 spectra for model evaluation. The OSCC-Benign classifier achieved an accuracy of 81.7% with an F1 score of 0.77 at the ROI level, and an accuracy of 83.3% with an F1 score of 0.8 at the biopsy level in predicting MT in OED. Our findings suggest that OEDs with biomolecular fingerprints similar to OSCC carry a higher risk of MT, while those resembling benign tissue carry a lower risk, providing new insight into the malignant transformation process. In summary, the FTIR-based machine learning approach outperforms traditional histopathology in predicting MT risk in OED, potentially offering a quantitative and objective tool for clinical diagnosis.


Introduction

Oral cancer refers to a subgroup of head and neck malignancies that affect various intra-oral sites, including the lips, tongue, gingiva, floor of the mouth, and buccal mucosa. It poses a significant global health concern, with an estimated 377[thin space (1/6-em)]713 new cases and 177[thin space (1/6-em)]757 deaths reported in 2020, predominantly in low- and middle-income countries.1 More than 90% of oral cancers are oral squamous cell carcinoma (OSCC). Despite the accessibility of the oral cavity for clinical examination and advancements in therapeutic strategies, the overall five-year survival rate for OSCC remains low, ranging from 50% to 60%. Early detection is critical for improving patient prognosis and survival outcomes.2,3

Oral carcinogenesis is a highly complex, multifactorial, and multistep process. It often begins with benign hyperplasia/hyperkeratosis, progressing to oral epithelial dysplasia (OED), carcinoma in situ, and finally invasive OSCC.4 OED, a microscopically diagnosed precancerous lesion, is associated with an increased risk of malignant transformation (MT) into OSCC. It is characterized by cytological and architectural alterations that disrupt the normal maturation and stratification of the epithelium. An OED can be graded as mild, moderate, or severe based on the three-tier classification system established by the World Health Organization (WHO).5 It has been reported that the likelihood of MT varied by OED grade, with an estimated rate of 7%–50% for severe, 3%–30% for moderate, and <5% for mild OED.6–8 Although these histological grades statistically correlate with clinical outcomes, the WHO's gold standard diagnostic system is subjective and ineffective in predicting MT risk in individual OED patients.9,10 To overcome these limitations, researcher have proposed integrating clinical and molecular biomarkers into the diagnostic process; however, none have achieved the desired clinical effectiveness.10,11

Fourier transform infrared (FTIR) spectroscopy is a versatile analytical technique used to characterize molecular structures by analyzing how chemical bonds interact with infrared radiation. This technique generates a unique spectral “fingerprint” that provides qualitative and quantitative insights into the biochemical composition of a specimen, including proteins, nucleic acids, lipids, and carbohydrates.12,13 When combined with light microscopy, FTIR imaging enables spatially resolved biomolecular analysis producing a three-dimensional hyperspectral dataset. Due to its label-free, non-invasive, and highly sensitive nature, FTIR spectroscopy and imaging techniques have been explored for the detection and diagnosis of various cancers.14–17 FTIR imaging generates high-dimensional data, often requiring multivariate chemometric and machine learning methods for data analysis and interpretation.17

In a previous study, we applied FTIR imaging and machine learning to differentiate between OSCC, OED and benign (hyperkeratosis) oral biopsy tissues. We developed three supervised discriminant models – partial least squares discriminant analysis (PLSDA), support vector machines discriminant analysis (SVMDA), and extreme gradient boosting discriminant analysis (XGBDA) – to discriminate OSCC from benign tissues. Using 10-fold Venetian blinds cross-validation, we reported 100% sensitivity and specificity for the PLSDA model, compared to 95% sensitivity and 96% specificity for the SVMDA and XGBDA models.18 These preliminary results were very encouraging, though the potential risk for overfitting remains a concern due to the limited sample size. Exploratory analysis of the OED biopsy samples (hereafter referred to as biopsies) revealed partial spectral overlap with both OSCC and benign tissues. When evaluated using the OSCC-Benign PLSDA model, some OED spectra were classified as OSCC-like, while others Benign-like. Since clinical outcomes for these OED biopsies were previously unknown, we hypothesized that OED spectra classified as OSCC-like indicate a higher risk of MT, whereas those classified as Benign-like suggest a lower risk.

In the current study, to test this hypothesis, we analyzed 30 OED patient biopsies with documented clinical outcomes, including 12 that underwent MT and 18 that did not. The original histopathological diagnoses of these biopsies were verified by a collaborating oral pathologist and used as the “ground truth” for validation. FTIR images were acquired from six regions of interest (ROIs) per biopsy, and the corresponding average epithelial spectra were computed, resulting in a dataset of 180 spectra labeled according to the verified MT outcomes. Using this OED FTIR dataset as a test set, we assessed the feasibility and effectiveness of the OSCC-Benign PLSDA model for predicting MT risk in OED. In comparison, we also trained a new PLSDA model using the OED FTIR dataset itself and evaluated its performance using cross-validation.

Experimental

Fig. 1 illustrates the schematic of the experimental workflow, which is described in detail in the following sections. All experiments were performed in accordance with the Ethical Principles and Guidelines for the Protection of Human Subjects of Research, and were approved by the Institutional Review Board of the University of Missouri–Kansas City (UMKC) under project number 2087842. Patient identifiers were removed prior to experimentation to ensure privacy and confidentiality.
image file: d5an00117j-f1.tif
Fig. 1 Schematic illustration of the experimental workflow. Two consecutive thin sections of FFPE (formalin-fixed paraffin-embedded) oral biopsy tissues were subject to H&E staining and FTIR imaging. Machine learning analyses were performed on FTIR spectra from the OED biopsies to assess their MT risk.

Patient biopsy identification and sample preparation

The patient database in the UMKC School of Dentistry Pathology Department was searched to identify suitable biopsy samples for the study. OED patient biopsies with documented MT were selected based on an initial diagnosis of dysplasia followed by a subsequent OSCC diagnosis in the same patients. OED biopsies without MT were selected based on an initial dysplastic diagnosis followed by a later diagnosis that did not progress to OSCC. Since the timeline for malignant transformation varies, definitive determination of OED cases without MT remains challenging. Considering the reported average transformation time of 1.82 ± 1.55 years for OED,19 we used a practical selection criterion requiring a minimum two-year interval between the initial and follow-up diagnoses for cases without MT. Consequently, 30 formalin-fixed paraffin-embedded OED blocks were identified from the Pathology Department, including 12 with MT from 9 patients and 18 without MT from 17 patients. Four patients had multiple OED diagnoses over time before the final diagnosis. These biopsies were included in our investigation as independent samples and are discussed in groups by patients. De-identified demographic and diagnostic information was provided by the collaborating oral pathologist and summarized in Table 1.
Table 1 Deidentified demographic and diagnostic information for the thirty OED patient biopsies and ROI-level risk prediction by the OSCC-Benign PLSDA model
Biopsy information Initial diagnosis Follow-up diagnosis Model prediction
ID Patient birth year Sex Clinical description Histological diagnosis Clinical description Histological diagnosis ROI-level
Location Location
Month/year Month/year
HK: hyperkeratosis, VHK: verrucous hyperkeratosis, SCC: squamous cell carcinoma, sf-invasive: superficially invasive, EH: epithelial hyperplasia, acan: acanthosis, atyp: atypic, epi: epithelial, mod: moderate, sev: severe, w: with, diff: differentiated.
OED biopsies with MT
05 1930 F Leukoplakia, ulceration Mod to sev OED Leukoplakia, ulceration Mod to well diff. SCC 6/6 low risk
Lateral/ventral Tongue Lateral tongue
10/2014 03/2015
06 1945 F Not given VHK w mod OED erythro-Leukoplakia Sf-invasive SCC 5/6 high risk
Buccal vestibule Retromolar region 1/6 low risk
10/2014 08/2019
94 1945 F Not given HK & acan w mod OED erythro-Leukoplakia Sf-invasive SCC 6/6 high risk
Retromolar pad area Retromolar region
09/2018 08/2019
42 1952 F erythro-Leukoplakia Mod to sev OED erythro-Leukoplakia Mod to well diff. SCC 5/6 high risk
Palatal papilla Buccal mucosa 1/6 low risk
09/2016 04/2018
46 1932 F Granular tissue mass Atyp EH w OED Raised dense tissue Well diff. SCC 6/6 high risk
Buccal mucosa Edentulous ridge & buccal mucosa
12/2016
03/2017
51 1939 M Mixed erythro-leukoplakia Atyp EH w sev OED Ulceration Mod diff. SCC 5/6 high risk
Anterior maxilla Palatal mucosa & gingival margin 1/6 low risk
02/2017
01/2018
55 1939 M erythro-Leukoplakia Atyp EH w sev OED Ulceration SCC 5/6 high risk
Anterior maxilla Palatal mucosa & gingival margin 1/6 low risk
04/2017
01/2018
63 1952 M Hyperkeratosis leukoplakia Mod OED Leukoplakia SCC 6/6 low risk
Lateral posterior tongue Anterior floor of mouth
11/2017 07/2018
67 1963 F Leukoplakia Mod OED Leukoplakia Sf-invasive SCC 5/6 high risk
Lateral tongue Left tongue 1/6 low risk
01/2018 05/2020
83 1963 F Leukoplakia Mild to mod OED Leukoplakia Sf-invasive SCC 6/6 high risk
Ventral surface tongue left tongue
03/2018 05/2020
80 1959 F erythro-Leukoplakia Atyp EH w OED Erythroplakia ulceration Mod to well diff. SCC 6/6 high risk
Gingiva and mucosa Buccal gingiva & mucosa
03/2018
03/2018
90 1945 F erythro-Leukoplakia Sev OED Mass with rolled borders Well diff. SCC 6/6 high risk
Buccal gingiva Gingiva
07/2018 12/2019
OED Biopsies without MT
04 1960 F Multiple leukoplakia Mod OED Rough leukoplakia HK w mild OED 6/6 low risk
Left upper vestibule Buccal gingiva
07/2014 08/2018
02 1967 M Leukoplakia Mild OED Leukoplakia Mild OED 6/6 low risk
Lateral tongue Lateral tongue
07/2014 06/2018
08 1967 M Leukoplakia Mild to mod OED Leukoplakia Mild OED 6/6 low risk
Lateral tongue Lateral tongue
11/2014 06/2018
09 1963 M erythro-Leukoplakia Mild to mod OED Erythroplakia ulceration Sev OED 6/6 low risk
Buccal mucosa Left cheek
11/2014 10/2018
10 1951 M Leukoplakia Mild to mod OED Leukoplakia Mod OED 6/6 low risk
Lateral tongue Lateral tongue
11/2014 02/2018
11 1983 F Leukoplakia VHK w mild OED Leukoplakia VHK w mild OED 6/6 low risk
Lingual Lingual
12/2014 12/2018
12 1960 M Leukoplakia Mild OED Leukoplakia Mild OED 6/6 low risk
Lateral border tongue Ventral lateral border
02/2015 02/2020
17 1982 M Not given Mod to sev OED Erythroplakia Mod to sev OED 5/6 high risk
Lateral tongue Lateral tongue 1/6 low risk
06/2015 01/2021
20 1940 F Leukoplakia w hyperkeratosis VHK w epi atyp Raised leukoplakia VHK 6/6 low risk
Palatal Apical gingival
09/2015 11/2021
41 1934 F Multiple leukoplakia VHK w mod OED Leukoplakia Mild OED 5/6 low risk
Left cheek, buccal & lingual upper left Left buccal mucosa 1/6 high risk
11/2020
08/2016
50 1936 F Leukoplakia w ulceration Mod OED Hyperplastic tissue Mild to mod OED 6/6 low risk
Buccal gingiva Buccal
01/2017 06/2019
54 1967 M Leukoplakia Mod OED Leukoplakia Mild to mod OED 5/6 high risk
Left tongue Lateral tongue 1/6 low risk
04/2017 01/2020
56 1952 M Leukoplakia VHK w mod OED Leukoplakia HK w epi atyp 6/6 low risk
Facial mucosa Edentulous site
06/2017 04/2020
59 1951 F Granular Mild OED Leukoplakia HK & acan. w epi atyp 5/6 low risk
Marginal gingiva Marginal gingiva 1/6 high risk
07/2017 08/2019
62 1942 M Verrucous leukoplakia HK & acan. w epi atyp Leukoplakia HK & acan. w epi atyp 6/6 low risk
Facial Gingival margin
08/2017 07/2020
70 1952 M Erythroplakia Mild OED erythro-Leukoplakia Mild to mod OED 6/6 low risk
Buccal mucosa Left buccal mucosa
01/2018 07/2021
86 1956 M Leukoplakia Mild OED Leukoplakia VHK 6/6 low risk
Palatal vault Gingiva
04/2018 07/2021
89 1957 M erythro-Leukoplakia Sev OED Rough leukoplakia Sev OED 3/6 high risk
Gingiva papilla Lingual 3/6 low risk
07/2018 03/2021


Two 4-micron-thick sections were cut from each block using a microtome (Leica RM2125, RTS, Leica Biosystems Inc., Buffalo Grove, IL, USA). One section was affixed onto a positively charged glass slide (Cat. No.: 22034979, Fisher Scientific, Waltham, MA, USA) for Hematoxylin and Eosin (H&E) staining and histological evaluation. The adjacent section was placed on an IR transparent BaF2 substrate measuring 25 × 2 mm (REFLEX Analytical Corporation, Ridgewood, NJ, USA) for FTIR imaging. The H&E-stained sections were digitally scanned at ×40 magnification using a Leica Aperio CS2 digital scanner (Leica Biosystems W. Lake Cook Road, Deer Park, IL, USA) and sent to our collaborating oral pathologist for verification. Then the digital H&E images were manually annotated for the dysplastic regions using the open-source software QuPath (v0.4.0) and used as references for ROI selection during FTIR imaging. The biopsy tissue sections on the BaF2 substrate underwent deparaffinization by immersion in histological grade xylene (CAS number 1330-20-7, Sigma-Aldrich, St Louis, MO, USA) three times for five minutes each at room temperature. The deparaffinized sections were air-dried and stored in a vacuum desiccator for FTIR imaging within a week.

FTIR image acquisition

FTIR images were acquired in transmission mode using a PerkinElmer FTIR Spectrum Spotlight imaging system (Spectrum one, Spotlight 300, PerkinElmer, Waltham, MA, USA). The Spotlight 300 imaging system is equipped with a dual-mode detector featuring a 1 × 16 narrow band mercury cadmium telluride (MCT) array and 100 μm medium band MCT single point detector, operating at liquid nitrogen temperature. The FTIR image acquisition parameters were set as follows: spectral resolution 4 cm−1, spectral range 4000–950 cm−1, pixel resolution 6.25 μm, and co-adding number 16 per pixel. First, a survey image was generated using the built-in light microscope in the FTIR Spotlight system for the tissue section. Subsequently, more than six ROIs in the size of 400 μm × 100 μm (64 × 16 pixels) were defined on the survey image, in reference to the annotated digital H&E image. The ROIs were selected to encompass the basal epithelium and adjacent stroma, while avoiding areas with poor structural integrity. Before acquiring FTIR images for each ROI, a background spectrum was obtained from a clean area of the substrate outside the tissue section for background correction. FTIR image acquisition was carried out using the Spectrum IMAGE software by PerkinElmer.

Data analysis

The same spectral preprocessing, outlier removal, and clustering procedures used in the development of the OSCC-Benign models in our previous study were used in the current study and are briefly described below.18 The data analysis and machine learning modeling were performed using the Eigenvector PLS_Toolbox 9.3 and MIA_Toolbox 3.1 (Eigenvector research incorporated, Inc., Manson, WA, USA) in MATLAB (R2020b, MathWorks).

Spectral preprocessing

Raw hyperspectral images were preprocessed as follows: 1, transmission to absorbance conversion (A = log(1/T)); 2, fingerprint region selection (1800–950 cm−1); 3, Savitzky–Golay smoothing (order = 2, window = 13 pt, tails: polyinterp); 4, EMSC (extended multiplicative signal correction) for light-scattering; 5, baseline correction using automated weighted least squares (order = 3); and 6, vector normalization (length = 1). During model training, validation and testing, a second derivative transformation (order = 2, window = 7 pt) was further applied to the input spectral data to enhance model performance.

Outlier removal

Outliers may be present in the pixel-level spectral data, particularly from areas with compromised tissue quality or integrity. To address this, unsupervised principal component analysis (PCA) was performed after spectral preprocessing to identify and remove outliers. PCA transforms the original coordinate system into a new one defined by principal components (PCs) that capture the most significant variance in the dataset. For each ROI image dataset, a Reduced Hotelling's T2versus Reduced Q residual scatter plot was generated. Spectra with values exceeding 1 in either Reduced Hotelling's T2 or Reduced Q residual were flagged, examined, and subsequently removed to ensure data integrity.

Representative spectra extraction

Each ROI typically contain both epithelial and stromal tissues. To distinguish FTIR spectra of different tissue components, unsupervised hierarchical cluster analysis (HCA) was employed to cluster pixel-level spectra in each ROI using Ward's method.20 The histological identity of each cluster was determined by referencing the corresponding H&E-stained section. Then the average spectra from epithelial clusters in each ROI were extracted and used as the representative spectra for subsequent analyses. Not all acquired ROI images yielded high quality spectral data. To ensure data reliability, six high-quality ROIs were randomly selected from each biopsy, resulting in a total of 180 representative spectra from thirty OED biopsies, which were combined into one OED FTIR dataset for further analysis. Each spectrum was assigned a class label according to the MT outcome of the corresponding patient (1: with MT; 2: without MT). The class-average spectra of OED with and without MT were then compared to those of benign and OSCC tissues from our previous study to identify spectral differences.

Fig. 2 illustrates an overview of the process, including ROI selection, FTIR image acquisition, spectral preprocessing, HCA clustering, and the extraction of average epithelial spectra.


image file: d5an00117j-f2.tif
Fig. 2 Illustration of (a) ROI selection and FTIR image acquisition, (b) spectral preprocessing, (c) FTIR image clustering and average epithelial spectrum extraction.

Model evaluation

We previously developed three discriminant models, PLSDA, SVMDA, and XGBDA, to differentiate average epithelial spectra between eleven OSCC and twelve benign oral tissue biopsies. Based on 10-fold Venetian blinds cross-validation, the PLSDA model achieved 100% sensitivity and specificity, while the other two models generated 95% sensitivity and 96% specificity.18 PLSDA is a supervised machine learning technique that extends Partial Least Squares (PLS) regression by incorporating linear discriminant analysis (LDA) for classification. Instead of predicting a continuous response, PLSDA uses a categorical response matrix (Y) to represent class labels. The algorithm identifies latent variables (LVs) from the spectral dataset (X) that best differentiate the classes while maximizing covariance between X and Y. The OSCC-Benign PLSDA model consisted of four LVs, explaining 94.50%, 4.48%, 0.38% and 0.29% of the spectral variance between OSCC and benign tissues, respectively. In the current study, we tested the OSCC-Benign PLSDA model using the OED FTIR dataset to evaluate its performance in predicting MT risk in OED. Under our hypothesis, an OSCC-class prediction indicated high MT risk, while a Benign-class prediction suggested low risk. The dataset contained 72 spectra from twelve OED biopsies with MT and 108 spectra from eighteen OED biopsies without MT, resulting in a class ratio of 40/60 or 2[thin space (1/6-em)]:[thin space (1/6-em)]3. Model performance at the ROI level was assessed using ROC (receiver operating characteristic) curves with AUC (area under the curve) scores, confusion matrices, sensitivity, specificity, accuracy, precision, and F1 scores.21 To determine MT risk at the biopsy level, we applied a majority voting rule: if at least three out of six ROI spectra from a biopsy were classified as high-risk, the entire biopsy was labeled as high-risk. This approach enabled estimation of the OSCC-Benign PLSDA model's predictive performance at the biopsy level.

Meanwhile, we trained another OED PLSDA model using the 180 OED spectra and evaluated its performance. We employed 10-fold Venetian blinds cross-validation for hyperparameter tunning and model performance evaluation. To ensure the rigor of model training and evaluation, we assigned all six spectra from each OED biopsy to a single blind so that they remained together during the split. Model performance at the ROI-level was evaluated using the same metrics as before. However, due to cross-validation and potential variations in ROI-level predictions across iterations, estimating biopsy-level prediction accuracy was not performed. The top four LVs of the OED model were identified and compared to those of the OSCC-Benign model.

Results

Fig. 3 presents a comparison of class average spectra for Benign, OSCC, and OED biopsies with and without MT, highlighting spectral differences mainly in four regions: around 1650 cm−1, 1600–1500 cm−1, 1350–1180 cm−1, and 1160–950 cm−1. The 1650 cm−1 band corresponds to the amide I band of proteins, mainly associated with C[double bond, length as m-dash]O stretching vibrations in the peptide backbone. The comparison revealed a descending trend in Amide I band intensity in the order of Benign > OED-without-MT > OED-with-MT/OSCC, with OED-with-MT exhibiting a band intensity closely resembling that of OSCC. The 1600–1500 cm−1 region represents the amide II band of proteins, mainly linked to the bending vibration of N–H bond and the stretching vibration of C–N bond in the peptide backbone. The comparison showed a descending band intensity at 1548 cm−1 following the order Benign > OED-without-MT > OED-with-MT > OSCC. Additionally, band broadening was observed in the order OSCC/OED-with-MT > OED-without-MT > Begin, along with a red shift toward lower wavenumbers from 1548 cm−1 to 1500 cm−1, following the trend OSCC > OED-with-MT > OED-without-MT/Benign. The spectral region of 1350–1180 cm−1 is attributed to the amide III band of proteins (1350–1250 cm−1), the asymmetric vibration of –PO2 (1240 cm−1) and the deformational modes of the CH3/CH2 groups in phospholipids and nuclei acids. The comparison disclosed a descending band intensity at 1310 cm−1 in the order of Benign > OED-without-MT > OED-with-MT/OSCC, with OED-with-MT closely resembling OSCC. Similarly, the band at 1240 cm−1 exhibited decreasing intensity in the order OSCC > OED-with-MT > OED-without-MT > Benign, accompanied by a red shift in the same order. The spectral region of 1160–950 cm−1 is attributed to the stretching vibrations of the C–O/C–C groups in carbohydrates (e.g., glycogen) at 1154 and 1030 cm−1, as well as the symmetric vibration of –PO2 in phospholipids and nuclei acids at 1080 cm−1. The comparison indicated a descending band intensity at 1080 cm−1 following the order OSCC/OED-with-MT > OED-without-MT > Benign, with OED-with-MT closely resembling OSCC.12,22–25
image file: d5an00117j-f3.tif
Fig. 3 Comparison of class average FTIR spectra in the fingerprint region (1800–950 cm−1) for Benign, OSCC, OED with and without MT. Inserts show the enlarged spectral regions for (a) Amide I, (b) Amide II, (c) Amide III, DNA/RNA, and (d) Glycogen, DNA/RNA.

Table 1 summarizes the demographic and diagnostic information for thirty OED biopsies from twenty-six patients. Notably, biopsy pairs #06 and #94, #51 and #55, #67 and #83, and #02 and #08 represent biopsies from the same patients. Among the twenty-six patients, there were an equal number of males and females, with thirteen of each. The patients’ ages ranged from 35 to 86 years at the time of their follow-up diagnoses. The time interval between the initial and follow-up diagnoses for the OED with MT ranged from three to fifty-eight months, with one exception. Biopsy #80 had an OSCC diagnosis within one month of the initial OED diagnosis, which may indicate a misdiagnosis due to mis-sampling, as moderate or severe dysplasia may occur near carcinoma. Nonetheless, we retained this biopsy to evaluate whether our approach could identify such misdiagnoses. The time interval between diagnoses for the OED without MT ranged from twenty-five to seventy-four months. The MT risk prediction for each of the six ROIs in the OED biopsies, as determined by the OSCC-Benign PLSDA model, is shown in the last column. Using the majority voting rule, our analysis identified five OED biopsies with incorrect predictions, including two false positives and three false negatives, as indicated by the grey rows in Table 1.

Fig. 4 illustrates the MT risk probability distribution for the 180 OED representative spectra predicted by the OSCC-Benign PLSDA model. Each biopsy is represented by six consecutive markers of the same color and shape. OED spectra with and without MT are represented in solid and empty markers respectively. The two false positives and three false negatives are indicated by empty and solid red arrows respectively. Among the thirty OED biopsies, twenty exhibited a homogeneous risk distribution across all six ROIs, nine demonstrated consistent risk predictions for five of the six ROIs. One particular biopsy #89 showed an equal number of high-risk (three) and low-risk (three) ROIs. Based on the predefined majority voting rule, biopsy #89 was assigned as high-risk for MT.


image file: d5an00117j-f4.tif
Fig. 4 MT risk probability for 180 representative FTIR spectra from 30 OED biopsies, predicted by the OSCC-Benign PLSDA model. Each biopsy is represented by six consecutive markers in the same color and shape. Red empty and solid arrows indicate false positive and false negative predictions, respectively.

Fig. 5 summarizes the performance of the OSCC-Benign PLSDA model and the OED PLSDA model for MT risk prediction. Fig. 5a shows the ROC curve with AUC score and confusion matrix for testing 180 OED representative spectra using the OSCC-Benign PLSDA model. The AUC score was 0.89, and the confusion matrix indicated a sensitivity of 76.4%, specificity of 85.2%, accuracy of 81.7%, precision of 77.5%, and an F1 score of 0.77 at the ROI level. When applying the majority voting rule, the OSCC-Benign PLSDA model achieved sensitivity, specificity, and accuracy of 83.3%, precision of 76.9%, and an F1 score of 0.8 at the biopsy-level. Fig. 5b shows the cross-validation ROC curve with AUC score and confusion matrix for the new OED PLSDA model. The AUC score was 0.85, and the confusion matrix indicated a sensitivity of 76.4%, specificity of 88.9%, accuracy of 83.9%, precision of 82.1%, and an F1 score of 0.79 at the ROI level. Among the evaluation metrics, both F1 and AUC scores provide insights into a model's accuracy. While the accuracy score indicates the proportion of correct predictions at a particular decision threshold, the AUC provides a broader evaluation of model performance across different thresholds. The AUC is an effective combined measure of sensitivity and specificity, reflecting the inherent validity of a diagnostic test. An AUC score of 0.8–0.9 indicates very good diagnostic accuracy.26 The F1 score, which combines precision and recall (sensitivity), is particularly useful for evaluating models with imbalanced classes. In the current study, with a class ratio of 2[thin space (1/6-em)]:[thin space (1/6-em)]3, the F1 score may provide a more reliable evaluation than the accuracy score. Fig. 5c provides a summary of the performance metrics, including sensitivity, specificity, accuracy, precision, F1 and AUC scores, of the two PLSDA models. For the OED PLSDA model, the Root Mean Square Error of Calibration (RMSEC) was 0.29 and the Root Mean Square Error of Cross-Validation (RMSECV) was 0.39.


image file: d5an00117j-f5.tif
Fig. 5 Model performance in ROC with AUC and confusion matrix for MT risk prediction in OED at the ROI-level for the (a) OSCC-Benign PLSDA model, in which the OED FTIR dataset was used as a test set, and (b) OED PLSDA model, which was trained using the OED FTIR dataset with 10-fold veteran blinds cross-validation. (c) A summary of performance metrics for both models.

Fig. 6 shows the top four LVs of the two PLSDA models in their second derivative format. In the OSCC-Benign PLSDA model, the loadings of LV1, LV2, LV3, and LV4 explained 94.50%, 4.48%, 0.38% and 0.29% of the spectral variations between OSCC and benign tissues, respectively. In the OED PLSDA model, the loadings of LV1, LV2, LV3, and LV4 explained 96.32%, 2.27%, 0.20% and 0.17% of the spectral variations between OED biopsies with and without MT, respectively. A visual comparison shows that LV1 and LV2 are very similar in both models, collectively explaining 99.0% of the spectral variations between OSCC and benign tissues, as well as 98.5% of the spectral variations between OED tissues with and without MT.


image file: d5an00117j-f6.tif
Fig. 6 Comparison of the top four latent variables (LVs) between the (a) OSCC-Benign PLSDA model and (b) OED PLSDA model. The LVs are in second derivative form of the FTIR spectra.

Eleven prominent FTIR bands were identified from the top two latent variable loadings (LV1 and LV2) of both PLSDA models as discriminative spectral biomarkers for differentiating OED biopsies with and without MT, as summarized in Table 2. These discriminative bands are attributed to alterations in the quantities and molecular structures of proteins (1670 cm−1, 1654 cm−1, 1550 cm−1, 1516 cm−1, 1080 cm−1, 970 cm−1), nucleic acids (1237 cm−1, 1080 cm−1, 970 cm−1), lipids (1704 cm−1, 1482 cm−1), and carbohydrates (1080 cm−1, 1428 cm−1, 1024 cm−1) within the tissues.12,24

Table 2 Important FTIR spectral biomarkers for differentiating between OEDs with and without MT
Wavenumber (cm−1) Vibrational modes and biochemical assignments
1704 Ester carbonyl C[double bond, length as m-dash]O stretching, fatty acid esters, lipids
1670 Amide I, secondary structure of proteins
1654 C[double bond, length as m-dash]O stretching of Amide I
1550 C–N & CN–H stretching of amide II
1516 Amide II, secondary structure of proteins
1482 Deformation vibrations of –CH3, lipid
1428 CH2 bending (polysaccharides, cellulose)
1237 Asymmetric phosphodiester stretching νas (–PO2), lipid, DNA/RNA, Amide III (C–N stretching, N–H bending)
1080 Symmetric phosphodiester stretching νs. (–PO2), protein phosphorylation, phospholipids, collagen, DNA/RNA
1024 Glycogen (C–O stretching associated with glycogen)
970 C–O stretching of the phosphorylated proteins, DNA/RNA


Discussion

OED is a histologically defined oral lesion that carries an increased risk of progression to OSCC. The primary goal of OED grading is to evaluate tissue abnormalities and assess a patient's risk of malignant transformation. The current gold standard for OED diagnosis is the WHO's three-tier system, which grades dysplastic lesions as mild, moderate, or severe based on histological features. Moderate or severe OED has been shown to be statistically associated with a greater risk of MT compared to mild OED, with an odds ratio of 2.4 (99% CI 1.5–3.8).27 However, there are several issues with the WHO diagnostic system, including substantial subjectivity and wide inter- and intra-observer variability, especially for moderate OED.28 To address these issues, Kujan et al. proposed a binary grading system that categorizes dysplastic lesions as either low or high grade using morphological criteria similar to those of the WHO system.29 Although some studies have suggested that the binary system improves observer consistency compared to the three-tier system,30,31 a recent meta-analysis indicated that both grading systems exhibited limited effectiveness in predicting MT in individual OED patients.32 In some cases, lower-grade lesions may progress to OSCC, while higher-grade lesions may remain unchanged.33,34 Additionally, patients without an OED diagnosis may still develop OSCC.35

At the molecular level, it remains unclear why some OED lesions progress to OSCC while others don't, and the mechanisms driving OED progression are still poorly understood.36 Oral carcinoma is thought to arise from the progressive accumulation of molecular changes, with the final change establishing the cancer genotype and triggering invasion.37 The process of MT may involve a range of genetic and molecular alterations, including enhanced oncogene activity, reduced tumor suppressor gene function, changes in cell cycle regulators and growth factors, as well as alterations in protein phosphorylation, mitosis, and cell proliferation.38–41 However, research on genetic changes in dysplastic lesions and OSCC has yielded conflicting results. While some research identified shared genetic changes between dysplastic lesions and carcinoma samples, other studies showed that molecular changes in precancerous lesions had little relationship to those in subsequent carcinomas.42–45 Many molecular biomarkers have been studied for their potential in predicting MT in OED. These include biomarkers of cell proliferation and apoptosis (e.g., Ki-67 and p53), cell adhesion and junction proteins (e.g., podoplanin and S100A7), immune and inflammatory markers (e.g., CD68 and PD-1), as well as biomarkers associated with genomic instability, cancer stem cells and miRNAs.46,47 Unfortunately, no single biomarker has demonstrated sufficient clinical reliability for OED diagnosis.48 Multi-omics approaches offer new opportunities to integrate data across different cellular function levels, including genomics, epigenomics, transcriptomics, proteomics, metabolomics, and microbiomes, to enhance our understanding of complex diseases such as oral cancer and precancerous oral lesions. Despite their promise, clinical translation and implementation of multi-omics approaches face significant challenges, including the need for specialized infrastructure, advanced computational resources, and the high costs associated with data generation and analysis.49

We propose the integration of FTIR with machine learning to address the challenges in OED diagnosis. One notable advantage of the FTIR technique is its capability to provide a comprehensive molecular fingerprint of a biological sample, capturing information across the entire “omics” landscape simultaneously. In a previous study, we applied FTIR to evaluate oral biopsy tissues with histological diagnoses of hyperkeratosis (benign), moderate-to-severe dysplasia, and OSCC. We developed discriminant models that accurately differentiated epithelial FTIR spectra between OSCC and benign tissues. Interestingly, we observed that some OED epithelial spectra closely resembled OSCC, while others were more similar to benign tissue. This led us to hypothesize that the OSCC-like subgroup of OED carries a high risk of MT, while the Benign-like subgroup carries a lower risk. In the current study, we tested this hypothesis by applying the OSCC-Benign PLSDA model to a labelled OED FTIR dataset with documented MT outcomes. The model demonstrated good predictive performance, with a sensitivity of 76.4%, specificity of 85.2%, precision of 77.5%, F1 score of 0.77, and accuracy of 81.7% at the ROI-level. Applying the majority voting rule for each biopsy further enhanced model performance, achieving sensitivity, specificity and accuracy of 83.3%, precision of 76.9%, and an F1 score of 0.8 at the biopsy level. In contrast, the gold standard WHO three-tier system has been reported to provide only 24–40% accuracy in predicting MT in OED.32,50 Similarly, a meta-analysis of the histological binary grading system showed that the MT rates for low-grade and high-grade OED cases were 12.7% and 57.9%, respectively.51 When comparing our model's predictions with WHO histological grading, all eight mild OED biopsies were classified as low-risk, while five out of six severe or moderate-to-severe OED biopsies (except for #05) were classified as high-risk for MT. This indicates a strong correlation between histological grading and our model's predictions, particularly for mild and severe/moderate-to-severe OED biopsies. Moderate OED diagnosis presents the greatest challenge within the WHO three-tier system, often exhibiting poor inter-observer agreement and a lack of clear clinical management guidelines (28, 52). In our study, out of the 12 moderate/mild-to-moderate OED biopsies, our model correctly classified ten cases (four high-risk and six low-risk), achieving an accuracy of 83%. This suggests that our FTIR-based machine learning approach could offer a more objective and reliable method for risk assessment and management of moderate OED biopsies.

At the biopsy-level, our model produced two false positives and three false negatives. Specifically, two OED biopsies #05 and #63, from patients with MT, were predicted as low risk. This discrepancy may result from mis-sampling during the initial biopsy procedure or inaccuracy of the model. On the other hand, three OED biopsies #17, #54, and #89 from patients without MT were predicted as high-risk, which may be due to inaccuracy of the model, possible clinical interventions following the initial OED diagnoses, or a potential MT process beyond the follow-up period. Four of the five misdiagnosed biopsies had false predictions from all (6/6) or the majority (5/6) of the ROIs, while one misdiagnosed biopsy (#89) had false predictions from half (3/6) of the ROIs, suggesting biochemical heterogeneity within some OED tissues. Tissue heterogeneity will be the subject of our future studies to better understand how it influences the prognosis and risk assessment of OED.

In the literature, we found two other studies from the same research group that also investigated an FTIR-based machine learning approach for MT prediction in OED. Ellis et al. applied principal component analysis followed by linear discriminant analysis (PCA-LDA) to FTIR spectra from 17 OED biopsies and produced a mean sensitivity of 79% and specificity of 76% in MT prediction.52 The authors subsequently conducted a follow-up study on the same dataset using a different machine learning algorithm (MLA) and generated a mean sensitivity of 84% and specificity of 79% in predicting the MT of OED.53 Our study differs from these two studies in several key aspects, including sample size (30 vs. 17), FTIR instrumentation and acquisition protocols, spectral pre-processing methods, discriminant models (PLSDA vs. PCA-LDA and MLA), and the use of average spectra versus pixel-level spectra for model training or testing. A particularly significant distinction lies in our model development approach. While the two other studies used the conventional method of splitting their OED spectral dataset into training and test sets, we explored a novel strategy: applying a model trained for OSCC-Benign discrimination to predict OED risk. This novel approach provides a biologically meaningful rationale for OED diagnosis: OED biopsies with biochemical profiles similar to OSCC are associated with a high risk of MT, whereas those resembling benign tissues carry a lower risk. For further comparison, we developed another OED PLSDA model using the 180-spectrum OED dataset and evaluated its performance via 10-fold cross-validation. Notably, the two PLSDA models demonstrated similar performance in predicting MT risk in OED. The loadings of the top two LVs of the two models were nearly identical. Comparison of the class average spectra revealed similar spectral features between high-risk OED and OSCC. Our findings suggest that high-risk OEDs share distinct biochemical characteristics with OSCC, reflecting underlying carcinogenic pathways and molecular changes that lead to cancer development in these OED patients.

This pilot study has several limitations. First, the generalizability of the findings is constrained by the small sample size, which was drawn from a single center. Our future studies aim to expand the sample size and include data from multiple centers, which will help enhance the robustness and reliability of our prediction model across a more diverse patient population. Second, we utilized six ROIs per biopsy to capture the spatial distribution of the tissue. However, the manual selection of ROIs may introduce potential bias. Besides, some biopsies exhibited spatial heterogeneity, which we accounted for using a majority voting rule for biopsy-level predictions. To refine this approach, we plan to capture FTIR images of entire biopsy tissues and perform a more comprehensive analysis of the whole-tissue dataset. We will also explore better methods for interpreting and analyzing spatial heterogeneity. Third, this study focused on the PLSDA model, which was the highest performing model in our previous OSCC-Benign study. Moving forward, we aim to investigate more advanced machine learning techniques, including deep learning architectures such as Convolutional Neural Networks (CNNs), to further enhance the model performance. Deep learning models excel at automatically learning relevant features from complex, high-dimensional data, and often outperform traditional machine learning models, particularly with large datasets. Through these efforts, we aim to establish and validate the FTIR-machine learning approach for clinical applications in OED diagnosis, ultimately contributing to improved patient outcomes.

Conclusions

In summary, our results demonstrate the potential of the FTIR-based machine learning approach as a more accurate tool for predicting MT risk in OED than traditional histological methods. Our FTIR-based PLSDA model generated 83.3% accuracy on the biopsy level in predicting the MT of OED, compared to less than 40% accuracy reported for the histological method. This approach provides a quantitative and objective diagnostic tool with enhanced biological interpretability, assisting clinicians in the assessment and management of OED patients. Low-risk OED patients can follow conservative surveillance protocols and avoid unnecessary medical procedures, while high-risk OED patients can benefit from timely interventions such as surgery, laser therapy, topical chemotherapy, and photodynamic therapy.54–56 A high-risk classification may also necessitate additional sampling, more frequent follow-ups, and re-examination of other oral regions to ensure comprehensive assessment.36 In addition to an objective and accurate diagnostic tool, the quality of biopsy, including factors such as size, depth, orientation and fixation, also contributes to reliable OED diagnosis. Early detection of oral cancer at its precancerous stage can significantly reduce their transformation rates and enhance patient survival outcomes.50,57,58 Future studies should focus on increasing sample size and developing advanced machine learning models to improve diagnostic accuracy. Additionally, validating these models across diverse multicentric cohorts will enhance their generalizability and readiness for clinical translation.

Author contributions

R.W. conceived and designed the study, secured funding, analyzed and interpreted the data, drafted the original manuscript, critically revised the manuscript, and led the project execution. R.S. contributed to sample preparation and data collection. T.G. provided biopsy samples, along with patient demographic and histological diagnostic information. Y.W. co-conceptualized the study, co-secured funding, reviewed and critically revised the manuscript. All authors approved the final version of the manuscript.

Data availability

The authors confirm that the data supporting the findings of this study are available within the article. Raw data that support the findings of this study are available from the corresponding author, upon reasonable request.

Conflicts of interest

The authors declare no conflicts of interest.

Acknowledgements

This work was supported by National Institute of Dental and Craniofacial Research, National Institute of Health, under grant number R21DE032560.

References

  1. J. Ferlay, M. Colombet, I. Soerjomataram, D. M. Parkin, M. Piñeros, A. Znaor and F. Bray, Int. J. Cancer, 2021, 149, 778–789 CrossRef CAS PubMed.
  2. R. L. Siegel, K. D. Miller and A. Jemal, CA-Cancer J. Clin., 2020, 70, 7–30 CrossRef PubMed.
  3. L. Pekarek, M. J. Garrido-Gil, A. Sánchez-Cendra, J. Cassinello, T. Pekarek, O. Fraile-Martinez, C. García-Montero, L. Lopez-Gonzalez, A. Rios-Parra, M. Álvarez-Mon, J. Acero, R. Diaz-Pedrero and M. A. Ortega, Oncol. Rep., 2023, 50, 213 CrossRef CAS PubMed.
  4. F. D. Shah, R. Begum, B. N. Vajaria, K. R. Patel, J. B. Patel, S. N. Shukla and P. S. Patel, Indian J. Clin. Biochem., 2011, 26, 326–334 CrossRef CAS PubMed.
  5. S. Muller and W. M. Tilakaratne, Head Neck Pathol., 2022, 16, 54–62 CrossRef PubMed.
  6. S. Shirani, N. Kargahi, S. M. Razavi and S. Homayoni, Iran J. Med. Sci., 2014, 39, 406–417 Search PubMed.
  7. S. Warnakulasuriya, J. Reibel, J. Bouquot and E. Dabelsteen, J Oral Pathol. Med., 2008, 37, 127–133 CrossRef CAS PubMed.
  8. I. van der Waal, Med. Oral Patol. Oral Cir. Bucal, 2014, 19, e386–e390 CrossRef PubMed.
  9. M. L. Goodson, P. Sloan, C. M. Robinson, K. Cocks and P. J. Thomson, Br. J. Oral Maxillofac. Surg., 2015, 53, 831–835 CrossRef CAS PubMed.
  10. K. Ranganathan and L. Kavitha, J. Oral Maxillofac. Pathol., 2019, 23, 19–27 CrossRef PubMed.
  11. A. Celentano, I. Glurich, W. S. Borgnakke and C. S. Farah, Oral Dis., 2021, 27, 848–880 CrossRef PubMed.
  12. A. C. S. Talari, M. A. G. Martinez, Z. Movasaghi, S. Rehman and I. U. Rehman, Appl. Spectrosc. Rev., 2017, 52, 456–506 CrossRef CAS.
  13. Z. Movasaghi, S. Rehman and D. I. ur Rehman, Appl. Spectrosc. Rev., 2008, 43, 134–179 CrossRef CAS.
  14. S. Kumar, A. Srinivasan and F. Nikolajeff, Curr. Med. Chem., 2018, 25, 1055–1072 CrossRef CAS PubMed.
  15. J. D. Pallua, A. Brunner, B. Zelger, R. Stalder, S. H. Unterberger, M. Schirmer and M. C. Tappert, Pathol., Res. Pract., 2018, 214, 1532–1538 CrossRef CAS PubMed.
  16. L. Bel’skaya, J. Appl. Spectrosc., 2019, 86, 165–188 CrossRef.
  17. R. Wang and Y. Wang, Int. J. Mol. Sci., 2021, 22, 1206 Search PubMed.
  18. R. Wang, A. Naidu and Y. Wang, Diagnostics, 2021, 11, 2133 CrossRef CAS PubMed.
  19. S.-F. Chiu, C.-H. Ho, Y.-C. Chen, L.-W. Wu, Y.-L. Chen, J.-H. Wu, W.-S. Wu, H.-K. Hung and W.-F. Chiang, Medicine, 2021, 100, e24934 Search PubMed.
  20. J. H. Ward Jr, J. Am. Stat. Assoc., 1963, 58, 236–244 Search PubMed.
  21. O. Rainio, J. Teuho and R. Klén, Sci. Rep., 2024, 14, 6086 Search PubMed.
  22. A. Barth, Biochim. Biophys. Acta, 2007, 1767, 1073–1101 CrossRef CAS PubMed.
  23. M. J. Baker, J. Trevisan, P. Bassan, R. Bhargava, H. J. Butler, K. M. Dorling, P. R. Fielden, S. W. Fogarty, N. J. Fullwood, K. A. Heys, C. Hughes, P. Lasch, P. L. Martin-Hirsch, B. Obinaju, G. D. Sockalingum, J. Sulé-Suso, R. J. Strong, M. J. Walsh, B. R. Wood, P. Gardner and F. L. Martin, Nat. Protoc., 2014, 9, 1771–1791 CrossRef CAS PubMed.
  24. G. Bellisola and C. Sorio, Am. J. Cancer Res., 2012, 2, 1–21 Search PubMed.
  25. E. Wiercigroch, E. Szafraniec, K. Czamara, M. Z. Pacia, K. Majzner, K. Kochan, A. Kaczor, M. Baranska and K. Malek, Spectrochim. Acta, Part A, 2017, 185, 317–335 CrossRef CAS PubMed.
  26. A.-M. Šimundić, EJIFCC, 2009, 19, 203–211 Search PubMed.
  27. O. Iocca, T. P. Sollecito, F. Alawi, G. S. Weinstein, J. G. Newman, A. De Virgilio, P. Di Maio, G. Spriano, S. Pardiñas López and R. M. Shanti, Head Neck, 2020, 42, 539–555 CrossRef PubMed.
  28. O. Kujan, A. Khattab, R. J. Oliver, S. A. Roberts, N. Thakker and P. Sloan, Oral Oncol., 2007, 43, 224–231 CrossRef PubMed.
  29. O. Kujan, R. J. Oliver, A. Khattab, S. A. Roberts, N. Thakker and P. Sloan, Oral Oncol., 2006, 42, 987–993 CrossRef PubMed.
  30. L. Krishnan, K. Karpagaselvi, J. Kumarswamy, U. S. Sudheendra, K. V. Santosh and A. Patil, J. Oral Maxillofac. Pathol., 2016, 20, 261–268 CrossRef PubMed.
  31. P. Nankivell, H. Williams, P. Matthews, S. Suortamo, D. Snead, C. McConkey and H. Mehanna, Oral Surg. Oral Med. Oral Pathol. Oral Radiol., 2013, 115, 87–94 CrossRef PubMed.
  32. B. S. de Freitas Silva, D. C. R. Batista, C. F. de Souza Roriz, L. R. Silva, A. G. C. Normando, A. R. dos Santos Silva, M. A. G. Silva and F. P. Yamamoto-Silva, Clin. Oral Invest., 2021, 25, 4329–4340 CrossRef PubMed.
  33. P. M. Speight, Head Neck Pathol., 2007, 1, 61–66 CrossRef PubMed.
  34. A. K. El-Naggar, J. R. Grandis, T. Takata and P. J. Slootweg, WHO Classification of Head and Neck Tumours, World Health Organization, Geneva, Switzerland, 2017 Search PubMed.
  35. S. Silverman Jr., M. Gorsky and F. Lozada, Cancer, 1984, 53, 563–568 CrossRef CAS PubMed.
  36. E. Odell, O. Kujan, S. Warnakulasuriya and P. Sloan, Oral Dis., 2021, 27, 1947–1976 CrossRef PubMed.
  37. C. R. Leemans, P. J. F. Snijders and R. H. Brakenhoff, Nat. Rev. Cancer, 2018, 18, 269–282 CrossRef CAS PubMed.
  38. K. Ranganath, A. L. Feng, R. A. Franco, M. A. Varvares, W. C. Faquin, M. R. Naunheim and S. V. Saladi, Cancers, 2022, 14, 5581 CrossRef CAS PubMed.
  39. G. Cervino, L. Fiorillo, A. S. Herford, U. Romeo, A. Bianchi, S. Crimi, C. D'Amico, R. De Stefano, G. Troiano, R. Santoro, L. Laino, G. Laino and M. Cicciù, Dis. Markers, 2019, 2019, 8040361 Search PubMed.
  40. S. Choi and J. N. Myers, J. Dent. Res., 2008, 87, 14–32 CrossRef CAS PubMed.
  41. H. K. Williams, Mol. Pathol., 2000, 53, 165–172 CrossRef CAS PubMed.
  42. H. M. Wood, C. Conway, C. Daly, R. Chalkley, S. Berri, B. Senguven, L. Stead, L. Ross, P. Egan, P. Chengot, J. Graham, N. Sethi, T. K. Ong, A. High, K. MacLennan and P. Rabbitts, J. Pathol., 2015, 237, 296–306 CrossRef CAS PubMed.
  43. C. S. Farah, M. Jessri, N. C. Bennett, A. J. Dalley, K. D. Shearston and S. A. Fox, Oral Oncol., 2019, 96, 42–50 CrossRef CAS PubMed.
  44. R. Veeramachaneni, T. Walker, T. Revil, A. Weck, D. Badescu, J. O'Sullivan, C. Higgins, L. Elliott, T. Liloglou, J. M. Risk, R. Shaw, L. Hampson, I. Hampson, S. Dearden, R. Woodwards, S. Prime, K. Hunter, E. K. Parkinson, J. Ragoussis and N. Thakker, Sci. Rep., 2019, 9, 11992 CrossRef PubMed.
  45. E. Makarev, A. D. Schubert, R. R. Kanherkar, N. London, M. Teka, I. Ozerov, K. Lezhnina, A. Bedi, R. Ravi, R. Mehra, M. O. Hoque, I. Sloma, D. A. Gaykalova, A. B. Csoka, D. Sidransky, A. Zhavoronkov and E. Izumchenko, Cell Death Discovery, 2017, 3, 17022 CrossRef PubMed.
  46. M. Crawford, E. H. Johnson, K. Y. P. Liu, C. Poh and R. Y. L. Tsai, Cells, 2022, 11, 1033 CrossRef CAS PubMed.
  47. P. M. Speight, S. A. Khurram and O. Kujan, Oral Surg. Oral Med. Oral Pathol. Oral Radiol., 2018, 125, 612–627 CrossRef PubMed.
  48. X. Cai, J. Zhang, H. Zhang and T. Li, J. Zhejiang Univ., Sci., B, 2023, 24, 868–882 CrossRef PubMed.
  49. O. Menyhárt and B. Győrffy, Comput. Struct. Biotechnol. J., 2021, 19, 949–960 CrossRef PubMed.
  50. H. M. Mehanna, T. Rattay, J. Smith and C. C. McConkey, Head Neck, 2009, 31, 1600–1609 CrossRef PubMed.
  51. F. Yan, P. D. Reddy, S. A. Nguyen, A. C. Chi, B. W. Neville and T. A. Day, Eur. Arch. Oto-Rhino-Laryngol., 2020, 277, 2967–2976 CrossRef PubMed.
  52. B. G. Ellis, C. A. Whitley, A. Triantafyllou, P. J. Gunning, C. I. Smith, S. D. Barrett, P. Gardner, R. J. Shaw, P. Weightman and J. M. Risk, PLoS One, 2022, 17, e0266043 CrossRef CAS PubMed.
  53. J. Ingham, C. I. Smith, B. G. Ellis, C. A. Whitley, A. Triantafyllou, P. J. Gunning, S. D. Barrett, P. Gardener, R. J. Shaw, J. M. Risk and P. Weightman, IOP SciNotes, 2022, 3, 034001 CrossRef PubMed.
  54. I. van der Waal, Oral Oncol., 2010, 46, 423–425 CrossRef PubMed.
  55. S. I. Kvaal and T. Warloe, J. Environ. Pathol., Toxicol. Oncol., 2007, 26, 127–133 CrossRef CAS PubMed.
  56. Z. Wang, R. Polavaram, C. F. Fuentes and S. M. Shapshay, Arch. Otolaryngol., Head Neck Surg., 2003, 129, 869–873 CrossRef PubMed.
  57. S. Warnakulasuriya, N. W. Johnson and I. van der Waal, J Oral Pathol. Med., 2007, 36, 575–580 CrossRef CAS PubMed.
  58. S. Silverman Jr., A. R. Kerr and J. B. Epstein, J. Cancer Educ., 2010, 25, 279–281 CrossRef PubMed.

This journal is © The Royal Society of Chemistry 2025
Click here to see how this site uses Cookies. View our privacy policy here.