Diana
Frimpong
a,
Angela C.
Shore
a,
Benjamin
Gardner
c,
Claire
Newton
b,
Joya
Pawade
d,
Jonathan
Frost
e,
Laura
Atherton
e and
Nick
Stone
*c
aNIHR Exeter Clinical Research Facility and Clinical and Biomedical Sciences, Faculty of Health and Life Sciences, University of Exeter, Exeter, Devon, UK
bUniversity Hospitals Bristol and Weston NHS Trust, Bristol, UK
cPhysics and Astronomy, Faculty of Environment, Science and Economy, University of Exeter, Exeter, Devon, UK. E-mail: n.stone@exeter.ac.uk
dSevern Pathology Services, North Bristol NHS Trust, Bristol, UK
eRoyal United Hospitals Bath NHS Foundation Trust, Bath, UK
First published on 27th February 2025
During post chemotherapy surgery for ovarian cancer, it is important to ensure that any residual disease is carefully assessed and removed. The assessment remains subjective, despite clear evidence of the benefits of complete macroscopic resection. In this work, we have considered Raman spectroscopy as a possible tool for residual disease assessment by exploring its ability to correctly classify ovarian cancer from benign and borderline tissues. Samples from seventy-three participants were analysed (n = 20 benign, n = 11 borderline and n = 42 cancer) using a multivariate analysis model. All models shown utilised validation with leave one participant out cross-validation. In ovarian tissue this model achieved 94% sensitivity and 98% specificity for prediction of cancer from benign and 98% sensitivity and 89% specificity for prediction of cancer from borderline. Thorough assessment of the surrounding peritoneal tissues is extremely important. For these peritoneal tissues taken from participants with advanced ovarian cancer, the model achieved 78% sensitivity and 84% specificity for prediction of cancerous peritoneum from benign peritoneum in participants who had primary surgery and 68% sensitivity and 81% specificity in participants who had post chemotherapy surgery. This demonstrates viability of Raman spectroscopy for assessment of ovarian cancer.
Current evidence is clear on the survival benefits of achieving complete macroscopic resection or as a minimum, less than 1 cm residual disease,4 however, this remains a subjective assessment, as no objective tool for residual disease volume assessment exists. Delaying surgery for primary chemotherapy has been demonstrated to be better for post operative outcomes,5 and non-inferior to primary surgery for progression free survival.5,6 However, achieving complete resection at interval debulking surgery (IDS) can be complicated by the fibrotic changes that occur as a result of primary chemotherapy.6
There is a need for a non-destructive tool that can accurately differentiate normal and fibrotic tissue from cancer, that is amenable to intraoperative use and can relay results in real time. One tool that has been explored for its biological applications is Raman spectroscopy.7 This is a technique that uses the inelastic scattering of light to determine the molecular composition of a sample.8 This technique has great potential for biomedical use. It has been demonstrated to be accurate for tumour excision margin assessment9 and lymph node assessment10 in breast cancer surgery and meets the aforementioned criteria for intraoperative assessment in ovarian cancer treatment.
There is previous work exploring the use of Raman spectroscopy to classify ovarian cancer however the body of work is limited when compared to other tumour sites and in exploring ovarian and peritoneal tissue in this context. The peritoneum is a common site of spread for ovarian cancer11 and presence of peritoneal disease suggests advanced stage ovarian cancer (stage III or IV).12 It would be futile to explore novel diagnostics for advanced ovarian cancer without including tissue from peritoneal disease. In 2007, Krishna et al. examined the biological differences of malignant ovarian tissue compared to normal and benign tissues using Raman spectroscopy. They concluded that lipids and DNA vibrations were the main differentiating features.13 This finding was supported by the work by Maheedhar et al. in 2008 where they identified amide I and III from proteins, lipids, shift in δCH2 and DNA as the features with relatively higher expression that distinguish cancer from normal. They also achieved a sensitivity of 88% and specificity of 84% for classifying cancer using principal component analysis-based model (PCA), with only eight participants without cancer and seven with ovarian cancer. Independent testing of this model with a limited number of participants achieved a sensitivity and specificity of 100%.14 Similar accuracy results using leave one patient out cross validation (sensitivity 93% and specificity of 88%) were achieved by David et al. for cancer detection using ovarian and endometrial cancer tissue samples further highlighting the efficacy of the technique.15
Mice modelling of this concept of residual disease targeting using Raman spectroscopy was completed by Andreou et al. in 2019 where peritoneal cavities of mice with ovarian cancer were scanned using probes after local administration of nanoparticles.16 This work highlights the feasibility of targeting metastases with vibrational spectroscopy techniques, however, although nanoparticles were locally applied, the currently unknown risks around retention of gold nanoparticles in this patient group is not ideal. Techniques to optimise Raman spectroscopy measuring only the native molecular signals, to achieve a similar aim are currently more likely to be adopted.
In this study, we aimed to assess Raman spectroscopy against histology for cancer detection in peritoneal and ovarian tissue and the effect of primary chemotherapy on detection accuracy. As such we have used tissue from women referred with suspected ovarian cancer, the outcome of which was either benign ovarian pathology, borderline ovarian tumour or ovarian cancer, of varying grades and stages of extent of disease. The importance of accurately discriminating cancer from these two ovarian pathology groups is highlighted by the Prostate, Lung, Colorectal and Ovarian cancer trial which demonstrated harm to women having extensive procedures to investigate suspected cancer.17
Demographic information | |||
---|---|---|---|
Benign | Borderline | Cancer | |
n = 20 | n = 11 | n = 42 | |
NACT = Neoadjuvant chemotherapy. | |||
Age (years) | |||
Mean (range) | 57 (36–81) | 55 (32–81) | 62 (37–88) |
No. of comorbidities | |||
0–2 | 8 | 4 | 21 |
3–5 | 9 | 5 | 19 |
6–8 | 3 | 2 | 2 |
Menopausal status | |||
Pre | 4 | 3 | 2 |
Peri | 0 | 2 | 1 |
Post | 16 | 6 | 39 |
Smoking status | |||
Never smoked | 13 | 7 | 23 |
Ex smoker | 6 | 2 | 15 |
Smoker | 1 | 2 | 4 |
Diagnosis | |||
Fibroma | Serous tumour | High grade serous | |
Dermoid | Mucinous tumour | Low grade serous | |
Strum ovarii | Torsion/necrosis | Clear cell | |
Endometriosis | Mucinous | ||
Benign with small vessel vasculitis | Endometroid | ||
Mucinous cystadenoma | Granulosa cell tumour | ||
Serous cystadenoma | |||
Samples | |||
Ovarian | 18 | 8 | 13 (2 NACT) |
Peritoneal | 9 | 1 | 24 (19 NACT) |
As both ovarian and peritoneal samples were collected from two thirds of participants, there were 52 ovarian samples and 63 peritoneal samples. Samples were excluded due to damage, being too small to measure after sectioning and morphology being obscured by blood. Where there was a duplicate of the same tissue type and pathology from one participant, one slide as excluded. The remaining 39 ovarian samples and 34 peritoneal samples were used for this analysis.
Tissue samples were defrosted at room temperature for a minimum of 10 minutes and then scanned using the white light image function of the InVia WiRE software. The area of interest previously marked on the H&E slide by a consultant histopathologist was matched to the Raman slide and a square grid of 300 μm by 300 μm was drawn over the area. Point measurements at 50 μm intervals were taken of the area covered by the grid, resulting in 49 spectra per area of interest. Each measurement was acquired for five seconds, with three accumulations using 100 mW laser power. RensihawWiRE software automated cosmic ray removal was enabled.
Multivariate analysis techniques were then used to explore models based on the data for the prediction of pathology. Principal component analysis (PCA), a technique used to the reduce data dimensions, was used to identify the variables (principal components) that explain the variance in the data. For each principal component (PC), the loadings represent the weight of the original spectral variables, and the scores explain the contribution of each PC to the individual spectrum.18 Analysis of variance (ANOVA) was used to determine the statistically significant PCs. The scores of the PCs with statistical significance were used in combination with the pathology data to create a linear discriminant model for the separation of the data into groups. The classification performance of the model was then calculated. Leave-one-out (participant) cross validation, i.e. all samples and spectra from each individual in turn were removed from the dataset used to create the classification model and then they were used to evaluate the performance of the model for predicting the pathology of those samples. Sensitivity, specificity, accuracy, area under the curve and F1 score were calculated for the prediction performance. The F1 score is a blend of the precision and recall of the model and as such, takes into account how the data is distributed and penalizes models with a high false negative rate.19
Tentative peak assignments have been made using reference tables.20–22
![]() | ||
Fig. 1 Mean spectra of ovarian tissue for pathology groups benign (green), borderline (blue) and cancer (red). |
Separation within the groups to varying levels is apparent even in this simple plot of group means. It is evident in the group comparisons in Fig. 1 that there is some overlap of variance within the pathology groups however there is still separation seen, such as in the cancer versus benign group at 1332 cm−1 suggesting increased nucleotides in the cancer group compared to the benign group. This is supported by the difference between the mean plot Fig. 2(d), which highlights the peaks suggesting an increase in concentration of lipids (545, 1074, 1119 & 1438 cm−1), amino acids (642, 1001 & 1205 cm−1), nucleotides (663, 711, 724, 777, 1256, 1332 & 1573 cm−1) and amide I group (1652 cm−1) in cancer. Peaks with a negative intensity value suggest lower concentration in cancer of phosphate minerals (591 cm−1), collagen (812, 856, 919 & 935 cm−1), amino acids (1035 & 1176 cm−1), carotenoids (1159 cm−1), amide III groups (1240 & 1276 cm−1), CH3 modes in proteins (1400 & 1415 cm−1) and deoxyribose (1465 cm−1) when compared to the benign group.
The cancer and borderline groups, Fig. 2(b) & (e), show their greatest differences with increased intensity in phosphates (1069 & 1095 cm−1), collagen (892 cm−1), ribose (912 cm−1), amino acids (1000, 1360 & 1597 cm−1), lipids (1118, 1442 cm−1), cytosine (1255 cm−1), nucleic acids (1327 & 1573 cm−1), CH3 bending mode (1386 cm−1) and amide I (1650 cm−1) in the cancer group and lower concentrations of thymine and guanine (broad peak 647–690 cm−1), DNA (807 & 831 cm−1), amino acids (1005, 1034 & 1176 cm−1), collagen (861 & 936 cm−1), carotenoids (1519 & 1158 cm−1), deoxy-ribose (1464 cm−1) and amide I group (1667 cm−1) when compared to the borderline group.
The comparison of the two non-cancer groups, Fig. 2(c) & (f), suggests a higher concentration of amino acids (643, 1003 & 1555 cm−1), nucleotides (666, 716, 981, 1336 & 1576 cm−1), DNA (827 cm−1), lipids (1074 & 1127 cm−1) and amide I (1655 cm−1) in the borderline group and lower concentrations of disulphide stretch in proteins (532 cm−1), collagen (814, 855, 918 & 935 cm−1), amino acids (1026 & 1040 cm−1), amide III group (1242 cm−1), CH3 deformation (1415 cm−1) and amide I group (1635 & 1690 cm−1) compared to benign group.
PCA-LDA models using the significant PCs on ANOVA testing (p = 0.001) were used to classify the spectra into their groups e.g. benign, cancer etc. The model was able to classify cancer from benign and cancer spectra with a sensitivity of 98% and specificity of 100%. Leave one participant out cross validation of 882 benign and 539 cancer spectra achieved a sensitivity of 94% and specificity of 98%. The F1 score for this validation model was 0.95 and mean area under the curve following two-fold cross validation of 0.97 (Fig. 3). Exploring the principal component loadings as seen in the ESI (Fig. S1†) it is clear the key molecular differences are very similar to those identified from the difference spectra in Fig. 2(d) as discussed above.
Classification of cancer from borderline and cancer spectra achieved a sensitivity and specificity of 99%. Leave one participant out cross validation of 392 borderline and 539 cancer spectra achieved a sensitivity of 98% and specificity of 89%. The F1 score for this validation model was 0.94 and mean area under the curve following two-fold cross validation of 0.99.
Classification of borderline from benign and borderline spectra achieved a sensitivity of 90% and specificity of 95%. Leave one participant out cross validation of 882 benign and 392 borderline spectra achieved a sensitivity of 72% and specificity of 93%. The F1 score for this validation model was 0.77 and mean area under the curve following two-fold cross validation of 0.88 (Fig. 3).
![]() | ||
Fig. 4 Mean spectra of peritoneal tissue for the pathology groups benign (green), cancer (red) and, interval debulking surgery (IDS) groups, benign IDS (orange) and cancer IDS (grey). |
PCA-LDA model using the significant PCs on ANOVA testing (p = 0.001) achieved a sensitivity of 86% and specificity of 98% for classifying cancer from benign and cancer spectra of tissue taken at primary surgery. Leave one participant out cross validation of 441 benign and 245 cancer spectra achieved a sensitivity of 78% and specificity of 84%. The F1 score for this validation model was 0.83 and mean area under the curve following two-fold cross validation of 0.86 (Fig. 3).
In the group with spectra from tissue collected at interval debulking surgery, as seen in other cancer and benign comparisons, the difference between the mean spectra suggested increased intensity in phosphates (1103 cm−1 DNA stretch mode), DNA (827 cm−1), amino acids (642, 1003, 1360 & 1597 cm−1), lipids (1126 cm−1), nucleic acids (665, 722, 779, 1332, 1485 & 1573 cm−1), amide I (1658 cm−1 stretch mode) and a subtle uplift in carotenoids (1156 & 1526 cm−1) in the cancer group, and lower concentrations of thymine (748 cm−1), collagen (812, 867, 919 & 922 cm−1), phenylalanine (1026 cm−1), amide III beta sheet (1240 cm−1), antisymmetric CH3 deformation (1415 cm−1), aromatic amino acids (1597 cm−1) and amide I group (1628 & 1698 cm−1) in the cancer group when compared to the benign group. The PCA-LDA model classified cancer from benign and cancer tissue (interval surgery) with a sensitivity of 81% and specificity of 91%. Leave one participant out cross validation of 931 benign and 931 cancer spectra achieved a sensitivity of 68% and specificity of 81%. The F1 score for this validation model was 0.73 and mean area under the curve following two-fold (50% training data and 50% held out) cross validation of 0.79 (Fig. 3).
The classification model in peritoneal tissue for the primary surgery group performed slightly better than the primary chemotherapy group (AUC 0.86 vs. 0.79). Whilst the sample size was smaller in the primary surgery group, 17 vs. 38, it does not account for the difference in performance as a smaller sample size is more likely to cause a type II error as opposed to improving diagnostic performance. That said, both models achieved high cross validation accuracies for cancer detection suggesting that the potential impact of chemotherapy on diagnostic performance, if any, is small and Raman spectroscopy can accurately differentiate between fibrosis and cancer.
To our knowledge, this is the first work assessing borderline tumours with Raman spectroscopy. The extremely reassuring results for cancer classification (AUC 0.99) against borderline tissue makes a very promising case for translation of this technique to use in place of frozen section should tissue measurements with a portable device29 yield similarly high accuracies. The availability of such a device could potentially decrease pathology workload and operative and anaesthetic time, with the biggest impact being in units that do not have pathology services in the same building or site as the operating theatre. As ovarian cancer surgery is currently an open surgical procedure, and the likely clinical application of a device would be intraoperative measurement excised areas of concerns, constraints of size of probe as seen in endoscopic devices does not apply however, these probes have demonstrated high accuracy in cancer detection and might offer a readily available first step to translation of this technique.30–33
In previous work looking at Raman spectroscopy of biological samples, two or three pathologists were consulted due to a known lack of consensus between pathologists for cancer and pre-cancer.34 All participants had a formal histological diagnosis as part of their clinical care by one of the gynaecology pathologists at their respective hospital, and there was no discrepancy between their clinical diagnosis and the diagnosis given on assessment of their research samples by a second histopathologist. A potential limitation of this study is that only one pathologist was consulted for the identification of the specific location on the tissue at which Raman spectroscopy measurements were taken, however, this was considered a relatively trivial process.
Whilst overall recruitment for this study was on par with, if not higher than, previous work on ovarian cancer, a limitation of this work is the small participant numbers when comparing pathology groups. Consequently, meaningful independent group testing could not be performed.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d4an01293c |
This journal is © The Royal Society of Chemistry 2025 |