Ana C. O. Nevesa,
Priscila P. Silvaa,
Camilo L. M. Moraisa,
Cleine G. Mirandab,
Janaina C. O. Crispimb and
Kássio M. G. Lima*a
aInstitute of Chemistry, Biological Chemistry and Chemometrics, Federal University of Rio Grande do Norte, Natal 59072-970, RN, Brazil. E-mail: kassiolima@gmail.com; Tel: +55 84 3342 2323
bHealthy Sciences Center, Federal University of Rio Grande do Norte, Natal 59010-180, RN, Brazil
First published on 14th October 2016
Cervical cancer is the fourth most frequent cancer in women worldwide and the third in Brazil. Screening methods can substantially reduce new cases of cervical cancer by identifying pre-cancerous lesions, making it possible to offer correct management and treatment. For this purpose, this work reports the use of attenuated total reflection Fourier-transform infrared (ATR-FTIR) spectroscopy coupled with principal component analysis (PCA) and variable selection techniques, such as successive projections algorithm (SPA) and genetic algorithm (GA) associated to linear or quadratic discriminant analysis (LDA/QDA), to classify samples for negative for intraepithelial lesion or malignancy (NILM), n = 43, and squamous intraepithelial lesion (SIL), n = 40, directly from blood plasma. Furthermore, the possibility to categorize SIL subclasses according to low-grade squamous intraepithelial lesion (LSIL) and high-grade squamous intraepithelial lesion (HSIL) lesion degrees was evaluated. Application of variable selection algorithms, especially GA, considerably improved the classifications by choosing spectral variables that reflect the chemical differences between a healthy and pre-cancerous plasma sample. This method was able to correctly classify NILM vs. SIL with sensitivity and specificity for both classes varying around 77% using LDA. With QDA, the results were enhanced to sensitivity around 90% and specificity of 83%. NILM vs. LSIL presented sensitivity and specificity ranging between 67–94% and 82–94%, respectively. In addition, NILM vs. HSIL were found to have sensitivity and specificity from 76–97% to 73–100%, respectively, where QDA substantially provided better classifications. These findings highlight the potentiality of ATR-FTIR spectroscopy combined with multivariate analysis as a screening tool for pre-cancerous cervical lesions, which could contribute to reduce cervical cancer incidence.
HPV is a small non-enveloped virus, a member of the family papilloma viruses, with a circular double-stranded DNA genome, which infects the epithelia of skin and mucosa. More than 180 HPV types have been identified, and can be separated into high-risk, intermediate-risk and low-risk, according to their potential to induce cancer in infected tissues. The high-risk HPVs most related to cervical carcinogenicity are HPV 16, 18, 31, 33, 35, 39, 45, 51, 52, 56, 58, 59 and 66, where HPV 16 and 18 are the most prevalent types and responsible for more than 70% of all cases of invasive cervical cancer.3–7 Although HPV infection is the most frequent sexually transmitted disease worldwide, approximately 90% of all infected women are able to clear the virus within 2 years after infection by natural action of their immune system. If the immune system does not properly fight against the virus, the infection can develop to cause cervical intraepithelial neoplasia (CIN 1, CIN 2, CIN 3, according to the severity of the lesions), which is initially an asymptomatic condition that can either spontaneously regress to normal without any treatment, or can progress to invasive cervical cancer in 5–20 years.8,9 These pre-cancerous lesions have different rates of progression to invasive cancer, where CIN 1 has a low rate, and CIN 3 has a high rate if left untreated.10,11 In the Bethesda system of classification of cervical cytology, CIN 1 is classified as low-grade squamous intraepithelial lesion (LSIL), and CIN 2 and CIN 3 are grouped together as high-grade squamous intraepithelial lesions (HSIL).12–16 In this context, it is essential to identify the occurrence/recurrence of these cervical lesions early to guarantee correct treatment and to avoid the risk of developing invasive cancer in the following years.
Some screening methods commonly used today include tests for HPV, tests to detect cervical lesions by cytology (Pap smear) and unaided visual inspection with acetic acid (VIA), being the Pap smear most currently employed in developing countries.17–19 Implementation of the Pap smear as a screening method worldwide has substantially decreased the morbidity and mortality from squamous carcinoma of the cervix. In the UK, the screening program using the Pap smear has considerably reduced the incidence of cervical cancer to become the eleventh most common female cancer in this region.20,21 However, considering the human subjectivity present in this method due to the sampling and sample management being interpreted by the cytologist, the sensitivity (meaning the percentage of true positive cases detected) and/or specificity (meaning the percentage of true negative cases that are negative) of the Pap smear are 51% (30–87%) and 98% (86–100%), respectively. Sensitivity is particularly affected by the inter observer variability, and this lack of accuracy can lead to high false-negative rates that can induce failures in preventing cervical cancer, mainly in women that do not follow the correct periodicity of the screening programs.7,20 Furthermore, some questions like poorly developed healthcare services, cultural and religious factors, limited resources and information can play a role in putting up barriers to implement Pap smearing as a screening method, especially in developing or rural regions where these issues are still a strong reality.21
Infrared spectroscopy is a vibrational technique that has the capacity to analyze biological systems, since complex molecules such as proteins, lipids, carbohydrates and nucleic acids exhibit distinct vibrational behaviors according to their molecular structure and conformation.22 ATR-FTIR spectroscopy is a powerful alternative to be employed in resolving biological issues, considering its ability to reflect on the composition and variability of samples, and especially in the region of the “bio-fingerprint” (1800–900 cm−1), where many important biomolecules have individual absorbing frequencies, thereby allowing scientists to search for biomarkers and metabolic profiles.23 Remarkably, ATR-FTIR is a fast, non-destructive and clean method, making it possible to analyze a considerable number of samples in a day and to reuse them after spectra acquisition, thus avoiding the necessity of many reagents and sample handling steps, and promoting a reduction of waste generation and making the experiment more simple and cost-effective.20,22 ATR-FTIR has been attracting great attention in cancer research as a powerful tool, leading to relevant publications over the last few years.24 Theophilou and co-workers have used this technique to analyze ovarian tissues and to discriminate them between normal, borderline or malignant,22 while Lima and co-workers have applied ATR-FTIR to classify blood plasma or serum samples according to their ovarian cancer stage.23 Moreover, Purandare and co-workers have showed the capability of vibrational spectroscopy to segregate low-grade cervical cytology-based samples considering their potential to regress, remain static or progress.25 Also in this field, Lima and co-workers have successfully classified cervical cytology specimens between high-risk or low-risk HPV infection.26
ATR-FTIR is definitely a remarkable tool for studying chemical species due to its ability to provide a high number of substantial information, however, when biological samples are taken into account, this technique itself may not provide enough specificity in the search for biomarkers since there are many biomolecules contributing to the whole signal, leading to a high amount of complex data. On the other hand, multivariate analysis has been proven to be effective in overcoming this drawback, allowing for the successful use of ATR-FTIR for biological purposes. This is especially evident in the possibility to extract essential information related to biomarkers, which reflects the particularity of each chemical system. In this context, SPA and GA have made it possible to select the most significant variables from complex spectral data, which can be associated to biomarkers. For classification, the employment of these algorithms is commonly associated to LDA, in the way that samples can be separated into groups based on their spectral similarities, and the classification model is used to predict unknown samples.26
The 21st century has been characterized by the search for alternative tools in several medical fields, and in this context the combination of inexpensive spectroscopic techniques and computational treatments emerges as a very promising strategy for screening cervical cancer. In this paper, we report our findings in the application of ATR-FTIR spectroscopy and multivariate analysis to differentiate NILM and SIL classes directly from blood plasma. Furthermore, we investigated the ability of this method to separate cervical squamous intraepithelial lesions into low-grade and high-grade lesions (LSIL and HSIL), respectively. Chemometric approaches were based on the use of PCA, SPA and GA algorithms associated to linear and quadratic discriminant analysis (LDA and QDA, respectively). To the best of our knowledge, this is the first work involving screening cervical pre-cancer stages in Brazilian women using ATR-FTIR and chemometrics. In addition, PCA-QDA, SPA-QDA and GA-QDA have never been reported in literature for this purpose. Considering the high incidence of cervical cancer all over the world and the relevance of its early detection, this fast, simple and inexpensive methodology may substantially contribute to cancer prevention, especially in developing countries.
Collection of fasting blood samples (containing the anticoagulant EDTA) was performed per patient prior to cytology smears or large loop excision surgery of the transformation zone (LLETZ), accounting for a total of 83 blood samples. In this study, atypical squamous cells (ASC) of undetermined significance (ASC-US and ASC-H) were excluded. Within two hours after blood collection by venipuncture, blood plasma was separated by density gradient, and aliquots were transferred into cryogenic tubes and stored at −80 °C until analysis. Before analysis, cytological samples (Pap smear) were obtained from women who were referred either in NILM or SIL groups. For women undergoing LLETZ surgery, histopathological analysis was performed on sections from paraffin blocks in 4 μm thickness and stained with hematoxylin/eosin. Cytology and histopathology are reported according to the Bethesda system:27 43 patients (NILM), 16 patients (LSIL) and 24 patients (HSIL).
![]() | (1) |
![]() | (2) |
To obtain a discriminant profile, the LDA classification score (Lij) is calculated for a given class k by the following equation:
![]() | (3) |
On the other hand, the QDA classification score (Qij) is estimated using the variance–covariance for each class k and an additional natural logarithm term, as follows:
![]() | (4) |
Additionally, the main differences between these discrimination methods are that QDA forms a separated variance model for each class and does not assume classes having similar variance–covariance matrices; whereas LDA does not take into account different variance structures in each class, assuming that the analyzed classes have similar variance–covariance matrices.31 The GA-LDA/QDA calculations were performed during 40 generations with 80 chromosomes each. One-point crossover and mutation probabilities were set to 60% and 10%, respectively. Moreover, the algorithm was repeated three times, starting from different random initial populations. The best solution (in terms of the fitness value) resulting from the three GA repetitions was employed.
The classification models were built for ATR-FTIR spectra pooled into three different cases:
(1) NILM (430 spectra) vs. SIL (LSIL and HSIL) (400 spectra);
(2) NILM (220 spectra) vs. low-grade lesions (LSIL) (160 spectra);
(3) NILM (220 spectra) vs. high-grade lesions (HSIL) (240 spectra);
Calculations of sensitivity (probability that a test result will be positive when disease is present) and specificity (probability that a test result will be negative when disease is not present) were performed for this study as important quality measures of model accuracy. Both parameters have a maximum value of 1 and a minimum of 0, and can be obtained by using the following equations:
![]() | (5) |
![]() | (6) |
![]() | ||
Fig. 1 ATR-FTIR mean spectra of NILM (blue), LSIL (red) and HSIL (green) samples, in the region of 900–1800 cm−1. |
It is possible to verify that the spectra present strong similarity related to absorption bands, in addition to being highly overlapped, in a way that it becomes difficult to categorize samples only considering the complex spectral information available. In this sense, application of multivariate algorithms is an essential strategy to extract the important spectral information, enabling the discrimination of samples between NILM or SIL classes based on their pathophysiological condition reflected in the spectral bands. Furthermore, variable selection algorithms such as SPA and GA are powerful tools to be used in the search for biomarkers in blood plasma, allowing that less complex models be obtained. In this study, all spectra were pre-processed by applying normalization (amide I band) and baseline correction, and the classification models (PCA-LDA/QDA, SPA-LDA/QDA and GA-LDA/QDA) were built using both the processed and the raw data in order to compare results. In general, sensitivity and specificity values of models were higher when classification was performed using the raw data, and the best results can be appreciated in the following discussions. Considering the importance of screening methods for reducing the new cases of cervical cancer, the main objective of this study was to apply chemometric tools to extract the biochemical information of samples representing women with or without cervical lesions, making it possible to separate samples in to the two classes of NILM and SIL. Additionally, more specific models were also investigated to categorize samples in attempt to show the potentiality of the proposed classification method, taking into account the existence of subgroups in the cervical lesion (SIL), LSIL and HSIL classes. This approach could be of great interest in clinical routine, since medical conduct is totally different in face of a patient with a low-grade lesion or high-grade lesion condition. Therefore, the whole NILM dataset (430 spectra) was divided approximately by half for this purpose, and a NILM dataset of only 220 spectra (22 samples) was used for models in order to have a similar data size compared to the LSIL and HSIL datasets. In all cases, a comparison between LDA and QDA models was performed by analyzing the sensitivity and specificity values obtained for both linear and quadratic models.
Model | LDA | QDA | |||
---|---|---|---|---|---|
Sensitivity (%) | Specificity (%) | Sensitivity (%) | Specificity (%) | ||
NILM vs. SIL | PCA | 37/80 | 38/75 | 74/52 | 26/52 |
SPA | 40/78 | 40/78 | 61/42 | 37/58 | |
GA | 77/78 | 75/78 | 89/83 | 90/82 | |
NILM vs. LSIL | PCA | 60/75 | 60/75 | 79/37 | 21/62 |
SPA | 63/71 | 60/71 | 76/47 | 24/58 | |
GA | 76/83 | 82/87 | 94/67 | 94/83 | |
NILM vs. HSIL | PCA | 54/97 | 54/97 | 67/44 | 76/42 |
SPA | 45/94 | 45/94 | 48/28 | 51/72 | |
GA | 76/94 | 73/86 | 88/97 | 91/100 |
It is possible to observe from Table 1 that the GA-LDA model using the 68 selected wavenumbers from a whole 450 spectral variables improved classification rates for prediction samples when compared to PCA-LDA and SPA-LDA results. The GA-LDA model presented sensitivity of 77 and 78% for NILM and SIL, respectively, and also maintained very similar specificity results for both classes (75 and 78% for NILM and SIL classes, respectively). Using quadratic discriminant analysis associated to GA algorithm provided even better classification models, according to Table 1.
The wavenumbers selected by GA are shown highlighted in Fig. 2A. GA-QDA model presented sensitivity and specificity values of 89 and 90%, respectively, for NILM class; and the model achieved sensitivity and specificity of 83 and 82% for SIL class, respectively, maintaining agreement between the classification indexes for both classes. Sample separation into the two categories is shown for GA-LDA and GA-QDA in Fig. 2B and C, respectively. Two clusters are adequately visualized, where samples are softly and more correctly grouped into their own classes with the GA-QDA model. GA-LDA/QDA have selected particularly interesting wavenumbers (Fig. 2A); namely, the variables at 1747 and 1724 cm−1, associated to CO stretching vibrations of lipids and aldehydes, respectively. The major peaks of 1639 cm−1 (amide I) of C
O stretching vibration of the amide group coupled to the N–H bond bending and the C–N bond stretching, as well as 1539 cm−1 (amide II) of C–N stretching and N–H deformation were observed. Finally, there are methyl and methylene groups of lipids and proteins at 1400 and 1454 cm−1, respectively, asymmetric and symmetric stretching vibrations of phosphate at 1219 and 1080 cm−1, respectively, and C–O groups of carbohydrates at 1155 cm−1 which also were observed.
In this case, GA-LDA/QDA selected some interesting variables (see Fig. 2D), namely the wavenumbers at 1724 and 1461 cm−1 associated to CO stretching vibrations of aldehydes and methylene lipid groups; amide III from proteins at 1334 cm−1; asymmetric and symmetric stretching vibrations of phosphate at 1221 and 1089 cm−1; and out-of-plane C–H bending at 960 cm−1. It is worth mentioning that some of these variables are coincident with those selected for NILM vs. SIL classification, as described above.
In this case, GA-LDA/QDA have selected some interesting variables (see Fig. 2G), namely: variables at 1758 and 1729 cm−1 are associated to CO stretching vibrations of lipids and aldehydes, respectively, major peaks at 1639 cm−1 (amide I) of C
O stretching vibration of the amide group coupled to the bending of the N–H bond and the stretching of the C–N bond, the right and side amide II at 1531 cm−1, methylene lipid groups at 1467 cm−1, amide III from proteins at 1342 cm−1, out-of-plane C–H bending at 968 cm−1, and the variables at 1043 and 1063 cm−1 representing glycogen band due to OH stretching coupled with bending and CO–O–C symmetric stretching of phospholipids and cholesterol esters, respectively.23
This journal is © The Royal Society of Chemistry 2016 |