Maria
Paraskevaidi
*a,
Camilo L. M.
Morais
a,
Daniel L. D.
Freitas
b,
Kássio M. G.
Lima
b,
David M. A.
Mann
c,
David
Allsop
d,
Pierre L.
Martin-Hirsch
e and
Francis L.
Martin
*a
aSchool of Pharmacy and Biomedical Sciences, University of Central Lancashire, Preston PR1 2HE, UK. E-mail: mparaskevaidi@uclan.ac.uk; flmartin@uclan.ac.uk
bInstitute of Chemistry, Biological Chemistry and Chemometrics, Federal University of Rio Grande do Norte, Natal 59072-970, Brazil
cClinical Neuroscience Research Group, Division of Medicine and Neuroscience, University of Manchester, Greater Manchester Neurosciences Centre, Hope Hospital, Salford M6 8HD, UK
dDivision of Biomedical and Life Sciences, Faculty of Health and Medicine, Lancaster University, Lancaster LA1 4YQ, UK
eDepartment of Obstetrics and Gynaecology, Central Lancashire Teaching Hospitals NHS Foundation Trust, Preston PR2 9HT, UK
First published on 15th August 2018
Alzheimer's disease (AD) is currently under-diagnosed and is predicted to affect a great number of people in the future, due to the unrestrained aging of the population. An accurate diagnosis of AD at an early stage, prior to (severe) symptomatology, is of crucial importance as it would allow the subscription of effective palliative care and/or enrolment into specific clinical trials. Today, new analytical methods and research initiatives are being developed for the on-time diagnosis of this devastating disorder. During the last decade, spectroscopic techniques have shown great promise in the robust diagnosis of various pathologies, including neurodegenerative diseases and dementia. In the current study, blood plasma samples were analysed with near-infrared (NIR) spectroscopy as a minimally-invasive method to distinguish patients with AD (n = 111) from non-demented volunteers (n = 173). After applying multivariate classification models (principal component analysis with quadratic discriminant analysis – PCA-QDA), AD individuals were correctly identified with 92.8% accuracy, 87.5% sensitivity and 96.1% specificity. Our results show the potential of NIR spectroscopy as a simple and cost-effective diagnostic tool for AD. Robust and early diagnosis may be a first step towards tackling this disease by allowing timely intervention.
Infrared (IR) spectroscopy has advanced significantly over the last few decades, specifically in the field of biomedical investigation.3,4 By exploiting the vibrational movements of the chemical bonds within a sample after excitation, IR spectroscopy can provide quantitative and qualitative information about a sample. Technological advancements have also simplified the previously expensive and complicated instrumentation, thus facilitating the wider use of these systems. In this study, near-IR (NIR) spectroscopy was employed to study the region of the electromagnetic spectrum ranging between ∼750–2500 nm. The most prominent bands in the NIR include overtones and combinations of fundamental vibrations of –CH, –NH, and –OH groups.5 It has been previously shown that NIR spectroscopy holds promise for biomedical applications,6 including the study of human skin (skin carcinomas, atopy and leprosy),7 diabetes,8 breast and colorectal cancers,9,10 Alzheimer's disease11 and chronic fatigue syndrome.12
Spectroscopic techniques have been previously employed by independent research groups for the investigation of neurodegenerative disorders, either by analysis of brain biopsies or biofluids, such as cerebrospinal fluid (CSF) and blood samples.13–17 The objective of the current study was to detect AD using a minimally-invasive, but at the same time rapid and inexpensive, blood test. Our aim was to use a large number of individuals and add further evidence to the current literature about the diagnostic capabilities of spectroscopy as a diagnostic tool.
The use of a suitable substrate in spectroscopy is also of major importance as it could distort the resultant spectral information and lead to falsified conclusions; for this reason, numerous studies have previously used costly and/or fragile substrates, such as calcium/barium fluoride or gold substrates, to avoid signal interference.18–22 At the same time, however, the substrate of choice should be relatively inexpensive in order to be welcomed to a clinical setting. Therefore, a secondary aim of this study was to investigate whether the signal from the commonly-used and inexpensive low-E glass slide23,24 could be removed from the samples’ spectra without affecting the diagnostic result.
Alzheimer's disease | Healthy control | |
---|---|---|
Number of cases | ||
111 | 173 | |
Age | ||
<65 | 60/111 | 90/173 |
≥65 | 51/111 | 79/173 |
Unknown | — | 4/173 |
Gender | ||
Female | 50/111 | 103/173 |
Male | 61/111 | 68/173 |
Unknown | — | 2/173 |
All participants were recruited at Salford Royal Hospital (Salford, UK) with informed consent obtained prior to enrolment in accordance with Local Ethical Approval (05/Q1405/24 conferred by North West 10 Research Ethics Committee Greater Manchester North). Blood samples were collected in EDTA tubes following standard operating procedures. To acquire the plasma, whole blood was centrifuged for 10 min at 2000 rpm and 4 °C and the supernatant was collected in new microtubes. Plasma samples were aliquoted and kept at −80 °C until needed for the spectroscopic analysis. Samples were thoroughly thawed before depositing 50 μL onto IR-reflective glass slides (MirrIR Low-E slides, Kevley Technologies, USA) and left to dry overnight at room temperature.
Fig. 1 Average (a) raw and (b) pre-processed NIR spectra for healthy controls (HC) and Alzheimer's disease (AD) patients. |
Classification was performed using principal component analysis with quadratic discriminant analysis (PCA-QDA). A PCA-QDA model is based on a PCA decomposition followed by a Mahalanobis distance calculation. PCA reduces the original dataset into a few number of principal components (PCs) accounting for the majority of the variance across the spectra. As a result, a scores and a loading array are generated for each PC representing the variance on the sample and variable (e.g., wavelength) directions, respectively.26 PCA also solves problems with ill-conditioned data (data matrix with a large condition number) by reducing redundant information across the data and solving collinearity problems. For this reason, PCA is commonly employed prior to discriminant analysis, with the PCA scores used as input variables for the QDA algorithm.
As aforementioned, QDA is a classification algorithm based on a Mahalanobis distance calculation. QDA assumes classes having different variance structures, calculating an individual variance-covariance matrix for each class.27 This improves the classification capacity of QDA in comparison with linear methods (e.g., linear discriminant analysis – LDA) when classes with different variances are being analysed, which occurs often in complex datasets. The QDA classification scores were calculated in a non-Bayesian form in order to reduce the degree of overfitting, as follows:28
Qik = (xi − k)TCk−1(xi − k) | (1) |
Outliers were identified using a Hotelling T2versus Q residual test.29 This test enables someone to create a chart containing the Hotelling T2 values in the x-axis and the Q residuals in the y-axis, where all samples far from the origin [0,0] are considered to be outliers. The Hotelling T2 values represent the sum of the normalised PCA scores, which is the distance from the multivariate mean to the projection of the sample onto the PCs; and the Q residuals are the sum of squares of each sample in the error matrix, which are the residuals between the sample and its projection via PCA.
In addition, a receiver operating characteristics (ROC) curve was generated using easyROC version 1.3 (http://www.biosoft.hacettepe.edu.tr/easyROC/),31 where the area under the curve (AUC) value was calculated as a general indicator of how well the model distinguished between the classes.
For classification, the PCA-QDA algorithm was applied using 2 PCs (67.24% cumulative explained variance). PCA scores and loadings are depicted in Fig. 2a and b, respectively. The scores profile on the two PCs was superposed for HC and AD samples, with no clear separation observed between the classes. The loading profiles (Fig. 2b) indicated greater differences close to regions corresponding to a combination of O–H stretch/C–O stretch second overtone (∼1860 nm), second overtone CO stretching (H-bonded) in peptides (1908 nm), and a combination of bands consisting of N–H bend second overtone, C–H stretch/CO stretch, and CO stretch/N–H in-plane bend/C–N stretching in proteins (2100 nm, 2111 nm, 2150 nm).32–34 These bands correspond to the most important spectral features used by the QDA classifier in PCA-QDA. The PCA-QDA model distinguished between AD and HC individuals with 92.8% accuracy, 87.5% sensitivity and 96.1% specificity (Table 2). The ROC curve and AUC value for PCA-QDA are shown in Fig. 3. The AUC value (0.928) is close to the maximum of 1, indicating its excellent predictive response.
Fig. 2 (a) PCA scores on PC1 and PC2 for healthy controls (HC) and Alzheimer's disease (AD) samples (explained variance for each PC inside parenthesis); (b) PCA loadings based on PC1 and PC2. |
Parameter | Value (%) |
---|---|
Accuracy | 92.8 |
Sensitivity | 87.5 |
Specificity | 96.1 |
PPV | 93.3 |
NPV | 92.5 |
Blood biomarkers indicative of disease constitute a developing field with great promise in the area of neurodegenerative disorders. A great number of new, blood-based molecular tests have emerged over the years, suggesting different biological markers for the detection of AD and other dementias.35–41 Even though the diagnostic capability of the above-mentioned tests is satisfactory, the high cost and laborious experimental work of these methods are great disadvantages for the development of a clinical test.
In contrast to conventional molecular techniques, spectroscopic tests allow cost-effective and rapid results. Previous studies using the mid-IR region for the diagnosis of AD have achieved comparable diagnostic results to the current NIR study. For instance, using ATR-FTIR in the mid-IR region, Paraskevaidi et al. achieved 86% sensitivity and specificity for individuals who carried one or two alleles of apolipoprotein e4 (APOE ε4);42 Carmona et al. used the mid-IR region (as well as Raman spectroscopy) to differentiate between healthy elderly and demented patients with a sensitivity of 89% and a specificity of 92%;14 Peuchant et al. also employed mid-IR spectroscopy and achieved 98.4% diagnostic accuracy.43 Other preliminary studies have also successfully applied spectroscopic approaches (IR or Raman spectroscopy) for the diagnosis of AD or other types of dementia;15,44,45 however, the small number of samples in these studies was a limitation, preventing more general conclusions. According to the NIR results of the present study, most of the differences between healthy and demented individuals seem to be related mainly to protein bands. Some of the well-known characteristics of AD include the building-up of Aβ plaques and neurofibrillary tangles (primarily consisting of tau protein) in the brain, and therefore we can speculate that the observed changes in the NIR region may be attributed to such protein changes.
The current study has achieved exceptionally high diagnostic accuracies using NIR spectroscopy in the transmission mode. The PCA-QDA classification model presented 92.8% accuracy, 87.5% sensitivity and 96.7% specificity which are comparable, and even superior, to current conventional diagnostic biomarkers. Using NIR spectroscopy also decreases the instrumental cost substantially, as instrumentation is much cheaper than other IR or Raman systems and it can be easily translated to portable systems. Our results, coming from a large cohort, add to the current literature by validating previous spectroscopic work, thus taking spectroscopy one step forward towards clinical implementation. Indeed, repetition and validation in independent research groups is of crucial importance for every new biomarker or diagnostic test prior to clinical trials. Herein, we have also shown that after appropriate spectral pre-processing, the low-E signal can be subtracted from the spectra, therefore allowing direct comparison of the sample information without interference from the slide. It should be noted here that the patient cohort in this study was already diagnosed with the disease, therefore the potential of blood-based NIR spectroscopy for pre-symptomatic detection remains to be further explored. Nevertheless, we are optimistic as previous spectroscopy studies have demonstrated segregation between early-stage/mild AD and healthy controls,15,44,46 but a larger number of early-stage patients are still required.
To conclude, this study detected a blood signature for AD, showing great promise in the accurate, simple and minimally-invasive diagnosis of the disease. Our main objective in the current study was to show whether NIR spectroscopy, in the transmission mode, could provide satisfactory diagnostic performance after removal of the substrate's signal. Future studies should focus on the recruitment of more participants, including asymptomatic individuals or patients with mild cognitive impairment (MCI). This would be the next big step in this field, as accurate identification of MCI individuals would allow immediate management and recruitment into clinical trials; the latter may also prove crucial for the development of new therapeutic strategies.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c8an01205a |
This journal is © The Royal Society of Chemistry 2018 |