Open Access Article
Lieshu
Tong
a,
Josef
Kauer
bc,
Xi
Chen
de,
Kaiqin
Chu
a,
Hu
Dou
*d and
Zachary J.
Smith
*a
aDepartment of Precision Machinery and Precision Instrumentation, University of Science and Technology of China, Hefei, Anhui, China. E-mail: zsmith@ustc.edu.cn
bBeuth Hochschule für Technik Berlin, Berlin, Germany
cNeuroCure Clinical Research Center, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Berlin, Germany
dDepartment of Clinical laboratory, Ministry of Education Key Laboratory of Child Development and Disorders, Key Laboratory of Pediatrics in Chongqing, Chongqing International Science and Technology Cooperation Center for Child Development and Disorders, Children's Hospital of Chongqing Medical University, Chongqing, China. E-mail: 375009808@qq.com
eCenter for Clinical Molecular Medicine, Children's Hospital of Chongqing Medical University, China
First published on 28th September 2018
Anemia affects more than ¼ of the world's population, mostly concentrated in low-resource areas, and carries serious health risks. Yet current screening methods are inadequate due to their inability to separate iron deficiency anemia (IDA) from genetic anemias such as thalassemia trait (TT), thus preventing targeted supplementation of oral iron. Here we present an accurate approach to diagnose anemia and anemia type using measures of pediatric red cell morphology determined through machine learning applied to optical light scattering measurements. A partial least squares model shows that our system can accurately extract mean cell volume, red cell size heterogeneity, and mean cell hemoglobin concentration with high accuracy. These clinical parameters (or the raw data itself) can be submitted to machine learning algorithms such as quadratic discriminants or support vector machines to classify a patient into healthy, IDA, or TT. A clinical trial conducted on 268 Chinese children, of which 49 had IDA and 24 had TT, shows >98% sensitivity and specificity for diagnosing anemia, with 81% sensitivity and 86% specificity for discriminating IDA and TT. The majority of the misdiagnoses are IDA patients with particularly severe anemia, possibly requiring hospital care. Therefore, in a screening paradigm where anyone testing positive for TT is sent to the hospital for gold-standard diagnosis and care, we maximize patient benefit while minimizing use of scarce resources.
Anemia can have many underlying causes. In this work we are primarily concerned with nutritional and genetic causes. The most common form of anemia worldwide is iron deficiency anemia (IDA), which can be easily treated with iron supplementation. To address the heavy global burden of anemia, some researchers have explored widespread, population-level iron supplementation,10 including iron-fortified staple crops such as millet and rice.11 However, anemia persists as a public health issue due to the potential dangers of iron over-supplementation. The famous Pemba study, found that widespread iron supplementation in areas where genetic anemias are prevalent led to an overall adverse outcome for subjects who received supplementation but were already iron replete,12 with this result confirmed in additional clinical trials.13–15 While the cause of iron-associated toxicity is not fully elucidated, it is believed to be related to the fact that iron supplements bypass the body's typical mechanisms for iron extraction and storage from food, increasing serum iron levels and interfering with inflammatory and other processes in the body,16 including interfering with pregnancy,17 and increasing susceptibility to and severity of Plasmodium falciparum malaria.18 This is particularly critical for areas where Thalassemia trait (TT) is endemic. TT is a collection of genetic anemias where the body has a reduced capacity for synthesis of either the α, β, or δ chain of hemoglobin. TT carriers have a markedly reduced capacity to process iron, and therefore excess iron can easily lead to toxic iron overload.19,20
In many parts of the world genetic anemias can be highly prevalent. For example, in South-eastern China, estimates of TT prevalence are greater than 10% of the population.21,22 Thus, iron supplementation must be accompanied by population-level screening to determine anemia status in order to safely deliver potential benefits without risk to the otherwise healthy. Point of care technologies to screen for anemia have a long history, with methods such as Tallquist's paper-based colorimetric scale having nearly 100 years of use.23 Methods have traditionally focused on measuring hemoglobin concentration, with commercial point-of-care assays such as HemoCue24 already part of established clinical practice. Newer noninvasive tests of hemoglobin concentration based on diffuse light propagation through tissue are also under development and have seen limited deployment,25 while paper-based assays have also seen renewed interest.26 However, these methods all rely on hemoglobin measurements that cannot determine anemia type, rendering it of limited use for supplementation purposes in areas where genetic anemias are common.
Clinical tests for nutritional deficiency measure serum iron, serum ferritin, zinc protoporphyrin, and other chemical markers. Hennig et al. have developed a non-invasive method of screening for iron deficiency by determining serum ferritin concentration using autofluorescence of zinc protoporphyrin (ZPP) measured in the inner lip.27 However, ZPP levels can fluctuate with inflammation and other disorders, and thus as a screening method it lacks sensitivity. In developed countries with low levels of hemoglobinopathies and parasitic illnesses, ZPP has shown promise,28 but elsewhere it has limited diagnostic ability29,30 with some cautioning against its use as a screening indicator.31 Meanwhile, Srinivasan et al. recently reported a paper-based test of serum iron using a cell phone as a colorimetric reader, however with rather poor accuracy when used in whole human blood.32 Further, neither of these methods test for the presence of genetic disorders. Thus, multiparametric tests that screen for, and differentiate, both iron and genetic status are still needed.
The most robust method for testing genetic status is through gel electrophoresis or polymerase chain reaction (PCR). However, these tests require highly trained operators working in well-staffed clinical laboratories, rendering them impractical for population-level screening in areas where genetic anemias are common. When considering genetic anemias, besides reductions in hemoglobin concentrations and alterations in hemoglobin structure, these anemias also manifest through altered red cell morphology (with the crescent-shaped blood cells of sickle disease being the most famous example). For TT, researchers have long proposed using cell morphology, including mean cell volume (MCV), red cell distribution width (RDW), and mean cell hemoglobin concentration (MCHC) as a sensitive and specific method for anemia screening.33,34 We previously reported that quadratic discriminant analysis (QDA) applied to the three red cell parameters MCV, RDW, and MCHC outperformed established indices in discriminating healthy from anemic patients, and IDA from TT patients in Chinese children.35 However, these parameters are currently measured in the hospital by a complex flow cytometry system that requires regular maintenance and operation by a highly trained user. This makes their measurement by standard methods unsuitable for wide-spread population-level screening. Further, variations in genetic profiles worldwide also lead to varying performance of established diagnostic indices in different populations. MCV, RDW, and MCHC are related to the size, polydispersity, and average refractive index of red blood cells, respectively. Elastic light scattering is an established metrology method with nanometer-scale precision that can extract exactly these parameters from polydisperse suspensions such as latexes, nanoparticles, as well as biological cells.36–38 We previously demonstrated the proof-of-concept of a cost-effective red cell analyzer for measuring light scattering from whole blood, without sample flow or any moving parts, to quickly determine red cell morphology.35,37 However, as our prior work was primarily focused on establishing, through historical review, that cell morphology could accurately separate anemia types, the validation of our instrument was strictly limited to a 10-patient proof of concept study of healthy adult subjects.
In this study we expand on our prior work through three advances: (1) a full-scale clinical study on more than 200 pediatric subjects, including a substantial fraction suffering from IDA and TT. (2) A machine-learning based analysis scheme, not previously reported on elastic light scattering data, which improves both robustness and accuracy compared with our prior reported physics-based model. (3) Improved construction of our system to enable it to be transported out of a laboratory setting, enabling all of the measurements to be performed at a field site. The results show that the sensitivity and specificity using our simple instrument coupled with machine learning methods outperforms prior results using established morphometric indices and gold-standard laboratory equipment. Further, our method is easy to perform, requiring only 10 μL of blood that is simply obtained via finger stick or heel prick, and has a per-test cost of ∼US$1. This makes it amenable to operation by minimally trained users in field settings, as phlebotomy is not required. It gives results in minutes, allowing relatively high throughput for large population screening. The high sensitivity and specificity of the method, particularly for separating healthy and anemic subjects, indicates it holds great promise for use as a wide-spread screening method for nutritional and genetic anemias in Southeast Asia and elsewhere that TT and IDA are endemic.
. Our experiments were performed using discarded, anonymized samples that were collected as part of routine clinical practice at the Children's Hospital of Chongqing Medical University and not for the purposes of this study. Therefore, informed consent was not required.
WHO diagnostic criteria were used to distinguish different types of anemia. Patients 6 months to 6 years old with hemoglobin less than 110 g L−1 and those 6 to 14 years old with hemoglobin less than 120 g L−1 were considered anemic. For diagnosis of IDA, the serum iron of patients must be less than 11 μmol L−1. For deletion α-thalassemia patients, polymerase chain reaction reverse dot blot (PCR-RDB) technology was used to detect the −α3.7, −α4.2 and −SEA α-thalassemia deletion genes. For mutation α-thalassemia patients, PCR-RDB technology was used to detect the common Quong Sze (QS), Constant spring (CS) and Westmead (WS) mutation sites. For β-thalassemia patients, PCR-RDB technology was used to detect the following common mutation sites and start codons: CD41-42(−TCTT), IVS-2-654 C → T, CD17 A → T, -28 A → G, CD26 G → A, CD71-72(+A), CD43 G → T, -29 A → G. PRC-RDB was also used to identify the following nine rare mutation sites: ATG → AGG, CD14-15(+G), CD27-28(+C), -32 C → A, -30 T → C, IVS-1-1 G → T, IVS-1-5 G → C, CD31(−C), CAP +40−+43 (−AAAC).
After being collected, blood samples were stored in an ethylene diamine tetra-acetic acid (EDTA)-coated anticoagulation tube. Reference clinical values of red cell parameters were measured using Sysmex XE-2100 hematology analyzer, a specialized flow cytometer where blood is split into multiple measurement channels and analyzed using a combination of fluorescence, forward- and side-scattered light, and electrical impedance. Serum iron was measured using a Johnson & Johnson VITROS 5.1 FS biochemical analyzer. Genetic testing was performed using an ABI Verity PCR amplifier, UVP HB-100 hybridizer, and Bio-Rad electrophoresis and gel imaging systems with its supporting reagents. Thalassemia genetic testing utilized a PCR amplifier (Verity, ABI Corporation, USA), HB-1000 hybridizer (UVP Corporation, USA), and electrophoresis and gel imaging systems. In performing these analyses, we utilized Yaneng BIO α-thalassemia and β-thalassemia point mutation gene detection kits.
As described later in the text, PLS regression was used to extract clinical parameters from the scattering data, and was implemented using a linear kernel. PCA–SVM and QDA were used to classify the samples as healthy, IDA, or TT. PCA–SVM was performed using a radial basis function kernel and standardized variables. PLS regressions, QDA classifications, and PCA–SVM classifications were all validated using 10-fold cross validation, where 90% of the data is used to construct the regression or classification model, and the remaining 10% is tested. The calibration and testing procedure is repeated 10 times until all samples have been used as both calibration and test samples.
| Variable | HC | IDA | TT |
|---|---|---|---|
| a Significantly different compared with HC group, p < 0.05. b Significantly different compared with IDA group, p < 0.05. | |||
| No. of samples | 195 | 49 | 24 |
| MCV (fL) | 85.50 ± 3.35 | 70.98 ± 7.79a | 63.15 ± 6.56ab |
| MCHC (g L−1) | 32.72 ± 0.73 | 31.61 ± 1.91a | 31.02 ± 0.93a |
| RDW (%) | 13.02 ± 0.78 | 16.44 ± 3.25a | 16.72 ± 2.04a |
A comparison between the clinical values and the PLS predictions based on the scattering patterns are shown in Fig. 2. The top row shows the correlation between our method and the clinical analyzer, while the bottom row presents the results of a Bland–Altman analysis. The results of our analysis on the three groups of blood samples demonstrates quite close agreement with clinical results, especially for MCV and MCHC, where the majority of the points fall within the 95% CI of the clinical analyzer. This agreement is in spite of the fact that clinical analyzers measure cells one at a time using complex flow cytometry, guaranteeing highly accurate results. By contrast, our method requires no flow or moving parts and measures population-level information in a single shot. Therefore we obtain highly accurate information despite using substantially simpler instrumentation.
These extracted parameters can then be used to separate healthy and anemic subjects, as well as IDA from TT subjects. Further, these values may find use in other clinical tasks such as in diagnosis or monitoring of other diseases known to alter cell morphology, such as macrocytic anemias due to liver dysfunction, B12 or folate deficiencies.42
| Index | AUC | Sens | Spec | YI |
|---|---|---|---|---|
| CLI + QDA | 99.46 | 97.44 | 97.26 | 0.94 |
| PLS + QDA | 99.28 | 98.46 | 98.63 | 0.97 |
| PCA + SVM | 99.75 | 99.49 | 93.15 | 0.93 |
| EF | 98.79 | 95.89 | 99.49 | 0.95 |
| E | 94.00 | 80.82 | 96.41 | 0.77 |
| Si | 98.94 | 95.89 | 99.49 | 0.95 |
| Index | AUC | Sens | Spec | YI |
|---|---|---|---|---|
| CLI + QDA | 83.50 | 87.50 | 85.71 | 0.73 |
| PLS + QDA | 79.59 | 79.17 | 81.63 | 0.61 |
| PCA + SVM | 84.86 | 81.17 | 85.71 | 0.67 |
| EF | 78.53 | 75.51 | 87.50 | 0.63 |
| E | 78.23 | 87.67 | 66.67 | 0.54 |
| Si | 78.32 | 75.51 | 87.50 | 0.63 |
As described above, PLS expects a linear model between the observed data and underlying latent variables. This assumption is violated to some degree in our dataset, potentially limiting the PLS performance. While the PLS-determined clinical values are useful for clinical interpretation, our goal is simply classification of a sample into healthy, IDA, or TT. Therefore, another option is to forgo the intermediate step of using PLS to extract the red cell parameters, and create a machine learning model that directly uses the raw scattering data to classify the patients into healthy, IDA, or TT. This has the advantage of not requiring any model of the data but relying purely on pattern matching. A principal component analysis decomposition of the dataset shows that even the first three principal component scores demonstrate strong visual separation of the data into healthy, IDA, and TT groups (see ESI† Fig. S6), indicating that a classification method based on a simple PCA decomposition of the raw data may have good performance.
To implement this, we created a principal components analysis–support vector machines (PCA–SVM) classification model starting from the raw data, and validated using 10-fold cross validation. For each round of validation, the PCA decomposition utilized only training data, preventing any information from the test set from being used in the calibration process. The first 10 principal component scores for each sample were used to create the SVM classification model. The remaining test samples were projected into the PCA space defined by the calibration set, and then classified using the established SVM model. As seen in the red line in Fig. 4B, the PCA–SVM model outperforms the PLS–QDA model, with an AUC similar to the QDA model using the gold-standard clinical data, albeit with somewhat reduced sensitivity. Similarly to the PLS validation, the optimum number of PC scores to pass to SVM was selected via cross validation. However, as detailed in the ESI,† the PCA–SVM performance does not strongly vary with model rank, indicating the robustness of the method.
Our results can also be compared with previously developed diagnostic functions that use red cell morphology to discriminate IDA and TT. While several diagnostic functions have been reported in the literature, our prior results have indicated the top-performing indices are:
(1) England and Fraser index (EF): MCV-RBC-(5 *Hb)-5.19;46
(2) Ehsani index (E): MCV-10*RBC;47 and
(3) Sirdah index (Si): MCV-RBC-3*Hb.48
Note that these functions require additional parameters, typically including the red cell count and the hemoglobin value, available only using complex instrumentation. When examining the ROC curves and Tables 2 and 3, we see that, despite using values determined by complex gold-standard clinical equipment, these indices all have lower performance than our method for discriminating IDA and TT, indicated by lower AUC and YI values.
Data from our system was used to separate healthy, IDA, and TT subjects in a large cohort of Chinese children, with excellent performance. In particular, the performance of our system using a support vector machines classification model was approximately equivalent to classification using raw data from the gold standard instrument, further confirming that our system can accurately probe red cell morphology, despite not requiring any flow or other moving parts. The majority of misclassifications of IDA vs. TT (∼75% of misclassifications) were IDA patients that were misclassified as TT. As the majority of the classification power in our data is found in the MCV, and as MCV for IDA subjects is correlated with hemoglobin concentration, the IDA patients misdiagnosed as TT may indicate moderate to severe anemia.
In a wide-scale population screening program for anemia, the goal is to provide the greatest benefit to the population while minimizing use of scarce resources. A proposed paradigm is to measure all subjects using the portable light scattering instrument. Based on the red cell morphology, subjects classified as healthy or those with IDA will be discharged or prescribed iron supplementation, respectively. Those registering as TT will be sent to the hospital for further gold-standard testing. Given that the IDA misclassifications are typically those IDA subjects with more serious anemia, and given that those with severe anemia may already be suffering deleterious effects of iron deficiency, evaluation in a hospital setting may be called for. Therefore, if all patients who register as TT on our instrument are sent for gold-standard testing, we can speculate that the “wasted” resources of IDA patients being sent to the hospital may be mitigated by the fact that their symptoms may warrant observation by a doctor. Additionally, the sensitivity and specificity of our system may be improved by adding an additional measurement channel to probe hemoglobin through simple absorbance measurements.
Further, in low-resource settings IDA often persists despite nutritional interventions due to helminth and other parasitic infections of which IDA is merely a symptom.49 Therefore, our system can act not only as a screening tool, but also as a method to conveniently monitor response of patients to therapy, identifying those patients for whom nutritional interventions are not sufficient.
Finally, as described in the introduction, point of care tests of iron status are currently under active research. These chemical tests may be combined synergistically with the morphological assay presented here, providing orthogonal and multiparametric information about anemia status at the point of care.
Footnote |
| † Electronic supplementary information (ESI) available. See DOI: 10.1039/c8lc00377g |
| This journal is © The Royal Society of Chemistry 2018 |