Prateek
Verma
a,
Elizabeth
Adeogun
a,
Elizabeth S.
Greene
b,
Sami
Dridi
b,
Ukash
Nakarmi
c and
Karthik
Nayani
*a
aDepartment of Chemical Engineering, University of Arkansas, Fayetteville, Arkansas, USA. E-mail: knayani@uark.edu
bDepartment of Poultry Science, University of Arkansas, Fayetteville, Arkansas, USA
cDepartment of Computer Science and Computer Engineering, University of Arkansas, Fayetteville, Arkansas, USA
First published on 18th September 2024
Novel biomaterials that bridge the knowledge gap in coupling molecular/protein signatures of disease/stress with rapid readouts are a critical need of society. One such scenario is an imbalance between bodily heat production and heat dissipation which leads to heat stress in organisms. In addition to diminished animal well-being, heat stress is detrimental to the poultry industry as poultry entails fast growth and high yields, resulting in greater metabolic activity and higher body heat production. When stressed, cells overexpress heat shock proteins (such as HSP70, a well-established intracellular stress indicator) and may undergo changes in their mechanical properties. Liquid crystals (LCs, fluids with orientational order) are facile sensors as they can readily transduce chemical signals to easily observable optical responses. In this work, we introduce a hybrid LC–cell biomaterial within which the difference in the expression of HSP70 is linked to optical changes in the response pattern via the use of convolutional neural networks (CNNs). The machine-learning (ML) models were trained on hundreds of such LC-response micrographs of chicken red blood cells with and without heat stress. The trained models exhibited remarkable accuracy of up to 99% on detecting the presence of heat stress in unseen microscopy samples. We also show that cross-linking chicken and human RBCs using glutaraldehyde in order to simulate a diseased cell was an efficient strategy for planning, building, training, and evaluating ML models. Overall, our efforts build towards designing biomaterials that can rapidly detect disease in organisms that is accompanied by a distinct change in the mechanical properties of cells. We aim to eventuate CNN-enabled LC-sensors that can rapidly report the presence of disease in scenarios where human judgment could be prohibitively difficult or slow.
Heat stress (HS) occurs when an animal is unable to regulate its body temperature in response to high environmental temperature, resulting in hyperthermia (increased body temperature). HS is detrimental to the well-being of an animal, causing discomfort, organ damage, or even death. In the livestock and poultry industry, HS is known to lead to massive economic losses in addition to decreased welfare of the animals.7 Increasing global temperatures due to climate change and the ever-increasing demand for meat production have prompted research efforts toward better understanding the effects of heat stress and ways to alleviate them.8,9 HS is a particularly important stressor for the poultry industry, as poultry entails fast growth and high yields, resulting in greater metabolic activity, higher body heat production, and decreased thermo-tolerance.7,10–13 In fact, it is estimated that the amount of metabolic heat produced by the modern broiler has increased by 30% over the last 20 years.9 In poultry, studies of HS and its effect on feed intake,10,14,15 immunosuppression,16–18 growth,10,13 gut health,10,11,15 meat yields,10,13,19etc., as well as its effect on physiological responses, such as increased production of heat shock proteins (HSPs, such as HSP70)20,21 or any other biomarker (such as GRP75 or orexin),22–24 have gained momentum recently. Facile methods to rapidly characterize the health of poultry and livestock are important in a broad range of contexts, which include understanding their health/stress status, welfare, and prediction of diseases and stressors.7,9 However, there remains a wide knowledge gap in coupling molecular/protein signatures of disease/stress to rapid readouts. Aside from economic concerns, overall animal well-being is greatly diminished by HS and has become a prominent concern for consumers. Therefore, there is an urgent need to develop rapid-reporting methods that can inform on whether the organism is experiencing HS.
Here, we introduce the concept of rapidly characterizing the mechanical properties of red blood cells (RBCs) using LCs. A key aspect of the development of our LC–cell sensor platform involves building ML-based convolutional neural networks (CNNs) that can generate classifiers to separate image sets of RBCs dispersed within LCs into healthy ones and those of chickens experiencing HS.25–27
The fundamental hypothesis that drives our research is that cells overexpressing well-established intracellular stress chaperones such as heat shock proteins (HSP70) also undergo cellular changes, for instance, the mechanical properties of cellular membranes, which, in turn, can be detected by dispersing them in LCs. The expression levels of HSP70 in the LC–cell biomaterial will be used to define our classes for the CNN framework we develop. Current methods for monitoring stress rely on the identification of molecular and protein markers such as corticosterone and HSPs.28,29 Although methods that report on molecular and protein markers have increased our understanding of HS, these methods are usually time-intensive and are not immediately accessible to the end user (farmer, technician on a production line) seeking to make informed decisions on the health and stress levels of chickens. Therefore, there is a critical need to identify reliable and rapid ways to monitor HS in poultry.28,29
A key innovation in this work and our methodology is to connect the expression of HSP70 to rapid optical readouts, which characterize the health of the blood cells of chickens. Recently, an LC-based technique has been employed to rapidly report on the mechanical properties of human RBCs.30 The underlying principle is depicted in Fig. 1. Molecules of LC (blue ellipsoids) are perturbed from the preferred parallel orientation when an inclusion, for instance, a colloidal particle, is present within the LC fluid (blue ellipsoids bend around the yellow particle in Fig. 1A). This creates an orientational strain within the LC, as depicted in Fig. 1A. However, if the inclusion is soft, such as an RBC, the LC can stretch out the cell and release some of the strain contained within the fluid. This sharing of strain is intimately coupled with the mechanical properties of RBCs, which we expect to change as they experience HS. Critically, we have previously shown that the mechanical properties of individual cells (elastic shear modulus from 2 to 16 × 10−6 N m−1) are stretched by an LC to different extents. The energetic cost associated with straining an LC about an RBC was estimated as Ka, in which K ∼ 10 pN (a Frank elastic constant of the LC) and a ∼10 μm (size of a cell), corresponding to ∼104kT, which is a sufficiently large energy for deforming a cell membrane.30 This simple scaling argument leads to the prediction that the magnitudes of the elastic energies associated with deformation of the LC and RBC are comparable, and thus that RBCs dispersed in LCs would exhibit shape-responses that reflect an interplay of the mechanical properties of both the LC and RBC. Thereby we hypothesize that LCs enable rapid readouts of the mechanical properties of cells, for instance, a simple experiment of dispersing a few μl of blood in LCs can be used to understand the health status of over a thousand cells within a few minutes.30,31
Physiological mechanisms of chickens' response to HS or to any ‘cure' employed to fight HS are far from being understood. Such studies require controlled and careful broiler studies, spanning weeks, and more often than not, the blood (for elaborate examination of genes or the biomarkers) or even sacrificing the chickens. Steps are being taken towards non-invasive examination of HS, such as using feather HSP70 (a specific HS protein).21 To help with this, the authors ideate that the wealth of existing information, and more easily obtainable information, could be put to good use by training ML algorithms to aid in rapid identification of HS, HS biomarkers, HS susceptibility of various chicken subspecies, effectiveness of HS treatments, and so on.
The motivation for this work lies in our initial observation that there was a dramatic difference in the extent of the strain of RBCs of modern-day broiler chickens and their jungle fowl ancestors, as presented in Fig. 2. The LC we use is disodium cromoglycate, whose disc-shaped molecular structure is shown in Fig. 2A.30,32,33 The self-assembly of the disc-like DSCG molecules into rod-like stacks is depicted in Fig. 2B.1,30,32,34,35Fig. 2C depicts the ailments experienced by chickens experiencing heat stress. Consistent with past observations, we see in Fig. 2D that the HSP70 expression is distinct between two poultry species, namely, jungle fowl and the Cobb 700 strain (strain D).10,21,23 Our RT-qPCR studies using the 2−ΔΔCT method reveal that HSP70 is overexpressed in the Cobb 700 strain whereas the jungle fowl expresses HSP70 at normal levels when the cells are subjected to HS (maintained at 45 °C for 2 h) as shown in Fig. 2D. Briefly, the 2−ΔΔCT method is a simple formula used in order to calculate the relative fold HSP70 expression of samples when performing qPCR. The key takeaway from Fig. 2D is that the HSP70 expression is significantly higher in the Cobb 700 strain in comparison with the jungle fowl under heat stress conditions. Our preliminary observations reveal that transfer of chicken blood cells into LCs formed by DSCG leads to qualitative changes in the shapes of the RBCs as presented in Fig. 2E and F. Chicken blood cells that assume ellipsoidal shapes under stress-free conditions in a buffer solution are strained to spindle-like morphologies with pointed tips when placed within an LC (Fig. 2E and F). Fig. 2E comprises RBCs extracted from a commercial boiler strain (Cobb 700) while Fig. 2F shows RBCs extracted from south-east jungle fowl. The micrographs of Fig. 2E and F are feature rich and reveal several key differences. For instance, the extent of stretching of RBCs, the orientation of RBCs and the texture of the LC around the RBCs are all distinct in the two micrographs. These image features are a direct result of the sharing of the strain between the RBCs and the LCs (as depicted in Fig. 1).
Motivated by the scaling arguments, we employ CNNs in this study to classify the state of health of the cells. CNNs have emerged as an ideal machine learning architecture for classification of images. Image classification through CNNs works by identifying and separating critical information (features) in the image using nonlinear convolutional operation and finding an optimum hyper plane/threshold for classification (Fig. 3A). Since the advent of architectures such as AlexNet36 and VGGNet,37 classification of images has become faster and more accurate. CNNs consist of convolutional layers that perform a mathematical convolution operation on the incoming image using a small filter (also called ‘kernel’) of size such as 3 × 3 pixels (shown in green in Fig. 3B). CNNs learn by optimizing the values of the filter which results in correct identification of the images. Sets of images with known labels (also called ‘classes’) are fed through the CNNs repeatedly for learning until a good enough accuracy is achieved. Through convolutional learning, CNNs have shown to be able to detect edges, shapes, and other, sometimes imperceptible, features of an image which enables them to perform the classification. Fig. 3B shows progression of data through a typical CNN, composed of convolutional layers, each followed by a max-pooling layer (that reduces the 2D-image size) ending in a single output (for binary classifications) that denotes the probability of the data belonging to one out of the two classes. In this study, we train the CNN algorithm on healthy human and chicken cells, heat-stressed chicken cells and also crosslinked human cells which simulate unhealthy RBCs (glutaraldehyde was used to crosslink the RBCs and stiffen them). Crosslinking of cells with glutaraldehyde has been used as a mimic for diseased cells, for instance for cells inflicted with malaria, wherein availability of real samples might be challenging.38 Therefore, we started training the CNN model with glutaraldehyde crosslinked data for mimicking unhealthy cells for scenarios where getting actual sample data might be challenging. In this particular study, we have plenty of actual sample data of the unhealthy cells and therefore the crosslinked data set provides additional information for the algorithm to classify cells as unhealthy. Optical micrographs of healthy RBCs in DSCG and of crosslinked and heat-stressed RBCs in DSCG were used to train a simple convolutional neural network (CNN).39 Eventually, chicken RBCs expressing HSP70 were dispersed in LCs to test our hypothesis and confirm whether HS could be detected through visual observation as well as through CNN classification.
Since RBCs naturally strain when put in a DSCG solution, strained cell samples were obtained by adding 2 μL of dispersed RBCs to 60 μL of DSCG solution and gently swirled. To prevent straining in DSCG and obtain crosslinked cell samples, glutaraldehyde was used to crosslink and stiffen the RBCs. A stock solution of 5% v/v of glutaraldehyde in water was used; 5 μL RBCs were slowly pipetted into 0.2 μL of this stock to effect crosslinking. The final glutaraldehyde concentration in the cells was chosen to be around 0.2% to make sure that the individual cells are fixated and do not form aggregates. The solution was slowly mixed on a shaker for an hour to allow glutaraldehyde to completely crosslink. About 2 μL of crosslinked RBCs were then added to 60 μL of DSCG solution and gently swirled. HSP70 RBCs were collected from 21 day old broiler chickens that were exposed to acute heat stress (35 °C for 2 hours). Whole blood was collected into EDTA coated tubes and the RBCs were isolated from the whole blood by centrifugation and washing with PBS three times.
For imaging, RBC samples were transferred (post swirling) to microscope slides. Micrographs were obtained using an Olympus BX41 optical microscope fitted with a 40× objective lens. Polarized and brightfield micrograph images were captured in the presence and absence of a polarizer, respectively.
Collected images were at least 1800 px in height and either 3:2 or 4:3 in aspect ratio (width to height ratio), and were saved in jpeg, bmp, tiff, or raw format. All images were RGB (containing information in red, green, and blue channels). Before building the dataset, all images were cropped and resized to the same size and converted to jpeg format. A square section from the center of the image was selected. This was done to (1) discard the sides which sometimes contained portion of the microscope slide or stage outside of the actual sample and (2) standardize the aspect ratio to 1:1 from 3:2 or 4:3. The resulting square image was scaled down in size to exactly 1000 px wide and 1000 px tall using the bicubic resizing algorithm in the PIL.
Each image was assigned labels (such as species, chemicals used, magnification, etc.) that were stored in a tabular form within excel files. These labels were used to programmatically find images matching a certain criterion. For instance, polarized images of crosslinked chicken RBCs were found by logical querying of these labels: polarized is “True”, chemicals used contain “glutaraldehyde”, species is “chicken”, and cell type is “RBC”.
Ten distinct subsets of the dataset (called classes) were built using this process and are summarized in Table 1. Each class was shuffled and split into training (train), validation (val), and test (test) sets in a 70:15:15 ratio, respectively.
Class or dataset name | Train | Val | Test |
---|---|---|---|
Brightfield micrograph | |||
A. Crosslinked chicken | 112 | 24 | 24 |
B. Healthy chicken | 585 | 125 | 125 |
C. Crosslinked human | 350 | 75 | 75 |
D. Healthy human | 350 | 75 | 75 |
E. HS chicken | 489 | 105 | 105 |
Polarized | |||
---|---|---|---|
A. Crosslinked chicken | 798 | 171 | 171 |
B. Healthy chicken | 1264 | 271 | 271 |
C. Crosslinked human | 390 | 83 | 83 |
D. Healthy human | 350 | 75 | 75 |
E. HS chicken | 423 | 90 | 90 |
A simple binary classifier CNN, meaning that it classified images into one of two input classes, was built. The model is schematically shown in Fig. 3B. The network begins with an input layer of 500 × 500 pixel images, which undergo convolution operations using a filter. These convolutional layers, each followed by a ReLU (rectified linear unit) activation function, extract features such as edges and textures from the images. Max-pooling layers are employed after each convolutional layer to downsample the feature maps, reducing their spatial dimensions while retaining essential information. The final layers involve flattening the pooled feature maps into a single vector, which is passed through a dense layer with a sigmoid activation function (maps the input to a value between 0 and 1) to classify the images at a single neuron. This neuron yielded the probability that a particular image belonged to one of the two classes. This architecture efficiently identified distinguishing features between healthy and stressed cells. The model's parameters are shown in Table 2. The model was compiled using the optimizer Adam and the loss function sparse_categorical_crossentropy. The performance of the model was measured using accuracy as the metric.
Layer name (type) | Output shape | Params |
---|---|---|
conv2d_1 (conv2D) | (None, 498, 498, 16) | 448 |
maxPooling-1 (maxpooling2D) | (None, 249, 249, 16) | 0 |
conv2d_2 (conv2D) | (None, 247, 247, 32) | 4640 |
maxPooling-2 (maxpooling2D) | (None, 123, 123, 32) | 0 |
conv2d-3 (conv2D) | (None, 121, 121, 64) | 18496 |
maxPooling-3 (maxpooling2D) | (None, 60, 60, 64) | 0 |
Flatten_1 (flatten) | (None, 230400) | 0 |
Dense_1 (dense) | (None, 1) | 230401 |
Total params: 253985 | ||
Trainable params: 253985 |
A receiver operating characteristic (ROC) curve was plotted for each experiment for a range of decision thresholds (101 linearly spaced threshold values from 0 to 1) for predictions made on validation sets. The best threshold was selected to be the one that was closest to the point (0, 1) in the ROC plot. The confusion matrix corresponding to the best threshold has been reported. A confusion matrix was also calculated for the predictions made on the test set using the best threshold (that was evaluated on the validation set) and has been reported.
Clearly, polarized images posed a greater challenge to CNNs. Polarized images have richer information concerning the cellular response in an LC field. The color gradients around a cell are representative of the LC director being disrupted by the presence, the shape, and the physical properties of the cell. These color gradients, which are absent in unpolarized images, may contain information about the cell’s physical properties, its biochemical signatures, and hence its health, which are not readily perceptible to human visual analysis. Longer range color gradients visible in the polarized images, which are not centered on the cell and not perturbed by the presence of the cells or foreign objects (like dust or a piece of fiber), likely don't contain information linked directly to the cell's response. Because the data labels for crosslinked or healthy samples were derived solely from the fact whether glutaraldehyde was used or not, the CNN's accuracy denoted the level at which it could successfully map the complex information in the images (especially the polarized ones) to the effects of glutaraldehyde on the cells. In this light, the accuracy of 97% was remarkable, given the complexities in the images.
Individual ‘accuracy vs. epoch curves’ revealed that accuracy improved faster for unpolarized images when compared to polarized images. A training accuracy of 99% was achieved, as early as, at the end of epoch 6 for AB, epoch 13 for CD, epoch 49 for EG, and epoch 41 for HI. Thus, learning was more difficult on polarized images. Additionally, a training duration of 100 epochs seemed to be sufficient for achieving the highest possible accuracy (indicated by the plateau in the curve), except in the case of EG, where just the training (not validation) accuracy looked like it could improve further if the training wasn't stopped. Neither the accuracy nor the loss curves showed any quintessential signs of over- or under-fitting. The imbalance and the diversity in the sizes of the classes did not affect the learning in any noticeable way as observed here, in turn, demonstrating the applicability of this approach in a testing environment where it will not be feasible to collect much data or collect data in a balanced way.
The ROC curves helped determine a binary decision threshold (probability above which the trained model decides that a particular image sample belongs to a particular class) that yielded the highest possible true positive rate at the lowest possible false positive rate. In simpler terms, the best threshold is one for which a point on the ROC curve is closest to the point (0, 1). There was some variation in the appearance of the ROC curves, with the curvature increasing as training became more difficult, but most data points were still nicely bunched up towards the upper left corner and edges. The confusion matrices corresponding to this ‘best threshold' for both the validation and test datasets revealed nothing out of the ordinary. Predictions were near perfect on the unpolarized images and at least 97% accurate on the polarized images, concurrent with the accuracies calculated by the model at the end of the training. A test accuracy of 100, 98.7, 97.7, and 98.1% was obtained for the experiments AB, CD, EG, and HI, respectively.
Here too, polarized images posed a greater challenge to the CNN algorithm. As described before, the color gradients around a cell, present in polarized images and absent in unpolarized images, are representative of the LC director being disrupted by the presence, the shape, and the physical properties of the healthy/HS cell and representative of the cell's physical properties, its biochemical signatures (like the production of HSP70), and health. Because the classification labels were derived solely from the fact whether the cell was heat-stressed or not, the CNN's performance denoted the level at which it could successfully map the complex information present in the images (especially the polarized ones) to the effects of HS. In this light, the accuracy of 92/94% for validation/test sets and higher was remarkable, given the myriad of humanly indistinguishable information present in the images: these complexities presumably being greater than the effects of glutaraldehyde alone.
Similar to the glutaraldehyde crosslinked system, individual ‘accuracy vs. epoch curves’ for healthy/HS cells revealed that accuracy improved faster for unpolarized images when compared to polarized images indicating that learning was more difficult on polarized images. A training duration of 100 epochs seemed to be sufficient for achieving the highest possible accuracy (indicated by the plateau in the curves). Neither the accuracy nor the loss curves, especially in conjunction with the ROC curves and the confusion matrices, showed any quintessential signs of over- or under-fitting. The dataset was still imbalanced and small for the healthy/HS pair but did not affect the learning in any noticeable way as observed here, in turn, demonstrating the applicability of this approach in a testing environment where it will not be feasible to collect much data or collect data in a balanced way.
The visual differences between healthy and crosslinked cells were more prominent than the differences between healthy and HS cells (Fig. 4). We characterized the shapes of the healthy and heat-stressed chicken RBCs by quantifying the cell major (rx) and minor axes (ry). A higher aspect ratio (rx/ry) indicates a more significant strain. Chicken RBCs, prior to straining, had an average aspect ratio of 1.5; upon applying mechanical strain, the aspect ratio increased to an average of 2.12 ± 0.64 for healthy cells and 1.85 ± 0.32 for heat-stressed cells. By comparing the variations in aspect ratio values of healthy and heat-stressed chicken RBCs, we observed that the healthy cells have a slightly higher strain than the heat-stressed cells; however, the datasets are not statistically different. Critically, the ML algorithm is able to identify the HS chicken cells with excellent accuracy (100% for unpolarized) even though the data sets are not statistically different.
The 250k parameter model is easily trainable on GPU equipped personal computers using just a few hundred study-specific micrographs. The trained model is lightweight enough to fit in a sensor computer and fast enough to virtually instantly perform the classification on every new ‘photograph’. In the context of the current rise of foundation vision models, whose fine-tuning and training are prohibitively expensive and possible only at very large research laboratories, our approach is geared towards empowering labs and individuals all around the world to construct and train their own ML models on general or niche tasks related to imaging and/or sensing.
In experiments, it was found that crosslinking chicken and human RBCs using glutaraldehyde in order to simulate a diseased cell was an adequate strategy for planning, building, training, and evaluating valid ML models ahead of collecting actual training data. In our case, no model-tweaking was found to be necessary while going from the simulated to the real heat-stressed cells. Because biological data could often be available in less quantities and later in the study, we believe that our simulation example might come in handy for researchers looking to work on their ML models while waiting, or those looking to simply augment their data in an appropriate manner. The successful implementation of the CNN algorithm to rapidly identify HS also opens up new questions related to our line of investigation including: i) are final static responses (micrographs) enough to develop a ML classifier to discriminate (specify) different RBC strains? ii) How do the dynamic conditions (response of cells as a function of time under HS) contribute to developing efficient and specific classifiers? iii) Can static images under TN conditions predict the expression levels of HSP70 under HS conditions? To answer these questions, in future work we will focus on collecting and maintaining a large and diverse image database of strained and unstrained samples and develop dedicated deeper CNNs that are computationally efficient through the use of ResNets. Additionally, CNN frameworks that process multi-channel image datasets, that include spectral channels, and dynamic images, beyond conventional 3 channel RGB images will be developed.
Optical microscopy images used for training and validating the models can be provided upon request to the corresponding author.
This journal is © The Royal Society of Chemistry 2024 |