Nagavendra Kommineni‡
a,
Navneet Kaur‡b,
G. S. Sainaga Jyothi Vaskuri‡c,
Nikesh Narang‡d,
Mohammad Mirazul Islamd,
Pratap Basima,
Mark Woollame,
Lalitkumar K. Voraf,
Mangilal Agarwaleg,
Jonathan F. Lovell
h and
Rajendra Prasad Meena*i
aThermo Fisher Scientific Inc., Cincinnati, OH 45237, USA
bDepartment of Chemical and Biomedical Engineering, FAMU-FSU College of Engineering, National High Magnetic Field Laboratory, Tallahassee, FL 32310, USA
cDepartment of Pharmaceutical Sciences, University of Tennessee Health Science Center (UTHSC), Memphis, TN 38163, USA
dDepartment of Ophthalmology and Visual Sciences, University of New Mexico School of Medicine, Albuquerque, NM 87131, USA
eIntegrated Nanosystems Development Institute, Indiana University Indianapolis, Indianapolis, Indiana 46202, USA
fSchool of Pharmacy, Queen's University Belfast, 97 Lisburn Road, BT9 7BL, UK
gDepartment of Biomedical Engineering & Informatics, Indiana University Indianapolis, Indianapolis, Indiana 46202, USA
hDepartment of Biomedical Engineering, University at Buffalo, State University of New York, NY 14260, USA
iSchool of Biochemical Engineering, Indian Institute of Technology (BHU) Varanasi, Uttar Pradesh 221005, India. E-mail: rajendra.bce@iitbhu.ac.in
First published on 18th August 2025
Early-stage cancer diagnosis is considered a grand challenge, and even though advanced analytical assays have been established through molecular biology techniques, there are still clinical limitations. For example, low concentration of target biomarkers at early stages of cancer, background values from the healthy cells, individual variation, and factors like DNA mutations, remain the limiting factor in early cancer detection. Volatile organic compound (VOC) biomarkers in exhaled breath are produced during cancer cell metabolism, and therefore may present a promising way to diagnose cancer at the early stage since they can be detected both rapidly and non-invasively. However, there are challenges in VOC analysis, especially regarding standardization of sampling, necessity for preconcentration, and cancer-specificity of biomarkers. There are also additional challenges, including the design and development of highly sensitive miniaturized sensors that detect VOC biomarkers at low concentrations with minimum cost efforts. In the present article, we have reviewed the potential impact of VOCs in cancer detection in the context of traditional methods such as liquid biopsies, which are typically employed at advanced stages of cancer progression. Described ultrasensitive technologies such as gas chromatography-mass spectrometry (GC-MS) and electronic noses using a variety of nanomaterials have been considered as technologies for breath-based early cancer detection.
Cancer is one of the major causes for deaths and is projected to increase to 15.3 million deaths per year by 2040.6 Amongst cancers, lung, colon, prostate and breast cancer have been the most prominent ones.7 Early cancer detection is crucial for identifying cancer prior to clinical manifestation, enabling therapeutic intervention well before metastatic progression, which could significantly improve patient outcomes and quality of life. Traditional cancer diagnoses majorly rely on procedures and techniques like biopsies, radioimmunoassays, imaging techniques (positron emission tomography-computed tomography scan CT PET scan), and endoscopic examinations (colonoscopy, mammography, gastroscopy, etc.). However, method sensitivity is limited to the size of the tumor, and therefore these tests generally do not facilitate early detection.8–12 Several efforts are in progress for detecting cancer-specific biomarkers including cell-free nucleic acids, circulating tumor cells, proteins, and lipids via analysis of blood or other biofluids.12–14 However, the low detection values, background values from the healthy cells, individual variation, and factors like DNA mutations from non-malignant clonal hematopoiesis of indeterminate potential, remain the limiting factor in early cancer detection.15 For instance, interleukin-6 (IL-6) is a biomarker associated with cancer progression, but it is also elevated in conditions like lung fibrosis and chronic kidney disease, complicating its use as a cancer-specific marker.16–18 Similarly, other pro-inflammatory markers (matrix metalloproteinases/interleukins/cytokines, ratios of neutrophils/lymphocytes)19,20 are not cancer-specific and therefore may lead to ambiguous signals. In contrast, VOC analysis may be a faster approach for detecting cancers where the specific biomarkers are released through exhaled breath, sweat, or urine, thus providing an accessible means for monitoring metabolic changes associated with cancer.4,21 The emerging significance of VOCs in early cancer can be understood through recent addition of VOCs in clinical trials involving early ovarian cancer detection (NCT06613230), non-small cell lung cancer detection (NCT06707519) and improving the detection of early lung cancer in a diverse population (NCT06628102).
The presence of cancer is often accompanied by hypoxia, hyperproliferation of cells, change in microenvironments, and excessive inflammatory activities, resulting in the release of VOCs both locally and systemically.21 Consequently, the VOC profile is shifted indicating the metabolic and genetic changes that may not be present in healthy individuals. Thus, sensing spectral dysregulation amongst the markers that would either not be aggressively released or not released in detectable limits in the healthy subjects, is the translational approach for VOC sensing. In comparison to traditional methods, including low-dose CT (lung cancer), mammography (breast cancer), gastroscopy (gastric cancer), colonoscopy (colorectal cancer), CA125 (cancer antigen 125) and transvaginal ultrasound (ovarian cancer), VOC analysis demonstrates comparable or even superior sensitivity, making them a promising option for cancer screening in the future.22
Various techniques has been employed for VOC biomarker detection (Fig. 1) including gas chromatography-mass spectrometry (GC-MS), gas chromatography-ion mobility spectrometry (GC-IMS), proton transfer reaction-mass spectrometry (PTR-MS), selected ion flow-tube mass spectrometry (SIFT-MS), and electronic-noses (e-noses).4,22–26 Mass spectrometry (MS) techniques are the most useful when identification of unknown VOCs is of ultimate importance, and can be broken down into offline (GC-MS, GC-IMS) and online (PTR-MS, SIFT-MS) methodologies. Online methodologies infuse sample VOCs directly into the MS, where analytes are ionized at ambient conditions, and therefore are relatively more rapid. Offline techniques on the other hand have the highest sensitivity/resolution, and therefore are the most well situated for biomarker discovery research although they are time-consuming. Even though MS techniques are well-suited for VOC identification, these instruments tend to occupy a relatively large footprint and require trained personnel to run assays. Therefore, there is a significant interest in designing and developing miniaturized gas sensors (e-noses) for rapid detection of biomarkers at a point-of-care. Further comparative breakdown of detection efficiencies is listed in Table 127–38 and several common VOCs (acetone, isoprene, hexanal, limonene, 2-pentanone, acetaldehyde, 2-butanone, and ethylbenzene) reported in literature have been studied for quantification of concentration levels in breath using these techniques.39–48
Technique | Detection limit | Key ref. |
---|---|---|
GC-MS (gas chromatography-mass spectrometry) | 10–90 ppt | 27 and 28 |
GC-IMS (with ion mobility) | 50 ppt–7 ppb | 29 and 30 |
PTR-MS (proton transfer reaction MS) | 60 ppt–3ppb | 31 and 32 |
SIFT-MS (selected ion flow tube MS) | 500 ppt–7 ppb | 33 and 34 |
Electronic nose (e-Nose) | 100 ppb–10 ppm | 35 and 36 |
Colorimetric arrays | 20 ppb–1 ppm | 37 and 38 |
Among VOCs and their respective functionalities, aldehydes have been most recognized amongst candidate biomarkers, showing significantly elevated levels in cancer patients compared to healthy individuals. Their relatively high solubility in blood allows for rapid detection through breath analysis. Aldehyde production in cancer cells is linked to cytochrome P450 through the lipid-oxidation of omega-3 and -6 polyunsaturated fatty acids. Cytochrome P450 is overexpressed in some cancers and is one of the reasons for higher release of aldehydes as VOCs in cancer patients. Moreover, endogenous aldehyde production can also be linked to a higher amount of saturated lipids in cancer cell membranes. Thus, aldehyde production is an indicator of overexpression of cytochrome P450, membrane lipid composition, and increased oxidative stress, which is often accompanied in cancer cells.49–52 In addition to aldehydes which are prevalent in many types of cancer, studies have indicated alkanes and esters as VOCs produced in breast cancer, ketones in breast, colon and stomach cancers, alcohols in lung, colon, prostrate and stomach cancers, and ethers in lung cancer, further expanding the potential of VOCs as biomarkers for early cancer detection.53
Apart from endogenous VOCs, recent studies have also explored the potential of synthetic probes which release VOC reporters in exhaled breath after interacting with a designated cancer target. For example, D5-ethyl-β-D-glucuronide (synthetic probe) was previously designed to be intravenously administered and readily metabolized to D5-ethanol (VOC reporter) by β-glucuronidase in tumor-bearing mice. β-Glucuronidase is extracellularly accumulated in tumor microenvironments in comparison to healthy cells where it is found intracellularly. The research strategy included capitalizing on the hydrophilic nature of D5-ethyl-β-D-glucuronide, where D5-ethanol release is an indicator of β-glucuronidase and tumor activity. This innovative approach has shown promise in distinguishing between healthy and cancer-bearing subjects and has been extended to lung cancer models and human clinical trials.54,55
As research in VOCs continues to advance, it is essential to standardize collection methods, data analysis, and biomarker validation to facilitate the translation of VOC technology into clinical practice. The potential for VOC-based cancer screening and monitoring represents an exciting frontier in early cancer detection and personalized medicine. In this review, we explain the contemporary techniques for cancer detection as the founding diagnostic stage, emphasize mechanistic insights, and address recent progress and interests in VOCs as potential biomarkers for early cancer detection. We also expand on their characteristics, modes of detection, practical diagnostic examples, and discuss prospects of the methodology and technology.
The enrichment and retrieval of CTCs from peripheral blood samples represent a major technological challenge for successfully implementing CTCs as a liquid biopsy approach.60 Most notably, there is substantial diversity in the efficiency and specificity of CTC isolation techniques. Most CTC isolation platforms are still hampered by a low number of detected CTCs in the blood. Moreover, this is a complex procedure with high cost that often leads to low purity of the retrieved CTCs. Numerous techniques for isolation have been developed recently, such as negative selection by the depletion of all unwanted blood cells, positive selection using potential cancer markers, and strategies based on the unique physical characteristics of cancer cells (such as size or deformability). Enrichment of CTCs based on immunogenicity is a commonly employed method for separating CTCs. Certain biomarkers that are expressed on the cell surface are used to trap the cells, which are then attached to a magnetic material or device surface. Since CTCs express a range of surface markers, there isn't a single, universal CTC antigen.61 There are numerous ways to positively enrich CTCs. AdnaTest (QIAGEN) employs cancer-specific antibody-coated beads, and the real-time polymerase chain reaction is used to ascertain the expression patterns on the cells.62 Cells containing magnetic nanoparticles affixed to antibodies are gathered by magnetic-activated cell sorting (MACS). A robotically controlled magnetic rod and antibody-coated magnetic beads are used in MagSweeper (Stanford University, previously developed by Illumina), an immunomagnetic enrichment method utilized to separate CTCs.63 The only CTC isolation technique with FDA approval, CellSearch® (Menarini Silicon Biosystems), uses epithelial cell adhesion molecule (EpCAM) which is widely utilized for positive selection and the gold standard as a cancer marker. However, EpCAM is often lost during processes such as the epithelial-to-mesenchymal transition (EMT) that take place when cancer cells intravasate into the bloodstream.64
ctDNA has gained a lot of attention for its practical abilities to efficiently detect malignancies. Recently, a study found that using ctDNA presents a promising method for early cancer identification.66 By refining sequencing methods to identify ctDNA without requiring prior knowledge of tumor mutations, this approach advances the field. Approximately 60–70% of the 200 patients with stage 1 and stage 2 tumors (colon, breast, lung, and ovarian cancers) were recruited for this study and assessed for ctDNA. In asymptomatic individuals, this strategy may have a significant impact on early cancer identification and lead to better patient outcomes. One of the main areas of continuing study is the capacity to use ctDNA to detect tiny, asymptomatic cancers. Empirical evidence suggests that it may not be possible to identify very small tumors, such as those with a diameter of less than 5 mm. Because of the low quantity of ctDNA in the blood, tumors of this size are generally challenging to detect. For instance, 1 mm-diameter tumors are linked to extremely low ctDNA concentrations that may be undetectable with existing techniques. The likelihood of identifying ctDNA increases with tumor size, up to about 5 mm in diameter, although the low quantity of circulating tumor DNA makes detection difficult even in these cases.67 The sensitivity and specificity of the current approach have limits, notwithstanding its potential. Because the technique depends on finding mutations that are found in a very small percentage of the ctDNA, its sensitivity may be hampered. Additionally, sampling errors may make it difficult to detect extremely small tumors. Furthermore, the existence of somatic mutations unrelated to cancer and technical noise may compromise the excellent specificity reported in previous studies. For example, one of these previous studies reported a sensitivity of 62% and a specificity of >99% in detecting early-stage cancers.68 Overall, ctDNA exhibits potential, although its current use in identifying very early-stage or asymptomatic malignancies may be restricted. Further developments and integrated diagnostic methods could improve its efficacy in the early diagnosis of cancer.
In cancer, processes like overexpression of cytochrome P450 enzymes, angiogenesis, oxidative stress, and altered glycolysis lead to the production of specific biomarkers, with aldehydes, alkanes, and alcohols being key VOC classes associated with the disease.49–51 Aldehydes are produced in many types of cancers due to the above mentioned metabolic processes. As for other functional groups, their prevalence in different types of cancers already been discussed in the introduction (alkanes and esters: breast cancer, ketones: breast, colon and stomach cancers, alcohols: lung, colon, prostrate and stomach cancers, and ethers: lung cancer). Ethane and pentane, which are saturated alkanes, are generated as byproducts of lipid peroxidation and are widely recognized as potential biomarkers for a host of different medical conditions. Lipid peroxidation is an indicator of several underlying conditions like inflammation, aging, and cancer. Alkanes have very low solubility in the blood, thus preventing further metabolism and ensuring they appear in breath within minutes, making them important VOC markers.73,74
Aldehyde production can also be altered due to the presence of ALDHs (aldehyde dehydrogenase) and ADHs (alcohol dehydrogenase), as ALDHs oxidize aldehydes to acids.75,76 It is reported that in ovarian cancer cell lines, the downregulation of 10-formyltetrahydrofolate dehydrogenase leads to increased aldehyde levels.61,77 Thus, based on the enzymatic conversion, the levels of volatile aldehydes can be increased. For example, the ADHs that convert alcohol to aldehydes is suggested to play a role in metastasis and ALDH overexpression in lung cancer.61,62 Moreover, some aldehydes are consumed by cancer cells (as well as other surrounding cells) which may alter the overall concentration level of aldehydes. Nonetheless, C ≥ 6 aldehydes (hexanal, heptanal, octanal, nonanal) are more stable and are associated with several cancers.4 Notably, higher amounts of nonanal have also been associated with apoptosis.75
Ketone production is hypothesized to occur through beta-oxidation of long chain fatty acids and branched chain fatty acids in the mitochondria. ADHs also catalyze the conversion of alcohols to ketones with a preference for primary alcohols. In liver cancer, the ADHs activity is highly elevated. Therefore in hepatocellular cancer, the ketone production is higher due to the long-chain fatty acid metabolism.78 Acetone, the simplest ketone, is produced through two main pathways: decarboxylation of acetoacetate and dehydrogenation/ADH metabolism of isopropanol in the body. Despite its distinctive odor, acetone's utility as a biomarker is limited due to its production in various physiological processes. On the contrary, other ketones such as 2-nonanone, 2-butanone, 3-heptanone, 4-heptanone, and cyclohexanone have shown to be potential biomarkers.78–81
Alcohols are also important VOCs owing to their diffusion into the blood and appearance in breath from the gastrointestinal tract. Although numerous biomarkers have been reported for various types of cancer, it is important to note that a single marker is generally insufficient for a definitive diagnosis. Instead, a comprehensive profile or “map” of VOCs is essential for accurate early cancer detection. This approach recognizes the complexity of cancer biology and the potential for overlapping biomarkers across different cancer types and other physiological conditions. Furthermore, the interpretation of VOC biomarkers is complicated by their varying blood-breath-fat partition coefficients. These coefficients describe how VOCs distribute between different compartments in the body, affecting their concentrations in various biological samples such as breath, blood, or urine. Currently, corrective measures to account for individual variations in VOC production, metabolism, and distribution are not well-defined. This presents a significant challenge in standardizing VOC-based diagnostic approaches across diverse patient populations.82
Aside from lung cancer, many studies have been conducted to discover non-invasive biomarkers for breast cancer, as the current methods for screening/diagnosis often lead to overdiagnosis and overtreatment. Previous work for breast cancer has been done in murine models, with urinary volatile ketones and terpenes/terpenoids implicated as significant candidates.87 Regarding previous studies focusing on breath-based biomarker discovery, Phillips et al. was one of the first research groups to profile breast cancer. Biomarker candidates were tentatively identified as ethylidenecyclopropane, octamethylcyclotetrasiloxane, limonene, 1,2,4,5-tetramethylbenzene, tridecane, 2,7,10-trimethyldodecane, tetradecane, longifolene, 2-ethyl-1-octanol, and 2,6-di-tert-butylbenzoquinone. The sample size for this study was relatively large (n > 250), and the diagnostic results were relatively modest with sensitivity = 75.3% and specificity = 84.8%.88 Beyond this study, an array of volatile aldehydes including hexanal, heptanal, octanal, and nonanal have also been implicated as potentially clinically relevant markers.88,89 Urinary biomarkers have also been studied, as Kure et al. previously built a predictive model based on just two VOCs (2-butanone and 2-propanol), which could distinguish breast cancer with >90% accuracy.90 Beyond these compounds, another study conducted by Silva et al. identified other biomarkers including but not limited to p-cymene, acetic acid, dimethyl sulfide, and 4-heptanone.91
Perhaps the most important cancer type to improve screening technologies for, prostate cancer has also been studied for VOC biomarkers with a large emphasis on urine given its local proximity to the tumor.92 Current screening methods (i.e., the prostate-specific antigen (PSA) test) have limited accuracy in prostate cancer detection as an array of factors not related to cancer can elevate PSA levels. Limited studies have been undertaken to establish a breath-based profile of prostate cancer biomarkers using technologies beyond electronic nose. Nonetheless, there have been multiple studies aiming to identify prostate cancer biomarkers using human urine samples. In the first study of its kind, in 2015 Khalid et al. identified a biosignature of just four VOCs in urine, that when coupled to the results of the PSA test, could distinguish prostate cancer with approximately 70% accuracy (dihyrdromyrcenol, pentanal, 3-octanone, and 2-octanone).93 Lima et al. undertook a similar analysis, which revealed that VOCs (volatile carbonyl compounds (VCCs) in particular) were dysregulated by prostate cancer. A biomarker panel of just 6 analytes (hexanal, 2,5-dimethylbenzaldehyde, 4-methylhexan-3-one, dihydroedulan IA, methylglyoxal, and 3-phenylpropionaldehyde) could distinguish prostate cancer with a sensitivity of 89% and specificity of 83%.94 Finally, a previous study sought to identify VOC biomarkers that could not only distinguish prostate cancer from healthy controls, but also from patients diagnosed with bladder cancer. These results showed that there were 7 VOCs associated with prostate cancer (toluene, phenol, acetic acid, 2-ethyl-1-hexanol, dimethyl disulfide, among others). Using biomarker panels, prostate cancer could be classified with area under the curve (AUC) of 0.83–0.97. Furthermore, the two urinary cancers could be stratified from one another with AUC 0.73–1.0.95
Colorectal cancer, although not as abundantly studied as some of the other cancer types mentioned above, is also another important carcinoma in which VOCs could aid in developing accurate screening technologies. Research has focused on identifying biomarkers in fecal samples, having identified an array of potential biomarkers including 2-propanol, 2-hexanone, p-xylene, and other compounds.96 Although feces may be the ideal sample of interest for this malignancy, there is also research focusing on urine and breath biomarkers. For example, Amal et al. previously demonstrated that breath-based biomarkers were statistically significantly dysregulated in patients diagnosed with colon cancer (p-value <0.017). These compounds included ethanol, acetone, ethyl acetate, and 4-methyloctane.97 Another published study identified an array of other breath-based compounds beyond these VOCs to be potentially useful in colorectal cancer diagnosis.98 Some of these VOCs incorporated 2-ethyl-1-hexanol, acetic acid, decanal, benzaldehyde and others including saturated hydrocarbons. VOC biomarkers have also been elucidated in urine samples, with Arasaradnam et al. identifying a biosignature that could distinguish colorectal cancer with 88% sensitivity and 60% specificity. These compounds consisted of acetaldehyde, acetone, 2-pentanone, 4-heptanone, allyl isothiocyanate, and other VOCs.99 An overview of different VOC biomarker candidates associated with some of the most studied cancer types is illustrated in Fig. 4. Given exhaled breath and urine are the most common sample types for viable VOC analysis, compounds are listed for these matrices separately.
![]() | ||
Fig. 4 Summary of VOC biomarkers in breath and urine for different cancer types that have been widely studied in the literature.83–85,88,89,91,93–99,102–106 |
As demonstrated through compiling results from some of the most widely cited studies, there is little to no agreement regarding the specific VOC biomarkers for different cancer types. For example, different biological sample types will have fundamental differences in VOC profiles and therefore are not directly comparable. For breath-based studies, given there is no gold standard method for sampling/collection, many different techniques are implemented across literature. It should also be noted that not all studies run pure analytical standards to confirm VOC identification, making biomarker comparison among literature difficult.100 Finally, it should also be noted that many studies report pilot data with relatively small sample sizes. In these studies, a degree of skepticism should be held especially when machine learning or pattern recognition is used to report diagnostic accuracies. In turn, all these factors make it difficult or unachievable to demonstrate VOCs are disease-specific through literature review and qualitative analysis. To combat these challenges, large and/or multi-center studies using comprehensive/standardized methods for collection and analysis should be undertaken on a host of different cancer types to quantitatively demonstrate VOC profiles are cancer specific. Some studies have taken this initiative99,101 with Nakhleh et al. identifying a subset of 13 VOCs that could distinguish 17 different medical conditions (including different types of cancer) with a clinically relevant accuracy of 86%.101
Breath analysis via miniaturized sensors is an emerging science that uses the distinct chemical profiles of volatile compounds in exhaled breath to potentially diagnose diseases, including cancer, in a non-invasive manner. The noninvasiveness, speed, and potential cost-effectiveness of this approach make it appealing. Previous research108,109 has demonstrated a novel method of cancer diagnosis utilizing cutting-edge sensor technology for breath analysis. The goal was to use an ultrasensitive, molecularly modified silicon nanowire field-effect transistor (SiNW FET) to analyze exhaled breath to detect gastric cancer. The trichloro-(phenethyl)-silane (TPS) layer on this sensor improves its capacity to identify VOCs correlated to gastric cancer (2-propenenitrile, furfural). The sensor can distinguish between common environmental VOCs found in breath samples and particular VOCs at incredibly low concentrations (down to 5 parts per billion) by combining components of lock-and-key and cross-reactive sensing. The sensor was tested in a clinical setting on breath samples from 107 participants, including healthy controls and those who had gastric cancer. Over 85% of the time, the sensor was able to accurately distinguish between conditions that were malignant and those that weren't. This accuracy held true even when controlling for gender and other confounding variables like tobacco use. The study's blind analysis demonstrated the sensor's potential as a trustworthy diagnostic tool and confirmed its efficacy. The fact that this technology is non-invasive is one of its main benefits. This breath analysis method provides a more convenient and less invasive option to traditional biopsy methods. Additionally, the diagnostic procedure is made simpler, portable, and cost-effective compared to more complex sensor arrays by using a single molecularly altered SiNW FET sensor.
In another study,109 SiNW FET was used for the identification and categorization of disease breath prints. Based on the VOCs found in exhaled breath, this technique can diagnose and differentiate between several illnesses, including asthma, chronic obstructive pulmonary disease (COPD), lung cancer, and gastric cancer. The sensors were tuned through a training procedure that linked the sensitivity and selectivity of the SiNW FET sensors to VOCs associated with specific diseases. The optimized sensors were then evaluated in a clinical setting using breath samples from 374 patients. The outcomes showed that, in most situations, the SiNW FETs could reliably identify and distinguish between disease states with over 80% accuracy. This technology provides a direct, non-invasive diagnostic tool without the need for needles, surgery, or radioactive materials, which gives it a substantial edge over older procedures. The study emphasizes how SiNW FETs can revolutionize diagnostic procedures. These sensors are ideal for widespread clinical and point-of-care application because they are very sensitive, react quickly to changes in VOC concentrations, and are reasonably priced to manufacture. Additionally, they provide the opportunity to track the advancement of the disease, which can be very helpful in treatment planning. To properly evaluate and improve this technique, more multicenter clinical studies with bigger sample sizes are necessary.
Adding to that, breast cancer is another major area for which early detection using VOC detection must be addressed. Breast cancer is still a major oncology concern because of its intricacy and its need for accurate, non-invasive diagnostic techniques. A unique method for identifying molecular subtypes of breast cancer based on VOCs found in exhaled breath has been made possible. This method seeks to overcome the drawbacks of conventional diagnostic techniques, including the high expense and technical challenges of biopsies and gene expression profiling. The breath samples collected from 276 female volunteers were divided into three groups (bearing benign conditions, ductal carcinoma in situ (DCIS), and malignant lesions) and analyzed using GC-MS and artificially intelligent nano sensor arrays. GC-MS revealed a strong association between the existence of cancer and 23 VOCs with notable variations in concentration across BC patients with different molecular subtypes. By utilizing artificial intelligence, the nanoarray was able to attain an astounding 83% accuracy rate in differentiating between molecular subtypes of breast cancer and carcinogenic versus non-cancerous cases. In cross-validation experiments, the method produced accuracies between 82% and 87%, sensitivities between 81% and 88%, and specificities between 76% and 96%. This volatilomic approach has the benefit of offering a non-invasive, economical, and time-efficient method for the diagnosis and subtype classification of breast cancer. Conventional techniques, such as gene expression profiling, can be impeded by sample degradation and the intricacy of post-translational modifications. They are also restricted by the requirement for large and superior tissue samples. By identifying VOC patterns connected to genetic and protein alterations linked to cancer, breath analysis provides a means of molecular subtype profiling. This technique can distinguish between several BC subtypes, including Triple Negative, Luminal A, Luminal B, and HER2+. It may also help with early identification, patient risk assessment, and the choice of targeted therapy.110
Colorectal cancer is a major global health concern, with forecasts of 2.2 million new cases and 1.1 million deaths by 2030.111,112 Fecal immunochemical assays are commonly used for non-invasive screening; however, their sensitivity and specificity might vary. This results in false positives and needless treatments, even though colonoscopy is the gold standard for diagnosis.113 Investigating the utilization of VOCs in fecal samples (which should be an ideal sample for studying colorectal cancers) as biomarkers for CRC detection is one way. There were eighty individuals in all, twenty-four of whom had adenomatous polyps, twenty-four of whom had adenocarcinomas, and thirty-two of whom had no lymphadenopathy. For individuals without cancer, feces were taken 48 hours prior to the colonoscopy; for those with colorectal cancer, they were taken 2–4 weeks after the procedure. Using magnetic graphene oxide as the extractant, magnetic headspace adsorptive extraction (Mag-HSAE) was used to isolate the VOCs. This was followed by thermal desorption-gas chromatography-mass spectrometry (TD-GC-MS). This technology provides a reliable approach for VOC identification in fecal samples by utilizing magnetic graphene oxide, which has a high capacity to extract aromatic chemicals at a low cost. Significant biomarkers for colorectal cancer (CRC) were found to include p-cresol, 3(4H)-dibenzofuranone, and 4a,9b-dihydro-8,9b-dimethyl-3(4H)-dibenzofuranone. Cancer samples had much higher concentrations of p-cresol, which had an area under the curve of 0.85, 83% sensitivity, and 82% specificity. Comparably, 3(4H)-dibenzofuranone,4a,9b-dihydro-8,9b-dimethyl-3(4H)-DBZ displayed 78% sensitivity and 75% specificity with an AUC of 0.77. The AUC increased to 0.86 when both biomarkers were combined, increasing sensitivity to 87% and specificity to 79%. Additionally, p-cresol showed promise in identifying pre-malignant lesions, with 83% sensitivity, 63% specificity, and an AUC of 0.69. These results suggest that Mag-HSAE-TD-GC-MS is a sensitive and non-invasive technique for early CRC detection and differentiation from cases of pre-malignancy.114
In a different study, a novel method of cancer detection based on flexible sensors fabricated using molecularly modified gold nanoparticles (GNPs) is presented. These sensors are intended to function as a component of a dynamic cross-reactive diagnostic array that can identify and categorize VOCs associated with disease in exhaled breath. The sensors can offer a rich dataset from each device by bending GNPs, which changes their spatial structure and improves diagnostic accuracy. The flexible GNP-based sensors are designed to react to VOCs at parts per billion (ppb) concentrations, which have been linked to ovarian cancer. Owing to this technology, the sensors can distinguish between VOCs that are associated with ovarian cancer and those that come from unrelated environmental sources. The sensors have the potential to be utilized for reliable, non-invasive cancer diagnosis because of their great accuracy in detecting these VOCs. The sensors were even able to identify breath samples from women with ovarian cancer (82% accuracy). Using GNP sensors has several benefits, including portability, low cost, and the ability to function without intrusive procedures or sophisticated equipment. These sensors’ dynamic bending states improve their diagnostic capabilities by allowing them to record a broad variety of VOC interactions. The study also emphasizes how these sensors could be used as a non-invasive, extra alternative for early disease identification and monitoring in addition to current ovarian cancer screening techniques. Breath-based diagnostics have advanced significantly because of the incorporation of molecularly altered GNPs into flexible sensor arrays. This strategy could revolutionize cancer screening and follow-up by providing a straightforward, affordable, and non-invasive diagnostic instrument that overcomes some of the present obstacles in the detection and management of diseases.115
Biosensing using these natural systems has the potential to revolutionize diagnostic procedures by providing extremely specific, sensitive, and non-invasive testing techniques. Nevertheless, there are several difficulties in converting these concepts from biology into useful biosensor technology. Traditional diagnostic methods, such as those for disorders of the respiratory system, frequently need expensive equipment, centralized laboratories, and labor-intensive procedures. Modern biosensors, on the other hand, take advantage of developments in micro- and nanotechnology to enhance performance indicators including sensitivity, specificity, and reaction time.118 Even with these developments, the stability and functionality of biological components such as ORs outside of their native membranes remain a major challenge for biosensor integration. The use of synthetic peptides and molecularly imprinted polymers (MIPs), which replicate the features of receptors while improving stability and performance, are examples of recent advancements in the production of biosensors that offer promising solutions.119,120
The development of bioelectronic noses (B-ENs) and electronic noses (e-noses) is a significant step toward the practical use of artificial olfaction. These devices can assess complicated gas mixtures without the requirement for separation by utilizing an array of chemical sensors and highly developed computing systems.101 VOCs may be analyzed qualitatively and quantitatively using e-noses, which offer a “fingerprint” of the compounds. These systems’ designs frequently take inspirations from the neurophysiology of the human olfactory system to mimic its capacity for pattern recognition. Even though the e-noses that are currently in use are efficient in some applications, research is constantly being conducted to improve them so that they can more accurately detect and identify a larger variety of VOCs. Simultaneously, the field is progressing in the application of biological components in biosensors, like ORs and odorant binding proteins (OBPs). Peptides that can imitate the action of these biological receptors have been developed because of advancements in synthetic biology and genetic engineering. These peptides provide several benefits, such as increased reusability, simpler immobilization, and increased stability. Additionally, these peptides are being designed and optimized using computational technologies like molecular dynamics and virtual docking, which improves their specificity and sensitivity for VOC detection. Biosensors that are more useful and efficient are being made possible by the convergence of biological knowledge and technological innovation. There is great evidence that this field will continue to progress with future research efforts to enhance sensor performance and develop biomimetic materials.116
![]() | ||
Fig. 5 (a) Shows the study design, involving breath sample collection from 1404 subjects across five countries for disease diagnosis, (b) presents a heat map of sensor responses to 17 diseases using nanomaterial-based sensors.101 |
Low concentrations of VOCs linked to lung cancer significantly affected the signal obtained on the smartphone-based chemiresistor device. Notably, excessive humidity and the presence of chemicals that interfere with signal transmission, like ammonia, carbon dioxide, and hexane, had no effect on the sensor's function. This robustness suggests that breath analysis could be used to diagnose lung cancer non-invasively using this technology. Another study developed a different high-performance chemiresistive sensor123 to detect VOCs, which may be lung cancer biomarkers. This sensor makes use of core–shell hybrid nanostructures made of conducting polymer poly(3,4-ethylenedioxythiophene) (PEDOT) and Fe3O4 magnetic nanoparticles (MNPs). In the presence of polymerized ionic liquids (PILs), which served as a stabilizing agent and a linker to bind the MNPs with PEDOT, the MNPs were synthesized using a microwave-assisted method. The resultant Fe3O4 hybrids modified via PEDOT–PIL were assessed as the active sensing component in a chemiresistive sensor, exhibiting remarkable sensitivity and low noise levels for the detection of VOCs, such as acetone, a potential marker in the breath of lung cancer patients. With a 38.8% increase in sensitivity and an 11% decrease in noise, the developed PEDOT–PIL–Fe3O4 sensor demonstrated significant gains over its PEDOT–PIL–counterpart. The benefits of embedding MNPs in conducting polymers for VOC detection are shown by this improved performance. The sensor exhibited high sensitivity towards VOCs at about 1 ppm, thus indicating its promise in lung cancer diagnosis.
In a 2023 study by Cai et al.,124 a non-invasive diagnostic method for gastric cancer using a paper based colorimetric assay with integrated smartphone sensing was reported. The method was tested on breath samples from 40 gastric cancer patients and 40 healthy subjects. The sensor array was designed using porous nylon filter paper incorporated with functionalized nanoplasmonic materials and chemically responsive organic dyes. The sensors changed color when exposed with the VOCs which was analyzed using a smartphone camera and orthogonal partial least squares discriminant analysis (OPLS-DA). The sensing efficiency was confirmed using GC-MS which identified 25 VOCs differentiating gastric cancer patients from the healthy subjects. The key biomarkers detected in gastric cancer patients included acetone, furfural, benzaldehyde, isoprene, and phenyl acetate. The sensor demonstrated a detection limit in the ppb range (acetone ∼33 ppb using 2-pyridinethiol modified GNPs, furfural ∼228 ppb using 2-phenylethanethiol modified GNPs, benzaldehyde ∼69 ppb using 2-naphthalenethiol modified GNPs, isoprene ∼78 ppb using 2-aminoethanethiol modified GNPs, phenyl acetate ∼67 ppb using 11-mercaptoundecanol modified GNPs) and differentiated the gastric cancer subjects with an accuracy of 90%, showing high promise of the breath analysis in cancer detection. Another example of a colorimetric sensor used chemically reactive dyes immobilized on a nanoporous matrix of organically modified siloxanes to study VOCs in the breath of 229 study subjects (92 with lung cancer and 137 controls). The nanoporous structure increased surface area, improving sensitivity to VOCs. The results were printed onto a disposable cartridge capturing the color changes used for determining VOC changes to identify unique breath biosignatures. The sensor exhibited a sensitivity of 70–91% and specificity of 73–86% amongst different lung cancer types (lung cancer, adenocarcinoma, squamous cell carcinoma, small cell lung cancer).125
A novel method was presented126 which combined genetically modified M13 bacteriophages with metallic nanostructures to create a gap plasmonic color sensor system. Because their structural protein capsids can be genetically altered to preferentially bind to target analytes, M13 bacteriophages were chosen for this application. Based on their functional groups, this alteration makes it possible to precisely identify VOCs. These genetically modified bacteriophages (engineered to display specific peptides on its surface which can interact with VOCs of interest) enable the sensor system to distinguish between distinct VOCs with more efficiency, increasing selectivity and overall performance. By employing hierarchical cluster analysis, the multiarray biosensor, which made use of the gap plasmonic color film, was able to identify VOCs in breath samples. A great degree of discrimination between various chemicals was made possible by this method. The method was used to categorize breath samples taken from 50 lung cancer patients and 70 healthy individuals during clinical trials. The technology identified lung cancer breath samples with an accuracy rate higher than 89%, which was a promising outcome. Machine learning analysis, which improved the sensor's capacity to discern between malignant and healthy breath samples, further corroborated this high rate of classification accuracy. Through non-invasive breath analysis, this hybrid approach not only increases the sensor's sensitivity and selectivity but also offers a strong foundation for the early identification of lung cancer. The technology is promising in clinical diagnostics and for the development of sophisticated, portable sensing devices for disease detection. This is highlighted by the effective classification of VOCs and its high accuracy in identifying samples that are malignant.
Exemplary approaches include using D5-ethyl-β-D-glucuronide which is intravenously administered in mice and readily metabolized to D5-ethanol (VOC) by β-glucuronidase.54 Therefore, the release of free D5-ethanol is an indicator of tumor microenvironments’ activity using this assay. In another example, Chan et al. devised intrapulmonary delivery of hydrofluoroamine (VOC reporter)-conjugated activity based nano sensors which are susceptible to serine protease neutrophil elastase as an indicator of lung disease.127 Thus, VOC detection using exogenous probes producing non-endogenous disease related breath markers can be the future of the technology in producing robust and reproducible disease signals. Additionally, the improvements in sample collection measures and instrumentation will accelerate the translation of the technology. For endogenous VOCs, further research in disease biology and understanding of VOC origins will facilitate the identification of robust biomarkers, including early indicators of aggressive cancers, which can be accepted by both the clinical and scientific community.
More hope can also be seen from newer techniques like cavity ring-down spectroscopy. Larracy et al. demonstrated non-small cell lung cancer detection with 51.61%–66.13% sensitivity and 73.96%–97.92% specificity.128 As the research further progresses to find unique molecules for different cancer types, tunable diode laser absorption spectroscopy can be a highly effective approach. Further, as an extension to nature-inspired approaches, neuron-based VOC biosensors can be highly promising, employing olfactory receptor neurons (ORNs) or reconstituted receptor proteins enabling ultrasensitive detection of disease specific VOCs. These approaches are very promising, as ORNs are naturally sensitive to VOCs at trace concentration levels. They also have high selectivity and therefore may be able to be tuned for the discrimination of structurally similar targets in which traditional e-Noses cannot. It should be noted that for portable sensor technologies, unknown VOC interferents during collection/sampling may bias results in a manner that cannot be normalized. Nonetheless, this can be overcome through ensuring that sensors are cross-sensitive and selective to the VOC targets of interest and implementing analytically validated breath sampling methodology that does not introduce interfering compounds.
VOC | Volatile organic compound |
GC-MS | Gas chromatography-mass spectrometry |
IL-6 | Interleukin-6 |
GC-IMS | Gas chromatography-ion mobility spectrometry |
SIFT-MS | Selected ion flow-tube mass spectrometry |
PTR-MS | Proton transfer reaction-mass spectrometry |
ctDNA | Circulating tumor DNA |
CTCs | Circulating tumor cells |
NGS | Next-generation sequencing |
MACS | Magnetic-activated cell sorting |
EpCAM | Epithelial cell adhesion molecule |
EMT | Epithelial-to-mesenchymal transition |
ALDHs | Aldehyde dehydrogenase |
PSA | Prostate-specific antigen |
VCCs | Volatile carbonyl compounds |
PET | Positron emission tomography |
MRI | Magnetic resonance imaging |
CT | Computed tomography |
SiNW FET | Silicon nanowire field-effect transistor |
TPS | Trichloro-(phenethyl)-silane |
COPD | Chronic obstructive pulmonary disease |
DCIS | Ductal carcinoma in situ |
Mag-HSAE | Magnetic headspace adsorptive extraction |
TD-GC-MS | Thermal desorption-gas chromatography-mass spectrometry |
GNPs | Gold nanoparticles |
TAARs | Tracing amine-associated receptors |
GPCRs | G-protein coupled receptors |
FPRs | Formyl peptide receptors |
OBPs | Odorant binding proteins |
BET | Brunauer–Emmett–Teller |
SEM | Scanning electron microscopy |
PEDOT | Polymer poly(3,4-ethylenedioxythiophene) |
MIPs | Molecularly imprinted polymers |
OPLS-DA | Orthogonal partial least squares discriminant analysis |
Footnotes |
† We dedicate this article to the memory of the late Professor Sanjiv Sam Gambhir, a molecular imaging scientist. |
‡ These authors have contributed equally to this work. |
This journal is © The Royal Society of Chemistry 2025 |