Juan Pablo Betancourt Arango,
Alejandro Patiño Ospina
,
Jhon Alexander Fiscal Ladino
and
Gonzalo Taborda Ocampo
*
Department of Chemistry, Research Group on Chromatography and Related Techniques, University of Caldas, Manizales, Caldas, Colombia. E-mail: gtaborda@ucaldas.edu.co
First published on 16th July 2025
Introduction: Omics sciences, particularly metabolomics and its subfield volatilomics, investigate small molecules to understand biochemical dynamics. Volatilomics targets volatile organic compounds (VOCs), which act as biomarkers for physiological changes, environmental stress, and xenobiotic exposure. Advances in GC-MS and HS-SPME have enabled precise VOC profiling. A critical issue in food safety is pesticide contamination, notably organochlorines like endosulfan, which bioaccumulate and disrupt plant metabolomes. Hass avocado (Persea americana Mill.), rich in lipids and terpenoids, offers an ideal matrix for studying xenovolatilomic responses. Objective: This study evaluated volatilomic alterations induced by endosulfan in Hass avocado using headspace solid-phase microextraction (HS-SPME) coupled with gas chromatography-mass spectrometry (GC-MS). It aimed to identify potential toxicity biomarkers associated with pesticide exposure, contributing to rapid, reliable detection methodologies for agricultural products. Methodology: Avocado peel, pulp, and seed were experimentally exposed to endosulfan for 8 and 20 days under controlled conditions. VOCs were extracted by HS-SPME and analyzed by GC-MS. Data were processed and subjected to multivariate statistical analyses, including Principal Component Analysis (PCA), Partial Least Squares-Discriminant Analysis (PLS-DA), random forest, variable importance in projection (VIP) scores, and receiver operating characteristic (ROC) curve analysis to identify VOCs differentially expressed under pesticide exposure. Results: Random forest and PLS-DA analyses identified five key VOCs as potential toxicity biomarkers: (E)-2-octenal (V93), oct-3-en-2-one (V86), decanal (V129), hexanal (V29), and nonanal (V102). These compounds exhibited significant concentration changes based on exposure time (8 and 20 days) and tissue type. Additionally, an unknown compound (VX83) emerged as a potential biomarker requiring future characterization. Conclusions: This study constitutes the first xenovolatilomic investigation in Hass avocado and validates the use of (E)-2-octenal, oct-3-en-2-one, decanal, hexanal, and nonanal as potential toxicity biomarkers for the early detection of pesticide-induced biochemical alterations. The integration of volatilomic profiling with multivariate statistical and biochemical analyses provides a solid foundation for developing rapid diagnostic tools and advancing computational metabolomics models for predicting pesticide-induced enzymatic inhibition processes. These findings have implications for food safety, export quality assurance, and the economic sustainability of agricultural production systems in regions like Caldas, Colombia.
Sustainability spotlightThe increasing presence of pesticide residues in tropical crops like Hass avocado poses a significant threat to food safety, consumer health, and export sustainability. This work presents a novel xenovolatilomic strategy for the early detection of agrochemical contamination by identifying potential toxicity biomarkers through VOC profiling and multivariate analysis. By enabling rapid, non-destructive monitoring of biochemical alterations in food matrices, this study advances sustainable food quality control. It aligns with the UN Sustainable Development Goals (SDGs), particularly Goal 2 (Zero Hunger), Goal 3 (Good Health and Well-being), and Goal 12 (Responsible Consumption and Production), by promoting safe agricultural practices and reducing toxic exposure in food systems. This approach supports long-term food security and environmentally conscious production. |
Volatilomics has gained relevance in diverse areas, including ecological interactions, environmental quality assessment, natural product bioprospecting, and food safety. This discipline is based on analyzing VOCs, which are naturally emitted by organisms and biological matrices, and whose presence and concentration can indicate physiological alterations.3 It enables the evaluation of differential metabolite expression in response to environmental factors, physiological stress, and contaminant exposure.4 In this context, VOCs have also been employed as tools in taxonomic and ecological studies across different species.5 More recently, specialized volatilomic approaches have emerged, such as exhalomics (analysis of VOCs in exhaled air) and xenovolatilomics, which examines volatile profiles induced by xenobiotic agents.6 These approaches have been integrated with multi-omics strategies, combining volatilomics with lipidomics and other disciplines to gain a deeper understanding of chemical variability in biological and food samples.7
The characterization of VOCs requires advanced extraction and analytical methodologies. Among them, solid-phase microextraction in static headspace (SHS-SPME) and dynamic headspace (DHS-SPME) are widely used for VOC collection due to their sensitivity, reproducibility, and solvent-free operation. For compound identification, gas chromatography (GC) remains the primary technique, often coupled with various detection systems such as mass spectrometry (MS),8 time-of-flight mass spectrometry (TOF/MS), triple quadrupole mass spectrometry (QTOF/MS),9 Orbitrap-MS, tandem mass spectrometry (MS/MS), and ion mobility spectrometry (MS-IMS).10 These platforms enable comprehensive structural elucidation of VOCs and in-depth exploration of the chemical behavior of biological samples. In terms of quantification, gas chromatography with flame ionization detection (GC-FID),11 is widely regarded as a robust technique due to its excellent linearity and reproducibility across a wide concentration range. Additionally, GC-MS operated in selected ion monitoring (SIM),12 mode enhances quantification by providing improved sensitivity and selectivity for targeted analytes. The combination of GC-FID and GC-MS (in SIM mode) is often employed to leverage the strengths of both techniques—GC-MS for compound identification and GC-FID or targeted MS modes for accurate quantification—thereby ensuring reliable and reproducible data. Volatilomic analysis is further supported by advanced statistical tools, including both supervised and unsupervised methods. Techniques such as principal component analysis (PCA), hierarchical cluster analysis (HCA), and linear discriminant analysis (LDA) allow for pattern recognition, sample classification, and evaluation of the variables contributing to chemical differentiation between groups.13
One of the major challenges in food safety today is the presence of pesticide residues in agricultural crops. Organophosphate and organochlorine pesticides have been extensively used for pest control, yet their uncontrolled application raises concern due to their accumulation in fruits, vegetables, and other agricultural products.14 These compounds, being foreign to biological systems, are classified as xenobiotics and can induce biochemical alterations in exposed organisms, even at low concentrations.15 Globally, organochlorine pesticides such as endosulfan have been restricted or banned in many countries due to their high persistence, lipophilicity, and tendency to bioaccumulate within ecosystems and food chains.16 The inclusion of endosulfan in the Stockholm Convention on Persistent Organic Pollutants (2011) marked a turning point in international recognition of its environmental and health risks.17 Nonetheless, residues of endosulfan and its isomers (α, β, and sulfate) continue to be reported in soil, water, and food commodities, particularly in tropical and developing regions. Toxicological studies have demonstrated endosulfan's potential to induce neurotoxicity, reproductive dysfunction, oxidative stress, and enzymatic inhibition in a range of biological systems, including plants, animals, and humans.18
In Colombia, pesticide use is regulated to ensure food safety and facilitate international trade in agricultural products. However, organochlorine pesticides such as endosulfan have raised significant environmental and health concerns due to their persistence and bioaccumulation. Notably, endosulfan isomers (α, β, and sulfate) exhibit substantial toxicity to both organisms and ecosystems.19,20 This situation underscores the need for rapid, cost-effective, and reliable research methods. Xenometabolomic,21–23 and xenovolatilomic studies provide valuable strategies for identifying secondary metabolites expressed in response to xenobiotic exposure, aiming to develop omics-based methodologies that characterize metabolites for potential toxicity biomarker identification.
Hass avocado (Persea americana Mill.) is among the most widely cultivated and commercially significant avocado varieties worldwide, representing a key agro-industrial product in countries such as Colombia, Mexico, and Peru. Its chemical composition, rich in lipids, phenolic compounds, and terpenoids, makes it an ideal model for volatilomic studies, as its VOCs are associated with quality, ripening, and environmental responses. However, avocado production is exposed to pesticide applications for pest and disease control, raising concerns about potential pesticide residues in the fruit.24 Notably, endosulfan has been identified as an environmental contaminant in avocado crops, with the potential to alter its metabolism and volatilomic profile. Xenovolatilomic studies have demonstrated that pesticide exposure can induce the formation of specific VOCs, which may serve as biomarkers of contamination and toxicity. In this regard, Hass avocado serves as an optimal matrix for evaluating volatilomic profile alterations induced by endosulfan, with the aim of identifying potential toxicity biomarkers.
A toxicity biomarker,25 is characterized as a response metabolite that emerges in the presence of a xenobiotic agent. These biomarkers provide a rapid methodology for early detection of contamination in samples exposed to xenobiotic agents such as pesticides. This study conducted a xenovolatilomic analysis of Hass avocado (Persea americana Mill.) using headspace solid-phase microextraction (HS-SPME) and gas chromatography-mass spectrometry (GC-MS).26 The objective is to assess alterations in the volatilomic profile induced by endosulfan and its potential for identifying toxicity biomarkers in this fruit.
As additional information, for each metabolite identified, key data were collected including the PubChem ID, common name, signal-to-noise ratio (S/N), molecular formula, and the corresponding codes from HMDB, KEGG, Lipid Maps, and MetaCyc, in order to support the construction of associated biochemical pathways. Details regarding both identified and unidentified xenovolatilomic metabolites observed during the two exposure periods are provided in the ESI.†
Specifically, metabolite data for peel (Tables S1 and S2), pulp (Tables S3 and S4), and seed (Tables S5 and S6†) are included to document changes across tissues and exposure times. Tables 1 and 2, summarize the xenovolatilomic profiles of Hass avocado peel, pulp, and seed after 8 and 20 days of endosulfan exposure, respectively. After 8 days, a total of 19 compounds were identified in the peel, 10 in the pulp, and 32 in the seed. After 20 days, the number of compounds increased to 27 in the peel, 19 in the pulp, and 35 in the seed, reflecting a time-dependent metabolic response. This progressive increase in detected VOCs may be attributed to the cumulative oxidative stress induced by prolonged exposure to endosulfan, leading to sustained lipid peroxidation, enzymatic inhibition, and continued formation of aldehydes, ketones, and secondary volatile metabolites. Fig. 2 provides a visual representation of the overlap and distribution of identified VOCs among the three fruit tissues after 8 days (Fig. 2A) and 20 days (Fig. 2B) of exposure, illustrating the interconnectedness and potential metabolic convergence triggered by the xenobiotic agent.
Compound number | Peel with endosulfan 8 days | Pulp with endosulfan 8 days | Seed with endosulfan 8 days | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Name | Code KEGG/Lipid Maps/HMDB/MetaCyc | RT | KI | Name | Code KEGG/Lipid Maps/HMDB/MetaCyc | RT | KI | Name | Code KEGG/Lipid Maps/HMDB/MetaCyc | RT | KI | |
1 | (2E)-Hexenal | HMDB0031496 | 7.118 | 847 | 2,2-Dimethylhexane | — | 3.205 | 705.8 | 2,2,4-Trimethylpentane | — | 3.224 | 706.5 |
2 | (2E)-Heptenal | LMFA06000019 | 11.261 | 959 | Hexanal | LMFA06000109 | 5.424 | 791.6 | Hexanal | LMFA06000109 | 5.424 | 791.6 |
3 | Benzaldehyde | C00261 | 11.580 | 967 | (2E)-Hexenal | HMDB0031496 | 7.118 | 847.1 | (2E)-Hexenal | HMDB0031496 | 7.118 | 847.1 |
4 | Beta-myrcene | C06074 | 12.590 | 991 | (2E)-Heptenal | LMFA06000019 | 11.261 | 959.3 | Alpha-pinene | C09880 | 9.996 | 929.1 |
5 | Eucalyptol | C09844 | 14.525 | 1033 | Benzaldehyde | C00261 | 11.580 | 966.9 | (2E)-Heptenal | LMFA06000019 | 11.261 | 959.3 |
6 | (Z)-Beta-ocimene | — | 14.875 | 1041 | 1-Octen-3-ol | LMFA05000090 | 12.595 | 991.2 | Benzaldehyde | C00261 | 11.580 | 966.9 |
7 | (E)-Beta-ocimene | LMPR0102010021 | 15.285 | 1050 | Oct-3-en-2-one | HMDB0033547 | 14.940 | 1042 | (−)-Beta-pinene | C06307 | 12.015 | 977.3 |
8 | (E)-2-Octenal | HMDB0013809 | 15.995 | 1065 | (E)-2-Octenal | HMDB0013809 | 15.995 | 1065 | Beta-myrcene | C06074 | 12.590 | 991.1 |
9 | Nonanal | LMFA06000040 | 18.140 | 1111 | Nonanal | LMFA06000040 | 18.140 | 1111 | 1,4,4-Trimethyl-3,5-dimethylenecyclopentene | — | 14.160 | 1026 |
10 | Decanal | HMDB0011623 | 22.870 | 1216 | 1-Chlorodecane | — | 20.875 | 1168 | Limonene | HMDB0004321 | 14.505 | 1033 |
11 | Alpha-ylangene | HMDB0301856 | 26.940 | 1379 | (Z)-Beta-ocimene | — | 14.875 | 1041 | ||||
12 | (−)-Alpha-copaene | HMDB0061851 | 27.060 | 1385 | Oct-3-en-2-one | HMDB0033547 | 14.940 | 1042 | ||||
13 | 1,5-Di-epi-bourbonene | — | 27.215 | 1393 | (E)-2-Octenal | HMDB0013809 | 15.995 | 1065 | ||||
14 | (−)-Beta-elemene | HMDB0061848 | 27.305 | 1397 | 4-Methyl-3-(1-methylethylidene)-1-cyclohexene | — | 17.020 | 1087 | ||||
15 | (−)-trans-Caryophyllene | C09629 | 27.875 | 1432 | Dehydro-p-cymene | HMDB0029641 | 17.325 | 1093 | ||||
16 | (+)-Delta-cadinene | C06394 | 29.413 | 1533 | Nonanal | LMFA06000040 | 18.140 | 1111 | ||||
17 | Zonarene | HMDB0303004 | 29.465 | 1537 | 4,8-Dimethyl-1,3,7-nonatriene | — | 18.500 | 1118 | ||||
18 | Methyl palmitate | HMDB0061859 | 34.175 | 1937 | Alpha-pinocarvone | C09884 | 20.830 | 1167 | ||||
19 | Tetratriacontane | LMFA11000587 | 40.935 | 2712 | n-Decanal | HMDB0011623 | 22.870 | 1216 | ||||
20 | Octyl acetate | LMFA07010197 | 22.985 | 1220 | ||||||||
21 | 1,9-Decadiene | HMDB0244254 | 25.095 | 1291 | ||||||||
22 | 10-Undecenal | HMDB0031128 | 25.598 | 1312 | ||||||||
23 | Alpha-ylangene | HMDB0301856 | 26.940 | 1379 | ||||||||
24 | (−)-Alpha-copaene | HMDB0061851 | 27.060 | 1385 | ||||||||
25 | Beta-maaliene | — | 27.233 | 1394 | ||||||||
26 | Decyl acetate | LMFA07010212 | 27.640 | 1418 | ||||||||
27 | Opposita-4(15),7-diene | — | 27.900 | 1434 | ||||||||
28 | Selina-4(15),7-diene | — | 28.512 | 1472 | ||||||||
29 | Alpha-amorphene | CPD-8797 | 28.735 | 1486 | ||||||||
30 | (+)-Delta-cadinene | C06394 | 29.413 | 1533 | ||||||||
31 | (Z,Z)-1,8,11-Heptadecatriene | HMDB0302479 | 31.220 | 1672 | ||||||||
32 | cis-11-Tetradecen-1-ol | — | 31.475 | 1692 |
Compound number | Peel with endosulfan 20 days | Pulp with endosulfan 20 days | Seed with endosulfan 20 days | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Name | Code KEGG/Lipid Maps/HMDB/MetaCyc | RT | KI | Name | Code KEGG/Lipid Maps/HMDB/MetaCyc | RT | KI | Name | Code KEGG/Lipid Maps/HMDB/MetaCyc | RT | KI | |
1 | 2,2-Dimethylhexane | — | 3.205 | 706 | 2,2-Dimethylhexane | — | 3.205 | 705.8 | Pentanal | HMDB0031206 | 3.400 | 713.3 |
2 | Pentanal | HMDB0031206 | 3.400 | 713 | Pentanal | HMDB0031206 | 3.400 | 713.3 | Pentanol | — | 4.935 | 772.7 |
3 | trans-2-Pentenal | CPD-13228 | 4.393 | 752 | Pentanol | — | 4.930 | 772.5 | Hexanal | LMFA06000109 | 5.424 | 791.6 |
4 | Pentanol | — | 4.935 | 773 | Hexanal | LMFA06000109 | 5.424 | 791.6 | Furfural | C14279 | 6.850 | 838.6 |
5 | Hexanal | LMFA06000109 | 5.424 | 792 | Furfural | C14279 | 6.850 | 838.6 | (2E)-Hexenal | HMDB0031496 | 7.118 | 847.1 |
6 | (2E)-Hexenal | HMDB0031496 | 7.118 | 847 | (2E)-Hexenal | HMDB0031496 | 7.118 | 847.1 | 2-Heptanone | LMFA12000004 | 8.340 | 886.1 |
7 | 2-Heptanone | LMFA12000004 | 8.340 | 886 | Heptanal | HMDB0031475 | 8.885 | 902.6 | Heptanal | HMDB0031475 | 8.885 | 902.6 |
8 | Heptanal | HMDB0031475 | 8.885 | 903 | Cyclohexane carboxaldehyde | — | 10.500 | 941.2 | Pentyl acetate | — | 9.290 | 912.3 |
9 | (E)-2-Heptenal | LMFA06000019 | 11.261 | 959 | (E)-2-Heptenal | LMFA06000019 | 11.261 | 959.3 | Alpha-pinene | C09880 | 9.988 | 928.9 |
10 | Benzaldehyde | C00261 | 11.580 | 967 | Benzaldehyde | C00261 | 11.580 | 966.9 | Cyclohexane carboxaldehyde | — | 10.500 | 941.2 |
11 | 1-Octen-3-ol | LMFA05000090 | 12.595 | 991 | Octanal | HMDB0001140 | 13.785 | 1018 | (E)-2-Heptenal | LMFA06000019 | 11.261 | 959.3 |
12 | Octanal | HMDB0001140 | 13.785 | 1018 | 5-Ethyl-1-formylcyclopentene | — | 14.520 | 1033 | Benzaldehyde | C00261 | 11.580 | 966.9 |
13 | (2E,4E)-Hepta-2,4-dienal | LMFA06000024 | 13.771 | 1017 | Oct-3-en-2-one | HMDB0033547 | 14.940 | 1042 | (−)-Beta-pinene | LMPR0102120013 | 12.015 | 977.3 |
14 | Oct-3-en-2-one | HMDB0033547 | 14.940 | 1042 | (E)-2-Octenal | HMDB0013809 | 15.995 | 1065 | Oct-1-en-3-one | HMDB0031309 | 12.065 | 978.5 |
15 | (E)-2-Octenal | HMDB0013809 | 15.995 | 1065 | Nonanal | LMFA06000040 | 18.140 | 1111 | Octanal | HMDB0001140 | 13.785 | 1018 |
16 | Nonanal | LMFA06000040 | 18.140 | 1111 | (Z)-Non-2-enal | LMFA06000041 | 20.840 | 1167 | o-Cymene | HMDB0037050 | 14.160 | 1026 |
17 | (E)-2-Nonenal | LMFA06000041 | 20.910 | 1169 | Decanal | HMDB0011623 | 22.870 | 1216 | 5-Ethyl-1-formylcyclopentene | — | 14.520 | 1033 |
18 | Decanal | HMDB0011623 | 22.870 | 1216 | (E)-2-Decenal | LMFA06000053 | 24.650 | 1276 | Oct-3-en-2-one | HMDB0033547 | 14.940 | 1042 |
19 | Octyl acetate | LMFA07010197 | 22.985 | 1220 | Undecan-6-one | LMFA12000062 | 24.805 | 1281 | Benzeneacetaldehyde | C00601 | 15.262 | 1049 |
20 | (E)-Dec-2-enal | LMFA06000053 | 24.650 | 1276 | (E)-2-Octenal | HMDB0013809 | 15.995 | 1065 | ||||
21 | (+)-Cyclosativene | HMDB0302487 | 26.935 | 1379 | Dehydro-p-cymene | HMDB0029641 | 17.325 | 1093 | ||||
22 | (−)-Alpha-copaene | HMDB0061851 | 27.060 | 1385 | (Z)-4-Decenal | HMDB0041014 | 22.415 | 1201 | ||||
23 | (−)-trans-Caryophyllene | C09629 | 27.875 | 1432 | Decanal | HMDB0011623 | 22.870 | 1216 | ||||
24 | trans-Alpha-bergamotene | — | 28.050 | 1443 | Acetic acid octyl ester | LMFA07010197 | 23.000 | 1221 | ||||
25 | (+)-Delta-cadinene | C06394 | 29.413 | 1533 | (E)-Dec-2-enal | LMFA06000053 | 24.650 | 1276 | ||||
26 | Zonarene | HMDB0303004 | 29.465 | 1537 | 10-Undecenal | HMDB0031128 | 25.598 | 1312 | ||||
27 | Phytone | — | 33.305 | 1855 | (−)-Alpha-copaene | HMDB0061851 | 27.060 | 1385 | ||||
28 | (Z)-Dec-4-enyl ethyl carbonate | — | 27.265 | 1395 | ||||||||
29 | trans-Alpha-bergamotene | — | 28.050 | 1443 | ||||||||
30 | Selina-4(15),7-diene | — | 28.580 | 1476 | ||||||||
31 | (+)-Alpha-muurolene | CPD-8798 | 28.795 | 1490 | ||||||||
32 | (+)-Delta-cadinene | C06394 | 29.413 | 1533 | ||||||||
33 | 1-Heptadecanol | — | 33.765 | 1897 | ||||||||
34 | Eicosane | HMDB0059909 | 33.875 | 1908 | ||||||||
35 | Tetratriacontane | LMFA11000587 | 40.935 | 2712 |
As shown in Fig. 2, increasing the exposure time to the xenobiotic leads to a significant rise in the number of detected metabolites in each fruit section. However, some of these metabolites begin to appear in all three parts simultaneously, indicating the formation of new, biochemically correlated compounds shared across tissues. The variations observed in the volatilomic profiles (Fig. 1) reflect this temporal differentiation, with new signals emerging and others disappearing as exposure time increases. This behavior results from metabolic fluxes in which endosulfan interacts with the natural cellular biochemical pathways in the peel, pulp, and seed of Hass avocado. This interaction triggers the generation of response metabolites (RMs), which act as potential toxicity biomarkers in response to the biological and biochemical alterations induced by the pesticide.
Additionally, in xenovolatilomic profile 1 (Fig. 2A), 9 unique metabolites were identified in the peel, 3 in the pulp, and 20 in the seed, with the seed showing the highest volatile diversity under this condition. Five common metabolites were also found across the three tissues. In xenovolatilomic profile 2 (Fig. 2B), 9 VOCs were unique to the peel, 3 to the pulp, and 16 to the seed. The seed exhibited a reduction in metabolites after 20 days of exposure, suggesting possible metabolization or biotransformation over time. A total of 11 metabolites were common to all three fruit parts, representing a significant increase compared to profile 1. This analysis indicates a notable increase in the number of shared metabolites among the different fruit tissues as exposure progresses, which could be attributed to the diffusion or migration of compounds, biotransformation processes, or a convergent adaptive response of the fruit matrices to the presence of the xenobiotic.30–33
![]() | ||
Fig. 3 (A) Summary of the enrichment analysis performed on the metabolite set. (B) Bubble plot showing the significance analysis of the enriched metabolite classes. |
The bubble plot results indicate that carbonyl compounds and monoterpenoids exhibit the lowest p-values (expressed as −log(p)) and relatively small bubble sizes, suggesting that these compound families are statistically significant, although less enriched. In contrast, phenylpropenes stand out due to their high level of significance and representativeness within the xenovolatilome, whereas fatty acid esters display low significance and minimal enrichment. These results indicate that these volatile compound families are significantly present in the xenovolatilome of Hass avocado during both exposure periods, with a strong dominance of phenylpropenes-type VOCs, followed by other aromatic compounds such as phenylacetaldehydes and cumenes. The enrichment analysis of these compounds suggests a specific chemical response of the fruit to stress induced by the xenobiotic agent, indicating a relationship with metabolic defense mechanisms, lipid oxidation processes, and enzymatic detoxification pathways.
Although the total explained variance is relatively low (12.4%), the analysis allows for a preliminary visualization of distribution patterns and groupings among the samples. The information represented in the PCA is validated by the different quality controls applied (CQC, FQC, and SQC), confirming the absence of instrumental interferences, ensuring high reproducibility of the technique, verifying the quality of the experiments performed, and supporting the validity of the model without noise interference from the technique or the biological samples themselves. Additionally, the PCA shows clear differentiation between doped and undoped samples, indicating that the endosulfan doping applied to the peel, pulp, and seed samples distinctively and consistently affects the volatilomic profile of these plant matrices.
![]() | ||
Fig. 6 (A) Hierarchical cluster analysis (HCA). (B) Partial least squares discriminant analysis (PLS-DA). (C) Top variable importance in projection (VIP) scores generated from the PLS-DA model. |
To further assess treatment separation, a Partial Least Squares Discriminant Analysis (PLS-DA) was applied (Fig. 6B). This plot projects the samples into a reduced-dimensional space, maximizing differences between predefined groups: blue circles (undoped samples) and yellow crosses (treated samples). The model shows that PLS1 accounts for 62.73% of the variance, while PLS2 explains 7.27%, indicating a clear separation between the two groups and confirming that the PLS-DA model effectively discriminates between doped and undoped samples. Additionally, no overlapping of samples is observed, reinforcing the hypothesis of a significant biological effect induced by endosulfan as a xenobiotic agent. Based on the PLS-DA results, the 30 most important variables (VIP) contributing to the separation between groups within the model were identified (Fig. 6C).
The determination of VIP scores generated from the PLS-DA model indicated that variables with VIP > 1.0 are considered relevant for classification; however, for this study, only those with VIP > 3.0 were selected, identifying V13, V3, V54, and V29 as the most discriminatory metabolites within the model. To verify these results, cross-validation was performed, yielding an optimal classification threshold of 0.9251 and a PLS-DA model accuracy of 0.75.
Given this moderate accuracy, the data were further evaluated using a random forest model, which generated a balanced working subset for the class groups. Cross-validation of this model achieved an accuracy of 1.00, demonstrating that this method efficiently classifies doped and undoped samples. VIP scores were then determined from the random forest model (Fig. 7A), with variables having VIP > 0.025 considered significant. The most important contributors to the group differentiation were identified as VX83, V29, V54, V102, V13, V86, V129, and V3. Consistently, both the random forest and PLS-DA models identified V13, V3, and V54 as influential variables in discriminating between doped and undoped sample groups. To further evaluate the potential of these compounds as toxicity biomarkers, ROC and corrected AUC-ROC analyses were performed (Fig. 7B) to assess the individual predictive capacity of each variable. The results obtained from the corrected ROC-AUC analysis (Fig. 7B) indicate that the metabolites V93, V129, V86, V102, V29, V54, V57, V13, V139, V218, V43, V3, and V1, along with the unidentified signals VX83, VX130, and VX281, can be considered volatile organic compounds (VOCs) with potential as toxicity biomarkers for the preventive detection of xenobiotic agents, such as endosulfan, in Hass avocado crops.
The p-values for this test were calculated using a custom Python script (version 3.13.5), specifically employing the scipy.stats library. However, to confirm this information, an ROC curve analysis was conducted for all 16 variables (Table 3). In this way, metabolites with an ROC-AUC value ≥ 0.9 indicate a very high ability to correctly separate the two evaluated groups (doped and non-doped samples), suggesting that they can be considered effective potential toxicity biomarkers.
Compound number | Code | Compound | U statistic (Mann–Whitney) | p-Value | Curve ROC-AUC | Potential biomarker of toxicity (significance) |
---|---|---|---|---|---|---|
** Acceptance criterion for ROC-AUC ≥ 0.90. (Ref 34). * Significance U stadistic (Mann-Whitney). | ||||||
1 | V93 | (E)-2-Octenal | 2580.00 | 0.0000 | 0.97 | ** |
2 | V129 | Decanal | 2523.00 | 0.0000 | 0.94 | ** |
3 | V86 | Oct-3-en-2-one | 2500.50 | 0.0000 | 0.94 | ** |
4 | V102 | Nonanal | 2418.00 | 0.0000 | 0.90 | ** |
5 | V29 | Hexanal | 2395.50 | 0.0000 | 0.90 | ** |
6 | V54 | (E)-2-Heptenal | 2319.00 | 0.0000 | 0.87 | * |
7 | V57 | Benzaldehyde | 2290.50 | 0.0000 | 0.86 | * |
8 | V13 | Hexane, 2,2-dimethyl- | 2223.00 | 0.0000 | 0.83 | * |
9 | V139 | (E)-2-Decenal | 2182.50 | 0.0000 | 0.82 | * |
10 | V218 | Zonarene | 2083.50 | 0.000 | 0.78 | * |
11 | V43 | Heptanal | 1980.00 | 0.000 | 0.74 | * |
12 | V3 | Acetonitrile | 1930.50 | 0.0000 | 0.72 | * |
13 | V1 | Methyl isocyanide | 1878.00 | 0.0000 | 0.70 | * |
14 | VX83 | — | 2484.00 | 0.0000 | 0.93 | ** |
15 | VX130 | — | 2355.00 | 0.0000 | 0.88 | * |
16 | VX281 | — | 2118.00 | 0.0000 | 0.79 | * |
This study determined the ROC-AUC values for the 16 candidate biomarker compounds, of which the ROC-AUC curve indicates a superior or equal performance to 0.9 for the variables V93 (Fig. 8A), V129 (Fig. 8B), V86 (Fig. 8C), V102 (Fig. 8D), V29 (Fig. 8E), and an unknown metabolite categorized as VX83 (Fig. 8F). Therefore, these 6 compounds were identified as potential toxicity biomarkers, as they individually exhibit a strong ability to discriminate between the doped and non-doped sample groups in Hass avocado. To date, no previous studies have reported the identification of potential toxicity biomarkers in Hass avocado. However, studies have been reported on the exploration of physicochemical and metabolomic markers developed during the fruit's ripening process.35
![]() | ||
Fig. 8 AUC-ROC curves for the following compounds: (A) V93 ((E)-2-Octenal). (B) V129 (Decanal).39 (C) V86 (Oct-3-en-2-one). (D) V102 (Nonanal).39 (E) V29 (Hexanal).39 (E) V29 (Hexanal).39 (F) VX83 (unknown compound). |
Additionally, in the context of omics research in Hass avocado, genomic analyses have been employed to assess the genotoxicological effects of the ethanol extract obtained from the seed of this fruit.36 Transcriptomic studies focus on identifying hormonal biomarkers to determine the physiological age of the fruit, as well as understanding the expressed biochemical changes. These studies have revealed the overexpression of genes associated with DNA replication, auxin transport, cell wall modifications, and the biosynthesis of gibberellins, brassinosteroids, and flavonoids.37 Regarding proteomic studies, there is a comparative analysis between the peel and pulp, evaluating their shelf life after harvest.38
Most current research on this fruit focuses on exploring this matrix to identify bioactive compounds such as fatty acids present in the tissue, for antimicrobial purposes, food additive properties, and the identification and quantification of phytochemicals generated during the fruit's ripening stage.40,41 In metabolomic studies, a combination of GC-MS and HPLC-UV-VIS techniques, based on chemometric methods, has been used to discriminate between Hass avocado samples that present and do not present physiological disorders such as black spot.42 This indicates that this biological matrix has been extensively studied from an exploratory perspective to identify its biological and nutritional properties. Chemically, the fruit is characterized by the presence of alcohols, fatty acids, phenolic compounds, carotenoids, carbohydrates, furan derivatives and furanones, diterpenes, lignans, among others.43 These compounds contribute to the nutritional value of this matrix and enhance its consumption, highlighting its potential. Therefore, the importance of the preventive determination of toxicological alterations in this fruit lies in this context. This study marks the beginning of the first xenovolatilomic research focused on identifying potential toxicity biomarkers, expressed through the controlled exposure of endosulfan to the peel, pulp, and seed of Hass avocado, with exposure periods of 8 and 20 days, respectively.
This analysis aimed to identify the involvement of pathways such as: pyruvate metabolism (p = 0.0032856, −log(p) = 2.4834, Holm p = 0.30556, impact = 0.03409), glycolysis or gluconeogenesis (p = 0.0041979, −log(p) = 2.377, Holm p = 0.3862, impact = 0.03155), fatty acid biosynthesis (p = 0.01884, −log(p) = 1.7249, Holm p = 1.0, impact = 0.01123), sesquiterpenoid and triterpenoid biosynthesis (p = 0.0435, −log(p) = 1.3615, Holm p = 1.0, impact = 0.0), phenylalanine metabolism (p = 0.047375, −log(p) = 1.3244, Holm p = 1.0, impact = 0.0), cutin, suberin and wax biosynthesis (p = 0.070352, −log(p) = 1.1527, Holm p = 1.0, impact = 0.0), biosynthesis of unsaturated fatty acids (p = 0.085412, −log(p) = 1.0685, Holm p = 1.0, impact = 0.0), fatty acid elongation (p = 0.089145, −log(p) = 1.0499, Holm p = 1.0, impact = 0.0), glycosylphosphatidylinositol (GPI)-anchor biosynthesis (p = 0.10762, −log(p) = 0.96811, Holm p = 1.0, impact = 0.0), fatty acid degradation (p = 0.14009, −log(p) = 0.8536, Holm p = 1.0, impact = 0.0).
This indicates the activation of lipid oxidation pathways, leading to the production of oxylipins,44 and other lipid peroxidation products as oxidative damage progresses under endosulfan exposure. As the exposure time increases, metabolic dysregulation intensifies, resulting in the biosynthesis and accumulation of specific potential toxicity biomarkers. In this study, five volatile compounds were identified as key biomarkers: (E)-2-octenal (V93), oct-3-en-2-one (V86), decanal (V129), nonanal (V102), and hexanal (V29). These VOCs are directly associated with unsaturated fatty acid degradation and lipid peroxidation, processes exacerbated by the inhibition of detoxification enzymes affected by endosulfan, making them reliable indicators of xenobiotic-induced oxidative stress in Hass avocado.
In Fig. 11A, a lower density of connections between the metabolites identified in the pulp is observed after an 8 day exposure to endosulfan. This exposure process induces the alteration of pathways such as lipid metabolism, phospholipid pathway, sphingolipid pathway, fatty acid biosynthesis, and toluene degradation. It can be observed that, at this early stage, endosulfan begins to cause lipid alterations and degradation pathways of VOCs, but no massive disruption occurs in the pulp of the Hass avocado yet. After 20 days (Fig. 11B), a higher density of connections between metabolites and biochemical pathways is observed, such as lipid metabolism, sphingolipid pathway, furfural degradation, toluene degradation, phospholipid pathway, and fatty acid biosynthesis. A greater number of enzymes involved in the biochemical alteration processes is observed, indicating that endosulfan causes significant disruption in lipid homeostasis and other pathways for aldehyde degradation. This explains the increase in analytical signals of compounds like hexanal, as the accumulation of the xenobiotic is biochemically enhanced, inhibiting key enzymes and inducing lipoperoxidation processes and degradation of unsaturated fatty acids, leading to the formation and increase in the number of aldehyde-type compounds.
For the case of the seed, Fig. 12A shows a similar trend to the metabolite connection network in the peel and pulp. Identifying, at an 8 day exposure period, few connections between biochemical pathways and metabolites, with the activation of pathways such as: TCA cycle, toluene degradation, lipid metabolism, monoterpenoid biosynthesis, lipid biosynthesis, sesquiterpenoid and triterpenoid biosynthesis, phospholipid pathway, pinene, camphor and geraniol degradation, and fatty acid biosynthesis. Based on the involvement of these biochemical pathways, an initial response to endosulfan toxicity is evident, primarily affecting lipid metabolism and the degradation of monoterpenes and sesquiterpenes. This suggests the induction of oxidative processes, as these pathways are associated with an early stage of metabolic stress and cellular adaptation. At 20 days of exposure in the seed (Fig. 12B), interaction with endosulfan leads to the activation of biochemical pathways such as: sphingolipid pathway, furfural degradation, pinene, camphor and geraniol degradation, phenylalanine metabolism, lipid biosynthesis, toluene degradation, sesquiterpenoid and triterpenoid biosynthesis, phospholipid pathway, fatty acid biosynthesis, TCA cycle, and lipid metabolism. The number of implicated biochemical pathways increases, with the additional involvement of the phenylalanine metabolism pathway. Furthermore, it is observed that after 20 days of exposure, there is a marked intensification of lipid and oxidative degradation, activating aromatic detoxification pathways, increasing enzymatic participation, and revealing a systemic disturbance affecting critical metabolic routes. This process drives Hass avocado into a state of generalized metabolic stress, amplifying enzymatic activity and triggering the opening of new detoxification and degradation pathways.45
This lipophilic organochlorine pesticide is a known inhibitor of several key enzymes, including cytochrome P450 monooxygenases (CYPs)—essential for lipid metabolism, aldehyde dehydrogenases (ALDH)—which convert toxic aldehydes into carboxylic acids, lipoxygenases (LOX)—catalysts in the oxidation of polyunsaturated fatty acids and the formation of hydroperoxides,46 alcohol dehydrogenases (ADH)—involved in converting alcohols into aldehydes, fatty acid synthase (FAS) enzymes—which drive the development of precursor compounds for oxylipins and aldehydes, and peroxisomal enzymes such as acyl-CoA oxidase and catalase—both affected by the accumulation of free radicals, and involved in the β-oxidation of fatty acids.47
The inhibition of these enzymes leads to the accumulation of oxidized lipids,48 further enhancing lipid peroxidation and promoting the formation of VOCs such as hexanal, (E)-2-octenal, and nonanal. These compounds serve as chemical signals that reflect the plant's—and specifically the fruit's—response to chemical stress caused by environmental contaminants like endosulfan.49 The production of these compounds exacerbates mitochondrial dysfunction, favoring pathways such as fatty acid biosynthesis, sphingolipid metabolism, and the lipoxygenase (LOX) pathway.50 As a result, a specific distribution of the affected enzymes and their metabolic impact can be observed in each part of the fruit. In the peel, enzymes such as CYPs, LOX, and ALDH are disrupted, leading to increased lipid peroxidation and the production of six- and nine-carbon aldehydes. In the pulp, enzymes like ADH, CYPs, and FAS are primarily affected, promoting the accumulation of aldehydes and ketones.51 In the seed, the inhibition of CYPs, ALDH, and peroxisomal enzymes alters the deeper defense metabolism, causing the retention of aldehydes.52
Chemically, these toxicity biomarkers are formed through the lipoxygenase (LOX) and hydroperoxide lyase (HPL) pathway, which represents the main route for the generation of volatile aldehydes and ketones in plants, triggering the onset of lipid peroxidation.53 Unsaturated fatty acids such as linoleic acid and oleic acid are oxidized by LOX to form fatty acid hydroperoxides. These hydroperoxides are subsequently cleaved by the action of HPL,54 producing six- and ten-carbon aldehydes, some of which can undergo isomerization, as in the case of (E)-2-octenal, or be reduced to their corresponding alcohols.
Additionally, β-oxidation in peroxisomes contributes to the formation of intermediate compounds such as medium-chain aldehydes and ketones. These biomarkers are involved in plant defense pathways against pathogens or oxidative stress induced by xenobiotic agents.55 Furthermore, they participate in the biochemical routes responsible for fruit aroma and flavor, modulating organoleptic properties, plant signaling processes, and lipid oxidation.56 Table 4, presents the codes of the main enzymes involved in the biochemical alterations induced by endosulfan, along with selected representations of crystalline structures reported in the Protein Data Bank (PDB), providing a basis for future computational metabolomics research applied to these plant enzymes.
Enzymes | EC code | Main function | Example representative PDB entry | Species |
---|---|---|---|---|
Cytochrome P450 monooxygenase (CYP) | EC 1.14.-.- | Oxidation of organic compounds by the incorporation of an oxygen atom | 6L8H | Arabidopsis thaliana (Cytochrome P450) 57 |
Aldehyde dehydrogenase (ALDH) | EC 1.2.1.3 | Oxidation of aldehydes to carboxylic acids using NAD+/NADH as cofactor | 4PXL | Zea mays58 |
Lipoxygenase (LOX) | EC 1.13.11.12 | Oxidation of polyunsaturated fatty acids to form hydroperoxides | 1IK3 | Glycine max59 |
Alcohol dehydrogenase (ADH) | EC 1.1.1.1 | Reversible conversion of alcohols to aldehydes or ketones using NAD+/NADH | 1YQD | Populus tremuloides60 |
Fatty acid synthase (FAS) | EC 2.3.1.85 | Synthesis of long-chain fatty acids from acetyl-CoA and malonyl-CoA | 2IX4 | Arabidopsis thaliana61,62 |
1W0I | ||||
Acetyl-CoA carboxylase (ACC) | EC 6.4.1.2 | Carboxylation of acetyl-CoA to form malonyl-CoA, a key step in lipogenesis | 1OD4 | Saccharomyces cerevisiae63,64 |
Acyl-CoA oxidase | EC 1.3.3.6 | Oxidation of acyl-CoA to trans-2-enoyl-CoA in the β-oxidation of fatty acids | 1W07 | Arabidopsis thaliana65 |
Catalase | EC 1.11.1.6 | Decomposition of hydrogen peroxide into water and oxygen, protecting against oxidative stress | 1 A4E | Saccharomyces cerevisiae66 |
This study makes a pioneering contribution to the emerging field of xenovolatilomics by identifying a series of volatile organic compounds as potential toxicity biomarkers in Hass avocado following pesticide exposure. The identification of (E)-2-octenal, oct-3-en-2-one, decanal, hexanal,67 and nonanal provides a solid foundation for future development of rapid diagnostic tools aimed at detecting agrochemical contamination. Moreover, the study underscores the tissue-specific enzymatic disruptions caused by xenobiotics like endosulfan, offering a novel plant-based model for investigating stress-induced metabolic responses.68
It is important to highlight that currently, only a limited number of crystallized and isolated enzymes from Persea americana are available in the Protein Data Bank (PDB). As a result, the enzymatic structures referenced in this study are drawn from representative plant and model species such as Saccharomyces cerevisiae, Arabidopsis thaliana, Populus tremuloides, Glycine max, and Zea mays. These examples provide a valuable biochemical framework and serve as analogs for future applications in computational metabolomics and enzyme modeling in non-model tropical crops like Hass avocado. Establishing these foundational comparisons may foster new research pathways in structural enzymology and strengthen predictive insights into xenobiotic interactions within food systems.
In addition, this work established the biochemical context for the formation of these biomarkers by identifying key enzymatic targets, listing their respective EC codes, and providing representative crystal structures reported in the Protein Data Bank (PDB). This biochemical mapping lays the foundation for future research, including the development of validation models for toxicity biomarkers and the application of computational metabolomics to predict enzyme inhibition patterns induced by xenobiotics.
Importantly, the unknown compound VX83 was tentatively identified as 4-pentenal, with a similarity index (SI) of 83%, highlighting its potential as a novel biomarker pending full structural and functional elucidation. Moreover, this study emphasizes the relevance of response metabolites formed under xenobiotic stress, which serve as additional indicators of biochemical disruption. The xenovolatilomic profile revealed previously unreported VOCs in Hass avocado tissues, including 2-heptanone (V40), cyclohexanecarboxaldehyde (V49), octyl acetate (V132), veratrole (V111), dipentyl ketone (V140), and 1,9-decadiene (V142)—each contributing to the overall stress signature associated with endosulfan exposure. Overall, this research successfully integrates xenovolatilomic profiling with statistical and biochemical modeling, confirming the reliability of these VOCs as toxicity indicators. It also demonstrates temporal metabolic differentiation (8 vs. 20 days of exposure), and addresses critical challenges in the early detection of emerging contaminants. These findings have direct implications for food quality monitoring, export compliance, and the long-term economic sustainability of avocado production systems in the Caldas region.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d5fb00163c |
This journal is © The Royal Society of Chemistry 2025 |