Open Access Article
Georgia M. Sinclair
a,
Sarah L. Green
ab,
Ryan Lester
ac,
Katherine J. Jeppe
d,
Steven D. Melvin
c,
Sara M. Long
e,
Oliver A. H. Jones
f and
David J. Beale
*a
aEnvironment, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Ecoscience Precinct, Dutton Park, Queensland 4102, Australia
bDepartment of Ecological, Plant and Animal Sciences, La Trobe University, Albury-Wodonga Campus, Victoria, Australia. E-mail: David.Beale@csiro.au
cAustralian Rivers Institute, Griffith University, Gold Coast Campus, Southport, Queensland 4215, Australia
dMonash Proteomics & Metabolomics Platform, Department of Biochemistry and Molecular Biology, Biomedicine Discovery Institute, Monash University, Clayton, Victoria 3800, Australia
eAquatic Environmental Stress Research Group (AQUEST), School of Science, RMIT-University, Bundoora, Victoria 3083, Australia
fAustralian Centre for Research on Separation Science (ACROSS), School of Science, RMIT University, Bundoora West Campus, Bundoora, Victoria 3083, Australia
First published on 16th April 2026
Field-based environmental metabolomics offers a powerful tool for examining how organisms respond to complex mixtures of chemical and environmental stressors. When coupled with traditional ecotoxicology, metabolomics can help reveal mechanistic pathways; however, laboratory-based tests cannot fully replicate the dynamic, multifactorial conditions of natural ecosystems. This review synthesises current advances, challenges, and opportunities in applying metabolomics to ecotoxicology under real-world field conditions. We highlight the growing use of wild-caught organisms, caged exposures, mesocosms, and laboratory studies using field-collected samples to detect sub-lethal metabolic disruptions associated with contaminants such as PFAS, metals, organic pollutants, and wastewater-derived mixtures of compounds. Key themes include the sensitivity of metabolomics to early physiological changes, integration with complementary chemical and ecological data, the challenges in distinguishing natural variability from contaminant effects, the importance of establishing baselines and dose–response relationships, and the need for improved QA/QC and metadata reporting. As the methodological and logistical challenges are overcome, metabolomic profiles from field-exposed organisms are increasingly demonstrating value for environmental risk monitoring and forecasting. Environmental metabolomics has been successfully used for environmental monitoring, supporting regulatory frameworks, and identifying mechanistically grounded biomarkers of exposure and effects. However, to realise its full potential, coordinated efforts among current and future metabolomics practitioners are still needed to advance the current Metabolomics Standards Initiative (MSI) guidance. The MSI should, ideally, be expanded to include common standardised workflows, strengthen bioinformatics infrastructure, expand case studies, and fully embed and integrate metabolomics within routine environmental assessment and decision-making processes, thereby transitioning these ‘academic’ approaches into practical regulatory tools.
Environmental metabolomics is maturing as a powerful tool for investigating biochemical responses to environmental stress.1 Metabolomics provides a comprehensive snapshot of an organism's physiological state and is highly sensitive to early changes induced by contaminants and other stressors.1 Because metabolites are downstream products of gene expression and protein activity, metabolomics can detect subtle, sub-lethal effects (biomarkers) before traditional endpoints are observed (i.e., reduced growth or impaired reproduction).5 This sensitivity makes metabolomics particularly valuable for assessing contaminants, including per- and polyfluoroalkyl substances (PFAS), metals, organic pollutants, and other complex mixtures in natural environments. It also highlights its potential role in effects-based monitoring and investigating cumulative effects on wildlife.1,5–11
Despite this promise, metabolomics is still not widely used in environmental regulation or risk assessment. Several factors underpin this gap. Natural variability makes it challenging to distinguish contaminant-driven effects from background noise; baseline metabolic ranges are poorly defined for most species; and consistent frameworks linking metabolic changes to adverse outcomes or regulatory thresholds remain limited. In addition, inconsistencies in field sampling, analytical workflows, QA/QC, metadata reporting, and bioinformatic interpretation continue to hinder reproducibility and comparability across studies. As a result, although metabolomics can provide mechanistic insight, its operational use in routine monitoring and regulatory decision-making remains constrained.
Laboratory-based and field-based metabolomics each address different components of the challenge to make metabolomics mainstream. This complementary relationship is illustrated in Fig. 1, which contrasts the controlled mechanistic resolution of laboratory metabolomics with the natural environmental exposure conditions and integrative exposure profiles captured in field-based studies. The figure illustrates how laboratory studies provide controlled, mechanistic insight into metabolic disruptions caused by specific contaminants, while field studies capture ecologically realistic responses shaped by variable environmental conditions and mixed stressors. Together, these complementary perspectives strengthen the interpretation of metabolic biomarkers and enhance the ecological relevance of metabolomics in environmental assessment. When integrated, the two approaches provide a more complete picture of exposure and effect. Field-based metabolomics cannot replace controlled experiments; rather, it complements them by capturing ecologically realistic exposure conditions that help generate and prioritise hypotheses for mechanistic testing. Addressing the practical and interpretive challenges of field-based metabolomics, along with laboratory and mesocosm validation, is essential for advancing metabolomics toward regulatory readiness.
To avoid overinterpretation, field-based metabolomics measurements should, ideally, be embedded within a framework that includes direct environmental chemistry (e.g., water, sediment, soil), organism contaminant burdens (bioaccumulation), and controlled experiments that constrain dose and other confounding variables. In this hybrid continuum, field studies provide exposure realism and relevance, while laboratory and mesocosm studies establish causality, mechanism, and quantitative thresholds. This mirrors the established trajectory of environmental health science, where biological mechanisms are first resolved under controlled laboratory conditions and are subsequently confirmed and contextualised through field observations of exposure, bioaccumulation, and ecological impact. The objective of this review is to synthesise key developments, challenges, and opportunities for the application of metabolomics in field-based ecotoxicology and environmental assessment. Rather than presenting an exhaustive survey, we focus on major themes emerging from studies using wild-caught organisms, caged deployments, mesocosms, field-derived samples, and wildlife-derived cell lines. We highlight where metabolomics has already demonstrated value in detecting sub-lethal effects, identifying biomarkers, and linking exposure to mechanistic pathways. We also consider where limitations continue to impede adoption. By drawing together methodological insights and identifying areas where standardisation, improved data infrastructure, and clearer interpretative frameworks are needed, we aim to clarify how field-based metabolomics can contribute effectively to environmental monitoring and risk assessment in future.
The sensitivity of metabolomics is, however, somewhat of a double-edged ‘biochemical’ sword. On the one hand, a specific early warning signal of potential harm to an organism or population would allow action to be taken before permanent damage occurs. On the other hand, how do we know that any observed changes will cause adverse effects, specific and large enough to warrant action? This is analogous to a metabolomics point-of-departure (POD) concept (i.e., a dose, concentration, or exposure level at which a defined biological response is first detected and is then used as the starting point for deriving health- or environment-based guideline values).14 Changes in metabolism occur constantly, including diurnal fluctuations, circalunar rhythms, seasonal variations in light and temperature, and fluctuations related to feeding and normal homeostatic mechanisms. In a human context, it is possible to detect metabolic changes before and after our morning coffee, or before and after taking medication, food, or alcoholic drinks.15 In each case, there might be a detectable biological change; it might even be a significant change from the normal functioning, yet remain biologically inconsequential. How do we separate natural changes from damaging alterations in metabolism with potentially irreversible downstream adverse effects, and, if we could, would such biochemical data be sufficient for environmental regulators to act? These challenges, along with suggestions for addressing them, are discussed in Beale et al.16
The key question for metabolomics is not whether a change is detectable, but whether the exposure to a stressor causes a metabolic shift, and how that shift fits within the broader cause-effect continuum—i.e., contextualised PODs. Addressing this requires asking several follow-up questions, such as:
1. Is the change outside normal ranges for the metabolites involved in the species in question? If so, how many metabolites are outside normal ranges?
2. How long does any observed change persist? Is it temporary or lasting?
3. Do the metabolic shifts lead to downstream biological effects?
4. Do these changes indicate potential harm at the organism, population, or community level, and can they be linked to Adverse Outcome Pathways (AOPs) – linking the biological response following a chemical stressor to adverse outcomes?
The first question is the most challenging because the normal ranges for most metabolites are not well-defined, even in humans. There are some exceptions; the normal range of blood glucose concentrations is well defined, for example, to enable the diagnosis of diabetes. There are also reference ranges for many hormones and enzymes, although these are proteins rather than metabolites. In addition, metabolomics is a static technique that cannot directly capture data on dynamic processes; however, this limitation can be addressed with appropriate experimental designs that incorporate sufficient longitudinal sample collection. In wild populations, such temporal inference is typically derived from repeated cross-sectional sampling rather than true longitudinal tracking of individuals, as repeated sampling of the same organism is often infeasible.
Although the remaining questions are central to interpreting metabolomic perturbations, they are not explored further in this introductory section. Our aim here is to outline the conceptual challenges that arise when attempting to disentangle natural variability from contaminant-driven effects in field-based metabolomics. Detailed consideration of persistence, biological consequences, and links to AOP frameworks requires study-specific evidence—including longitudinal datasets, matched contaminant burdens, mechanistic assays, and ecological context—which are addressed later in the review through dedicated case studies and methodological discussions. The distinction is important because, at this stage, these questions are intended to underscore the inherent complexity of interpreting field-based metabolomics. At the same time, their resolution relies on the acquired data and applied frameworks, which will be addressed in later sections of this review.
To bridge these exposure-driven considerations with analytical interpretation, the following section outlines how different metabolomics acquisition and quantification modalities are selected to address specific biological questions arising from field, caged, and mesocosm study designs.
There are two broad approaches to metabolomics, namely non-targeted and targeted. In a non-targeted approach, the analyst attempts to detect, identify and at least semi-quantify as many endogenous metabolites as possible, then looks for any differences between groups, usually via some form of multivariate statistical analysis.1 In a targeted approach, only specific metabolites or biochemical pathways are assessed, allowing precise quantification of known metabolites linked to defined mechanisms.5,21 There is a third approach, widely targeted metabolomics, which is less commonly used and combines elements of the first two. Here, untargeted metabolomics is used for initial metabolite screening for a particular factor; targeted metabolomics is then used for validation and to explore and identify metabolite classes from step one in more depth.22 Extremely beneficial to environmental field-based metabolomics are methods that simultaneously perform quantification and discovery (SQUAD) from the same biological extract within a single analytical run.23 Modern hyphenated or hybrid mass-spectrometry workflows, particularly those using QToF (Quadrupole Time-of-Flight) instruments, now enable acquisition modes that capture both untargeted and targeted metabolite data simultaneously (e.g., Mass Spectrometry with Elevated Energy (MSE), Data-Independent Acquisition (DIA), Sequential Window Acquisition of All Theoretical Mass Spectra (SWATH)). These approaches effectively bridge the gap between traditional methods by broadening metabolite coverage while retaining the sensitivity and quantification accuracy of targeted analysis. It also makes it possible to include non-target contaminant/suspect screening in a metabolomics workflow, helping to characterise the full profile of exposure stressors under study. This framework is illustrated in Fig. 2 (Panels A and B).
Targeted metabolomics is helpful if one knows the specific metabolic pathways that might be affected by a pollutant(s) of interest. Some have argued that targeted metabolomics is not true metabolomics, but rather metabolic profiling, since, by definition, metabolomics (and any omics science) seeks to identify as many things as possible. However, if there are specific metabolites and/or metabolic pathways that are changing, and some that are not, it makes sense to focus on the former and not ‘dilute the signal’ (so to speak) with non-relevant data. This focus becomes especially valuable when those metabolites clearly map onto known Molecular Initiating Events (MIEs) or Key Events (KEs) within AOPs,24 allowing targeted metabolomics to directly quantify AOP-relevant biochemical changes. Noting that MIEs are the first measurable biological interactions at the molecular level within a biological system and initiate a pathway leading to an adverse effect, KEs represent the necessary biological change along the pathway, and an AOP links a MIE to an adverse outcome through a series of causally connected KEs. By measuring metabolites already associated with toxicity pathways, which are cellular response pathways that can lead to adverse biological effects when perturbed by chemical stressors, and validated under controlled laboratory conditions, targeted metabolomics applied to a field-based setting strengthens mechanistic interpretation and enhances the ability to link pollutant exposure to potential adverse outcomes. The practical implementation of these analytical strategies in environmental settings introduces additional constraints that influence study design, data quality, and interpretability.
Correlating metabolite changes with pollutant concentrations is also not straightforward. Most environmental omics-based studies attempt to assess impact after it has occurred, but applying this knowledge outside the lab requires knowing the state of the environment in its ‘unaffected’ state. Shah et al.27 addressed this challenge by applying both targeted and untargeted metabolomics to surface sediments (n = 50) from four pristine estuaries along the Western Cape York Peninsula in Far North Queensland, Australia. Their analysis revealed clear taxa–function relationships that predicted microbial community metabolic potential, with pathways such as carbon metabolism and amino acid biosynthesis showing strong positive correlations with community-level metabolic outputs—including 2-oxisocaproate, tryptophan, histidine, citrulline, and succinic acid. These findings establish a valuable baseline microbial metabolic blueprint against which future ecosurveillance efforts can assess environmental change. They also provide a necessary foundation for emerging effects-based and weight-of-evidence approaches in ecological risk assessment, although routine regulatory adoption remains in its early stages.
Subsequent research incorporated targeted screening for both organic contaminants (e.g., pesticides) and inorganic pollutants (e.g., metals) to determine how perturbed systems diverge from this baseline across similar geographical regions.28–31 This integrative approach enables researchers to link shifts in community metabolism not only to natural environmental variation but also to specific contaminant pressures, strengthening the diagnostic power of ecosurveillance frameworks.3
Bioaccumulation is another important factor to consider. This occurs when an organism absorbs pollutants from the environment at a rate greater than it can excrete and/or metabolise them, resulting in a concentration in the body much higher than in the surrounding media.32–35 Beale et al.36,37 demonstrated a significant bioaccumulation of perfluoroalkyl sulfonic acids (PFSAs), a 100-fold increase in serum relative to surrounding water concentrations, and that these elevated body burdens were associated with a range of adverse biological effects on freshwater turtles (Emydura macquarii macquarii). The effects included altered egg composition (particularly in magnesium to calcium ratios, potentially affecting eggshell strength); altered biochemical profiles and hatchling deformities (due to adult females offloading PFAS into eggs). Similar observations have been made in snakes, toads and frogs.11,38,39 These findings highlight the critical need to overcome such experimental challenges by measuring contaminants directly within the surrounding environment—such as water, sediment, soil, and local flora and fauna—because exposure concentrations, exposure routes (e.g., waterborne uptake, dietary intake, dermal adsorption) and the rates at which organisms take up and eliminate contaminants all vary substantially across species, habitats, and environmental conditions. Without such context, it is difficult to determine whether observed metabolic or developmental effects stem from direct rapid/recent toxicant exposure or long-term, trophic transfer, or cumulative environmental loading.
Matrices, such as soils or sediments, must often be finely crushed under liquid nitrogen and extensively sonicated to ensure that all microbial metabolites are released from the matrix;42 samples from marine systems may contain high concentrations of salts that need to be removed.43 Some biofluids may need to be treated with β-glucuronidase enzymes to convert hydroxylated and carboxylated metabolites back to the parent form.44 Samples from complex matrices, such as plants, may require fractionation or the use of QuEChERS (Quick, Easy, Cheap, Effective, Rugged, and Safe) sample preparation to separate out pigments and other large molecules.44 This type of work is particularly challenging in remote locations, where logistical constraints can limit sampling frequency, accessibility, and the ability to comprehensively characterise exposure sources. In addition, collected samples must be kept cold to prevent post-collection metabolic changes during transport, making strict cold-chain maintenance essential to preserve sample integrity and ensure reliable metabolomic and contaminant analyses. Several studies have directly compared field-compatible preservation methods (e.g. RNAlater™ and chemical quenching) with conventional −80 °C storage, showing that some approaches can yield comparable metabolomic profiles for specific tissues and metabolite classes.45,46 These findings support their use in field-based metabolomics where immediate ultra-cold storage is impractical, while also highlighting the need for method-specific validation.
![]() | ||
| Fig. 3 Conceptual framework for field-based metabolomics approaches and its application to ecological assessment and regulatory decision-making. | ||
Species-specific metabolic responses have also been observed, with differences in contaminant sensitivity and biotransformation capacity influencing metabolomic outcomes. For example, ringed seals showed stronger metabolic responses to PFAS than polar bears despite lower contaminant concentrations, highlighting the importance of physiological traits in interpreting exposure effects.47 This cross-species analysis of exposure and metabolome across regions allows for the identification of correlations across broader contamination gradients/profiles and species.
Several studies have proposed oxidative stress and disruptions to energy metabolism as common pathways affected by metals, organohalogens and many other contaminants,49,50 with some metabolites identified as potential biomarkers of exposure. Gago-Tinoco et al.50 reported that while certain metabolites showed promise as biomarkers of exposure, some of these were tissue-specific, though some were found to be common across all sample types within a given site. Despite discussing metals and metabolites in parallel, the study did not analyse correlations between contaminant concentrations and metabolic profiles, limiting the strength of mechanistic inference. In contrast, Gauthier et al.49 highlighted that PCB concentrations in Arctic charr (Salvelinus alpinus) muscle were 29 ng g−1, within ranges known to induce reproductive toxicity in other fish species, and linked the metabolic differences observed between populations in two lakes to these elevated PCB levels. Melvin et al.41 demonstrated, using mosquitofish (Gambusia holbrooki) from a metal(loid)-contaminated wetland, that suites of metabolites, owing to their integrative roles within biochemical pathways, are sensitive indicators of metal stress. Nevertheless, their application as environmental biomarkers depend on rigorous validation and quantitative assessment to confirm reliability and ecological relevance in monitoring programmes.
In another example, metabolomics was applied as part of a regulatory investigation into PFAS contamination in freshwater turtles (Emydura macquarii macquarii) in Queensland (Australia), providing sensitive biochemical evidence of sub-lethal impacts linked to pollutant exposure.32,36 Turtles sampled from PFAS-impacted catchments showed extreme PFAS bioaccumulation, with serum PFOS concentrations reaching up to 235-fold higher than surrounding water.51 Metabolomic analyses revealed consistent alterations in purine metabolism, glycerophosphocholine and lipid metabolism, immune-related pathways, and central carbon metabolism, all of which correlated with elevated PFAS burdens.52 Integrated microbiome–metabolome studies further showed disruptions to amino acid, butanoate, and nucleotide metabolism, demonstrating combined host–microbiome responses to PFAS exposure.53,54 Additional sampling identified hormone disruption, increased gout risk in adult turtles (which is deadly for reptiles), altered mineral ratios in eggs, and hatchling deformities, with population modelling predicting long-term decline at highly contaminated sites.36 Collectively, these findings demonstrate that metabolomics, embedded within a regulatory assessment framework, can provide mechanistic evidence linking PFAS exposure to biological impairment and support improved ecological risk assessments for contaminated wildlife. Other studies have examined reptiles and amphibians in similar regulatory settings,11,38,39 but unlike the freshwater turtle study, they did not focus on establishing proof of environmental harm.
Despite these insights, several important limitations persist across studies. Many did not establish direct correlations between contaminant concentrations and metabolite responses50,55 or the correlations were not strong,11,38,39 limiting mechanistic interpretations.56,57 For example, Nzabanita et al.55 measured differences in metabolites between Pacific black ducks (Anas superciliosa) based on site location, but observed metabolite patterns did not correlate with feather metal concentrations. Importantly, this study did not measure non-metal contaminants and therefore could not assess whether unmeasured chemical exposures contributed to the observed metabolomic variation. This finding raises concerns about the development of biomarkers, as the metabolic impacts of all stressors/variables present in an environment must be understood to infer the drivers of metabolomic responses.
In the study by Nzabanita et al.,55 tissue mismatch introduces additional limitations, with the authors measuring the accumulation of metals in one tissue and the metabolome in another, potentially obscuring relationships due to differential accumulation and metabolic activity. Additionally, small sample sizes, sex bias, and limited geographic coverage reduced statistical power and generalizability.
Analytical constraints can impact data quality. For instance, suboptimal sample storage temperatures reaching higher than the recommended −80 °C for long-term biobanking47 and semi-quantitative metabolomics approaches55 may affect metabolite stability and detection. In some cases, contaminant concentrations were not reported directly but referenced from prior studies,49 complicating exposure assessment. However, a further methodological challenge lies in the fundamental mismatch between metabolomics workflows, which aim to capture all measurable metabolic changes, and traditional contaminant analyses, which typically rely on targeted methods that measure only a predefined list of chemicals. This discrepancy creates an inherent imbalance in data-acquisition breadth, making it difficult to align wide-scope metabolic responses with narrow contaminant panels. Yet this gap is surmountable: modern high-resolution mass spectrometry within metabolomics pipelines now enables both targeted and untargeted acquisition modes, supporting broad suspect-screening and non-targeted contaminant discovery in the same biological sample. Integrating these complementary datasets, together with improved annotation and mapping of metabolomic pathways, will expand the network of potential biochemical interactions and strengthen our ability to link biological changes directly to environmental stressors, improving correlation and, ultimately, mechanistic inference from wild-caught organisms.
Mussels are particularly well-suited for such studies due to their sedentary lifestyle, filter-feeding behaviour, contaminant bioaccumulation capacity, and ease of deployment. Studies using Mytilus galloprovincialis in Italy demonstrated that 1H NMR metabolomics, combined with histology and immunohistochemistry, can sensitively detect petrochemical pollutants such as mercury and PAHs.59–61 Disruptions in osmoregulation, energy metabolism, and neurotransmission were observed across gill, muscle, and digestive gland tissues, with carbonic anhydrase proposed as a biomarker. However, regulatory relevance was limited by single-site exposures, narrow chemical profiling, and a lack of dose–response data.
Similar findings were reported using a clam species (Ruditapes decussatus) transplanted to contaminated agricultural and urban runoff sites in Spain. Digestive gland metabolomics using LC-MS revealed elevated levels of amino acids, osmotic protectants, and nucleotides after 7 d of exposure, likely reflecting disturbances in osmoregulation and energy metabolism. Metabolite levels dropped below those of control clams following 22 d, suggesting a two-phase metabolic response: initial compensation followed by metabolic exhaustion.62 Taurine emerged as a potential biomarker for complex contaminant mixtures, though the authors emphasized that metabolomic responses in field studies should be interpreted by considering environmental parameters such as trophic conditions and potential time-dependent patterns.
Caged zebra mussels (Dreissena polymorpha) were used to demonstrate that combining metabolomics with classical biomarkers is beneficial.63 The study suggested that site-specific metabolic profiles revealed impacts on osmoregulation and anaerobic metabolism, while traditional energy biomarkers were less consistent, lactate was identified as a promising health indicator.
In the Great Lakes, mussel metabolomes correspond closely with local chemical pollution profiles, underscoring the utility of combining chemical and metabolic data64 to provide a holistic view of stressor impacts, though seasonal metabolic cycles and stressor interactions must be carefully considered.
Recent multi-omics studies using Sinanodonta woodiana, commonly known as Chinese pond mussel, revealed pronounced tissue-specific adaptive responses to environmental contamination. These studies identified key metabolites, including alanine, glutamate, and 7-oxocholesterol as consistent biomarkers of exposure across multiple tissues and sampling times, effectively discriminating streams with differing pollution loads.65,66 The inclusion of antioxidant enzymes and metallothionein responses strengthened causal inference, though biomarker validation remains ongoing.
Studies using other caged macroinvertebrates further highlight the value of field deployable metabolomics frameworks. Notably, work involving Brua and co-authors has advanced the application of caged invertebrates, particularly aquatic insect nymphs and crayfish, in real-world contaminant assessment.67–69 Izral et al.69,70 demonstrated that caged crayfish exhibit clear, energy-related metabolic shifts in response to food limitation and dissolved oxygen stress, identifying metabolites in tail muscle as sensitive indicators of sublethal physiological disruption. In a complementary in situ study in the Athabasca River, transplanted dragonfly nymphs (Ophiogomphus colubrinus) housed in baskets were used to assess the combined effects of municipal sewage effluent, polycyclic aromatic hydrocarbons (PAHs) from oil-sands extraction activities, and naturally eroded bitumen-derived contaminants (Fig. 4 illustrates field-deployable cages used to house dragonfly nymphs), for a region characterised by some of the world's largest oil deposits (northeastern Alberta, Canada). Despite exposure to complex mixtures of heavy metals and PAHs, nymph survival remained high and their metabolomes did not differ significantly among sites after one week. This may reflect both species-specific tolerance to these contaminants and the potential need for longer or more sensitive exposure designs in multi-stressor environments.68 Together, these studies emphasise the promise of caged macroinvertebrates as field-ready sentinels and demonstrate how metabolomics can deepen ecological interpretation in systems influenced by interacting anthropogenic and natural stressors.
Field-based metabolomics with fish species like fathead minnows (Pimephales promelas) and yellow perch (Perca flavescens) has also shown promise for identifying contaminant-driven biological responses. In the Great Lakes Basin, liver metabolite shifts were linked to contaminants such as DEET (N,N-diethyl-meta-toluamide) and bisphenol A, though many acted as markers of wastewater effluent rather than direct toxicants.71 Despite the constraints of single-tissue analysis and the lack of temporal resolution, the integration of metabolomics with chemical analyses offered valuable evidence for prioritising contaminants of emerging concern and emphasised the need for replication, multi-tissue analyses, and consideration of confounding stressors such as temperature.
Ekman et al.72 demonstrated that targeted and untargeted metabolomics could detect both estrogenic and oxidative stress responses in P. promelas exposed to wastewater treatment plant (WWTP) effluents. Dose–response relationships were observed for estrone and several pesticides (thiabendazole, malathion, and 1,4-dichlorobenzene), reinforcing the value of untargeted metabolomics in characterising complex mixtures. The authors emphasised that such tools are critical for assessing the diverse effects of anthropogenic contaminants.
Finally, Defo et al.73 highlighted the complexity of interpreting metabolomic data in caged P. flavescens exposed to urban WWTP effluent. Differences between caged and lab-reared fish suggested a caging effect, while limited metabolomic separation between effluent-exposed and reference fish raised questions about adaptive responses and biomarker sensitivity. Differences in profiles of liver metabolites emerged by week six, but the authors noted that larger sample sizes were needed to confirm these patterns. These findings underscore that contaminant exposure does not always result in immediate or measurable metabolomic disruption, raising critical questions about the timescales of adaptive responses and the sensitivity of current biomarker approaches in complex field environments.
While field-deployable caged metabolomics has clear value for detecting sublethal stress under realistic field conditions, major challenges remain in disentangling metabolomic signals from natural variability, seasonal cycles, and multi-stressor interactions. These limitations highlight the need for more standardised designs, longer exposures, and multi-tissue, multi-omics integration. Even so, the approach offers substantial opportunity; by coupling metabolomics with chemical analyses and complementary biomarkers of effect and exposure, caged organisms can become a powerful field-ready tool for diagnosing early ecological stress and strengthening environmental assessments within a regulatory framework.
These field observations highlight both the interpretive power of metabolomics and the considerations required when translating biological signals into regulatory or monitoring contexts.
However, because few such studies currently exist using metabolomics, this section also includes mesocosm-like experiments conducted in greenhouses and aquaria, where environmental factors such as temperature and photoperiod were not fully controlled but still incorporated field-relevant matrices or stressors. It is important to note that field-deployed mesocosm studies remain rare for several reasons, including the substantial logistical effort required to transport materials and maintain experimental systems in remote or uncontrolled environments; the need for strict regulatory and permitting approvals to deploy organisms or experimental infrastructure in natural habitats; and the challenge of preserving metabolite integrity through reliable onsite cold-chain procedures. Moreover, in situ mesocosms are vulnerable to unpredictable environmental variability (e.g., storms, heatwaves, hydrological shifts) and disturbance by wildlife or humans, complicating experimental control, replication, and sample recovery. These combined constraints, along with the high analytical demands of metabolomics, mean that many researchers opt for semi-controlled mesocosm systems as a practical compromise between environmental realism and experimental feasibility.
Across these reported studies, metabolomics was used to characterise organismal responses to a range of environmental stressors. These included rhizosphere microbial communities and plants subjected to nutrient and moisture-limited soil,78,79 insects exposed to a range of copper concentrations80 and mussels exposed to oily wastewater discharges before and after biofilm membrane reactor treatment.81
The only true field-deployed mesocosm study to date was conducted by Jeppe et al.,80 who spiked reference-site sediments with copper, placed them into tub-based mesocosms, and deployed them back into a wetland to assess impacts on aquatic macroinvertebrates. Laboratory-bred snails (Potamopyrgus antipodarum and Physa acuta) and the chironomid, Chironomus tepperi, were introduced at different time points, enabling assessment of responses across multiple biological levels. Metabolites from C. tepperi were quantified using a targeted LC-MS method.80,82 Importantly, metabolite shifts were detected at 60 mg kg−1 copper concentrations, lower than the concentration that was reported to affect larvae dry weight with an EC50 of 238 mg kg−1 for copper, demonstrating the higher sensitivity of metabolomics for early detection of sediment toxicity.
Other mesocosm studies, though not field-deployed, further highlight the utility of metabolomics for disentangling organism–environment interactions. Baker et al.78 profiled metabolites from rhizosphere soil under nutrient-limited, and moisture-limited conditions over 18 weeks in a greenhouse, revealing distinct shifts in nitrogen-containing metabolites and metabolites associated with moisture stress. Cai et al.79 used a plant-wastewater mesocosm to evaluate how nitrogen speciation (NH4+–N vs. NO3−–N) influences plant metabolic responses in constructed wetlands, showing clear differences in root exudate metabolite profiles that reflected nitrogen uptake, energy metabolism, and growth. Gornati et al.81 examined mussels (Mytilus galloprovincialis) exposed to untreated and treated oily wastewater within aquaria-based mesocosms, finding tissue-specific metabolite alterations consistent with differing levels of wastewater treatment efficacy. Gyawali et al.83 performed a similar experiment with mussels exposed to human wastewater and used metabolomics to identify potential biomarkers of exposure related to Norovirus contamination in shellfish, and Nguyen et al.84 to identify acute heat stress responses in Abalone.
Collectively, these studies demonstrate that metabolomics provides sensitive and diagnostically powerful endpoints in mesocosm-based ecotoxicology, even when environmental realism differs across systems. Although relatively few true field-deployed mesocosm studies have incorporated metabolomics, transcriptomic analyses have already been successfully applied in such settings,77 indicating that the logistical and methodological barriers are not prohibitive. This gap highlights a clear opportunity to expand metabolomics into field-deployable mesocosm experiments that would allow researchers to verify whether biomarker patterns and mechanistic stress responses observed under controlled conditions persist in natural environments. Broadening this work is essential for advancing environmental metabolomics toward routine use in biomonitoring and regulatory assessment frameworks.
Acclimation to laboratory conditions prior to experimentation is standard practice with wild caught organisms (and for commercially sourced taxa), to ensure the animals are unstressed and healthy, and to increase both statistical power and the reliability of results by reducing intraspecific variation of the test population. The acclimation process involves transitioning animals to laboratory housing (including water for aquatic species), lighting, feeding and handling. On the one hand, such acclimation may be particularly important for metabolomics research since an abrupt change to environmental conditions or the animals’ experience will likely have a physiological toll. On the other hand, little attention has been paid to how the acclimation process, or the use of long-standing laboratory-cultured lineages, might reduce the ecological relevance of metabolomic responses due to the ‘de-wilding’ of wild-collected organisms. Depending on the goal of the study (i.e., unravelling mechanistic information vs. predicting environmental risk), approaches for acclimating field collected organisms prior to laboratory testing may differ.
Timeframes for acclimating field-collected organisms to laboratory conditions vary widely, spanning anywhere from a few days to several months, and although lab conditions likely differ considerably from the collection site, these are often undisclosed. In a study in the USA, wild oysters (Crassostrea virginica) were acclimated for 30 d to constant temperature, salinity, photoperiod, and a commercial food, in filtered UV-sterilised artificial sea water.85 A similar metabolomics study with pearl oysters (Pinctada martensii) collected from Lingshui harbour, China, acclimated animals to laboratory conditions for just 3 d, and used filtered natural sea water.86 Neither study reported the environmental parameters present at the collection site, limiting interpretation of pre-exposure influences. Wild-caught adult topsmelt (Atherinops affinis) collected by beach seine in the USA were acclimated for 3 m to laboratory conditions prior to exposure to crude and dispersed oil, and responses were compared to unacclimated topsmelt embryos obtained from a commercial supplier.87 The authors attributed disparities in metabolomic responses to life-stage differences but did not consider how organism origin or acclimation history may have contributed to the observed variation.
Depending on the organism, it may be unlikely or unfeasible for metabolomics studies to use organisms that have not been fully acclimated to laboratory conditions. For example, Cladocerans and Chironomids are commonly cultured in ecotoxicology labs and reared for multiple generations to obtain enough healthy individuals for exposures,88,89 sometimes for as long as a decade.90 While this may contribute towards enhanced comparability between studies over time, the use of such populations raises uncertainty about how well the metabolism of these organisms reflects that of natural populations.
Failure to disclose the age of cultures from wild-caught populations is also common. For example, Dumas et al.91 described the exposure of a laboratory culture of Mediterranean mussels (Mytilus galloprovincialis) to sewage effluent but did not provide any information about when the culture was established, its origin, or its rearing conditions. Similarly, Santos et al.92 investigated metabolomic effects in stickleback (Gasterosteus aculeatus) exposed to copper, but while the authors indicate the test population originated from a pristine field site in England, they do not describe the conditions of the collection site or details surrounding the acclimation timeframe and simply state that the fish were maintained until sexually mature. Other studies fail to disclose the source of experimental animals altogether. Tang et al.,93 for instance, reported acclimation conditions for earthworms (Eisenia fetida) to environmental chambers, but did not specify if the worms were wild-caught or laboratory-cultured. Beyond the potential for differences in water quality and environmental parameters between field and laboratory to influence the metabolome and its susceptibility to chemical insult, failure to fully characterise the environment where animals are collected could overlook pre-exposure to relevant physical or chemical stressors.
Despite these challenges, the integration of field-collected organisms, tissues and fluids into laboratory-controlled metabolomics experiments remains a promising and rapidly growing area of ecotoxicology. By combining ecological realism with mechanistic clarity, these studies help validate biomarkers, improve causal inference, and strengthen the interpretation of field-based metabolomic datasets. Overall, such studies demonstrate the value of metabolomics in ecotoxicology but highlight the need for standardised reporting of organism origin, acclimation conditions, and environmental context, along with matched tissue analyses and robust statistical frameworks to enhance reproducibility and support biomarker development.
Applications include assessing the cellular effects of pollutants, identifying metabolic signatures linked to ecological stress, and comparing metabolic phenotypes across related species. Studies in primates (Rhesus macaques),95 for example, show that metabolomics can distinguish species-specific metabolic traits, while environmental toxicology research demonstrates its value for detecting biochemical disruptions caused by contaminants. Standardised metabolomics workflows developed for mammalian cell culture also provide a methodological foundation for applying these techniques to wildlife-derived cells.96 Research groups are establishing wildlife-derived cell culture biobanks dedicated to toxicity testing,97,98 such as the Australian ARI-Tox Marine Wildlife Cell Bank99 (https://aritox.com/mwcb/). While many global wildlife biobanks mainly focus on conservation,100–104 their wildlife cell lines are available to be used in ecotoxicology, especially mechanistic toxicology and chemical mode-of-action work.
Free-ranging organisms are likely to be exposed to complex mixtures of hazardous and non-hazardous chemicals, including natural products, breakdown products, nutrients and pollutants.113 Such mixtures can cause interactive, additive and non-monotonic effects that complicate analyses and obscure comprehension of field metabolomics findings. Complex mixtures vary between sites, seasons and samples, and are challenging to characterise, leading to further between-site and sample variance and Type II statistical errors (false negatives).
Addressing these strengths and limitations in practice depends not only on study design, but also on transparent reporting and rigorous analytical quality control.
Despite major advances in analytical instrumentation, field-based environmental metabolomics continues to suffer from inconsistent reporting of essential metadata. Compliance with the MSI remains low, with many studies omitting details necessary for assessing analytical rigour and reproducibility.123 This inconsistency is common amongst metabolomics studies but is particularly consequential for field-based environmental studies, where environmental variability amplifies the need for transparent and comprehensive method reporting.
About a third of the papers considered in this review used machine learning/predictive modelling such as defining predictive accuracy (Q2) beyond validation or using methods like concentration-response modelling to determine Benchmark Doses (BMD). These approaches require large independent data sets to build and test models, which is a limitation in many field-based metabolomics studies. Consequently, while AI- and machine-learning-driven predictive modelling represents a promising future direction for the field, its widespread and reliable application in environmental monitoring is likely to remain aspirational in the near term.
The DRomics framework (which uses BMD analysis) streamlines dose-response modelling for omics datasets, like metabolomics.126 It allows users to import and process omics data, identify responsive metabolites, fit various concentration–response models, and calculate BMD values. Tailored for ecotoxicology and environmental risk assessment, DRomics manages non-replicated datasets collected from field studies and supports comparisons across transcriptomics, proteomics, and metabolomics within a unified model. Tools such as DRomics-Interpreter offer biological annotation and pathway visualization.127 Lettoof et al.11 investigated dose–response relationships between contaminant concentrations (PFAS and metal(loid)s) and metabolomic alterations, in the livers and tissues of Ranoidea moorei (motorbike frogs) collected across contamination gradients in urban wetlands. Wild-caught frogs were used, and their chemical burden linked to whole-organism health metrics derived from metabolomics datasets. Such case studies show DRomics helps identify dose-responsive metabolic pathways, distinguish early versus late molecular effects, compare omics sensitivities, and establish quantitative thresholds for AOPs and ecological risk assessments.
As computational power is increasing, untargeted metabolomics analysis has looked toward networking, such as GNPS (Global Natural Products Social Molecular Networking) to address the challenges of identifying unknown metabolites. These networks organise fragmentation spectra into molecular networks that reflect structural similarity, supporting annotation propagation and comparative analyses across datasets.128 This approach is powerful for exploring chemical space without prior knowledge, but it is constrained by incomplete spectral libraries, leaving a ‘dark metabolome’ of unannotated features. The dark metabolome, as well as the spectral libraries, can be confounded by in-source fragments and other artefactual features that form clusters unrelated to true metabolites.129 Careful feature curation, fragmentation-aware workflows, and orthogonal validation are therefore essential to avoid overinterpretation if these approaches are to be considered.
Only a quarter of studies reported using internal standards, half of that reported including pooled QC samples, and only 5% documented system suitability testing, despite MSI and Chemical Analysis Working Group (CAWG) guidance emphasising their importance.123 Reporting of sample handling—collection conditions, storage regime, freeze–thaw cycles—was similarly low (16%), even though storage conditions significantly affect metabolite stability.64,123 No reviewed studies explicitly stated that sample preparation or analysis order was randomised, a key requirement for avoiding batch or drift related artefacts.130 Data deposition in public repositories such as the Metabolomics Workbench,131 MetaboLights132 or Dryad Digital Repository80 was recorded in only 20% of papers, mirroring broader concerns that MSI guidelines are difficult to interpret and inconsistently enforced.133 The deposition requirement, defined by MSI,134 aims to make raw data and metadata available for re-analysis, but the submission process remains cumbersome.130
Importantly, these improvements are not only methodological but also critical for regulatory relevance. Regulatory decisions often carry significant ecological, societal, and financial consequences, including impacts on biodiversity, ecosystem services, human health, and economic penalties associated with environmental noncompliance. For most free-ranging wildlife, particularly large-bodied, long-lived, or disturbance-sensitive species, regulators lack laboratory-derived effects or risk data, meaning that evidence of exposure and biological impacts must necessarily be derived from organisms exposed in the field.
The use of metabolomics data that lack adequate QC controls, transparent reporting, or robust validation therefore poses a risk of misinforming management actions or misattributing environmental harm.
To meet regulatory expectations, field-based metabolomics must adhere to the same data integrity standards historically required in environmental chemistry and toxicology. Strengthening documentation of internal standards and pooled QC samples, improving transparency in storage and sample handling conditions,135–137 and implementing routine systems suitability testing would markedly improve analytical defensibility. Likewise, wider adoption of streamlined metadata templates and checklists—such as those proposed for lipidomics130 could reduce reporting burdens and improve compliance.
Increased editorial oversight is needed to ensure compliance with standards for internal protocols and quality controls. Thorough documentation supports reliable assessment of metabolite data, while guiding researchers toward best practices in interpreting pathways and biological processes connecting metabolite changes to significant outcomes. While online data repositories are valuable, the community must address the ‘inconvenience’ of metadata recording.133 Streamlined tables or checklists, such as those from McDonald et al.138 for lipidomics, can simplify metadata reporting, improve compliance, and encourage the reuse of high-quality environmental datasets. By enhancing methodological consistency, data robustness, and biological interpretability, environmental metabolomics will be better positioned to contribute to regulatory decision making, risk assessment, and the development of mechanistically grounded biomarkers in multi stressor ecological contexts.
Despite these analytical advances, several methodological and data-integration gaps continue to limit broader application and regulatory uptake of metabolomics data.
Field metabolomics often produces large, multidimensional datasets that require robust computational solutions. Efficient workflows for handling large datasets in field contexts—including mobile computing platforms, automated QC, and scalable cloud solutions—are still emerging. Research into lightweight, high-throughput data processing pipelines would accelerate field adoption.
Further improvement in bioinformatics tools, particularly those designed to operate efficiently in field contexts, is needed. Challenges include handling low-resolution or partially annotated datasets, limited internet connectivity, and the need for near-real-time feedback. Improvements to platforms such as other machine-learning-based annotation pipelines could enhance automated identification, reduce uncertainty, and streamline interpretation.
Field environments inherently introduce variability (e.g., temperature shifts, fluctuating exposure mixtures, light variability, etc.) that cannot (and should not) be fully controlled. The goal of field-adapted protocols is therefore not to recreate the laboratory-like control outdoors but to ensure that variability is transparently documented, analytically defensible, and interpretable alongside matched environmental chemistry and validation studies. Standardised field workflows (sampling kits, preservation and quenching approaches, portable extraction options, and harmonised metadata capture) should be framed as tools to strengthen reproducibility and inference, while retaining the ecological realism that motivates field metabolomics in the first place.
Metabolomic research benefits from other overlapping omics field sampling approaches, such as transcriptomics-based protocols for rapid tissue preservation that can be readily adopted by metabolomics to improve comparability and integration across datasets. Tools and technological platforms that can capture onsite field metadata via smart phones would also be advantageous.
There remains a limited understanding of dose–response relationships linking metabolomic changes to measured contaminant concentrations. Building stronger mechanistic evidence through controlled exposure studies, combined with sentinel field sampling, will support causal inference and improve relevance to risk assessment.
Importantly, field-based metabolomics should not be positioned as a stand-alone replacement for controlled testing. Regulatory decision-making will be strengthened when metabolomics is integrated with direct environmental sampling (i.e., water, sediment, soil chemistry), exposure reconstruction, and laboratory or mesocosm experiments that validate candidate pathways and establish quantitative dose–response relationships. In practice, field metabolomics is most powerful for identifying ecologically relevant exposure scenarios and early biological signals, while controlled experiments are required to confirm mechanisms, eliminate confounding drivers, and derive defensible thresholds that can be mapped to AOPs and monitoring endpoints.
Consistent with historical regulatory practice, field-derived biological signals gain evidentiary weight only when interpreted alongside the consideration of mechanisms and dose–response relationships established through controlled experimentation, reinforcing the need for paired laboratory, mesocosm, and field-based approaches.
At the same time, the discipline faces a suite of challenges that must be addressed before its full regulatory potential can be realised. Variability in field conditions, limited standardisation across sampling and analytical workflows, and the complexity of large metabolomics datasets all present barriers to reproducibility and uptake. Furthermore, the ecological interpretation of metabolomic signals—particularly in relation to contaminant exposure and harmful effects, baseline variability, and dose–response relationships—requires improved bioinformatics capacity and closer integration with established monitoring frameworks. These challenges define clear research opportunities and invite coordinated efforts to develop infrastructure, tools, and protocols that support consistent application in real-world settings.
Addressing these gaps offers significant potential for strengthening regulatory and policy decision-making. Centralised data repositories, validated field-ready protocols, and interoperable bioinformatics pipelines will enable metabolomic endpoints to be assessed alongside traditional biomarkers and ecological indicators. As the evidence base grows, metabolomics could play a vital role in effect-based monitoring programs by providing sensitive measures of sub-lethal stress, supporting cumulative impact assessment, and informing adaptive management strategies. Expanding case studies across diverse species, environments, and contaminant profiles will be key to demonstrating robustness, building confidence, and aligning research outputs with regulatory needs.
Taken together, the opportunities, challenges, and regulatory potential outlined in this review highlight a pivotal moment for the field. Realising the promise of field-based metabolomics will require sustained investment, cross-disciplinary collaboration, and active engagement between researchers, regulators, industry, and monitoring agencies. We call for coordinated national and international efforts to standardise methods, strengthen data infrastructure, and embed metabolomic indicators into environmental monitoring frameworks. By acting now, the research community can accelerate the transition of field-based metabolomics from experimental innovation to an operational, policy-ready tool for safeguarding ecosystem health.
| This journal is © The Royal Society of Chemistry 2026 |