Yongfu
Liu
ab,
Xinghua
Liu
a,
Yuting
Hu
a,
Feng
Gao
a,
Wenjun
Yu
a and
Fan
Cheng
*a
aDepartment of Rehabilitation, Taihe Hospital Affiliated to Hubei University of Medicine, Shiyan, Hubei Province 442000, China. E-mail: 15071583344@163.com
bCollege of Sports Medicine, Wuhan Sports University, Wuhan 430079, China
First published on 30th October 2025
Untargeted metabolomics has emerged as a transformative approach in sports nutrition research, offering an unbiased means to characterize the complex biochemical responses to exercise, training, and dietary interventions. Unlike targeted assays restricted to predefined metabolites, untargeted strategies capture broad metabolic perturbations across lipid, carbohydrate, amino acid, and nucleotide pathways, enabling the discovery of novel biomarkers and unanticipated physiological mechanisms. This review critically evaluates the design and application of untargeted metabolomic pipelines in the context of exercise and nutrition science, from pre-analytical sample handling and analytical platforms such as NMR, LC-MS, and GC-MS, to data processing using tools like XCMS, MZmine, and MS-DIAL, and subsequent statistical and bioinformatic interpretation. Key applications include delineating acute metabolic shocks induced by endurance exercise, identifying athlete-specific metabolic phenotypes shaped by chronic training, and assessing the impact of nutritional interventions such as fruit intake, amino acid supplementation, or polyphenol-rich foods on exercise recovery and oxidative stress. The integration of metabolomics with other omics, particularly microbiome metagenomics and lipidomics, highlights the potential for systems-level insights into host–microbe–diet interactions. Nonetheless, significant challenges remain, including the reproducibility of findings, difficulties in metabolite identification, and the translational gap between large datasets and actionable nutritional strategies. By synthesizing current strengths, limitations, and controversies, this review emphasizes that the future of sports metabolomics lies in methodological standardization, multi-omics integration, and validation of candidate biomarkers in independent cohorts. Collectively, these efforts position untargeted metabolomics as a cornerstone for advancing precision nutrition and personalized performance monitoring in athletes.
However, enthusiasm for untargeted metabolomics is tempered by critical challenges. Detractors argue that untargeted studies produce massive complex datasets with many unknown compounds7 and can yield irreproducible or hard-to-interpret results.8 Only a fraction of detected features can typically be confidently identified, due to limitations in spectral libraries and reference standards.9 Furthermore, subtle differences in sample handling, analytical protocols, and data processing pipelines may lead to divergent findings from the same raw data.10 Proponents counter that untargeted approaches are indispensable for discovery – for uncovering novel biomarkers and pathways that targeted assays (limited to preconceived metabolites) might miss.11 They point to successes like the discovery of an exercise-induced metabolic signature resembling the rare disorder hawkinsinuria, or the identification of unique gut microbiome–metabolite interactions in elite athletes, as evidence that untargeted metabolomics can yield meaningful insights into nutrition and performance.12 The core controversy centers on whether the benefits of broad discovery outweigh the drawbacks of data complexity and ambiguity in biochemical identification.
In this review, we evaluate untargeted metabolomics pipelines for sports nutrition research. We first outline the typical workflow of untargeted metabolomics – from sample collection to data analysis – and survey the bioinformatic tools available. Next, we examine key findings from recent studies applying metabolomics to exercise and nutrition interventions, highlighting both the insights gained and the recurring patterns observed. We then analyze the major pros and cons debated in the field: what are the strengths of untargeted metabolomics in advancing sports nutrition science, and what are the principal technical and interpretative challenges? The focus of these debates ranges from practical issues (such as data processing variability and metabolite identification bottlenecks) to fundamental questions about how to translate complex metabolomic readouts into actionable nutritional strategies. Fig. 1 provides a high-level view of how metabolomics fits into the broader context of systems biology in exercise, interacting with genomic, transcriptomic, and proteomic factors against the backdrop of environmental influences like training and diet. We aim to provide a balanced perspective and identify where consensus has been reached versus where controversies remain. Ultimately, improving and standardizing untargeted metabolomic pipelines – and effectively integrating them with nutritional and physiological data – will be crucial for turning sprawling metabolomic data into genuine advancements in sports nutrition and performance optimization.
![]() | ||
| Fig. 1 Metabolomics as an interface between the genome, proteome, and environment in exercise physiology. | ||
![]() | ||
| Fig. 2 Analysis workflow in untargeted metabolomic studies. This schematic outlines the key steps of an untargeted metabolomics pipeline, from raw data to biological insight. | ||
Next, the samples undergo metabolite extraction and analysis using high-throughput platforms, predominantly nuclear magnetic resonance (NMR) spectroscopy or mass spectrometry (MS) coupled to a chromatographic separation.18,19 NMR-based metabolomics is highly reproducible and non-destructive, providing rich structural information, but has relatively lower sensitivity.20 MS-based approaches, especially liquid chromatography-MS (LC-MS) and gas chromatography-MS (GC-MS), offer greater sensitivity and coverage of low-abundance metabolites.21 In practice, many sports metabolomics studies deploy LC-MS or a combination of LC-MS and GC-MS to broaden the range of metabolites detected. For example, among 24 exercise metabolomics papers reviewed by Sakaguchi et al.,22 11 used LC-MS and several others combined LC-MS with GC-MS, while a few used NMR. The choice of platform can influence the metabolite classes observed – e.g. LC-MS excels at lipids, amino acids, and many polar metabolites,23 GC-MS is strong for organic acids and derivatized compounds, and NMR provides quantitative data on abundant metabolites like lactate, glucose, and certain amino acids.24 Cutting-edge studies increasingly use multiple platforms or integrate lipidomics (focused MS analyses of complex lipids) alongside global metabolomics to ensure key pathways (like fatty acid metabolism) are comprehensively covered.25
In sports-specific matrices and settings, practical choices follow from these strengths. For plasma collected within ∼30–90 min after an acute endurance bout (e.g., cardiopulmonary exercise testing), LC-MS detects transient surges in acylcarnitines and free fatty acids that track mitochondrial β-oxidation dynamics (time-series heatmaps for these classes are reported in a controlled CPX cohort).26 For urine sampled over the 0–24 h recovery window, GC-MS with methoximation–silylation robustly quantifies TCA intermediates and short-chain organic acids (succinate, citrate, 3-hydroxybutyrate), aiding interpretation of energy metabolism, hydration, and diet effects during recovery. For low-volume, pitch-side saliva and sweat, 1H-NMR and LC-MS are complementary: NMR affords minimal preparation and reliable quantification of abundant metabolites (e.g., lactate, acetate, choline) and has been used to monitor performance in soccer players, while LC-MS—especially with chemical-isotope labeling—expands coverage of amines/phenols in sweat collected after moderate-to-intense exercise and enables practical normalization (e.g., dried-powder mass) for field-collected patches.27 Finally, during graded exercise, GC-MS/PTR-MS of exhaled breath captures rapid (≈3–5× within ∼1 min) rises in VOCs such as isoprene/methyl acetate, providing a non-invasive window into ventilatory/hemodynamic responses under load.28 Together, these examples operationalize platform selection by matrix and timing: LC-MS for immediate post-exercise plasma lipid/acylcarnitine dynamics; GC-MS for urinary organic-acid profiling during recovery; NMR or LC-MS for on-field saliva/sweat; and GC-MS/PTR-MS for breath-borne VOC kinetics during exercise.
After data acquisition, a crucial and challenging phase is spectral data processing – converting raw spectra into a list of metabolite features and their abundances.29 For LC-MS data, this typically involves peak detection, deconvolution, and alignment across samples.30 Commonly used algorithms like XCMS (an R package) detect chromatographic peaks in each sample and align them by m/z and retention time across the dataset.31 MZmine 2 is another popular open-source toolkit for LC-MS data processing, offering interactive peak picking, chromatogram building, and alignment.32 In NMR data, an older approach was simple spectral binning, but this can merge signals; in athlete cohorts, peak-based and deconvolution pipelines are preferred because pre- vs. post-exercise pH and ionic changes shift 1H resonances and intensities. Recent sports studies processing urine and plasma by high-throughput 1H-NMR have adopted peak-level quantification and rigorous alignment to resolve exercise-induced changes (e.g., winter-sport athletes around sport-related concussion; recreational athletes before/after HIIT).33 Regardless of platform, alignment is critical for time-series designs typical in exercise trials: even slight chemical-shift or retention-time drift must be corrected with algorithms such as correlation-optimized or dynamic time warping; we now explicitly point to exercise studies and sports-methods reviews that motivate these steps.34Table 1 lists some of the major software tools available for metabolomic spectral processing and data analysis, many of which are free and have been widely adopted in the metabolomics community.
| Tool | Description & features (analysis step) | Sports-relevant exemplar/context | Primary references |
|---|---|---|---|
| XCMS | LC-MS peak detection, retention-time alignment, feature quantification; supports centroid/profile data; extensive parameterization and CAMERA integration | Used for untargeted plasma LC-MS in acute hypoxic exercise interventions (pre/post/3 h) to characterize exercise-induced metabolic shifts | 33 |
| MZmine 2/3 | Comprehensive LC-MS workflow (import → detection → deconvolution → alignment → gap-filling → export); modernized in MZmine 3 with multimodal MS support and robust batch processing | Recommended in exercise/sports methods overviews as a robust alternative to XCMS for athlete time-series; MZmine 3 paper documents current best practices | 45 |
| MS-DIAL 5 | Vendor-neutral GC/LC-MS(/MS) processing with MS/MS-assisted annotation, lipidomics modules, and MSI support; large built-in libraries | Strong fit for exercise lipidomics where training remodels lipid species; also widely used in untargeted workflows relevant to anti-doping contexts | 46 |
| OpenMS 3 | Modular, reproducible LC-MS(/MS) pipelines (feature finding, alignment, FDR, quantification) with workflow engines (KNIME/Nextflow) | Scalable for longitudinal athlete studies and multi-batch intervention designs requiring reproducible reprocessing | 47 |
| MetaboAnalyst 5 | End-to-end platform: spectra processing (LC-HRMS), normalization, covariate adjustment, statistics, functional/pathway analysis, and multi-omics integration | Supports covariate handling common in sports trials (e.g., repeated-measures, diet/training covariates) and quick biomarker panel exploration | 48 |
| NOREVA 2.0 | Web tool to compare normalization strategies with multi-criteria evaluation; supports time-course and multi-class designs | Helpful for pre/post and multi-visit sport designs to select normalization that stabilizes QC CV% and preserves group effects | 42 |
| statTarget (QC-RFSC/QC-RLSC) | QC-based signal-drift correction (random forest and LOESS), batch integration, and downstream statistics in R with GUI | Addresses intra/inter-batch drift in large athlete cohorts where frequent QC injections are used | 49 |
| SIRIUS + CSI:FingerID (with CANOPUS/COSMIC) | In silico formula/structure annotation from MS/MS via fragmentation trees and fingerprint prediction; compound class assignment; confidence scoring workflows | Boosts identification levels for exercise-altered unknowns (e.g., acylcarnitines/derivatives) when spectral libraries are sparse | 50 |
| GNPS FBMN | Feature-based molecular networking linking LC-MS/MS features by spectral similarity, integrates quantification and ion-mobility; interoperates with XCMS/MZmine/MS-DIAL exports | Useful to visualize families of exercise-responsive metabolites and track isomeric series across training blocks | 51 |
| ASICS (R) | Automated 1H-NMR identification/quantification via library-based mixture modeling; includes preprocessing and statistical analysis | Appropriate for urine/plasma NMR in athlete monitoring where pH/ionic shifts require deconvolution beyond simple binning | 52 |
| BATMAN (R) | Bayesian deconvolution for 1H-NMR spectra accounting for peak shifts; outputs metabolite concentration estimates | Supports high-throughput NMR pipelines in sports cohorts by improving quantification robustness under variable hydration/pH | 53 |
To provide quantitative context, we summarize performance on the widely used MTBLS733 benchmark (Thermo Q Exactive; 836 ‘true’ LC-MS features established by targeted analysis). Using those ground-truth features as the denominator (recall for peak detection), reported recalls were XCMS 820/836 (98.1%), MS-DIAL 799/836 (95.6%), and MZmine 769/836 (92.0%). For quantification accuracy (proportion of detected true features with accurate fold-change), XCMS achieved 89.2% (731/820), MS-DIAL 81.9% (654/799), and MZmine 99.0% (761/769). As a practical precision proxy at the differential-analysis stage, the number of false discriminating markers on this dataset was 51 for XCMS, 42 for MS-DIAL, and 3 for MZmine (true markers: 45, 42, and 48, respectively). These values come from a harmonized re-analysis that directly compared the three tools under matched parameter optimization on MTBLS733.35 Peak-detection precision/recall has also been evaluated methodologically for XCMS's centWave algorithm on dilution/mixture experiments (seed/leaf extracts), showing higher F-scores and improved precision and recall versus matchedFilter (XCMS) and centroidPicker (MZmine), consistent with the recall figures above.36
Gap-filling and missingness diagnostics are critical for downstream reliability. In a recent modular comparison across common workflows, post-processing missing value fractions were on the order of 2–3% across tools (e.g., ∼2.1% for XCMS, ∼3.4% for MS-DIAL, ∼2.7% for MZmine), underscoring that aggressive gap-filling can reduce missingness but may inflate false positives if not coupled with quality filters. We therefore recommend reporting per-feature gap-filled intensity fractions and excluding features with high gap-filled proportions in athlete-monitoring datasets.37 For alignment, we now explicitly recommend reporting median ΔRT after warping and the percentage of features with ΔRT below a fixed threshold (e.g., 0.1–0.2 min), alongside the overall missingness after alignment. These diagnostics make the alignment error profile transparent and facilitate comparisons across pipelines in sports plasma/urine studies.38
To make QA/QC actionable and comparable across sports-nutrition studies, we adopt the following operational scheme supported by recent guidance and scoping analyses. QC conditioning: inject ≥5 pooled-QC samples at the start of each batch to condition the LC-MS system (in demanding matrices, up to 10).39 In-run pooled-QC frequency: inject one pooled-QC every 6–10 study samples; this reflects current practice (majority of studies) and has minimal impact on coverage while enabling robust monitoring and correction.40 Acceptance thresholds: (a) feature-level precision: retain only features with QC CV (RSD) ≤30% after preprocessing/normalization; remove features exceeding this threshold.41 (b) Blank exclusion: exclude features with mean process-blank signal ≥30% of the pooled-QC signal. (c) Internal-standard performance & system suitability: require IS CV ≤ 15% across the run, mass accuracy within ±5 ppm, and retention-time drift ≤0.1–0.2 min relative to system-suitability injections; failing IS triggers investigation/re-tuning. Drift correction and diagnostics: apply QC-sample-based correction (e.g., QC-RLSC/LOESS or SERRF) and report quantitative diagnostics before vs. after correction: (1) median QC CV% across retained features; (2) PCA of QC samples with the Euclidean distance from the QC centroid (expect tight clustering post-correction); and (3) the proportion of features meeting the CV ≤ 30% criterion post-correction. QC-RLSC and SERRF are established approaches; SERRF has reduced technical error to ∼5% RSD in large sets.42 MS/MS coverage fraction (DDA/DIA acquisition): report the fraction of features (after QC filtering) for which MS/MS spectra were acquired above intensity thresholds, alongside key acquisition settings (e.g., AGC target, NCE, dynamic exclusion). Optimizing DDA parameters materially affects coverage; thus, reporting this fraction increases transparency across studies.43 Identification-level distribution: report counts (and percentages) of features/metabolites at MSI levels 1–4 (identified → unknown) for the final result set. This provides a transparent view of annotation confidence.44 Rationale for frequencies and thresholds: the pooled-QC cadence (every 6–10 injections) aligns with a scoping review of current LC-MS untargeted practice; conditioning with multiple QC injections is recommended by recent guidance. A ≤30% QC-CV inclusion rule and ≤30% blank-to-QC ratio are widely used in untargeted workflows, while IS CV ≤ 15%, ±5 ppm mass accuracy and tight RT windows are consistent with system-suitability practice. Together, these criteria provide reproducible, auditable QA/QC suited to sports-nutrition metabolomics.
Once a feature table is generated, data analysis and interpretation can begin. Researchers typically apply a combination of univariate statistics (e.g. paired t-tests or ANOVA for each metabolite) and multivariate techniques to decipher the metabolomic shifts.54 In sports metabolomics, multivariate models like PCA and partial least squares-discriminant analysis (PLS-DA) are common to reduce data dimensionality and visualize group separation – for instance, to see if metabolite profiles post-exercise differ from pre-exercise, or if a supplemented group diverges from placebo. Supervised models (PLS-DA, OPLS-DA) have been used to classify athletes vs. sedentary controls based on metabolite patterns, or to predict performance metrics from metabolomic signatures. Caution is warranted, as overfitting is a risk with high-dimensional data; best practices include using large sample sizes, cross-validation, and external validation sets to ensure any putative biomarkers are robust. To operationalize this, PLS/OPLS models should be evaluated with repeated or k-fold cross-validation reporting both R2 and Q2, complemented by response-permutation testing (e.g., 500–2000 permutations) and CV-ANOVA to test model significance (typical criterion p < 0.05). These procedures help detect spurious class separation and optimism in predictive performance; seminal evaluations in metabolomics demonstrate that permutation and CV-ANOVA effectively flag overfitting and that naïve validation yields overly optimistic results.55 Y-Scrambling (randomly re-assigning class labels and refitting) should further be reported to demonstrate that observed discrimination exceeds that expected by chance.56
Because sports studies frequently employ cross-over or longitudinal designs with repeated measures, analysts should either (i) adopt multilevel decompositions that remove between-subject variation prior to PCA/PLS-DA (multilevel sPLS-DA; implemented in mixOmics via the within-subject variation), or (ii) perform metabolite-wise linear mixed-effects modelling with random intercepts/slopes for participants, followed by multiple-testing control and, where appropriate, multivariate aggregation of significant features. These approaches are explicitly designed for paired/repeated data and have been shown to improve feature selection and classification by accounting for subject-specific baselines—an important consideration in athlete cohorts.57 In sport-science more broadly, methodological guidance recommends mixed models for longitudinal, imbalanced datasets typical of training studies; this translates directly to metabolomics where within-athlete correlation structures otherwise inflate false positives.58
A final and formidable step is metabolite identification – figuring out what compounds the significant spectral features correspond to.59 This often requires matching MS/MS fragmentation spectra to databases (like HMDB, METLIN, or MoNA) or confirming with authentic standards.60–62 In untargeted studies, typically only a subset of features (sometimes ∼20–50%) can be confidently annotated.50 For example, in an untargeted LC-MS study on cyclists, ∼509 features were detected but 107 metabolites were “chemically identified” as known compounds.63 The rest remained putative or unknown. The reliance on database matching means that novel or unexpected metabolites may go unidentified – a key limitation of untargeted metabolomics in general. Some advanced techniques like tandem MS (MS/MS) networking, retention time prediction, or 13C-labeling can help illuminate unknowns, but these add complexity and are not yet routine in sports studies. Scholars have highlighted metabolite identification as a “bottleneck” and called for community efforts in sharing reference spectra and developing better computational identification tools.64 In nutrition studies, where many metabolites may derive from diet or gut microbiota, identification is particularly challenging because not all food-derived compounds are well-represented in standard libraries.
Despite these hurdles, untargeted pipelines have successfully catalogued a broad array of metabolic changes with exercise and diet.65,66 Recent efforts to integrate metabolomics with other “omics” (transcriptomics, proteomics, and especially microbiome metagenomics) are creating a more holistic systems biology approach to sports nutrition. Fig. 3 shows an example of such an integrative workflow from a 2025 study, where fecal metagenomics was combined with plasma metabolomics/lipidomics in elite athletes.67 By merging multi-omics data, researchers can begin to connect specific gut microbial pathways with circulating metabolite profiles (for instance, seeing how microbial metabolism of amino acids might link to host energy metabolites in endurance vs. strength athletes). These comprehensive pipelines underscore that metabolomics is not a stand-alone technique, but one piece of a larger puzzle. In summary, the untargeted metabolomics workflow – from sample prep to bioinformatics and biological interpretation – is complex, but when executed rigorously it provides an unparalleled window into the metabolic adaptations underlying sports performance and nutrition.
![]() | ||
| Fig. 3 Multi-omics integration in sports metabolomics. This diagram illustrates an end-to-end pipeline combining metagenomics and metabolomics, as applied in a recent study of elite weightlifters vs. endurance cyclists.67 (a) Samples are collected and subjected to different analytical techniques: fecal samples for metagenomic sequencing of the gut microbiota; blood (plasma) samples for untargeted metabolomics and lipidomics profiling. (b) Data processing converts raw data into features: DNA sequences are processed to identify microbial taxa and genes (using bioinformatics tools like MetaPhlAn), while chromatographic MS data are processed to extract metabolite and lipid features. (c) Data integration and analysis bring together the metagenomic and metabolomic layers. For example, multivariate models (PCA/OPLS-DA) can distinguish athlete groups based on combined omics profiles, and network analysis can link specific microbes with metabolites. | ||
| Metabolite or pathway | Acute exercise response (single session) | Training or athlete baseline | Nutritional modulation |
|---|---|---|---|
| Free fatty acids (FFA) | ↑ FFA during prolonged moderate/high-intensity exercise (2–10 fold post-exercise) as adipose lipolysis accelerates. Example: palmitate, oleate peak 1 h post-run, returning to baseline by 24 h | Endurance-trained athletes often show lower fasting FFA but more rapid FFA mobilization when exercise starts | High-carb meals before exercise attenuate FFA rise (insulin effect), whereas fasted exercise exaggerates FFA release. Fat-rich diets can elevate baseline FFA availability |
| Acylcarnitines & ketones | ↑ Acylcarnitines (medium- and long-chain) and ketone bodies (β-hydroxybutyrate, acetoacetate) after endurance exercise. Indicates enhanced β-oxidation and overflow of acetyl-CoA. In a 75 km cycling, acylcarnitine levels spiked ∼4–6× at 1.5 h post-exercise. Ketones like 3-hydroxybutyrate rose >5×, reflecting hepatic ketogenesis | Trained individuals may have higher peak ketone responses to exercise (due to metabolic flexibility) but also clear them faster | Ketogenic or low-carb diets significantly elevate exercise-induced ketones and acylcarnitines at rest and during exercise. Carbohydrate ingestion blunts ketone formation during exercise |
| Triglycerides (blood) | ↓ Glycerides (triacylglycerols) immediately post prolonged exercise, as they are consumed for fuel. In a multi-hour ski march, ∼88% of detectable di- and triacylglycerols dropped in plasma. They tend to normalize or overshoot during recovery as lipoproteins redistribute lipids | Athletes (especially endurance) generally have lower resting triglycerides than sedentary peers, reflecting training-induced improvements in lipid clearance | Certain supplements (e.g. omega-3 fatty acids) can lower resting triglycerides. High glycemic carbs post-exercise can accelerate triglyceride restoration by promoting lipid uptake into tissues |
| Glucose & lactate | ↑ Lactate (often 3–10×) with high-intensity exercise, concurrent with transient ↑ glucose (mobilization from liver glycogen). Example: after 30 min at 110% VO2max intervals, blood lactate was substantially elevated versus moderate exercise. Lactate usually clears within 1–2 h post-exercise. Glucose may dip below baseline during recovery if glycogen depleted | Trained athletes show a smaller rise in blood lactate at a given workload (higher lactate threshold). They also maintain tighter glycemic control during exercise, relying more on fat oxidation | Pre-exercise carbohydrate loading elevates starting glucose and can attenuate early lactate accumulation. Ingredients like caffeine can increase peak lactate by allowing higher exercise intensity for the same effort |
| Amino acids (BCAA, Trp) | ↓ Certain amino acids (e.g. branched-chain amino acids valine, leucine, and tryptophan) during exhaustive exercise. Believed to be taken up by muscle for oxidation or converted (tryptophan → serotonin/kynurenine). ↑ Alanine and glutamine often observed (glucose–alanine cycle, ammonia scavenging). Ammonia-related metabolites (glutamine, urea) may rise indicating amino acid catabolism | At rest, endurance athletes sometimes have lower BCAA levels than sedentary (possibly due to chronic utilization or dietary differences). Training can increase the efficiency of amino acid turnover, reflected in faster post-exercise recovery of amino acids | Protein or BCAA supplementation before exercise can increase baseline and post-exercise BCAA levels and may reduce their drop during exercise. Certain supplements (e.g. β-alanine) elevate specific amino acid-derived metabolites (carnosine) in muscle, detectable indirectly via histidine/alanine changes |
| TCA cycle & energetics | ↑ TCA intermediates (citrate, succinate, fumarate) modestly after exercise, but pattern is complex. Succinate often accumulates (incomplete oxidation in oxygen-limited muscle), then is cleared. Markers of nucleotide breakdown (inosine, hypoxanthine) ↑ with intense exercise, indicating ATP turnover and contributing to oxidative stress post-exercise | Resting levels of TCA intermediates don't differ greatly by training status, but trained muscle may have higher mitochondrial enzyme activity – metabolomics of muscle tissue can show higher baseline TCA cycle intermediate content in athletes. Chronic high-intensity training can lower resting purine breakdown products due to improved energy efficiency | Antioxidant-rich diets might modulate exercise-induced purine oxidation and reduce accumulation of urate/xanthine. Some evidence suggests β-alanine supplementation (to boost carnosine) might indirectly stabilize TCA cycle metabolism by buffering muscle pH, though metabolomic signatures are still being studied |
| References | High-intensity/long-duration exercise lipid responses;80 Ski march FFA and acylcarnitine changes;81 lactate and carb metabolism;78 amino acid shifts and ammonia;82 purine metabolism and fatigue;6 training vs. sedentary metabolite differences83 |
Besides lipids, carbohydrate metabolism exhibits expected shifts: intense exercise raises glycolytic intermediates (e.g. glucose-6-phosphate, pyruvate)76 and end-products like lactate,77 while depleting some TCA cycle intermediates and related compounds as they are rapidly cycled. In moderate 30–60 min exercises, metabolomics has detected significant lactate elevations (confirming anaerobic glycolysis) and increases in Krebs cycle metabolites such as succinate and malate.78 However, the changes in TCA cycle metabolites are often relatively modest (on the order of <2-fold) compared to lipids. This may be because these intermediates are tightly regulated and also rapidly reused, or due to their compartmentalization in muscle vs. blood. Still, even short bouts can perturb them: one study of 18 min high-intensity running found small but detectable increases in plasma succinate and citrate immediately post-exercise. Another recurring theme is amino acid metabolism: exercise tends to decrease plasma levels of certain amino acids (e.g. tryptophan, branched-chain amino acids) and increase others or their catabolites (like alanine, glutamine) as part of the alanine-glucose cycle and protein breakdown for energy. Tryptophan in particular often drops during prolonged exercise, likely due to increased uptake and oxidation or its conversion to serotonin and kynurenine; in a cycling fatigue test, plasma tryptophan fell significantly as fatigue approached. Correspondingly, metabolites of purine nucleotide breakdown (indicative of ATP turnover and fatigue), such as hypoxanthine, may rise during high-intensity efforts – one study in runners showed hypoxanthine spiking after exhaustive exercise, reflecting muscle ATP degradation and perhaps contributing to oxidative stress upon its conversion to uric acid.79
As seen above, untargeted metabolomics confirms many expected exercise-induced changes but also provides a more nuanced picture.84 For instance, it was through metabolomic analyses that researchers noted a consistent post-exercise increase in unusual dicarboxylic fatty acids (e.g. azelate, sebacate), suggesting amplified ω-oxidation of fatty acids during intense exercise beyond traditional β-oxidation.85 Metabolomics also draws attention to metabolites in the interface of multiple pathways:86 an example is 2-hydroxybutyrate, a byproduct of both amino acid catabolism and glutathione synthesis. Several studies observed a decrease in 2-hydroxybutyrate with exercise despite increased lipid use – potentially linked to its consumption in counteracting oxidative stress via glutathione.78,87 Such insights exemplify how untargeted data can generate new hypotheses about exercise metabolism (in this case, that antioxidant demands rise and consume certain intermediates during exercise).
Intervention studies on training provide dynamic information: for instance, a 6 weeks training study in cyclists (comparing two training intensity distributions) used 1H-NMR metabolomics on urine and found changes in hippurate, creatinine, and other metabolites reflecting shifts in gut-microbiome co-metabolism and energy metabolism post-training.89 Hippurate (a gut microbial cometabolite of polyphenols) decreased with certain training regimens, suggesting training might influence gut metabolism or dietary intake patterns. While the interpretations can be complex, these studies underscore metabolomics' potential to serve as a comprehensive monitoring tool for athlete conditioning, recovery status, or even overtraining – though more research is needed to establish reliable biomarkers.90 An emerging concept in personalized nutrition and exercise science is the athlete ‘metabotype’—the stratification of individuals into metabolically homogeneous subgroups based on their metabolome (and related clinical/omic data) at baseline or in response to a standardized stimulus. This concept builds on nutrition science, where metabotyping has been proposed and tested as a framework for tailoring diet at the group level and for predicting inter-individual responses to interventions.91,92 In sports cohorts, metabolic phenotyping has differentiated athlete groups and linked metabolite patterns to functional characteristics: large cross-sectional work in elite and sub-elite athletes shows discipline- and training-status-specific metabolic profiles,93 targeted panels separate endurance vs. strength-trained athletes across a training year,94 and blood metabolic phenotypes measured around a standardized exercise test discriminate sprint performance tiers, lactate responses, and subsequent illness susceptibility in highly-trained skiers.95 Moreover, specific metabolite panels correlate with aerobic capacity and race performance, supporting the construct validity of an ‘athlete metabotype’ (e.g., VO2max-linked metabolites in runners and differential acylcarnitine/arginine-related signatures across performance strata) [Shi et al., 2020; Schader et al., 2020; Kelly et al., 2020].96–98 Together, these data justify the use of ‘metabotype’ in an athlete context while highlighting that rigorous, standardized phenotyping and validation are required before routine deployment. As large datasets are analyzed, we expect clearer patterns to emerge linking specific metabolomic signatures to fitness phenotypes.
![]() | ||
| Fig. 4 Plasma 9,10-DiHOME in pistachio and water conditions (interaction effect, Q < 0.001) (mean ± SE). Adapted/reproduced from ref. 63 with permission from PLOS,63 copyright 2014. | ||
Untargeted metabolomics has also been used as a compliance or exposure measure for diets in athletes. For example, metabolomic “fingerprinting” can distinguish athletes consuming a high-protein diet vs. high-carbohydrate diet by detecting differences in urea cycle metabolites, amino acid catabolites, and even food-specific compounds (like citrus flavonoid metabolites if one diet is fruit-rich). One study of fencers supplemented with beetroot juice (rich in nitrates) used untargeted metabolomics to monitor the 4 weeks metabolic effect: it found higher circulating nitrite/nitrate and related nitrogenous metabolites in the beetroot group, confirming uptake, as well as enhanced TCA cycle and antioxidant metabolites that correlated with improved time-to-fatigue.100 In sports anti-doping research, metabolomics is being explored to detect signatures of illicit substances or extreme diets; for instance, certain steroid metabolites or xenobiotics can be picked up in untargeted urine screens and signal potential doping.101
Overall, nutritional metabolomics in sports emphasizes individual responses. There is high interest in precision nutrition: tailoring diets to an athlete's metabolic profile. Metabolomics helps define “responders” vs. “non-responders.” For example, some individuals show a strong metabolite shift after fish oil supplementation, while others show minimal changes – untargeted profiles might help explain such variability by revealing differences in absorption or baseline metabolism.102,103 In a study of green tea extract, two groups of men had different metabolic responses during exercise despite the same dose: metabolomics indicated that only one subgroup showed significant increases in catechin phase II metabolites and corresponding antioxidative metabolites, suggesting differences in gut microbiome or enzyme activity. These examples illustrate how untargeted metabolomics not only measures the average effect of a nutrition intervention but can uncover inter-individual differences in metabolism that might be crucial for personalized nutrition strategies. As metabolomics datasets grow, machine learning approaches are being tested to predict the optimal diet for a given athlete's metabolic makeup – for instance, using baseline metabolite profiles to predict who will benefit most from carbohydrate loading or from certain supplements.87
In summary, applications of untargeted metabolomics in sports nutrition have demonstrated: (1) a robust ability to catalog the biochemical “footprint” of exercise – largely confirming known pathways but also finding new candidate biomarkers of performance and fatigue; (2) the capacity to differentiate metabolic phenotypes of trained vs. untrained individuals and monitor training-induced changes; and (3) insights into how specific dietary components influence metabolism before, during, and after exercise. These advances come with the caveat that results must be carefully validated (one study's significant metabolite may not always replicate in another due to methodological differences). Nonetheless, the rich data from untargeted pipelines have undoubtedly expanded our understanding of exercise nutrition biochemistry. The next sections will critically evaluate the strengths and weaknesses of these approaches, and how the field is addressing the controversies that have emerged.
| Advantages (pros) | Challenges (cons) |
|---|---|
| Holistic, unbiased coverage: can detect hundreds of metabolites across diverse pathways in one run, enabling discovery of unexpected changes. E.g., revealed novel exercise-induced metabolites (e.g. unusual tyrosine catabolites, gut microbiome-derived compounds) beyond predefined targets | Complex data, difficult interpretation: yields large datasets with many unknowns. Metabolite identification is a major bottleneck – a significant fraction of features remain unannotated. Harder to derive clear mechanistic insight when key metabolites are “feature 123@m/z 256” with unknown identity |
| Integrative phenotype readout: metabolome reflects combined genetic and environmental influences, providing a direct readout of physiological state. Useful for assessing overall impact of training/diet on the body (closer to performance outcomes than gene or protein changes) | Reproducibility concerns: results can vary with different data processing pipelines or analytical platforms. Lack of standardized protocols means one study's biomarker may not replicate in another study. “Discovery” metabolites often fail validation due to false positives or batch effects |
| Biomarker discovery for personalized nutrition: identifies candidate biomarkers of performance, fatigue, or nutritional status that targeted approaches might miss. This can enable precision nutrition – tailoring diets based on an individual's metabolomic profile (e.g. detecting who is a high lactate producer or who has lipid oxidation deficiencies) | Data overload and multiple comparisons: hundreds of comparisons raise false discovery risk. Without rigorous stats (FDR correction), studies may report too many “significant” changes by chance. Signals can be overwhelmed by noise, requiring cautious interpretation and validation in independent cohorts |
| Insights into mechanism and recovery: provides time-resolved metabolic profiles to understand exercise recovery and overtraining. Untargeted assays have found metabolic markers of overreaching (e.g. elevated acylcarnitine accumulation) and distinct recovery trajectories for different diets. These insights can guide training load adjustments and nutritional recovery strategies | Pre-analytical variability: extremely sensitive to sample handling and timing. Circadian influences, feeding state, storage conditions can all confound results if not strictly controlled. Differences in protocols between studies (e.g. fasting vs. fed, morning vs. afternoon sampling) contribute to inconsistent findings |
| Hypothesis generation for other omics: untargeted metabolomics findings can drive new hypotheses tested by targeted assays or other omics. For instance, a spike in a specific metabolite might prompt investigation of the enzyme or gene regulating it. This cross-talk enriches the overall understanding of exercise biology | Translational gap: critics note that translating metabolomic changes into practical advice is not straightforward. Many discovered metabolites have unclear relevance to performance or health, making it hard to act on the findings until further research links them to outcomes. Targeted approaches or known markers are seen as more immediately actionable for coaches/athletes |
| Adaptable to various samples: can be applied to plasma, urine, saliva, muscle extracts, etc., giving flexibility. Non-invasive biofluids (urine, saliva) allow frequent sampling to monitor metabolic fluctuations without burdening athletes. Untargeted assays have even been used in field settings (with portable devices or rapid sampling protocols) | Cost and expertise: untargeted metabolomics experiments are costly (instrument time, data storage) and require specialized bioinformatics expertise. This can limit sample sizes or replication. High cost and complexity mean findings might not be readily validated in large trials, slowing acceptance into practice |
000 entries) and community initiatives to share MS2 spectra are underway to improve this, but it remains a limiting step.
Another major issue is data overload and processing variability. Untargeted metabolomics generates huge datasets – often thousands of features per sample. How one handles and filters this data can dramatically affect results. Different software pipelines or parameter settings can yield divergent feature lists from the same raw data. One study noted that applying various missing value imputation or normalization methods changed the set of “significant” metabolites considerably.108 Thus, there is a concern about reproducibility: are the metabolomic biomarkers reported in one paper reproducible by another lab using a different pipeline? The answer has often been “not exactly.” This fuels the controversy – skeptics argue that without standardized workflows, untargeted metabolomics findings can be cherry-picked or may contain false positives. The lack of uniform quality control across studies (e.g. how to handle batch effects, how to threshold feature detection) makes it hard to compare results or perform meta-analyses, a challenge the field is actively working to address through guidelines and shared protocols.
Pre-analytical and analytical variability further complicate matters. As discussed earlier, small differences in sample collection or handling can produce metabolite differences that swamp the biological effects of interest. For instance, if one study allowed subjects to eat a light breakfast before exercise and another required an overnight fast, their metabolomic “exercise response” profiles could differ simply due to baseline nutrition. The timing of sample collection is another debated point – some argue that many exercise metabolomics studies capture only the immediate post-exercise snapshot, missing the full kinetic picture.22 If studies are not aligned in timing, their results might conflict. These inconsistencies fuel controversy about which metabolite changes are “real” or universal.
The statistical interpretation of untargeted data is also a point of contention. With hundreds of metabolites tested, the risk of false positives is high if proper corrections (like false discovery rate control) are not applied. Early metabolomics papers sometimes reported dozens of “significant” changes without adjustment for multiple comparisons, which later might not hold up. Now, better practice is in place, but it means often only the top few changes are robust, others hover at the edge of significance. Scholars opposing over-reliance on untargeted data argue that it can generate so many hypotheses that there is a temptation to engage in post-hoc storytelling – finding some pathway among the many that changed and attributing significance to it, while ignoring others (sometimes referred to as the “Texas sharpshooter fallacy”). Prudent studies now use validation: for example, identifying a candidate biomarker in an untargeted run, then using a targeted assay in a follow-up cohort to verify it. Still, few metabolomic biomarkers discovered for exercise (e.g. specific acylcarnitine ratios or amino acid derivatives as fatigue markers) have been validated across independent populations, highlighting the challenge of moving from discovery to confirmed application.109,110
One more controversy centers on the actionability of untargeted findings in sports nutrition. While untargeted studies generate interesting lists of altered metabolites, coaches and nutritionists ask: how does this translate to improved training or diet plans? For example, discovering that endurance exercise raises metabolite A fourfold is scientifically intriguing, but does it help an athlete? If metabolite A is a marker of fatigue, perhaps yes – one could monitor it to gauge recovery. But if metabolite A's function is unknown, it's unclear what to do with that knowledge. Some critics thus view untargeted metabolomics as too far removed from practical guidance: lots of noise, hard-to-interpret signals, and only incremental gains in understanding. Proponents respond that this is a short-sighted view – today's obscure metabolite could be tomorrow's key to a new nutritional intervention (as lactate once was: once just a “byproduct,” now a focus for training strategies). They also note that metabolomics has already suggested practical angles, such as using certain metabolite ratios to detect overtraining before performance drops, or confirming that an antioxidant-rich diet mitigates specific oxidative metabolites after exercise.
Finally, a subtle but important controversy is untargeted vs. targeted metabolomics in sports research. Some experts advocate that targeted methods (measuring a set of known, hypothesis-driven metabolites, like specific hormone levels or known ergogenic biomarkers) are more reliable and easier to standardize, and thus more useful for advancing sports nutrition. Untargeted, in their view, should remain in the discovery realm and not be overinterpreted. Others argue that targeted and untargeted are complementary – the untargeted screens find new leads, and targeted follow-ups confirm and quantify them. A convergence is happening where many labs do a broad untargeted scan and then develop targeted assays for the highlights (like a short list of potential biomarkers).
The debates outlined often boil down to a fundamental question: can we trust and effectively use the complex data from untargeted metabolomics to meaningfully improve sports nutrition and performance? On one side, enthusiasts say yes – these data are revealing hidden aspects of metabolism (like microbiome contributions, novel signaling metabolites) that will form the basis of personalized nutrition and training programs. On the other side, skeptics worry that without greater rigor and standardization, untargeted metabolomics may produce more confusion than clarity, with each study finding its own set of “significant” metabolites and few consistent outcomes across studies. They also emphasize the need to validate findings in terms of actual performance outcomes: it's interesting if metabolite X increases with a supplement, but does that correlate with better endurance or recovery?
Both sides acknowledge certain needs: (1) improved standardization of workflows (from sample handling to data processing) to enhance reproducibility; (2) expanded reference libraries and sharing of metabolomic data to tackle the identification problem – for example, consortia pooling spectral data to identify unknowns that commonly appear in exercise studies; (3) integrative approaches combining metabolomics with other measurements (hormones, proteomics, genomics) to strengthen conclusions – a metabolite change linked with a gene expression change and a performance change is far more convincing than any alone. Indeed, the future may see hybrid pipelines that merge the breadth of untargeted with the focus of targeted: for example, using untargeted data to guide the creation of targeted multi-metabolite panels that can be routinely used for athlete monitoring.
Table 3 succinctly contrasts the main pros and cons with references. It is evident that while untargeted metabolomics has extraordinary exploratory power, it demands careful execution and interpretation. The controversies today are driving improvements: for instance, the argument over reproducibility has led to initiatives for more open data sharing and metabolomics quality control in multi-center trials. As methods mature, we expect the “pros” to strengthen (with more discoveries actually translating to practice) and the “cons” to be mitigated (with better reliability and clarity of data). The next section concludes with forward-looking perspectives on how these pipelines can be refined and integrated into the toolkit of sports nutrition.
Despite the challenges, there is a general optimism that most limitations can be addressed with advancing technology and better study design. For instance, comprehensive reference libraries and improved algorithms (including machine learning for spectral match) are gradually improving the metabolite ID success rate. International efforts like the Metabolomics Quality Assurance and Propagation (MQAP) have been established to improve reproducibility by standardizing protocols and providing quality control samples across labs. In sports metabolomics specifically, consortia are beginning to share data, which will help discern true biological signals from study-specific noise. The controversies have, in a sense, been productive: they highlight where we must focus (e.g. ensuring a metabolomics finding has physiological relevance and is not just statistically significant but trivial).
Pitch-side saliva and sweat sampling present matrix-specific issues. Salivary flow rate, pH, and oral contaminants influence untargeted signals; post-match studies describe both advantages and limits, emphasizing the need to record flow/pH and to normalize accordingly. For sweat, localized sweat-rate normalization (or area/time-based collection) materially changes biomarker variance structure and should be reported with device/calibration details.113 Field logistics favor capillary dried-blood-spot (DBS) or volumetric microsampling, which now support metabolomics/lipidomics and lactate-threshold determination in elite cyclists and other athletes. We therefore reference recent DBS/VAMS studies and suggest reporting validation steps (spot volume accuracy, punch location, hematocrit assessment) when using such approaches in athlete monitoring or race settings.114 Finally, biological covariates unique to or prevalent in sport can dominate untargeted readouts: menstrual-cycle phase and hormonal contraception shift substrate use and lipid metabolism, and low energy availability/REDs alters endocrine and metabolic status. We now advise pre-specifying sex-hormone status and cycle phase (or contraceptive class), or adjusting statistically, and screening for REDs where relevant to endurance cohorts.115
Moving forward, the field is poised to address these challenges head-on. Improved standardization of untargeted metabolomics workflows is underway, with community guidance consolidating best practices for LC-MS based untargeted metabolomics, including reference materials, data-quality review, and identification/annotation criteria; the 2024 mQACC/NIST workshop report outlines a living QA/QC guidance framework intended for broad adoption and continual updates.116 In parallel, an easily implementable QC protocol for routine monitoring of data quality (“QComics”) has been proposed and evaluated in 2024, facilitating transparent and reportable QC in discovery workflows.39 Collectively, these efforts enhance inter-laboratory comparability and are directly applicable to exercise and sports-nutrition studies. The advancement of analytical technology will also play a role. Beyond gains in sensitivity and mass accuracy, recent multimodal MS environments now enable structural elucidation of lipids and small molecules with electron-activated dissociation and MS imaging, improving sn-position and C
C localization and thereby reducing isomeric ambiguity in untargeted datasets. The 2024 MS-DIAL 5 platform exemplifies this shift, validating structure assignments against standards and NIST SRM 1950 plasma and explicitly supporting IM-MS and DIA-PASEF data types.117 Such tools, together with curated MS/MS libraries, directly address long-standing identification bottlenecks that have limited translation in sports metabolomics.
One of the most promising trends is the integration of metabolomics with other omes to build a comprehensive picture. This is now exemplified by the NIH MoTrPAC program, which in 2024 published a whole-organism temporal multi-omic atlas of endurance-training adaptations, spanning 19 tissues and nine molecular platforms with publicly accessible data resources. These data sets provide a rigorous substrate for validating metabolomic signals and linking them to proteomic, phospho- and epigenomic remodeling relevant to performance and recovery.118
In terms of practical applications, untargeted metabolomics is edging closer to real-world use. Recent athlete-specific resources illustrate feasibility at scale and with rigorous QC. For example, a 2025 Scientific Data descriptor provides a plasma metabolomics and lipidomics dataset in trained race-walkers sampled across four recovery time-points, reporting 859 untargeted metabolites plus >800 targeted analytes/lipids, with publicly accessible raw data and methods suitable for secondary analyses and biomarker discovery.119 Complementary pilot work continues to compare endurance and strength athletes versus sedentary controls, helping to operationalize panels relevant to training status and recovery kinetics.120 Such datasets support the development of practical, validated “wellness” panels derived from untargeted discovery and refined into targeted assays for longitudinal athlete monitoring. Recent trials also help scope personalization claims by reporting effect magnitudes and heterogeneity. In a randomized crossover study (n = 19), 2 weeks pistachio ingestion before a 75 km time trial resulted in a 4.8% slower performance and higher post-exercise 9,10-DiHOME alongside raffinose/sucrose translocation, quantitatively linking specific food components to oxylipin stress biochemistry; these oxylipins would be denoted as level 1–2 identifications depending on standard/MS/MS confirmation.121 Complementarily, a 4-arm crossover in 20 cyclists reported attenuated 9+13-HODEs and cytokines with carbohydrate intake from bananas or sugar beverages versus water, with model validation (R2Y/Q2Y as above), suggesting that recovery-panel thresholds can be anchored to observed fold-change ranges rather than qualitative trends.122 Together with field-deployable dried-blood-spot workflows, these data support cautious, validated movement toward routine athlete monitoring, while personalization should remain contingent on replicated effect sizes and explicit identification confidence reporting.114
Crucially, addressing the controversies has catalyzed a more rigorous approach in recent studies: greater use of validation cohorts and clearer separation of association versus causation, supported by field-level guidance synthesized for exercise researchers in 2024,123 which we now cite alongside the above QA/QC and software advances. For instance, metabolomics alone might show that a certain dipeptide increases after training; collaboration with biochemists and physiologists could then identify it as a marker of muscle protein turnover, giving it meaning. We see a future where metabolomic data will not exist in silos but will be part of a broader matrix including endocrinology, traditional biomarkers, and performance metrics. This will diminish the chance of overhyping any single metabolite and instead allow metabolomics to enrich our overall understanding of the athlete's adaptive landscape.
In conclusion, evaluating untargeted metabolomics pipelines – as we have done in this review – demonstrates that despite some drawbacks, these approaches are immensely valuable for sports nutrition research. The pros clearly offer transformative opportunities for personalized and optimized nutrition strategies in sport. The cons are being actively mitigated through technical, computational, and collaborative advancements. The focus of controversy has sharpened the science, pushing the community toward higher standards of evidence. Untargeted metabolomics has, in a short time, advanced from simply cataloguing “molecules that change with exercise” to generating targeted hypotheses about improving athlete health and performance – for example, suggesting nutritional interventions to influence specific metabolic pathways. Emerging evidence already quantifies the monitoring potential and begins to define where personalization may be justified. In elite cyclists, dried-blood-spot field sampling captured exertion signatures with two- to three-fold increases in lactate and succinate during graded tests, and larger fatty acid/acylcarnitine responses during long aerobic sessions; comparable signatures appeared during race sprints and climbs, supporting translational monitoring feasibility (n = 28 lab; n = 5 in-race).114 In a randomized crossover trial of 20 trained cyclists, banana or sugar beverages (vs. water) reduced post-exercise inflammatory and lipid-oxidation perturbations; multivariate modeling showed clear separation of treatment arms (OPLS-DA R2Y = 0.848; Q2Y = 0.409; permutation validation passed), with 109 metabolites increasing >2-fold and 71 decreasing <0.5-fold in the water condition, indicating quantifiable, diet-responsive recovery trajectories suitable for compliance and load tracking.122 Where named metabolites are confirmed against standards (e.g., lactate, succinate) we indicate identification at MSI/Schymanski level 1; library-matched acylcarnitines and other features are flagged as level 2, to make identification confidence explicit for any proposed monitoring panel.124 In the ongoing quest to understand how the human body responds to exercise and diet, untargeted metabolomics has proven itself as a powerful lens, and with continual refinements, it is poised to bring the blurry edges of our knowledge into ever sharper focus, benefiting both scientific insight and athletic practice.
| This journal is © The Royal Society of Chemistry 2025 |