Open Access Article
Vladyslav Dovhalyuk
and
Daniel Globisch
*
Department of Chemistry for Life Science, Science for Life Laboratory, Uppsala University, Sweden. E-mail: Daniel.globisch@kemi.uu.se
First published on 3rd June 2026
Untargeted fecal metabolomics has gained a higher scientific interest in the past decade, due to the increased importance of the gut microbiome metabolism. It is highly sensitive to preanalytical variations. However, the impact of sample collection, storage, and preparation on the metabolome composition remains insufficiently studied. In this study, we have systematically evaluated the effects of two sample preparation protocols: 5% DMSO/water solvation, followed by solvent substitution to methanol, referred to as the double-liquid extraction (DLE) protocol in this study, and methanol homogenization with FastPrep lysing matrices. Additionally, we examined long-term storage at −80 °C in a freezer, including fresh and freeze-dried samples, with and without methanol, at various stages of sample preparation. Global feature coverage and sensitivity analyses revealed that efficient mechanical homogenization is critical for maximizing metabolite recovery, particularly for intracellular and microbially derived compounds (enterolactone, allolithocholic acid). However, complete depletion of water and extraction in pure methanol resulted in a reduced feature coverage and lower signal intensities, particularly for polar metabolites such as proline and tyrosine. In the case of a freeze-drying step, reintroduction of water during sample preparation substantially improved extraction efficiency, underscoring the importance of maintaining controlled hydration of the fecal matrix to improve the solubility of the metabolite mixture.
Upon the initial breakthrough findings of the metagenomics analyses, fecal metabolomics has now emerged as a powerful tool to profile microbial metabolism to improve the understanding of the chemical interplay between the gut microbiota and their host.4 However, the collection and handling of fecal samples remain major challenges in fecal metabolomics. The fecal material undergoes biochemical changes immediately after defecation due to ongoing microbial activity, enzymatic reactions, exposure to oxygen, and temperature fluctuations. In established routine clinical workflows, delays between sample collection and laboratory processing are often unavoidable as it has been investigated in several studies.5,6 Transport conditions, patient compliance, and temperature differences during transit can substantially influence the sample composition.7 As a result, standardizing fecal sample handling from collection to laboratory analysis is critical for ensuring reproducible and biologically meaningful results.8
Storage conditions add another layer of complexity. Fecal samples can be stored freshly frozen, lyophilized, or preserved in stabilization solutions with organic solvents such as methanol. Each strategy can differentially affect microbial viability, enzymatic activity, metabolite stability, and the formation of new compounds through degradation of labile compounds.9–11 Despite widespread use of these approaches, there is currently no clear consensus on which preparation and storage conditions best preserve the original fecal metabolome.1,12,13
In metabolomics studies, sample preparation introduces additional variability. Unlike plasma, urine, or tissue, fecal material lacks a standardized extraction strategy because of its heterogeneous composition, variable water content, and high microbial load. Moreover, the optimal workflow for metagenomic microbiome profiling is not suitable for standard global metabolomics analysis. Methods developed for microbial DNA stabilization do not guarantee chemical stability of metabolites and chemicals used for DNA/RNA preservation are not compatible for mass spectrometric analysis. In here, we systematically evaluate combinations of sample preparation and storage conditions to assess their relative impact on the fecal metabolome profiles under a controlled experimental setting to provide a comparative analytical framework for understanding workflow-dependent variability.
| No. | Sample preparation | Freeze-dried after collection | Long-term storage condition | Collection days |
|---|---|---|---|---|
| 1 | Double-liquid extraction | No | Fresh storage | 1–7 |
| 2 | Homogenization (lysing matrix D) | No | Fresh storage | 1–7 |
| 3 | Double-liquid extraction | Yes | Dry storage | 1–7 |
| 4 | Homogenization (lysing matrix D) | Yes | Dry storage | 1–7 |
| 5 | Homogenization (lysing matrix D) | No | Methanol storage | 1–7 |
| 6 | Homogenization (lysing matrix D) | No | Homogenized storage | 2–3, 5–7 |
| 7 | Homogenization (lysing matrix D) | Yes | Methanol storage | 1–7 |
| 8 | Homogenization (lysing matrix D) | Yes | Homogenized storage | 1–7 |
| 9 | Homogenization (lysing matrix E) | No | Homogenized storage | 2–5 |
| 10 | Homogenization (lysing matrix D) | No | Fully prepared and dried | 2–7 |
| 11 | Double-liquid extraction | Yes | Fully prepared and dried | 1–7 |
Prior to the collection of the samples, each aliquot tube was loaded with a mixture of isotopically labeled stability standards to monitor extraction reproducibility and signal stability throughout the study. Tubes were spiked with isotopically labeled tyrosine (13C9, 15N, 5 μg mL−1), phenylalanine (13C9, 15N, 10 μg mL−1), and valine (13C5, 30 μg mL−1), followed by drying using a SpeedVac.
Each daily stool sample was divided into 11 conditions designed to evaluate the impact of the extraction method, hydration state, and storage solvent on metabolome stability. Conditions were selected based on commonly used protocols in metabolomics workflows: for “fresh storage” conditions, aliquots were processed immediately without freeze-drying. “Dry storage” conditions involved overnight lyophilization. Homogenization with lysis matrix D, lysis matrix E, and double-liquid extraction corresponded to variations in mechanical disruption and solvent ratios, as detailed below. Methanol storage conditions consisted of immediate submersion of the aliquot in cold LC-MS grade methanol (100%) to rapidly inactivate enzymatic and microbial activity. Milli-Q water and MS grade solvents were used throughout the experiment. All mixture proportions are expressed as v/v.
Two validated extraction protocols were used, depending on the condition type:
1. Double-liquid extraction protocol
Samples after freeze-drying overnight, weighed and solvated with dimethyl sulfoxide (DMSO) and water (1
:
20) to a final equivalent of 60 mg mL−1. Tubes were sonicated for 5 min and centrifuged (13
400g, 5 min). Supernatants were normalized by volume across all samples, aliquoted, and freeze-dried overnight. For the second extraction, dried pellets were reconstituted in 50 µL water, followed by the addition of 240 µL cold methanol. After a brief freezing step (−20 °C, 1 h), samples were centrifuged, supernatants collected, dried with SpeedVac, and reconstituted in 50 µL 5% acetonitrile (ACN)/water prior to LC-MS injection.
Condition 1 aliquots were frozen within 30 min after collection, and sample preparation was performed after long-term storage. Condition 3 aliquots were freeze-dried overnight, and the dry pellets were stored at −80 °C, and sample preparation was continued after long-term storage. Condition 11 aliquots were completely prepared before long-term storage, and the dry pellets before reconstitution in 5% ACN were frozen.
2. Homogenization with lysing matrices D and E (FastPrep-based protocols)
Selected conditions employed bead-beating homogenization adapted from the solid tissue extraction protocol. Aliquots were mixed with 4 µL of 100% methanol per mg of fresh fecal material. For lysing matrix D, homogenization beads were added in a 1
:
1 mass ratio to the Eppendorf tubes. For lysing matrix E, commercially available Fast-Prep tubes that already contained beads were used. Both lysing matrices were obtained from MP Biomedicals. Samples were homogenized using FastPrep at 4 m s−1 in three cycles (20 s shaking, 30 s rest). Supernatant were aliquoted to new tubes, incubated at −20 °C for 1 h, centrifuged, and dried by SpeedVac. Final reconstitution was performed in 5% ACN/water at a volume normalized to sample weight.
Conditions 2, 5, 6, 9, and 10 were treated as fresh samples. Conditions 4, 7, and 8 were freeze-dried before homogenization. Conditions 2 and 4 were frozen fresh and freeze-dried, respectively, and sample preparation was performed after long-term storage. Conditions 5 and 7 were solvated with Methanol and placed for long-term storage, with sample preparation followed before analysis. Conditions 6, 8, and 9 were homogenized with respective beads and frozen with beads in tubes. Condition 10 was completely prepared before storage and frozen as a dry pellet before final reconstitution.
Storage strategies differed by hydration state (fresh vs. dried) and solvent environment: fresh samples were kept on ice before homogenization and extraction or long-term storage at −80 °C. Dried samples underwent lyophilization and were stored dry at −80 °C before starting additional steps. Methanol-stored samples were immediately submerged in ice-cold MS-grade methanol to suppress metabolism. Non-methanol storage conditions kept the fecal material in its assigned state (fresh or dry) at −80 °C until extraction. All samples were protected from thawing, oxygen exposure, and microbial activity as much as possible. No sample underwent more than one freeze–thaw cycle. Samples were in long-term storage for at least 5 months.
Due to practical constraints during the sample preparation and storage, not all condition-day combinations were available for all sampling days (Table 1). In particular, conditions 6 and 10 were originally planned across all sampling days, but selected condition-day combinations were excluded due to incomplete replicate sets as a result from sample loss. To maintain consistency in the replicate structure, only condition-day combinations with complete triplicates were included for statistical analysis. Condition 9 (lysing matrix E) was additionally included as an exploratory condition with limited combinations and was therefore excluded from the multifactorial analysis.
:
1)] was used for internal calibration at the beginning of each run, in a segment between 0.1 and 0.3 min. The samples were injected into the UPLC-MS system in a randomized order with QC samples injected at the beginning and end of the sample list in both ionization modes, as well as after every eight samples. The standard deviation of the mass calibration for all injections was below 0.5 ppm.
000 ion count for positive ion mode and intensities > 10
000 ion count for negative ion mode, retention time >1 min and <18 min, and RSD of the QCs < 30% were selected for further statistical analysis. The intensities of the internal standards and the QC samples were plotted against the UPLC-MS/MS sample injection order to evaluate the stability and performance of the experimental set over time. An overview of the data was provided by principal component analysis (PCA).Statistical analysis for each metadata factor was performed using a linear regression model with cofactor adjustment utilizing in-house R scripts. For each feature, models of the form intensity ∼ factor_of_interest + sampling_day + cofactors were fitted, where one factor was evaluated at a time while adjusting for the remaining experimental variables. Linear models were fitted using the available observations without imputation of missing condition-day combinations. Sampling day was included as a covariate to account for temporal variability. For Fold change analysis, estimated marginal means (EMM) were calculated using emmeans R package.17 P-Values were not adjusted for multiple testing, and statistical significance was interpreted in combination with effect size thresholds and feature filtering criteria, consistent with the exploratory and method development nature of the study.
For untargeted selectivity and sensitivity analysis, stable features were filtered based on the following criteria: average intensity greater than 20
000 ion counts and a relative standard deviation (RSD) below 30% within a given sampling day and experimental condition. Selectivity was defined as the breadth of metabolite detection and quantified as the number of stable features per condition and sampling day. Sensitivity was defined as the typical signal intensity of detectable metabolites within a given condition. Sensitivity was quantified by calculating, for each sampling day and condition, the median of the average intensities of all stable features passing the filtering criteria. We used the ggplot2 R package for creating statistical plots, and Biorender for graphical illustrations.
We have used a combination of structure elucidation with our in-house chemical library of authentic standards as well as the SIRIUS software to classify metabolites into four confidence levels (CL).24
Fecal samples were separately prepared based on the 11 conditions of different sample preparation protocols and storage (Table 1) and subsequently investigated each condition via UHPLC-MS/MS analysis in a randomized sequence with quality control samples included. The mass spectrometric data were pre-processed using XCMS, that was followed by statistical analysis via in-house R scripts.
The global feature coverage and sensitivity analysis offers a complementary, systems level perspective on the performance of the different methodologies. Summarizing 15
658 of untargeted features across all conditions and sampling days provides the possibility to obtain an integrated evaluation of extraction robustness, analytical sensitivity, and temporal variability that cannot be captured by isolated factor-based analyses alone.
Across conditions, global feature coverage revealed consistent differences in the number of stable, detectable features recovered by each protocol ranging from 1000–5000 (Fig. 2A), whereas sensitivity heatmaps further discriminated conditions based on the overall signal intensity of stable metabolites (Fig. 2B). Together, these complementary metrices provide a balanced evaluation of the analytical performance, capturing both the breadth of metabolite recovery (selectivity) and the typical signal strength of extracted features (sensitivity; Fig. 2C). Protocols combining beads-based homogenization with non-lyophilized samples generally achieved higher feature coverage and greater signal intensities, indicating more efficient extraction of a broader range of metabolites. In contrast, freeze-dried samples reconstituted in pure methanol consistently yielded fewer detectable features and reduced signal intensities. Although the DLE protocol showed intermediate performance in terms of global selectivity and sensitivity, it demonstrated comparatively high robustness toward variations in sample preparation and storage conditions. Importantly, differences observed at the level of global selectivity and sensitivity were modest when compared with the pronounced day-to-day variability present in the dataset, likely reflecting biological and pre-analytical factors such as dietary intake and sampling time. However, despite this dominant temporal variability, several metabolite classes exhibited clear and reproducible preferences for specific storage and extraction strategies. These findings underscore the necessity of strictly standardized sample preparation and storage protocols within and across studies to minimize technical variability and to avoid confounding intra- and inter-individual biological differences.
Multivariate principal component analysis (PCA) revealed a clear clustering mainly based on the extraction and homogenization approach (Fig. 3A). Samples prepared using a double-liquid extraction protocol (5% DMSO/water solubilization followed by methanol extraction) formed a distinct cluster, indicating a characteristic metabolite pattern associated with the solvent-based extraction.
In contrast, beads-based homogenization protocols using lysing matrix D from MP Biomedicals have a much wider spread, while lysing matrix E clustered more closely. Lysing matrix E contains additionally smaller spheres to also homogenize microbial cells completely. Noteworthy is the fact that only four instead of seven samples were treated with matrix E. Matrix E conditions were not included in the multifactorial analysis due to the smaller sample size and the lack of combinations with other factors. Thus, matrix D and E were compared separately (Fig. S1). These differences likely stem from an improved mechanical disruption efficiency, bead composition, and physicochemical selectivity of each workflow. QC samples consistently formed a tight cluster across all analyses, confirming the high analytical reproducibility and validating that the observed variations are originated from biological and methodological factors.
Daily sampling over 10 days from one healthy volunteer enabled assessment of short-term biological variability within a single individual. The PCA plot of sampling days demonstrates the close clustering of the first five days, while quite pronounced day-specific clustering was observed for days 6 and 7 (Fig. 3B). This observation is consistent with previous reports that the individual fecal metabolome varies substantially over time due to changes in diet, microbial activity, and intestinal transit.34 Despite these differences, most days partially overlapped and only gradually shifted, reflecting the shared individual core metabolome. These observations highlight the intra-individual temporal variability, which is important when designing longitudinal fecal metabolomics studies and reinforces the necessity of multi-day sampling when feasible.
To closely investigate the differences between sample preparation protocols, we compared the DLE workflow with a procedure using bead-based homogenization using lysing matrix D. For this comparison, we applied a linear regression model adjusted for the sampling day, the different lyophilization statuses, and storage with or without methanol. The volcano plot shows a broadly balanced distribution of up- and downregulated features between homogenization and the double-liquid extraction. However, features upregulated under double-liquid extraction display a wider dispersion in both fold change and statistical significance (Fig. 4A). Pathway annotation of significantly altered features within the Sankey diagram highlights extraction preferences for specific compound classes (Fig. 4B). Amino acids including proline (CL2), aspartic acid (CL1), tyrosine (CL2), and tryptophan (CL1) were significantly upregulated in samples processed using the DLE protocol. Additionally, a series of glycine-containing dipeptides, such as glycyl-phenylalanine (CL2) and glycyl-tyrosine (CL1), were also enriched under the DLE conditions. Based on these results, amino acids and short peptides exhibited a higher extraction efficiency and broader coverage with the DLE extraction method compared to methanol-based homogenization. This effect is likely attributable to the physicochemical properties of DMSO, which is known to induce a reduction of the cell membrane and an increase of the membrane fluidity, thereby facilitating the release of intracellular and matrix-associated polar metabolites.35 Representative boxplots of five selected metabolites within sample-preparation factor are shown in Fig. 4C. Boxplots of a selection of annotated significant metabolites are provided in the SI. Due to the partially unbalanced design, estimated effects represent conditional comparisons within the observed data structure rather than fully orthogonal factor effects. As a result, partial confounding between workflow factors and sampling day cannot be fully excluded.
In contrast, methanol rapidly denatures proteins and promotes localized aggregation, which can entrap small polar metabolites and reduce their effective recovery. Consistent with this interpretation, heterocyclic nucleobases and nucleosides, including inosine (CL2), guanosine (CL2), and uracil (CL2), were also significantly upregulated in samples processed via the DLE workflow. These compounds have a high polarity and have been associated with microbial nucleotide turnover, making their recovery particularly sensitive to solvent polarity and protein precipitation.36 In contrast, methanol-based homogenization demonstrated a slightly improved extraction of more hydrophobic metabolites. Lipid-associated compounds such as hexadecasphingosine (CL2), sphingosine (CL2), and N-lauroyl glutamic acid (CL2) were upregulated in methanol-homogenized samples, consistent with the strong solubilizing capacity of methanol for amphipathic and lipid-like molecules.37 Microbial metabolites and gut microbiota metabolized dietary compounds like enterolactone (CL1) and solanidine (CL2) were also significantly affected by beads homogenization protocols compared with the DLE protocol. This suggests that their recovery depends strongly on the mechanical disruption and extraction solvents. As an example, a more detailed examination of individual sample preparation conditions revealed that the dependence of enterolactone levels on the preparation method with lower values with the double liquid extraction. One potential reason is that the aqueous conditions may promote reduced recovery and partial lactone ring opening under hydrolytic sample preparation conditions (Fig. 4D). In contrast, the highly polar and water-soluble vitamin pantothenic acid (CL2) was significantly downregulated under methanol-extraction conditions, which further supports the assumption that methanol homogenization has limited extraction efficiency for small polar metabolites.
To specifically assess the performance of lysing matrix E, we compared Condition 9 with Condition 6, which serves as its lysing matrix D counterpart, as all other experimental factors were identical. Sampling day was included as a cofactor in the statistical model. This comparison revealed more than 700 features showing significant differences (p < 0.05, fold change ≥ 1.5). Annotation and compound-class prediction suggested that lysing matrix E preferentially enhanced extraction of carbohydrates, shikimates, and alkaloids such as nicotinic acid (CL1), adenine (CL1) and guanine (CL1). These metabolite classes are commonly associated with intracellular pools or metabolites localized within microbial cells, which is consistent with the design of Matrix E beads that include smaller spheres intended to improve microbial cell disruption and release of intracellular content. In contrast, lysing matrix D showed higher abundances of several features tentatively annotated at CL 3 as indoles or compounds from the tryptophan pathway. As lysing matrix E contains additional smaller beads for optimized homogenization of microbial cells, the upregulation of these microbial metabolites is a result of an improved release of intra-cellular bacterial content. Additional targeted validation is required to confirm whether these observations reflect systematic extraction biases or compound-specific effects. Comparative highlights of lysing matrices D and E are provided in the SI (Fig. S1).
Freeze-drying is commonly employed to stabilize fecal material, normalization, simplify storage, and improve homogeneity. However, its impact on the metabolite profiles remains insufficiently studied. Comparison of fresh and freeze-dried conditions demonstrated a partial overlap with some separation in the PCA (Fig. 3C), indicating that lyophilization introduces systematic chemical shifts. Freeze-drying may alter microbial enzymatic activity and volatile compounds differently prior to drying and after thawing than with the other sample preparation processes. Despite these differences, freeze-dried samples demonstrated relatively low within-group dispersion, suggesting that lyophilization produces a chemically more stable matrix.
Application of a cofactor-adjusted linear model identified that the majority of significantly altered features were associated with extraction conditions without lyophilization were used. The volcano plot clearly shows larger numbers of 2649 upregulated features for samples that were not lyophilized compared to 147 downregulated features (Fig. 5A). Among the annotated metabolic pathways, metabolites within the Fatty acids, Alkaloids and Amino acids and short peptides were the ones of the most affected (Fig. 5B). The following metabolites from diverse compound classes, particularly pantothenic acid (CL2), spiculisporic acid (CL2), nicotinic acid (CL1), and xanthine (CL2) have been found to be significantly upregulated in samples that were prepared without prior lyophilization. Several studies have reported that lyophilization can increase the number of metabolites due to enhanced cell lysis or membrane disintegration, while volatile metabolites are at risk of being removed.38,39 In contrast, our data shows that fresh fecal samples provide a broader metabolite coverage and higher extraction efficiency in comparison to lyophilized samples. Reduced performance of freeze-dried samples extracted in pure methanol may be due to insufficient rehydration or altered solvent–matrix interactions of the dried matrix. In the DLE protocol, the samples that were reconstituted with a 5% DMSO/water solution under sonication yielded a lower homogenization effect in comparison to the beads-based homogenizer due to insufficient mechanical disruption. This is potentially limiting the release of intracellular and matrix-associated metabolites. The reason for the detection of amino acid derivatives at reduced levels in the lyophilized samples exemplified by proline (CL2), valyl-valine (CL2), ophthalmic acid (CL1), and 4-guanikdinobutanoic acid (CL1/Fig. 5C). As summary, while sufficient homogenization is achieved in the bead homogenization protocol, the pure methanol solution had a reduced efficiency for highly polar compounds.
Next, we investigated whether performing the majority of the sample preparation workflow immediately when receiving samples and before long-term storage has a positive affected on the metabolome stability or not. The PCA demonstrated partial separation between samples prepared before storage and those prepared after thawing (Fig. 3D), while the magnitude of this effect was smaller relative to freeze-drying or methanol exposure. These findings support the idea that completing as many extraction steps as possible before long-term storage can potentially reduce variability. It needs to be decided if the observed minor effect is sufficient to compensate for the increased complexity of sample handling and collection to storage timing.
Because fecal material remains enzymatically and microbially active immediately after collection, some workflows use organic solvents such as methanol to deactivate biological processes and stabilize metabolites before storage. PCA analyses indicate that methanol exposure during long-term storage produced a small shift in the global metabolite composition with a substantial overlap between conditions (Fig. 3E).
To relatively quantify these differences, we performed a linear regression model adjusting for the sampling day, extraction protocol, and freeze-dry status. Methanol storage significantly affected 381 out of 8895 metabolomic features, with both increases and decreases observed across diverse chemical classes. NPClassifier annotation of the significantly altered features revealed that several molecular families were particularly impacted by methanol storage. While methanol exposure at a global level during long-term storage resulted in only minor compositional shifts, more pronounced differences were observed for individual features, metabolites and specific metabolite classes. Within the amino acids and peptide derivatives pathways, N-acetyl-phenylalanine (CL1) and nicotinic acid (CL1) were significantly reduced under methanol storage. Similarly, nucleotide-related metabolites, like xanthine (CL2) were significantly reduced in the samples stored in methanol (Fig. 6A). These shifts may reflect methanol-induced chemical extraction biases, selective solubilization of polar metabolites or quenching of enzymatic pathways that otherwise lead to production of metabolites during the storage. While these changes are limited to a global context, they are analytically relevant for studies targeting specific metabolite classes sensitive to the investigated conditions. Notably, uracil (CL2) exhibited a distinct and informative behaviour. Although it appeared significantly altered across multiple experimental factors, the drivers of its elevated levels in condition 2 (homogenized after storage from wet sample) were not immediately evident. Closer analysis revealed that uracil was consistently upregulated under the DLE protocol, whereas it was either absent or detected only at trace levels in methanol-stored samples. One-way ANOVA followed by Tukey's HSD post-hoc test revealed no significant differences among methanol-stored conditions irrespective of when bead-beating homogenization was applied (Fig. 6B). Lyophilization combined with subsequent homogenization under methanol-based extraction resulted in a pronounced reduction in uracil abundance. Together, these observations indicate that uracil levels are primarily influenced by extraction chemistry with no significant dependence on whether extraction was performed before or after long-term storage, if methanol extraction is preferred, storage must be performed with not lyophilized or solvated sample (Fig. 6B).
The consistent suppression of these metabolites suggests that long-term methanol storage may promote degradation or reduce the extraction efficiency. This potential effect needs to be considered for the interpretation of studies targeting nucleosides, vitamins, or purine metabolism.40 In contrast, the higher levels observed in samples stored without methanol may reflect DNA degradation by microbial activity, suggesting a greater level of microbial suppression by methanol storage. Targeted comparisons of individual sample preparation factors, compound classes, and representative metabolites provide particular insights into the most significantly altered differences. Several metabolites can be identified to be significantly altered in a global perspective, however, these do not retain significance for individual factor comparison after cofactor adjustment. This suggests that their variance was distributed across multiple experimental variables rather than driven by a single preparation or storage factor. For example, methylthioadenosine sulfoxide (CL1) and 6-hydroxycoumaran (CL1) exhibited significant differences across all grouping factors, indicating that these compounds are broadly sensitive to sample handling and storage conditions. Contrary, 4-vinylphenol (CL1) was not observed with significant variations across any the tested conditions, suggesting a high degree of robustness to differences in sample preparation and storage.
Our results reveal a metabolomic profile shift associated with long-term methanol storage, particularly affecting purine and pyrimidine-related metabolites when compared with freezing of fresh material. At the global level, selectivity and sensitivity analyses revealed consistent, yet relatively modest, differences between sample preparation and storage conditions, particularly when contrasted with the pronounced day-to-day variability observed across sampling days. This temporal variability likely reflects biological and pre-analytical factors such as diet and sampling time, which highlights the inherent complexity of fecal metabolomics. As this study is based on repeated sampling from a single individual, these observations reflect workflow-dependent effects within a controlled biochemical background without capturing inter-individual variability.
Despite this dominant day-to-day variability, compound-class-specific analyses revealed reproducible preferences for particular extraction and storage strategies. Protocols incorporating beads-based homogenization of non-lyophilized samples generally provided broader metabolite coverage and higher signal intensities, supporting their suitability for untargeted applications. When freeze-drying is required, reintroduction of water prior to extraction appears important for preserving extraction efficiency, particularly for polar metabolites. While exhibiting intermediate global performance, the DLE protocol demonstrated comparatively consistent performance across replicates and sampling days, suggesting potential utility in workflows where methodological flexibility is required.
Several practical considerations can be outlined based on the data in this study. For broad untargeted metabolite coverage and higher signal intensities, protocols incorporating bead-based homogenization of non-lyophilized samples are generally preferred. For polar metabolites, extraction conditions that preserve or reintroduce water should be prioritized, while extraction in pure methanol after freeze-drying may reduce coverage. For workflows requiring long-term storage without immediate processing, methanol-based storage introduces only modest global variation but may selectively affect specific metabolite classes. For studies prioritizing consistency across preparation and storage conditions, the DLE protocol provides stable performance with moderate global coverage.
In summary, our findings emphasize that the choice of sample preparation and storage strategy should be guided by the analytical objective of global analysis and the metabolite classes of interest in targeted applications. Maintaining the same sample handling procedure for all samples remains the most essential factor to minimize technical and preanalytical variability. Overall, our results provide a comparative analytical overview of how sample preparation and storage workflows influence the fecal metabolome. Our data provides new information for method selection within controlled experimental designs. Translation of these findings to broader clinical or population-level studies requires further validation across clinical cohorts.
Supplementary information (SI) is available. See DOI: https://doi.org/10.1039/d6an00237d.
| This journal is © The Royal Society of Chemistry 2026 |