Open Access Article
Laura A. E. Van Poelvoorde
*a,
Bert Bogaertsa,
Kevin Vannestea,
Bavo Verhaegenb,
Veronik Hutsec,
Sigrid C. J. De Keersmaecker
a,
François E. Dufrasned,
Koenraad Van Hoordeb and
Nancy Roosens*a
aTransversal activities in Applied Genomics, Sciensano, Brussels, Belgium. E-mail: laura.vanpoelvoorde@sciensano.be; nancy.roosens@sciensano.be
bFoodborne pathogens, Sciensano, Brussels, Belgium
cEpidemiology of infectious diseases, Sciensano, Brussels, Belgium
dNational Influenza Centre, Department of Infectious Diseases in Human, Laboratory of Viral Diseases, Sciensano, Brussels, Belgium
First published on 25th March 2026
Wastewater-based epidemiology (WBE) has emerged as a powerful tool for community monitoring, offering a non-invasive, cost-effective means of assessing public health. PCR-based methods, including digital droplet PCR (ddPCR), combined with reverse transcription (RT) for RNA viruses, are widely used for detection of known pathogens in wastewater. Recently, sequencing approaches have gained importance. Targeted amplicon sequencing enables high-sensitivity detection of specific pathogens, while metagenomic sequencing provides untargeted profiling of microbial and viral communities, supporting surveillance of novel pathogens and antimicrobial resistance (AMR). A major challenge across all genomic approaches remains the efficient concentration and extraction of nucleic acids, particularly for low-abundance targets like respiratory viruses. This study aimed to evaluate the fit-for-purpose of concentration and extraction protocols for wastewater surveillance, focussing on their suitability for genomic downstream applications: (RT-)ddPCR, amplicon sequencing and metagenomics. Both RNA (viral pathogens) and DNA (AMR markers) targets were included and experiments were performed on raw and spiked wastewater. The results revealed substantial variability in method performance. Ultrafiltration with a concentrating pipette (InnovaPrep) consistently yielded lower viral concentrations, whereas precipitation-based methods, magnetic capture and a column-based concentration/extraction kit provided higher yield and better sensitivity. PCR inhibitors reduced targeted sequencing efficiency, while metagenomics produced consistent taxonomic profiles across methods. Overall, the study highlights the importance of protocol optimization to improve sensitivity in WBE. Optimization of protocols, particularly for detecting low abundance pathogens and inhibitor removal, remains essential. Broader validation across diverse wastewater matrices and microbial species will be critical to strengthen the role of WBE in public health surveillance.
Water impactThis study evaluates the suitability of concentration and extraction protocols for genomic applications in wastewater-based epidemiology. Identifying effective methods for recovering pathogens from wastewater is critical for public health surveillance. By improving sensitivity and consistency, these optimized protocols enhance early-warning systems for infectious diseases and antimicrobial resistance. The findings support more reliable wastewater monitoring strategies, enabling communities to respond more rapidly to emerging health threats. |
WBE has previously been effective in monitoring enteric viruses such as poliovirus, norovirus and hepatitis A virus,4,5 as well as bacterial pathogens including Salmonella and Escherichia coli,6 and has more recently gained attention for its role in SARS-CoV-2 surveillance.7 PCR-based methods are the most commonly used downstream detection approaches in WBE with digital droplet PCR (ddPCR) and reverse transcription ddPCR (RT-ddPCR) offering high sensitivity and absolute quantification of nucleic acid targets. RT-ddPCR methods have been widely applied for the detection of viral pathogens such as SARS-CoV-2,8–10 respiratory syncytial virus (RSV),11 and influenza viruses,11 while ddPCR methods have been used to detect antimicrobial resistance (AMR) genes.12 In the context of wastewater surveillance, ddPCR is a powerful and widely used platform due to its absolute quantification and its reduction in the biases linked to the PCR inhibitors.13 Recently, also sequencing approaches have become more prominent in WBE. Targeted amplicon sequencing enables high-resolution monitoring of specific circulating pathogens and their variants.14 In contrast, metagenomics offers an untargeted and comprehensive view of microbial and viral communities in wastewater. While metagenomics remains primarily used in research settings, it holds great promise for the detection of novel and emerging pathogens as well as AMR markers their bacterial hosts.15 However, sensitivity remains a major challenge, as sequencing will typically large amounts of background DNA and RNA, while targets of interest may represent only a very small fraction of the total reads.
Despite significant technological advances in downstream genomic analysis, a major bottleneck remains: the efficiency of pathogen concentration and nucleic acid extraction methods. This limitation is particularly critical for detecting low-abundance targets, such as respiratory viruses or single-copy AMR genes within rare bacterial species, within complex matrices like wastewater. Enhancing these upstream processes is essential to improve the sensitivity and reliability of WBE. Indeed, pathogen detection in wastewater presents distinct challenges compared to clinical testing. Whereas clinical samples are collected directly from individuals with a known or suspected infection, and typically contain relatively high viral concentrations, wastewater samples exhibit much lower virus loads, and are further challenged by a complex matrix with numerous PCR inhibitors and diverse background DNA.16 Therefore, effective concentration and extraction methods are necessary to achieve sensitive and reliable detection without concentrating the inhibitors, especially for viruses, that are present at low concentrations in wastewater such as SARS-CoV-2, respiratory syncytial virus (RSV) and influenza viruses,17 and AMR genes that are present at low concentrations.12 Despite the widespread use of various concentration and extraction methods, there is little consensus on which one is the most efficient for specific downstream applications.18–20 Another consideration is whether to analyse the liquid or solid fraction of wastewater, or both, as different fractions may capture different pathogen populations.21,22
To address the current uncertainty regarding optimal concentration and extraction methods for wastewater-based surveillance of pathogens and AMR markers, this study aims to evaluate the fit-for-purpose of several protocols that have been applied in wastewater based epidemiology, with a focus on their suitability for typical genomic downstream applications, namely digital droplet PCR (ddPCR), reverse transcription digital droplet PCR (RT-ddPCR), amplicon sequencing, and metagenomics. In contrast to previous method-comparison studies that typically focus on one analytical approach, this study integrates (RT-)ddPCR, targeted amplicon sequencing, and untargeted metagenomics. Moreover, within the study RNA viral pathogens and DNA AMR markers are evaluated together, thus providing an assessment of concentration and extraction methods across multiple downstream genomic applications, rather than evaluating performance based on a single metric or endpoint. The assessment includes four concentration methods previously described in wastewater application, including PEG/NaCl precipitation,23 magnetic capture (Nanotrap magnetic virus particles),24 ultrafiltration with a concentrating pipette (InnovaPrep),25 and ultracentrifugation,26 in combination with two extraction kits, including QIAamp Viral RNA Mini Kit and QIAamp RNeasy PowerFecal Pro Kit. In addition, the ZymoPURE Water DNA/RNA Kit was included as it represents a commercial and integrated approach combining concentration and extraction into a single workflow. Rather than focussing on direct comparisons between individuals methods, the goal of this study was to evaluate which methods can consistently yield sufficient nucleic acid of adequate purity to support reproducible downstream results across multiple workflows. Within this study, the performance of the methods was assessed in terms of measured target concentrations, by the success of downstream sequencing, and by the consistency of results across multiple sampling dates and locations, rather than by technical replicate reproducibility or absolute nucleic acid yield alone. Both DNA (two AMR targets, namely blaCTX-M and tetM) and RNA (norovirus serotypes GI and GII, SARS-CoV-2, RSV A and B, and influenza A and B) targets were included to ensure a comprehensive evaluation. Given that some RNA targets such as respiratory viruses are typically only present at low concentration in raw wastewater samples, sensitivity and comparability may be compromised. Therefore, these methods were tested not only on raw (non-spiked) wastewater samples, but also on samples spiked with SARS-CoV-2, RSV, and influenza viruses, which are generally close to the limit of detection in raw samples. This dual approach enabled a more robust assessment of method performance under both realistic and controlled conditions.
To evaluate the concentration and extraction methods, two types of samples were used, i.e., 50 mL of raw wastewater; and 50 mL of spiked wastewater. To ensure consistency across workflows, a starting volume of 50 mL wastewater was processed for all concentration methods. The spiked wastewater was prepared by adding at each time point (n = 5) clinical samples that contain influenza A and B virus, RSV A and B virus or SARS-CoV-2 virus to 600 mL of raw wastewater sample to obtain a final concentration of either 50 or 100 cp μL−1 (Table 1). The initial concentration was measured using their respective RT-ddPCR method described below. These samples were obtained through the Belgian National Influenza Centre from the two main surveillance systems in Belgium, ILI for mild infections and SARI for moderate and severe infections. These spiked wastewater samples will serve as positive controls to verify method performance under controlled higher-input conditions for viral targets that are typically present at low concentrations in raw wastewater. These spiked samples didn't undergo any additional pre-treatment steps. The spiked wastewater was used to enable relative, method-to-method comparison of concentration and extraction performance under controlled conditions because calculating the absolute recovery efficiencies will prove challenging due to background levels of the same targets already present in the sample.
| Sample | Pathogen | Final concentration (cp μL−1) |
|---|---|---|
| H3N2_23-IS-00280 | Influenza A H3N2 | 100 |
| H1N1_23-IS-01030 | Influenza A H1N1 | 50 |
| RSVA_A2 | RSV A | 100 |
| RSVB_WV/146 | RSV B | 50 |
| B_VIC_23-IG-00115 | Influenza B (VIC) | 50 |
| SC2_23-IS-00311 | SARS-CoV-2 | 100 |
An overview of sample handling, storage conditions, concentration and extraction workflows, target types, and downstream analytical methods is provided in Table S1.
Two Qiagen kits were tested for the nucleic acid extraction: the QIAamp RNeasy PowerFecal Pro Kit, used with all concentration methods, and the QIAamp Viral RNA Mini Kit, only paired with the PEG/NaCl precipitation concentration method. Both kits were used according to the manufacturers' instructions, with slight modifications to accommodate sample volume (see SI Methods). The QIAamp RNeasy PowerFecal Pro and ZymoPURE Water DNA/RNA kit have inhibitor removal steps included in their protocol.
| Target | Name | 5′ → 3′ sequence | Ref. |
|---|---|---|---|
| blaCTX-M | blaCTX-M_FW | ACCAAYGATATYGCGGTKAT | 12 |
| blaCTX-M_RV | ACATCGCGRCGGCKYTCT | ||
| blaCTX-M_PR | FAM/TCGTGCGCCGCTG/MGB-Eclipse | ||
| tetM | tetM_FW | GGTTTCTCTTGGATACTTAAATCAATCR | 12 |
| tetM_RV | CCAACCATAYAATCCTTGTTCRC | ||
| tetM_PR | HEX/ATGCAGTTATGGARGGGATACGCTATGGY/IABkFQ | ||
| RSVA | RSVQA1 | GCTCTTAGCAAAGTCAAGTTGAATGA | 27 |
| RSVQA2 | TGCTCCGTTGGATGGTGTATT | ||
| RSVQA_PR | FAM/ACACTCAAC-ZEN-AAAGATCAACTTCTGTCATCCAGC/IABkFQ | ||
| RSVB | RSVQB1 | GATGGCTCTTAGCAAAGTCAAGTTAA | 27 |
| RSVQB2 | TGTCAATATTATCTCCTGTACTACGTTGAA | ||
| RSVQB_PR | HEX/TGATACATT-ZEN-AAATAAGGATCAGCTGCTGTCATCCA/IABkFQ | ||
| FluA | InfA_Forward | GACCRATCCTGTCACCTCTGAC | 28 |
| InfA_Reverse | AGGGCATTYTGGACAAAKCGTCTA | ||
| Infa_Probe | FAM/TGCAGTCCT-ZEN-CGCTCACTGGGCACG/IABkFQ | ||
| FluB | InflB_For | AAATACGGTGGATTAAATAAAAGCAA | 28 |
| InflB Rev | CCAGCAATAGCTCCGAAGAAA | ||
| InflB_Probe | /HEX/CACCCATAT-ZEN-TGGGCAATTTCCTATGGC/3IABkFQ | ||
| ORF1a | ORF1a-F | AGAAGATTGGTTAGATGATGATAGT | 8 |
| ORF1a-R | TTCCATCTCTAATTGAGGTTGAACC | ||
| ORF1a-P | FAM/TCCTCACTG-ZEN-CCGTCTTGTTGACCA/IABkFQ | ||
| RdRp | RdRp_IP4-F | GGTAACTGGTATGATTTCG | 8 |
| RdRp_IP4-R | CTGGTCAAGGTTAATATAGG | ||
| RdRp_IP4-P | HEX/TCATACAAA-ZEN-CCACGCCAGG/IABkFQ |
RT-ddPCR was performed using the One-Step RT-ddPCR Advanced Kit for Probes (Bio-Rad), and ddPCR was conducted using the 2× ddPCR Supermix for Probes (no dUTP, Bio-Rad), following the manufacturer's instructions (see SI Methods). Droplet generation and amplification were performed using the QX200™ Droplet Generator and T100™ Thermal Cycler (Bio-Rad), respectively, and results were read on the QX200™ Droplet Reader. Data analysis was performed using QuantaSoft software v1.7.4.0917 with manual thresholding (see SI Methods). Negative controls consist of no-template controls to monitor for contamination. The limit of detection (LOD) for each (RT-)ddPCR assay is assay specific and has been calculated previously for the SARS-CoV-2 genes9 and AMR genes.12 Although the other assays were not re-estimated, it is assumed to be consistent for the samples across all concentration and extraction methods.
For each target, generalized linear models with a Gaussian error distribution were fitted to compare concentrations obtained with different concentration and extractions methods, using the precipitation method combined with the QIAamp RNeasy PowerFecal Pro Kit as reference. The concentrations were log10-transformed prior to analysis. For spiked targets, models were fitted separately from the raw wastewater. Estimated coefficient were exponentiated to obtain fold changes relative to the reference method, and 95% confidence intervals were calculated. P-values for method comparisons were extracted from the model summaries and adjusted for multiple testing using the false discovery rate method.
000× using Seqtk sample (v1.4),29 based on total read bases versus reference genome length (NC_045512.2). The raw reads were then trimmed using fastp v0.23.4.30 Reads were aligned with BWA-MEM (v0.7.17)31 using −k 19 and −r 1.5. The trimmed paired and unpaired reads were mapped separately and merged with SAMtools merge (v1.17).32 Coverage statistics (median depth and genome coverage) were calculated using SAMtools depth v1.17 with the −a option (see SI Methods). Consensus sequences were generated through an iterative mapping approach, where reads were repeatedly aligned to an updated reference and variants were called and incorporated at each step. Iterations continued until the consensus stabilized or a set maximum was reached. Variants were filtered to ensure sufficient depth, quality, and allele frequency, resulting in high-confidence consensus sequences.
:
1 ratio of AMPure XP beads to DNA (see SI Methods).For adapter ligation, 25 μL ligation buffer, 10 μL quick T4 DNA ligase, and 5 μL ligation adapter (ONT Ligation Sequencing Kit V14) were added. After 10 min incubation at room temperature, samples were cleaned with AMPure XP beads and short fragment buffer to ensure DNA fragments from all sizes are retained. Libraries were eluted in 15 μL of elution buffer and quantified with Qubit BR to adjust to 35–50 fmol for sequencing on an ONT R10.4.1 Flow Cell run on a GridION device for 72 h using MinKNOW v24.02.16 (see SI Methods).
Reads were trimmed with SeqKit v2.3.1,34 and taxonomic classification was performed with Kraken2 v2.1.1 (ref. 35) using the in-house ‘full’ Kraken2 database (accessed 13 January 2024).36 Classification results were visualized using Krona,37 and comparisons across samples were visualized to depth level 5 via radar charts created in RStudio (v2022.02.2.485) using the fmsb package (v0.7.6) (see SI Methods).
For the respiratory viruses, including SARS-CoV-2, RSV A, RSV B, influenza A, and influenza B, the analysis of raw wastewater samples generally showed no significant differences between the concentration or extraction methods (Table 3). This likely reflects the low viral concentrations typically present in these samples, which limits statistical power to distinguish between small differences in recovery efficiency. To address this, wastewater was also spiked with respiratory viruses, which provided a clearer basis for comparing methods, as higher input levels allowed differences in recovery to become detectable.
The comparative evaluation was conducted against a reference method namely the precipitation method paired with the QIAamp RNeasy PowerFecal Pro Kit. Across both raw and spiked samples, method performance was assessed in terms of recovery efficiency, measured as the absolute concentration of each target detected by (RT-)ddPCR. While results varied per target (Table 3), most differences between methods were not statistically significant. Notably, the InnovaPrep method consistently yielded the lowest concentration across all viral targets tested. In contrast, the precipitation, Nanotrap and ZymoPURE Water DNA/RNA methods generally produced higher recoveries indicating greater suitability for downstream applications. Moreover, within the specific concentration–extraction pairings evaluated, (RT-)ddPCR results indicated that both the QIAamp RNeasy PowerFecal Pro Kit and the QIAamp Viral RNA Mini Kit performed similarly when combined with PEG precipitation (Table 3). However, no inference can be made regarding their relative performance when combined with other concentration methods. Additional pairwise comparisons between methods can be derived from the model coefficients present in Table 3 or from the mean concentrations that are included in Table S2.
Moreover, a dilution-based qPCR inhibition assessment was performed by comparing Cq values obtained from undiluted extracts and 5-fold dilutions targeting the tetM gene (Table S6). For most concentration–extraction workflows, the observed Cq shifts following dilution were close to the expected value of 2.32 cycles, indicating limited PCR inhibition. However, pronounced deviations from the expected shift were observed for specific methods, most notably PEG precipitation combined with the QIAamp Viral RNA Mini Kit, where undiluted extracts showed delayed or absent amplification that was partially or fully recovered upon dilution.
Sequencing results from the raw wastewater samples showed poor genome coverage and low median sequencing depth (Table 4), making it difficult to determine whether the suboptimal results were due to the inefficiency of the methods or simply due to the insufficient SARS-CoV-2 concentration for effective sequencing. Consequently, the spiked wastewater samples, which contained much higher SARS-CoV-2 concentrations, provided a more reliable evaluation of the effectiveness of the different protocols on the sequencing method.
| Date | Concentration method | Extraction method | ORF1a (cp μL−1) | RdRp (cp μL−1) | Median depth | Genome coverage | |
|---|---|---|---|---|---|---|---|
| Spiked wastewater | 18/03/2024 | INNOV | RN | 554.40 | 1243.00 | 8289.50 | 99.91% |
| NANO | RN | 2805.00 | 2651.00 | 10045.00 | 99.98% | ||
| Prec | QIA | 1826.00 | 2508.00 | 557.00 | 95.37% | ||
| Prec | RN | 1002.10 | 1430.00 | 9904.00 | 99.70% | ||
| UltraC | RN | 2134.00 | 2310.00 | 9849.00 | 99.71% | ||
| Zymo | Zymo | 4444.00 | 4400.00 | 10012.50 | 99.87% | ||
| 29/04/2024 | INNOV | RN | 1973.40 | 1870.00 | 10023.00 | 99.71% | |
| NANO | RN | 2301.20 | 2235.20 | 7740.50 | 99.73% | ||
| Prec | QIA | 4967.60 | 6292.00 | 7.00 | 89.84% | ||
| Prec | RN | 1036.20 | 1282.60 | 4318.00 | 99.81% | ||
| UltraC | RN | 2728.00 | 2820.40 | 10013.00 | 99.98% | ||
| Zymo | Zymo | 3256.00 | 3410.00 | 10032.00 | 99.81% |
In the spiked wastewater samples, most concentration and extraction methods yielded high genome coverage and median depth, indicating successful sequencing across workflows. Notably, the combination of PEG precipitation with the QIAamp Viral RNA Mini Kit consistently resulted in lower genome coverage and median depth compared to other methods. These results suggests that this specific pairing may be less optimal for targeted sequencing under the evaluated conditions and likely reflecting the co-concentration of PCR inhibitors, as neither the concentration nor extraction method includes an inhibitor removal step. While no internal amplification control (IAC) was included in the ddPCR assays to directly quantify inhibition, a limitation of this study, these findings provide valuable guidance for selecting concentration–extraction workflows that maximize sequencing success and support more reliable interpretation of viral detection wastewater.
The sequencing results (Table S4) show considerable variability in the total number of reads obtained per sample. This variability can likely partly be attributable to technical factors such as the number of active pores per flow cell, but also reflects the inherent heterogeneity of wastewater samples themselves, rather than being strongly associated with a specific method (Table S5). The median read quality and median read length remained consistent across all tested methods and did not show any significant associations with the tested method (Table S5).
When analysing the taxonomic profile of the non-spiked samples (Fig. 1), the Kraken2 results revealed that the community structures were highly similar between the different methods. Across all three sampling dates, bacterial sequences consistently represented the largest fraction of the classified reads. In contrast, very few viral reads were detected in both raw and spiked wastewater samples, preventing the construction of meaningful viral taxonomic profiles. This limited sensitivity can mask potential method-dependent differences, and the metagenomic results should therefore be interpreted as a consistency-base comparison rather than an evaluation of viral detection sensitivity. Fig. S1 provides an overview of the taxonomic profiles of the spiked samples, which also exhibited consistent trends.
Overall, several protocols demonstrated robust and consistent recovery across both DNA and RNA targets, particularly the Nanotrap method paired with the QIAamp RNeasy PowerFecal Pro Kit and the ZymoPURE Water DNA/RNA Kit. These methods provided reproducible detection across viral (RNA) and AMR (DNA) markers by (RT-)ddPCR and yielded reliable SARS-CoV-2 genome coverage in targeted sequencing, within the specific concentration–extraction pairing evaluated, suggesting their suitability for applications requiring high sensitivity under these tested workflows. In contrast, the InnovaPrep method consistently resulted in lower recovery across both AMR and viral targets, indicating reduced efficiency in capturing nucleic acids. For the spiked samples, recovery efficiencies were not calculated in this study because that the spiked viruses were already present at a certain concentration. Future work should incorporate dedicated process controls to enable quantitative assessment of recovery.
In Belgium, the wastewater surveillance also includes SARS-CoV-2 sequencing at some critical time points, such as the emergence of a new variant. To ensure reliable results, it is important to confirm that the methods remain effective for amplifying SARS-CoV-2. However, targeted sequencing has shown some challenges in practice. Raw wastewater samples often contain very low amounts of virus, especially during periods of low circulation, which can lead to limited genome coverage and shallow sequencing depth. Under these conditions, variant detection and genomic surveillance become less reliable. To help overcome this limitation, spiked wastewater samples were also included and analysed. It was observed that the combination of PEG precipitation and QIAamp Viral RNA Mini Kit resulted in lower genome coverage and sequencing depth. This outcome was consistent in both raw and spiked wastewater samples, suggesting that the reduced performance was not solely due to the low viral concentration. The absence of an inhibitor removal step in this protocol may negatively impact amplification efficiency, particularly within the PEG precipitation–QIAamp Viral RNA Mini Kit workflow evaluated here. In contrast, the other methods that include a PCR inhibitor removal step, performed well, especially for the spiked samples, producing high coverage and depth within their respective concentration–extraction combinations. PCR inhibition is often cited as a concern in wastewater analysis depending on both concentration and extraction methods as well as the molecular target. PEG precipitation, in particular, has been associated with co-concentration of inhibitory compounds leading to an apparent reduced recovery and interference with PCR methods.38 A limited dilution-based qPCR assessment targeting tetM suggested that most methods showed Cq shifts close to the theoretical expectation, indicating no or limited inhibition. In contrast, the PEG precipitation concentration method in combination with the QIAamp Viral RNA Mini Kit displayed strongly aberrant Cq differences or amplification failure consistent with substantial PCR inhibition (Table S6). Although this assessment was not systematic and cannot be used for quantitative comparison, the observations align with previous reports of method- and matrix-dependent inhibition in wastewater and support the importance of inhibitor mitigation strategies when interpreting molecular results.
Metagenomics sequencing was performed only on the methods that had the best performance in the (RT-)ddPCR and targeted sequencing evaluations, namely the Nanotrap method in combination with the QIAamp RNeasy PowerFecal Pro Kit, the precipitation method in combination with the QIAamp RNeasy PowerFecal Pro Kit and the ZymoPURE Water DNA/RNA Kit. Focusing on the top-performing workflows enabled a robust and informative comparison of consistency across methods and provides valuable insights into their suitability for metagenomic applications, while appropriately aligning the analysis with the intended scope of the study rather than positioning it as a sensitivity benchmark. Interestingly, the taxonomic profiles were highly consistent across the selected concentration and extraction methods carried forward to metagenomic sequencing, indicating that once an effective concentration and extraction protocol is established, reliable taxonomic profiling can be achieved, regardless of the specific method used across different methods within this top-performing subset. By focusing on higher taxonomic levels, the analysis emphasizes the stability and reproducibility of the taxonomic profiles, underscoring the robustness of the selected workflows for comparative metagenomic analyses. This consistency is crucial for the interpretation of metagenomic data, as it suggests that reliable community-level insights can be obtained across different protocols. However, the taxonomic profiles were dominated by bacterial reads with little viral reads detected. This reflects a well-recognized limitation of untargeted metagenomics in wastewater, where RNA viruses are typically too scarce to be reliably recovered without enrichment or amplification. Low-abundance species, including most viruses, may therefore remain undetected despite apparent consistency in the bacterial profiles. Future studies could investigate probe-based capture, random amplification, or nanopore adaptive sampling as possible approaches to enhance viral signal recovery.
Our findings align with previous wastewater-based epidemiology studies, which generally report that no single concentration method is universally optimal across all targets and matrices. PEG/NaCl protocols can yield relatively high and reproducible recoveries for SARS-CoV-2 and other viruses, but also often tends to co-concentrate PCR inhibitors, leading to variable assay performance across wastewater matrices.38–41 This observation aligns with our results, where PEG precipitation paired with an extraction workflow lacking inhibitor removal showed reduced sequencing performance despite adequate target detection by RT-ddPCR. Ultrafiltration methods, such as InnovaPrep, are widely applied in wastewater surveillance but have shown variable performance for enveloped viruses and can be sensitive to clogging and sample composition.18,42–47 Magnetic capture approaches, including Nanotrap, have been reported to improve RNA virus recovery and extract purity, and in our study demonstrated robust and reproducible performance across multiple downstream applications.48–50 Importantly, whereas most previous studies evaluate a single target or analytical application, our work provides a unified fit-for purpose comparison of concentration and extraction methods across multiple viruses and AMR targets in RT-ddPCR along with SARS-CoV-2 sequencing and shotgun metagenomics using shared upstream methods. It should also be noted that wastewater physicochemical characteristics, such as solids content, organic load, and conductivity, are known to influence concentration efficiency and PCR inhibition, and may therefore contribute to some of the observed method-dependent differences. Within this study, the pH and conductivity were measured. pH values were consistently near neutral across all samples, while conductivity varied across time points and locations, indicating variability in wastewater composition (Table S1).
While analytical performance is critical for selecting concentration and extraction methods, practical considerations are also important for routine wastewater surveillance. Methods differed substantially in hands-on time, scalability, equipment requirement, and relative consumable costs (Table S7). Integrated or magnetic capture-based workflows offered most favourable balance between throughput, hands-on time and scalability, highlighting their suitability for routine surveillance. In contrast, precipitation- and ultracentrifugation based methods, while cost-efficient in terms of consumables, were more time-intensive and less scalable without significant investment, which may limit their applicability for high-throughput surveillance.
While this study offers valuable insights into methodological performance, several aspects offer opportunities for further refinement. First, although consistent detection across DNA and RNA targets was observed across the methods, some target-specific variability remains and could be further characterized. Second, the study was restricted to two wastewater treatment plants, so additional evaluations across diverse matrices and seasonal conditions would help generalize these findings. Third, while inhibition was not comprehensively assess using IAC, limited dilution-based testing suggested method-dependent inhibitory effects, partially mitigating this limitation. Standardized inhibition controls should be included in future studies to further refine performance comparisons. Furthermore, while our work demonstrates fit-for-purpose comparisons across multiple methods, controlled recovery experiments with technical replicates could provide precise quantification and establish method-specific thresholds under standardized conditions. Finally, only limited physicochemical metadata such as pH and conductivity has been collected. As a result, this may also influence the variability in target recovery. Future studies incorporating should focus on understanding these effects to better generalize method performance across wastewater types.
Taken together, the results offer a strong foundation for the further optimization of these protocols. The findings highlight the importance of PCR inhibitor removal and pathogen and AMR gene concentration optimization, and demonstrate that certain protocol are particularly well-suited for high-sensitivity downstream applications. Furthermore, the study underscores the need for further optimization for RNA virus recovery and for evaluation across a wider diversity of wastewater types and pathogen and AMR genes loads. Addressing these factors will help refine methodologies for WBE, ultimately improving the detection and genomic surveillance of public health threats at the community level.
Supplementary information: Table S1: Overview of sample handling, concentration and extraction workflows, physicochemical parameters, and downstream analytical methods. Table S2: Mean concentrations obtained by (RT-)ddPCR for viral and AMR targets across the evaluated concentration and extraction methods. Table S3: Comparison of concentration and extraction methods using targeted sequencing for SARS-CoV-2 across raw and spiked wastewater samples. Table S4: Overview of sequencing results for metagenomic analysis of wastewater samples collected from the one Belgian WWTP on three time points in 2024. Table S5: Statistical analysis of sequencing metrics, evaluating potential associations between the concentration/extraction methods and sequencing output characteristics. Table S6: Dilution-based qPCR inhibition assessment on selected wastewater samples using the tetM gene. Table S7: Qualitative practical consideration for concentration and extraction methods evaluated in this study in 2024. See DOI: https://doi.org/10.1039/d5ew00930h.
| This journal is © The Royal Society of Chemistry 2026 |