Gaspar
Massiot
*ab,
Dominique
Courault
c,
Pauline
Jacob
d and
Isabelle
Albert
b
aUMR AgroParisTech/INRAE Silva, INRAE-AgroParisTech-Université de Lorraine, 54000, Nancy, France. E-mail: gaspar.massiot@agroparistech.fr
bUniversité Paris-Saclay, AgroParisTech, INRAE, UMR MIA Paris-Saclay, 91120, Palaiseau, France
cUMR 1114 EMMAH, INRAE, Site Agroparc, CS 40509, 84914 Avignon, France
dVeolia Recherche & Innovation, Chemin de la Digue, 78600 Maisons-Laffitte, France
First published on 24th November 2022
Reuse of reclaimed wastewater for agricultural irrigation is an expanding practice worldwide. This practice needs to be monitored, partly because of pathogens that the water may contain after treatments. More particularly, sprinkler irrigation is known to generate aerosols which may lead to severe health risks to the population close to irrigated areas in case of the presence of Legionella bacteria in the water. A pilot experiment was conducted on two corn fields in South-Western France, irrigated with wastewater undergoing two different water treatments (ultra-filtration and UV). Water analyses have shown high levels of Legionella in the water even after a standard wastewater treatment plant (WWTP) cleaning process followed by the UV treatment (up to 106 GC per L in 2019). In this context, an updated general Bayesian network (GBN), using discrete and continuous random variables, in quantitative microbial risk assessment (QMRA) is proposed to monitor the risk of Legionella infection in the vicinity of the irrigated plots. The model's originality is based on i) a graphical probabilistic model that describes the exposure pathway of Legionella from the WWTP to the population using observed and non-observed variables and ii) the model inference updating at each new available measurement. Different scenarios are simulated according to the exposure time of the persons, taking into account various distances from the emission source and a large dataset of climatic data. From the learning process included in the Bayesian principle, quantities of interest (contaminations before and after water treatments, inhaled dose, probabilities of infection) can be quantified with their uncertainty before and after the inclusion of each new data collected in situ. This approach gives a rigorous tool that allows monitoring the risks, facilitates discussions with reuse experts and progressively reduces uncertainty quantification through field data accumulation. For the two pilot treatments analyzed in this study, the median annual risk of Legionella infection did not exceed the US EPA annual infection benchmark of 10−4 for any of the population at risk during the past few months of the pilot experiment (DALYs are estimated up to 10−5). The risk still bears watching with support from the method shown in this work.
Water impactThis paper presents a model for the risk assessment of pathogen inhalation from sprinkler irrigation using treated wastewater. The approach is applied to Legionella analyzed on a pilot site where two different treatments were performed on water with numerous analyses for monitoring pathogens. |
However, wastewater reuse still remains uncommon in some countries. Several factors can explain this observation, among them being the pathogens that water may contain, which resulted in strict reuse guidelines. Adegoke et al. (2018)8 have reported the different pathogens found in wastewaters and the associated risks. Quality indicators are commonly used, but they are more and more debated on their representativeness (particularly for viruses and Legionella, e.g., they are not explicitly taken into account as indicators despite their wide occurrence in the environment). Pragmatic recommendations and guidelines were defined by the World Health Organization (WHO)9–12 on the basis of health risk considerations. The new European regulations on minimum requirements for water reuse for agricultural irrigation will take effect on 26 June 2023.13
Sprinkler irrigation with reclaimed wastewater is widely practiced throughout the world but may enhance hazards relative to the dispersion of pathogens by bioaerosols.14–16 A collective expertise appraisal has been performed by ANSES in 2012,17 which assessed various secondary risks (e.g. ingestion) and pointed out the lack of knowledge on health risks, particularly due to pathogen inhalation.
An experiment has been conducted in South-Western France to evaluate the impact of two different treatments on the quality of water reused for sprinkler irrigation of two corn fields (SmartFertiReuse project, https://www.smartfertireuse.fr/, accessed on October 4th 2022), controlling the population risks. Among the different analyzed pathogens in the water, Legionella was found at a high level both at the entrance and output of the wastewater treatment plant (WWTP) (up to 107 GC per L). In the new EU regulations, criteria have been defined for water quality. The minimum required value for Legionella has been set to <1000 CFU L−1 for a reclaimed water class A used for food culture irrigation. These rules are expected to stimulate and facilitate water reuse in the EU and shall also be accompanied by appropriate monitoring of the health risks linked to this practice.
Legionella is an opportunistic bacterium which may be inhaled by humans through water aerosols.18–20Legionella has around 50 species, such as Legionella pneumophila, which is responsible for the most severe infections such as Legionnaire's disease and pneumonia illness (Pontiac fevers).21 These infections due to Legionella pneumophila are particularly problematic for people who have a certain fragility or those who are exposed for a long period to these pathogens and may cause in some cases mortality. In France, an increasing trend of Legionellosis has been observed since 1995.‡ A lot of cases were linked to cooling towers22 but some studies have shown contaminated aerosols by Legionella pneumophila in the vicinity of WWTPs23 and associated diseases have been observed for example in the Netherlands.24
Different approaches have been proposed to quantify health risks applied to Legionella (reviewed by Hamilton et al. (2018)19 and Hamilton et al. (2016)20). Often, there are few observations and numerous authors have shown a wide variability in both space and time of punctual measurements.18,25 It is therefore important to take into account this variability and the uncertainty in risk assessment.
Quantitative microbial risk assessment (QMRA) is an established framework recommended by multiple international organizations (WHO, US EPA, EFSA) to monitor such risks of infection.26 QMRA models have been widely used for food safety management,27 and in the risk assessment of waterborne diseases28–30 due to aerosolisation14,31–34 of water reuse systems,35 and are often coupled with Monte Carlo (MC) simulation approaches which consist of drawing values for all the inputs in the model from distributions defined by external knowledge (e.g., experimental or historical data, expert knowledge) and then propagating these values to all the intermediate variables and outputs through the system in a way that takes the variability and uncertainty of the inputs into account. It is, by definition, a unidirectional approach that prevents the inversion of the dependency relationship between the variables. The Bayesian network (BN) approach, as an alternative to the MC simulation approach in QMRA, overcomes this unidirectional path among other things.36,37 While QMRA may not be able to incorporate the many factors that influence the quantification of the risks (for example, epidemiological data), models based on BNs are an effective framework that addresses this limitation. BN approaches in QMRA provide a powerful approach where the dependency relationship between variables can be inverted (‘backward calculation’, by which the states of the model's variables are updated using ‘downstream’ data).
A BN is a flexible graphical model based on a directed acyclic graph (DAG) where conditional dependencies between the variables are qualitatively represented by directed arcs on the graph38,39 and quantitatively defined with local transition probabilities or deterministic equations. BNs offer transparency and the ability to support causal reasoning based on causal connections. The visual representation in a BN of large quantities of complex information provides an information platform for improved communications with experts through their graphical representation.36,37,40,41 A BN as a system approach encompasses both holistic and modular views, enables the synthesis of knowledge of parts to help understand the whole and makes a complex system more manageable.42 BNs can be applied to a wide range of application domains such as environmental modelling43 and artificial intelligence,44 and as mentioned above to quantitative microbial risk assessment (for a review, see Beaudequin et al. (2015)45), and water reuse and QMRA.41,42,46–48 While QMRAs are often constrained by the availability of required data (for example, in dose–response assessments), system models based on BNs are an effective framework that tackle the lack of data. As all data in the network contribute to the whole, accurate predictions can be made with incomplete data or a quite small sample size.48 BNs were also used for scenario assessment,46 risk minimisation36 and work with uncertainty and variability as a probabilistic tool.41,49
One of the objectives of this paper is the combination of a general Bayesian network (GBN) and QMRA. GBNs, known also as graphical independence networks in the context of continuous and discrete random variables, are a more general case of the BN in which each variable in the data is modelled with a random variable that best suits it, rather than limiting ourselves to multinomial and normal distributions as in other more classical BNs.39,50,51 GBNs can be used as an attractive way of modelling complex systems because they most closely model expert knowledge and data. Due to their ability to incorporate diverse data types, a GBN enables a complex, multivariate statistical problem (such as QMRA) to be efficiently addressed where classical statistical methods are inept. The method allows the integration of a broad range of quantitative or semi-quantitative information, which is particularly useful in domains where traditional experimental and observational data are missing, inaccurate, sparse, or costly. In this paper, compared to other applications in the water reuse and QMRA context previously cited, there is no discretization of the continuous variables in the BN for the learning algorithm. The joint posterior distribution of all the random variables in the BN is obtained by a MCMC (Markov Chain Monte Carlo) algorithm52 as used in computational Bayesian model inference. The powerful capacity of conceptualization available by the BN approach via the graphical network is used to construct the core model36 and the knowledge is updated via computational Bayesian inference via data in the spirit of the Bayesian model approach (see Verbyla et al. (2016)53 in the water reuse QMRA context).
Another objective was the development and application of an updated GBN to monitor health risks associated with exposure to Legionella pneumophila during irrigation events with reclaimed wastewater. The impacts of two types of water treatments were analyzed (described in the following section). Keeping the steady monitoring aspect in mind, the constructed GBN is reevaluated with each arrival of new data. At each step, posterior distributions for all the variables in the model were computed using priors defined as the posterior distributions at the previous step. The final purposes were to assess different risk scenarios according to the population activities, accounting for their distance from the irrigated areas and the environmental conditions in order to get infection probabilities for each configuration.
In the following text, the Materials and methods section describes the study site, the data and the models implemented for the developed risk assessment monitoring tool. The update process of the risk quantification and three scenarios on simulated data are described in the Results section. The update can occur whenever desired and discussed with the experts. The Discussion section discusses the benefits of the developed approach for the continuous monitoring of health risks in the context of QMRA, emphasizing the necessity of informed use of such tools when few data are available.
All samples were analyzed to quantify Legionella via molecular biology (qPCR from the standard NFT90-471¶). All Legionella species are considered potentially pathogenic for humans, but Legionella pneumophila is the etiological agent responsible for most reported cases of community-acquired and nosocomial legionellosis.54,55 Additionally, some cultivations were made for samples showing high values. Fig. 2 shows the data collected for Legionella spp. presence using the qPCR method and differentiated by sampling zone. The graph presents a large variability between the different analyzed compartments and for the three studied years. As expected, the raw water at the entrance of the WWTP presented the highest values of Legionella and the groundwater had the lowest values from 2019 to 2021. The impact of the two tertiary treatments (A and B) tends to decrease the quantities compared to the values found in raw water. A large variability was also observed according to the dates in 2021 which is difficult to explain. Many factors, including climatic and water storage conditions, can lead to variability. These points will be discussed in the Results section.
In parallel to these analyses, the meteorological conditions are monitored thanks to a climatic station set up in the area, which records continuously the wind speed and direction, global radiation, air temperature and moisture. To complete this dataset, daily data acquired from the Meteo-France weather station (Tarbes (43°11′12′′N, 0°00′00′′E)) between May and September over the period from 04/29/2010 to 06/18/2020 have been used in order to better take into account the temporal variability of environmental conditions. The crop height was also measured during the irrigation events to assess the aerodynamic roughness (z0) occurring in the atmospheric transport. The values of the crop height varied from 80 cm to 3.2 m for the studied period.
![]() | ||
| Fig. 3 Overview of the exposure pathway of Legionella pneumophila in the context of agricultural irrigation of the experimental plots in Tarbes. | ||
1. Irrigation directly from the groundwater table with quality CGT [genome copies per liter (GC per L)], currently in use;
2. Irrigation from wastewater sequentially treated by standard WWTP treatments and experimental pilot A with quality CA [GC per L];
2b. Irrigation from wastewater sequentially treated by standard WWTP treatments and experimental pilot B with quality CB [GC per L].
In the process currently used (part 1 above), the groundwater is not treated in the WWTP before being used for irrigation. In the experimental processes (parts 2 and 2b above), reclaimed wastewater is used. The initial wastewater coming to the WWTP is sequentially treated by standard WWTP treatments and the experimental pilot (A or B). The treatment effects on the water quality are modelled by the following logarithmic decay formulas:
| CA = CTP/10kA, | (1) |
| CB = CTP/10kB, | (2) |
| with CTP = CWW/10kTP, | (3) |
The concentration of Legionella pneumophila CLp,P [GC per L] in each treatment train P ∈ {GT, A, B} was deduced from the concentrations CP above by the following formula:19
| CLp,P = CPfLp, | (4) |
| QLp,P = FSPp150CLp,P3600/103, | (5) |
![]() | (6) |
• passersby with a supposed average tpasserby of 1 minute of exposure per irrigation day,17
• residents with a supposed average tresident of 2 seconds of exposure over 2.27 hours spent outside per irrigation day,17,57 and
• farmers with a supposed average tfarmer of 30 minutes of exposure per irrigation day (chosen according to the local farmer behavior knowledge).
For each population, the fraction of day the population was exposed was computed and an uncertainty of half a minute around it was considered. For example, tpasserby was taken as a Gaussian distribution centered around 1/(60 × 24) = 6.94 × 10−4 with a standard deviation of
truncated on [0; 1].
The inhaled dose DA,x,P [genome copies per day] was calculated for each treatment train P, distance x and activity A taking values in {passerby; resident; farmer} as follows:
| DA,x,P = Cx,PItA, | (7) |
| Pi,A,x,P = 1 − e−riDA,x,PfCFU, | (8) |
| Pyear,i,A,x,P = 1 − (1 − Pi,A,x,P)nede, | (9) |
Fig. 4 presents the directed acyclic graph (DAG) of the QMRA model constructed with the equations above. The input distributions (or prior distributions in the Bayesian framework) for each parameter are defined in Table 1.
(μ, σ) denotes the normal distribution of mean μ and standard deviation σ, eventually truncated on the interval T(,), and
2 (M, Σ) denotes the bivariate Gaussian distribution of mean M and variance Σ. I2 denotes the identity matrix of order 2.
(a, b) denotes the uniform distribution on the interval (a, b). X ∼ Multi{a1, a2, …} stands for equiprobable sampling from the values a1, a2, … (multinomial distribution)
| Description | Symbol X | Unit | Value | Ref. |
|---|---|---|---|---|
| Water contamination | ||||
| Water quality in groundwater table in Legionella | C GT | GC per L | log10(X) ∼ (10, 1) |
Expert knowledge and literature data |
| Water quality before treatment in Leg. | C WW | GC per L | log10(X) ∼ (15, 1) |
Expert knowledge and literature data |
| Log decay in WWTP by standard treatment | k TP | log10(GC per L) |
X ∼ (8, 1) |
Project expert knowledge |
| Log decay by experimental pilot A | k A | log10(GC per L) |
X ∼ (4, 1) |
Project expert knowledge |
| Log decay by experimental pilot B | k B | log10(GC per L) |
X ∼ (3, 1) |
Project expert knowledge |
| Portion of Legionella pneumo. | f Lp | Unitless |
X ∼ (1 × 10−3, 2 × 10−4)T(0,) |
From testing lab analytical techniques |
| Air contamination | ||||
| Flow rate of sprinkler 1 | F 1 | m3 h−1 |
X ∼ (44, 1) |
Sprinkler properties59 |
| Flow rate of sprinkler 2 | F 2 | m3 h−1 |
X ∼ (42, 1) |
Sprinkler properties59 |
| Portion of aerosols in respirable range (<150 μm) | p 150 | Unitless |
X ∼ (5 × 10−4, 7 × 10−4) |
Sprinkler properties59 |
| Horizontal distance perpendicular to wind | y | m |
X ∼ (0, 2.5) |
From the literature19,60 |
| Downwind receptor breathing zone height | z | m |
X ∼ (1, 1.7) |
Height of breathing zone |
| Estimated aerodynamic rugosity on the field | z 0 | m |
X ∼ (0.1, 0.45) |
Prior knowledge on the field |
| Wind speed and insolation rate | (u, E) | m s−1 and J cm−2 | log(X) ∼ ![]() |
Vague climatic prior |
| Microbial decay coefficient | λ | s−1 |
X ∼ (1.32 × 10−4, 3.44 × 10−4) |
61, 62 |
| Exposure | ||||
| Inhalation rate | I | m3 per day |
X ∼ (20, 2) |
From the literature63,64 |
| Fract. of day a passerby is exposed | t P | Unitless |
X ∼ (6.94 × 10−4, 4.91 × 10−4)T(0, 1) |
From the literature17 |
| Fraction of day a resident is exposed | t R | Unitless |
X ∼ (3.15 × 10−3, 4.91 × 10−4)T(0, 1) |
From the literature17,57 |
| Fraction of day a farmer is exposed | t F | Unitless |
X ∼ (2.08 × 10−2, 4.91 × 10−4)T(0, 1) |
Knowledge on local farmer behavior |
| Illness | ||||
| Portion of CFU | f CFU | Unitless |
X ∼ (1 × 10−3, 2 × 10−4)T(0,) |
From testing lab analytical techniques from uncensored data |
| Dose response parameter for Legionella pneumophila infection endpoint | r inf | Unitless | log(X) ∼ (−2.934, 0.488) |
From the literature19,58,65 |
| Dose response parameter for Legionella pneumophila clinical severity infection endpoint | r csi | Unitless | log(X) ∼ (−9.688, 0.296) |
From the literature19,58,66 |
| Number of irrigation episodes per year | n e | Unitless | X ∼ Multi{3, 4, 5} | Process knowledge |
| Irrigation episode duration | d e | Days | X ∼ Multi{2, 3, 4, 5, 6, 7} | Process knowledge |
This new model takes into account the measure uncertainty for the microbiological data and the inter- and intra-annual variabilities for the climatic data. In the case of the microbiological data, let CQ,i be the ith measured concentration of Legionella (in GC per L) at step Q ∈ {WW, GT, TP, A, B}. This random variable is linked to the concentration of Legionella CQ defined in section 2.2.1. These random variables are linked through the following model:
log10(CQ,i) ∼ (log10(CQ), σQ), |
(,) is the normal distribution and σQ represents the measure uncertainty. A gamma distribution was assigned to the square of its inverse i.e. 1/σQ2 ∼ Γ(100, 1) which represents a vague prior confidence interval of [0.09; 0.11] at 95% for σQ (in GC per L).
In the case of the climatic data, a bivariate distribution was chosen to model the joint distribution of wind speed and insolation value to get the distribution that best suits these jointly observed data capturing the correlation between both quantities. Let (u, E)obsj,k denote the vector of the observed wind speed (in m s−1) and insolation value (in J cm−2) at the jth day (j = 1, …, nday) of the kth year (k = 1, …, nyear). This random vector is linked through the following model to the vector (u, E) of the QMRA model:
log[(u, E)obsj,k] ∼ 2(Mk, Σintra), | (10) |
Mk ∼ 2(Mm, Σinter), | (11) |
Myear ∼ 2(Mm, Σinter), | (12) |
and log[(u, E)] ∼ 2(Myear, Σintra), | (13) |
2(,) is the bivariate normal distribution, Σintra and Σinter are the measured variability between days of each year and the measured variability between each year respectively, and Mk and Mm are the mean of the kth year and the global mean over the years respectively. Myear is the predicted mean vector for a randomized year given by Myear ∼
2(Mm, Σinter). (u, E) is the vector of a predicted wind speed and insolation value in a randomized year. The following prior distributions were assigned to the variance and mean parameters described above:| Σintra−1 ∼ W2(I2; 24), | (14) |
| Σinter−1 ∼ W2(I2; 24), | (15) |
![]() | (16) |
The R67 package rjags68 was used to compute the Bayesian inference of the augmented model. MCMC sampling algorithms were applied as implemented in the software JAGS 4.3.0.69
400
000 simulations with a burn-in period of 1
000
000 were run. Convergence of the MCMC run was assessed by graphical inspection of the chains.
In the following, the results and the update of prior beliefs in the model are described module by module.
Fig. 6 and 7 illustrate that the groundwater (CLp,GT) and the water after treatment by the pilot A or B (CLp,A, CLp,B) show similar contamination levels (∼102 GC per L) indicating that the new irrigation process is as safe as the previous one (with groundwater without treatment). As expected, a significant log reduction between contamination of raw and treated wastewater for the posterior distributions appears (∼3
log10 reduction). Nevertheless, one observes that the parameters kTP, kA and kB are smaller in the posterior distributions than in the prior distribution indicating that the data introduced at the CTP, CA and CB levels (see Fig. 5) are more contaminated than expected. This may be due to the storage of the water between treatments in large reservoirs or poor control of the treatment chain as discussed below.
From the prior distributions (in red in Fig. 6) to the posterior ones, one observes reduced uncertainties (narrower distributions in posterior). The posterior distribution at June 2021 (in pink in Fig. 6) shows a reduction of the concentration after treatments: 1
log10 reduction (from 106 to 105 GC per L) for the wastewater after treatment in the treatment plant and treatment A and similarly for treatment B. The transition to Legionella pneumophila from Legionella measurements analysed by PCR reduces again the contamination by 3
log10.
Although a significant log reduction between raw and treated wastewater is observed, the differences between treatments A and B and groundwater are very slim compared to what would have been expected by the prior knowledge of processes A and B which should eliminate most of the bacteria. This may be due to storage issues or lack of control in the chain of experimentation and will be discussed in the following.
![]() | ||
| Fig. 8 log10 of concentrations of Legionella pneumophila in the air due to the irrigation process for groundwater (GT), and after treatments A and B at 100 m and 1000 m from the source. | ||
log10 is observed between farmers and residents for the probability of clinical severity infection (which is again the case between residents and passersby). Treatments A and B and groundwater give similar probabilities of infection with a slightly higher risk for groundwater which confirms the previous results that the new irrigation process is slightly safer than the one under use (with groundwater, without further treatment). It is also notable that the upper bound of the 95% credibility intervals never exceeds the US EPA annual infection benchmark70 of 10−4 infections per year (for drinking water) for clinical severity infections.
(15, 1) as a classical contamination as established by expert knowledge and literature data (see Table 1), log10(CTP,i) (i = 1, …, 10) data were obtained with a decay of around 8
log of log10(CWW,i) as the WWTP standard treatment is supposed to be effective (see Table 1, kTP ∼
(8, 1) as established by expert knowledge and literature data when the WWTP standard treatment is effective) and finally, log10(CA,i) (i = 1, …,10) data were obtained with around no log decay (kA ∼
(0, 1)) because in this scenario, a failure of the experimental pilot A is supposed. The annual probability of infection increased by 0.8
log which gave a median of −4.89
log and an interdecile range of 1.33
log at 100 m for the toy population. For all distances and all populations studied, the 9th decile never exceeded the US EPA threshold of −4
log annual probability of infection.
(20, 1) to simulate high contamination of the source and that the pilot treatment is supposed to be effective (see Table 1, kA ∼
(4, 1) as established by expert knowledge and literature data when the WWTP standard treatment is effective). This resulted in an increase of 0.9
log of the annual probability of infection which is comparable to the previous scenario. Once more, the 9th decile never exceeds the US EPA threshold for annual probability of infection but gets even closer with a value of −4.2
log at 100 m for the toy population.
log. This threshold was exceeded by the median for the toy population for 100 m with a value of −3.79 and by the 9th decile at 300 m with a value of −3.20. The risk for the other populations did not exceed this threshold with a maximum value of −4.88 attained for farmers at 100 m from the source.
The general Bayesian network methodology used is a very powerful approach without any discretization to quantify complex phenomena in presence of prior knowledge and scarce data. Indeed, the approach allows taking into account expert knowledge and literature data in the prior distributions of the random variables and consideration of simultaneously collected data all over the phenomenon. The uncertainty and variability of each variable are evaluated through the strength of its prior knowledge combined with the strength of the data information provided by the size of data sets, which is not the case in simulated networks, just using Monte Carlo simulations, where the data are fitted marginally to establish the prior distributions, and then their provided uncertainty is spread poorly in the whole network. Also, by contrast with non-parametric Bayesian networks, in the highly parametric approach used in this paper, the knowledge of the functional relationships between the random variables is maximized to permit the estimation of quantities of interest in the context of scarce data.
Both for annual infection and clinical severity infection, no major difference was observed between risks induced by irrigation using water treated by treatment A (ultrafine filtration, after standard treatment plant), treatment B (UV, after standard treatment plant) and groundwater. This comforts the prior belief that treatments A and B give a water quality at least comparable to the quality of the groundwater. Risks were observed to decrease with distance from the source, which was expected according to the atmospheric dispersion modelling. An approximately 1
log difference was observed between each of the studied categories of people (farmers, residents and passersby), with the passersby being the population least at risk and farmers the population most at risk according to the model (logical order according to the exposure time).
High values of Legionella in GC per L were found in the different analyzed compartments even after the two studied treatments. The impact of the storage time in large reservoirs after the two pilots can in part explain some high observed levels. The cleaning of these tank covers appears as an important point to check for operational applications. Such an observation reveals the interest of working in the first steps with GC for an operational monitoring of the water quality; GC are uncensored data and PCR analyses are less time consuming and can be done routinely. The values reported in GC were first introduced in the model until the risk characterization, and then converted in CFU applying a significant decrease of 10 × 103 between GC and UFC, as quantified from the laboratory analytical results and as often observed in the literature. Let us mention that if one applies this decrease factor (fCFU) on the contamination after the tertiary treatment A and B, i.e. on CA and CB, the two treatments are class A reclaimed water because only 0.1% are above the new European limit (<1000 CFU L−1).
Note that the US EPA criterion of annual probability of infection was used throughout the study instead of disability adjusted life years (DALYs) recommended by the WHO. Indeed, the current method of estimation of the DALY for Legionella pneumophila is still controversial to our knowledge. The DALY can be computed as 0.97 times the annual probability of infection with low precision71 and a recent study72 taking the heterogeneity of responses to contamination into account could be used to derive a better estimation of the DALY.
It is notable that the upper bound of the interdecile credibility intervals of the posterior never exceeded the target risk values for infection (10−4 annual probability of infection for the US EPA criterion and 10−6 disability adjusted life year per person per year, i.e. DALY pppy for the WHO criterion). However, a slow decrease of risk is observed with the distance to the source which is the consequence of the meteorological conditions in the vicinity of the agricultural plots. This observation is consistent with previous studies that have predicted the long-range transport of Legionella.19,22,73 In the local climatic context of Tarbes where the decrease is slow, it seems essential that the concentration has to be already low (below 104) at 100 m from the sprinklers.
The developed tool is ready to be updated especially by air measured concentrations on the plots by the irrigations with reclaimed water that will take place starting in Summer 2021. This tool will then be used to monitor the potential risks in the vicinity of the experimental plots and thus meet the public health demands of population protection.
In a more general QMRA context, the proposed model gives an operational tool and theoretically stable methodology to maintain a continuous monitoring of the risks induced by the irrigation practices on the agricultural plots under study. This methodology can adapt to new sets of data and easily update the model if given new information. This model can easily be adapted to other well-known pathogens through the enlightened adaptation of a few priors such as the ones on the decay parameter λ, the dose–response parameter r, etc.
A challenge would lie in the usage of the model to answer simultaneous multipathogen risk analysis. Also, one of the next main interests lies in the addition of modelisation of the impact of the irrigation with reclaimed wastewater on the quality of the groundwater to simultaneously quantify the inhalation induced risk and the groundwater table contamination by reclaimed wastewater transport in soil. Furthermore, it could be very interesting to link the final probability of infection over the year to epidemiological data over the region as done by Albert et al. (2008),36 linking an epidemiological study of campylobacteriosis cases to the probability of suffering from campylobacteriosis over one year in France due to broiler meat. In the same spirit, the link would necessitate here the introduction of a random variable for the water reuse attributable fraction of Legionella infection to take other routes of contaminations into account. Also, it would necessitate considering only a fraction of the cases that could be due to this irrigation zone. But for the moment, the epidemiological data at disposal are only available at a large geographical scale (the surface of the Occitanie region is approximately 70
000 km2 compared to the surface of the study zone which is approximately 6 km2) and it does not seem reasonable to link these data to the very local modeling proposed.
• The first developed general Bayesian network approach in QMRA for wastewater surveillance. The main asset is that risks are quantified with their uncertainties at each desired time taking into account new and past data.
• The QMRA model describes the exposure pathway of the water contamination from the entrance of the WWTP to the infection risk using pathogen decay models in the WWTP, a Gaussian plume model for the air contamination, an inhalation model for the population exposure and finally a dose–response model for risk characterization.
• The uncertainty of the probabilistic QMRA model is reduced by the introduction of observed data (pathogen concentrations and regional meteorological data) along the modelisation to obtain the distribution of the number of pathogens before and after treatment in the WWTP, the number of aerosolised pathogens, and the concentration of pathogens in the air at different distances, and finally the exposure and illness distributions for three different categories of population at risk and two illness endpoints.
• Legionella annual subclinical infection risk and annual clinical severity infection risk linked to the agricultural irrigation using the groundwater table or two experimental water treatment pilots are inferred to be below the 10−4 tolerable limit defined by the US EPA for farmers, passersby and residents at distances between 100 m and 1000 m away from sprinklers.
• Such a dynamic approach can be applied to various pathogens in the context of wastewater reuse.
Footnotes |
| † Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d2ew00311b |
| ‡ https://www.santepubliquefrance.fr/maladies-et-traumatismes/maladies-et-infections-respiratoires/legionellose (accessed on October 4th 2022). |
| § https://www.smartfertireuse.fr/les-moyens (accessed on October 4th 2022). |
| ¶ https://tinyurl.com/5n6v4y38 (accessed on October 4th 2022). |
| This journal is © The Royal Society of Chemistry 2023 |