 Open Access Article
 Open Access Article
      
        
          
            Boya 
            Xiong
          
        
       a, 
      
        
          
            Mario A. 
            Soriano
             Jr
a, 
      
        
          
            Mario A. 
            Soriano
             Jr
          
        
       b, 
      
        
          
            Kristina M. 
            Gutchess
b, 
      
        
          
            Kristina M. 
            Gutchess
          
        
       b, 
      
        
          
            Nicholas 
            Hoffman
b, 
      
        
          
            Nicholas 
            Hoffman
          
        
       a, 
      
        
          
            Cassandra J. 
            Clark
a, 
      
        
          
            Cassandra J. 
            Clark
          
        
       c, 
      
        
          
            Helen G. 
            Siegel
c, 
      
        
          
            Helen G. 
            Siegel
          
        
       b, 
      
        
          
            Glen Andrew D 
            De Vera
b, 
      
        
          
            Glen Andrew D 
            De Vera
          
        
       a, 
      
        
          
            Yunpo 
            Li
a, 
      
        
          
            Yunpo 
            Li
          
        
       a, 
      
        
          
            Rebecca J. 
            Brenneis
a, 
      
        
          
            Rebecca J. 
            Brenneis
          
        
       a, 
      
        
          
            Austin J. 
            Cox
a, 
      
        
          
            Austin J. 
            Cox
          
        
       a, 
      
        
          
            Emma C. 
            Ryan
          
        
      cd, 
      
        
          
            Andrew J. 
            Sumner
          
        
      a, 
      
        
          
            Nicole C. 
            Deziel
a, 
      
        
          
            Emma C. 
            Ryan
          
        
      cd, 
      
        
          
            Andrew J. 
            Sumner
          
        
      a, 
      
        
          
            Nicole C. 
            Deziel
          
        
       c, 
      
        
          
            James E. 
            Saiers
c, 
      
        
          
            James E. 
            Saiers
          
        
       b and 
      
        
          
            Desiree L. 
            Plata
b and 
      
        
          
            Desiree L. 
            Plata
          
        
       *a
*a
      
aDepartment of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA. E-mail: dplata@mit.edu
      
bSchool of the Environment, Yale University, New Haven, Connecticut, USA
      
cDepartment of Environmental Health Sciences, School of Public Health, Yale University, New Haven, Connecticut, USA
      
dTufts University, Department of Public Health and Community Medicine, 136 Harrison Avenue, Boston, MA 02111, USA
    
First published on 12th January 2022
Horizontal drilling with hydraulic fracturing (HDHF) relies on the use of anthropogenic organic chemicals in proximity to residential areas, raising concern for groundwater contamination. Here, we extensively characterized organic contaminants in 94 domestic groundwater sites in Northeastern Pennsylvania after ten years of activity in the region. All analyzed volatile and semi-volatile compounds were below recommended United States Environmental Protection Agency maximum contaminant levels, and integrated concentrations across two volatility ranges, gasoline range organic compounds (GRO) and diesel range organic compounds (DRO), were low (0.13 ± 0.06 to 2.2 ± 0.7 ppb and 5.2–101.6 ppb, respectively). Following dozens of correlation analyses with distance-to-well metrics and inter-chemical indicator correlations, no statistically significant correlations were found except: (1) GRO levels were higher within 2 km of violations and (2) correlation between DRO and a few inorganic species (e.g., Ba and Sr) and methane. The correlation of DRO with inorganic species suggests a potential high salinity source, whereas elevated GRO may result from nearby safety violations. Highest-concentration DRO samples contained bis-2-ethylhexyl phthalate and N,N-dimethyltetradecylamine. Nevertheless, the overall low rate of contamination for the analytes could be explained by a spatially-resolved hydrogeologic model, where estimated transport distances from gas wells over the relevant timeframes were short relative to the distance to the nearest groundwater wells. Together, the observations and modeled results suggest a low probability of systematic groundwater organic contamination in the region.
| Environmental significanceThis work illustrates that a large sampling of groundwater wells in Northeastern Pennsylvania have not been substantially contaminated with hydrophobic organic contaminants spanning a spectrum of volatilities (volatile organic, gasoline-, or diesel-range organic compounds) even after a decade of intense hydraulic fracturing activity. The reasons for this can include the relatively protective nature of groundwater flow in the area, long transport times of sorption-retarded chemicals, and a lack of systematic chemical releases from oil and gas well operations. Nevertheless, accidental chemical releases are commonly documented, albeit with limited specificity. | 
To date, studies of groundwater organic contamination by HDHF have primarily focused on groundwater quality impairment by methane,5–13 traced to failure of gas-well integrity in some cases. In contrast, data on a broad spectrum of organic contaminants remain scarce, where the number of targeted compounds is limited. In 2013, Gross et al.14 revealed groundwater contamination due to surface spills in Colorado, using publicly available data (n = 62) that included the concentration of benzene, toluene, ethylbenzene, and xylene. More recently, a search for 21 volatile organic compounds (VOCs) in the Marcellus region (50 homes) indicated 7 compounds were detected universally, but with concentrations well below EPA Maximum Contaminant Levels (MCLs).15 A similar study targeted 25 VOCs over the Eagle Ford, Haynesville, and Fayetteville regions in southern Texas (n = 116), and detected benzene in 2–13% of the samples at levels well below the MCL.16 Additional data that encompass expanded analyte lists are needed, especially considering the wide range of compounds that are used in hydraulic fracturing fluids or detected in wastewaters.17–19 For example, Llewellyn et al.20 adopted non-targeted two-dimensional gas chromatography with mass spectrometry, ultimately detecting 2-butoxyethanol associated with a single chemical release event. While providing critical information, such focused studies preclude identification of pervasive contamination pathways (such as surface spills, wastewater pond leakage, and subsurface transport in a fracture). To this end, Drollette et al.21 found select diesel and gasoline-range organic compounds in a small subset of groundwater samples (n = 64) collected in Northeastern Pennsylvania (PA) over 2012–2014. In that case, ancillary geochemical indicators22 were consistent with a surface-derived source, rather than the upward migration of formation brine over geologic or short time horizons. One caveat presented by the authors was that the industry was relatively young at the time, and groundwater contamination mechanisms via shallow groundwater routes that occur on slower timescales may have not yet evolved. Weighing this hypothesis and considering the influence of episodic events observed to date,14,20,21,23 there is a clear need to revisit areas of heavy HDHF development with broad spectrum organic compound analyses in groundwater studies of large sample size and spatiotemporal distribution to enable elucidation of any emergent, prevalent contamination mechanisms.
Although occurrences of organic chemicals in groundwater have been linked to HDHF incidents in select cases, chemical concentrations at levels exceeding health standards are rare to date.15,21 This could arise from: (1) episodic releases of HDHF-associated fluids limiting likelihood of contamination; (2) limited travel distances due to low porewater velocities and/or sorption-retarded transport; (3) natural biochemical degradation of target analytes prior to impacting drinking-water receptors; or (4) insufficient availability of samples at the necessary spatial and temporal resolution to capture transient events. As an example of these instances, Brantley et al.24 reported 20% of gas wells had at least one non-administrative notice of violation in PA, and Maloney et al.25 reported 31% of unconventional wells across 4 states had spills. Demonstrating the constraining impact of water lifetime, McMahon et al.15,16 estimated the distribution of groundwater age (quantified by 3H level) above four major HDHF active formations and found that groundwater age predated shale-gas drilling. In other words, waters were recharged prior to commencement of HDHF activities, suggesting a low likelihood of HDHF contaminating groundwater. Relatedly, Rogers et al.18 illustrated that only 15 of 659 disclosed organic fracturing fluid compounds had sufficiently fast transport times and high chemical persistence (i.e., resistance to natural degradation as measured by chemical half-lives) to survive transit from an HDHF well to a groundwater well (at least 10 of those 15 were confirmed to be within the analytical window investigated here). Finally, the low frequency of elevated organic contamination detection may result from the small number of targeted organic analytes compared to the large number of organic chemicals used in fracturing fluids or infrequent temporal–spatial sampling. This is certainly the case for the voluntary testing of private wells by homeowners in these regions, where organic contaminant analyses are rarely conducted.26 Such management practice imparts uncertainty on drinking water quality, where homeowners assume some risk of exposure to health-relevant pollutants without awareness or systematic means of detection.
To determine the degree of such risk, we undertook a broad spectrum organic chemical analysis of shallow groundwater in Northeastern PA of the largest sample size to date (n = 94). In this region, more than 1000 unconventional gas wells have been completed since 2008 and much of the rural population relies on private wells for drinking water. Specifically, we quantified volatile organic compounds, an integrated measure of gasoline- and diesel-range organic hydrocarbons, and then qualitatively evaluated a subset of those using two-dimensional gas chromatography coupled with time-of-flight mass spectrometry (GC × GC-TOF-MS). It is important to note that the gasoline- and diesel-range organic hydrocarbons (GRO and DRO, respectively) capture a volatility range shared by compounds in gasoline and diesel fuel (from volatile to semi-volatile), but GRO and DRO are not necessarily derived from any gasoline or diesel sources. Hoelzer et al.27 have previously expounded upon the composition of GRO and DRO range in a set of hydraulic fracturing flowback and produced waters. The analysis presented here does not stipulate that GRO or DRO are derived from oil and gas drilling activities; the analytical range is simply valuable to interrogate fingerprints of oil and gas activity. To elucidate possible sources of organic chemicals to nearby groundwaters, we explored relationships between their occurrence and various geochemical indicators (both conservative and non-conservative), as well as Pennsylvania Department of Environmental Protection (PA DEP) violation reports. Further, we applied a spatially explicit model of coupled groundwater flow and solute transport to estimate transport over a wide range of organic chemical behavior. These model results can be contrasted to direct distance-to-well or well-density metrics for assessing risk. The easier-to-determine metrics are readily available to most interested parties, such as well operators, homeowners, public health experts who postulate that distance and exposure are related, and to policy makers. Indeed, because upstream/downstream groundwater modeling is both costly and difficult to be conducted with the necessary resolution and speed to inform where UOG wells should be permitted, legislation for such permits often relies on setback distance—a linear distance required between a UOG well and a private home. Thus, the linear distance and other well distribution metrics we applied are most valuable from both a policy and a public health proxy perspective. We seek to evaluate if these metrics or other mechanistic indicators are valuable for projecting risk of groundwater contamination proximal to HDHF activities.
|  | ||
| Fig. 1 Groundwater well sampling sites and unconventional oil and gas (UOG) wells developed since 2000 in Bradford County, PA. The inset highlights Bradford County. Colored triangles indicate UOG wells listed by spud date; black stars show the sampled drinking water wells. Note that UOG refers to wells that were registered for use with horizontal drilling with hydraulic fracturing (HDHF) technologies. Both oil and gas are produced in this region, but gas is the dominant product.31 Spud dates are publicly available and reflect well drilling dates, but not necessarily chemical injection or production timelines. | ||
Samples for major cations were filtered with a 0.45 μm polyether sulfone filter, acidified and collected in acid-washed high-density polyethylene (HDPE) bottles. Samples for anion analysis were also filtered, but un-acidified and frozen shortly after collection until analysis. Field blanks were collected daily for each type of sample using lab, 18 MΩ MilliQ water.
Major cations, major anions, and trace metals were analyzed by inductively couple plasma optical emission spectrometry, ion chromatography, and inductively coupled mass spectrometry, respectively (see ESI†). Major anion and cation information was used to delineate water types according to four previously defined22 classifications: low-salinity waters (Cl less than 20 ppm) dominated by Ca–HCO3 (Type A) or Na–HCO3 (Type B) and high-salinity waters (Cl greater than 20 ppm) with molar-based Br/Cl less than 0.001 (Type C) or greater than 0.001 (Type D).
To calculate the distance between groundwater wells and the nearest oil and gas wells with some type of documented violation, information on wells with violations was extracted from the PA DEP violation reports35 in Bradford, Wyoming, Tioga, Sullivan Counties (to capture nearest wells on the border of Bradford County) with inspection dates ranging from 2007 to June 2018. First, violations were divided into six categories using their “violation code” outlined in Bradford compliance data: (i) spills/potential spills, (ii) erosion and potential erosion, (iii) cementing/casing failure, (iv) improper impoundments, (v) site restoration, and (vi) solid waste issue. Each category has a set of enforcements (Table S2†), which we have adapted from Rahm et al. 2015.36 Then, the remaining entries of violation that were not under the enforcement codes were manually searched for keywords within the “inspection comments” to include incorrectly indexed violations. Specific keywords included: erosion, fluid, brine, spill, contaminated, leak, and release for the spill/potential spill and erosion types. For other categories, keywords included cementing, casing, impoundment, leak, and failure. Note here that “casing” includes high casing pressure violations. Some violations fell under two or more categories. Of 32![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 980 violation entries, 1191 distinct violations were selected based on procedure described above. Of all violations, only two were associated with conventional wells, all were associated with a gas well type, and six were associated with inactive wells. In addition, we examined the possibility of relationship with distance to gas stations or leaking underground storage tanks (LUST).37 Statistical analyses were performed using built-in functions in OriginLab® Pro.
980 violation entries, 1191 distinct violations were selected based on procedure described above. Of all violations, only two were associated with conventional wells, all were associated with a gas well type, and six were associated with inactive wells. In addition, we examined the possibility of relationship with distance to gas stations or leaking underground storage tanks (LUST).37 Statistical analyses were performed using built-in functions in OriginLab® Pro.
Groundwater velocities were computed from the calibrated model and were used within a Monte Carlo framework to infer the distribution in transport length scales for weakly- and strongly-adsorbing organic contaminants (e.g., acrylamide (log![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) Koc = 0.55) and bis-2-ethylhexyl phthalate (log
Koc = 0.55) and bis-2-ethylhexyl phthalate (log![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) Koc = 4.99), where Koc is in L kgoc−1).43 Note that these were chosen to represent extrema that capture a range of sorptivity of the chemicals that are frequently disclosed (see ESI†). Assuming a linear, equilibrium reaction governs contaminant adsorption, the advective-transport distance of the contaminant (DA) varies proportionately with time (t), such that
Koc = 4.99), where Koc is in L kgoc−1).43 Note that these were chosen to represent extrema that capture a range of sorptivity of the chemicals that are frequently disclosed (see ESI†). Assuming a linear, equilibrium reaction governs contaminant adsorption, the advective-transport distance of the contaminant (DA) varies proportionately with time (t), such that
|  | (1) | 
|  | (2) | 
![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 Monte Carlo samples from uniform distributions of ϕ, ρb, foc and from a lognormal distribution of qw, which was estimated by fitting the log-normal probability density function to the 2 × 107 calculations of qw made at each finite element within the groundwater model domain (Table S4†). Here, we underscore that the results represent a distribution of output values that capture the sensitivity to input parameter choice (i.e., the model accounts for variability of the input mean through the large number of simulations and combinations of possible parameters).
000 Monte Carlo samples from uniform distributions of ϕ, ρb, foc and from a lognormal distribution of qw, which was estimated by fitting the log-normal probability density function to the 2 × 107 calculations of qw made at each finite element within the groundwater model domain (Table S4†). Here, we underscore that the results represent a distribution of output values that capture the sensitivity to input parameter choice (i.e., the model accounts for variability of the input mean through the large number of simulations and combinations of possible parameters).
        As a thought exercise, we also used the calibrated model to simulate forward-in-time location probability from hypothetical spills or underground leaks at each well pad in the domain. Thirty well pads in the modeled area were set as source locations, where 13 of these had documented spills between 2009–2014.44 Forward location probability describes the likely future position of a solute after its release from a known source location, where a unit probability “mass” is introduced at time zero via a line source extending from the surface to the bottom of the domain.45 We assume rapid vertical transport within the unsaturated zone in order to account for a worst-case aquifer contamination event. Location probabilities at 1, 5, 10, and 25 years post-spill were illustrated with maps.
The five highest concentrations of GRO and DRO were measured in samples collected within 2 km of gas well operations (Fig. 2A and B), but were not correlated with one another (i.e., high GRO and DRO were not co-occurring; Fig. 2C). This lack of a clear GRO-to-DRO signature has been observed previously in groundwater21 and was investigated in flowback and produced water, where no consistent relationship or characteristic GRO-to-DRO was observed.27 Previously, Osborn et al.6 set a threshold value of 1 km to the nearest gas well as a critical delineator for elevated fugitive methane levels. Acknowledging that GRO or DRO compounds are less volatile and mobile than methane, we found that neither GRO nor DRO displayed statistically significant correlations with distance to nearest well (GRO: n = 94, p = 0.913, ρ = 0.011; DRO: n = 90, p = 0.433, ρ = 0.084; Spearman correlations, Fig. 2A and B). Similarly, no threshold or “cut off” distance (i.e., 1, 1.5, 2, 3 km) to nearest gas well yielded a statistically significant difference in GRO or DRO levels of samples (p > 0.05, Mann–Whitney U test). Interestingly, when exploring potential relationships between GRO or DRO levels with distance to nearest gas well with environmental health and safety (EHS) violations, we found GRO was statistically more likely to be elevated within 2 km of a gas well with a violation (p = 0.021, Mann–Whitney U test). We found no such statistically significant difference for DRO, irrespective of cut-off distance. Neither GRO nor DRO displayed a linear correlation with distance to nearest violation (GRO: p = 0.124, ρ = −0.159; DRO: p = 0.772, ρ = 0.030; Spearman Correlations, Fig. 2A and B). Note that previous relationships have been observed in this region between DRO and nearest EHS violation,21 whereas our data provides some evidence of relationship between GRO and nearest EHS violation. This may reflect the stochastic nature of spills and subsequent groundwater impingement, temporal variability between our study (2018) and the previous one (2011–2014), or result from discrepancy in the sample size and corresponding spatiotemporal distribution. Taken together, these results imply that distance to the nearest HDHF operation or currently documented violation alone cannot provide robust prediction of the occurrence of organic contamination in proximal water-supply aquifers. One might expect this result if release mechanisms are highly variable or the hydrogeological transport influences the distribution of contaminants.
|  | ||
| Fig. 3 Neither GRO nor DRO levels were significantly different between water types, but DRO was correlated to Sr/Ca ratios. This suggests that upward migration of deep formation brine was not the dominant source of low-level GRO and DRO to shallow groundwater aquifers (GRO p = 0.733; DRO p = 0.830 via Kruskal–Wallis test). Nevertheless, DRO was correlated with Sr/Ca ratios with statistical significance (Spearman correlation, p = 0.011, ρ = −0.269), suggesting the DRO originated from saline fluids. DRO stands for diesel-range organic compounds, GRO stands for gasoline range organic compounds, and water Types A–D22 reflect low-salinity waters (Cl < 20 ppm) dominated by Ca–HCO3 (Type A) or Na–HCO3 (Type B) and high-salinity waters (Cl > 20 ppm) with Br/Cl less or greater than 0.001 (Type C or D, respectively). | ||
Beyond the water typing analysis, we considered correlations with inorganic species individually and found some statistically significant relationships. Interestingly, we found significant correlation between DRO and the level of Ba (Spearman correlation, p = 0.013, ρ = 0.260), Sr (p = 0.014, ρ = 0.258), Mn (p = 0.010, ρ = 0.271), the Sr/Ca ratio (p = 0.011, ρ = −0.269), NO3 (p = 0.043, ρ = −0.213), and Fe (p = 0.027, ρ = 0.233); no such correlation was found for other inorganic species (e.g., Pb, Li, Na, Cl, and Br) or between GRO and any inorganic species. Sr/Ca ratios have been used previously to distinguish formation fluid sources,22 where higher Sr/Ca ratios (0.03–0.17) were attributed to Marcellus formation fluid compared to lower Sr/Ca ratios (0.002–0.08) attributed to Upper Devonian sources. While there is some overlap in the range between these two formations, our data dominantly reflect an Upper Devonian sources, with only 12 samples are in the previously defined22 “Marcellus range”. The two samples with highest DRO levels were in the Upper Devonian range (Sr/Ca 0.031 to 0.035). Thus, these individual ion correlations may suggest elevated DRO could derive from higher salinity fluid, through natural or augmented pathways.
Finally, DRO or GRO levels were uncorrelated with self-reported private well depth (Fig. S1†) and topographic location (e.g., hillslope or valley, Fig. S2†). This is consistent with samples collected soon after the onset (circa 2008) of HDHF activities in Northeastern PA for methane and hydrocarbon gases5 (2011) and organic compounds21 (2011–2014). Taken together, our results imply that despite over ten years of HDHF activity in the region, there remains a lack of striking evidence linking occurrence of DRO and GRO in domestic groundwater with migration of deep brines. This can be reconciled with the significant DRO and Ba, and DRO and Sr/Ca ratios, if one invokes a path where the fluid brine and DRO are rapidly impinged on a groundwater source (e.g., through a direct spill or release).
Lateral transport via leaky casing or poorly cemented well annuluses to groundwater has been demonstrated previously for both light hydrocarbons (i.e., nC1–nC3)5 and postulated for a hydraulic fracturing chemical (e.g. 2-butoxyethanol).20 Such lateral transport from faulty wells would result in the occurrence of relatively low DRO, high GRO, and elevated levels of light hydrocarbon production gasses. However, there was no obvious enrichment of GRO relative to DRO. Here, we note that the starting composition of the GRO and DRO cannot be presumed; it is widely varying in flowback fluids due to both spatial, temporal, geological source and chemical additive effects.27 There was a significant correlation between DRO and methane concentration (Spearman correlation, p = 0.029, ρ = 0.242), but no such correlation was found between the GRO and methane concentration (Fig. S3;† nor was there a relationship with well age, Fig. S4†), indicating the relationship may not have been causative. In other words, if faulty well casing gave rise to both methane and DRO, GRO would be co-occurring necessarily, unless GRO was somehow absent in casing-derived fluids.
Potential sources of DRO in the most concentrated samples (e.g., 30–100 ppb DRO) might be illuminated by compound specific analysis. First, the GC × GC-TOF-MS signatures lacked the fingerprints of gasoline, kerosene, and diesel range organic compounds (Fig. 4; no correlations with LUST or gas stations existed; Fig. S7 and S8†). Second, analyses of the top two DRO samples revealed the presence of bis-2-ethylhexyl phthalate in both samples, and N,N-dimethyltetradecylamine in the sample with the highest DRO concentration (Fig. 4). Both structures were confirmed with authentic standards. These chemicals were not found in the other samples where DROs were detected (0–100 ppb; n = 5), or in contemporary field (n = 7) and lab (n = 7) blanks (Fig. S5†), suggesting the chemical was not introduced during sampling or analysis. The detected phthalate is an anticipated human carcinogen46 and is used in drilling and hydraulic fracturing fluids.47 However, phthalates are pervasive industrial chemicals with many other documented uses. Bis-2-ethylhexyl phthalate has been reported in residential groundwater in Bradford and Dimock Pennsylvania,21,48 flowback waters from the Marcellus Shale, Barnett Shale, Fayetteville and Denver-Julesburg basin,27,47,49,50 and surface runoff adjacent to an incident of gas well spill.51N,N-dimethyltetradecylamine is speculated for use as a cationic surfactant in the oil and gas industry,52 but has not been reported in any literature or chemical use database (e.g., FracFocus.org).
|  | ||
| Fig. 4 Extracted ion chromatograms (m/z 41) illustrate detection of N,N-dimethyltetradecylamine and bis-2-ethylhexyl phthalate using GC × GC-TOF-MS in high DRO samples (1 and 2) and no detection in low DRO samples (3 and 4). Mass spectra of samples 1, 2, and the bis-2-ethylhexyl phthalate standard are given in the ESI (Fig. S5†). N,N-dimethyltetradecanamine identification was also confirmed with an authentic standard. These signals were absent in 14 field and lab blanks, and absent in two field blanks collected at the same time and two lab blanks extracted on the same date as samples 1 and 2 (Fig. S5†). The z-axis is scaled automatically to the highest intensity peak. The color map indicates the extracted ion intensity (where the maximum height is colored red) and the small peaks (colored royal blue) are close to the baseline intensity due to their small abundance. | ||
The highest-level DRO sample (101 ppb) also contained elevated salinity with a Type D signature, which could result from contamination via either transport of high salinity water leaking from a waste storage impoundment located 930 m away (violation incident documented in 2012). For the sample with the second highest DRO, a pit violation was reported 2 km away (documented in 2012). Critically, we emphasize that sorption-retarded transport times for bis-2-ethylhexyl phthalate over these length scales are prohibitively long to explain their occurrence at these distances from the wells with reported violations. As such, if these DRO compounds have a source derived from flowback or produced waters, the input to local groundwaters would have to occur either via a nearby or direct surface spill, well blowout, or some accelerated transport scheme (e.g., preferential flow path; not considered in transport model as these are not well documented in the study region). Lastly, for the sample that contains the highest GRO (2.16 ± 0.71 ppb), there was a brine spill at a gas well located 3.8 km from the domestic well (documented in 2012).
![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) Koc of 0.55 and 4.99, respectively, where Koc is in L kgoc−1) of organic carbon-water partition coefficients of disclosed compounds. This range was determined by analyzing a disclosed fracturing chemical dataset, that included all reported chemicals until 2014 in four states with major shale plays,54 where 508 out 959 chemicals have available log
Koc of 0.55 and 4.99, respectively, where Koc is in L kgoc−1) of organic carbon-water partition coefficients of disclosed compounds. This range was determined by analyzing a disclosed fracturing chemical dataset, that included all reported chemicals until 2014 in four states with major shale plays,54 where 508 out 959 chemicals have available log![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) Koc (Fig. S10†). Here, we note that geologically-derived compounds present in flowback and produced water are excluded in this approach (but also have Koc values within the disclosed chemical space27). The retardation factor (R), was calculated using eqn (3):
Koc (Fig. S10†). Here, we note that geologically-derived compounds present in flowback and produced water are excluded in this approach (but also have Koc values within the disclosed chemical space27). The retardation factor (R), was calculated using eqn (3):|  | (3) | 
|  | ||
| Fig. 5  Contaminant transport length-scales are short relative to the distance to nearest oil and gas well over the timescale of unconventional oil and gas well development in Northeastern PA. (A) Gas well drilling age and distance to nearest groundwater well (blue circles) are shown alongside transport distance as a function of time considering a distribution of hydrological conditions in southeastern sub-region of Bradford county, PA and two representative end-members: acrylamide (red, log ![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) Koc = 0.55) and bis-2-ethylhexyl phthalate (black/gray, log ![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) Koc = 4.99, where Koc is in L kgoc−1). The solid, dashed, dotted, and shaded regions refer groundwater transport velocities that bound the distribution of the results. B (acrylamide) and C (bis-2-ethylhexyl phthalate) illustrate geospatially specific transport distances about 30 gas well pads (gray circles) near 8 groundwater wells (black stars) in southeastern Bradford County after 10 years. The heat maps in B and C delineate the location probability of acrylamide and bis-2-ethylhexyl phthalate ten years after a pulse-type injection from each gas well. The low probability is always set as 10−7, and high probability is a function of the size of the plume and is defined as 10−0.8 (0.16) in B, and 10−0.004 (0.99) in C. The inset shows a scaled site for viewing. The area of gas wells is not proportional to the map scale. The evolution of the transport zones over 1, 5, and 25 year time frames are available in the ESI (Fig. S11†). | ||
Over the 10 year time horizon, the transport distance of the phthalate was always shorter than the distance between gas wells and drinking water wells (Fig. 5A), even under the maximum groundwater velocity scenario (0.0007 km over 10 years). This is also true for the case of acrylamide considering velocities at or below the 75th percentile (0.08 km at year 10); only at maximum groundwater velocities can acrylamide transport distance approach 2 km (transporting from gas wells to adjacent groundwater wells) in 4.1 years. This indicates that contamination of a groundwater via subsurface transport from a gas well could occur only for the fastest transport compounds (e.g., with the lowest Koc values) under the fastest groundwater velocities. As a point of reference, an unretarded chemical traveling with the fastest groundwaters would reach a transport distance of 2 km after approximately 3 years and 152 m (setback distance considered by PA DEP) in around 0.2 years (see Soriano et al. articles38,55 for a full analysis of unretarded transport for a “worst case” scenario). This finding is consistent with the analysis by Rogers et al. 2015.18 The 10 year transport distances for the majority of organic contaminants are too short relative to the distance between gas wells and groundwater wells to account for chemical contamination via subsurface transport mechanisms. Considering the setback distance (gas-well structure to occupied residence) of 152 m suggested by the PA DEP,56 modeling results at 90th-percentile velocity suggest that it would take 9.1 years for acrylamide and 97![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 years for phthalate to reach the nearest domestic well; a distance of 1 km increases these times to 33 and 18
000 years for phthalate to reach the nearest domestic well; a distance of 1 km increases these times to 33 and 18![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 years, respectively. Over these timescales, natural attenuation18 would further reduce the likelihood of detecting organic contaminants in groundwater. Importantly, we note that setback distance has not been established in all states despite the EPA's recommendation, and the distance restriction of some states can be as low as 50 m (e.g., Ohio).57
000 years, respectively. Over these timescales, natural attenuation18 would further reduce the likelihood of detecting organic contaminants in groundwater. Importantly, we note that setback distance has not been established in all states despite the EPA's recommendation, and the distance restriction of some states can be as low as 50 m (e.g., Ohio).57
To illustrate contaminant-transport behavior within the heterogeneous aquifer system that underlies the study area, we used the coupled flow and transport model to simulate the location probability of acrylamide and the phthalate after ten years, assuming a point source at each of the gas well pads (Fig. 5B and C). The modeled domain included all gas well pads in the region (n = 30) and sampled groundwater wells in our study (n = 8). In the case of acrylamide (Fig. 5B), only three groundwater wells spatially overlapped with a low probability (∼10−7 to 10−6) area of contaminant transport. Under no circumstance could the modeled phthalate travel from a gas well to a nearby groundwater well (Fig. 5C). An important feature is that the solute plumes never propagate from the source in radially symmetric fashion, but rather, travels in one, advection-dominant direction. Multiple groundwater wells are located “upstream” or shifted away from the direct “downstream” path of gas wells, leading to low or no probability of contamination. This holds even after 25 years of transport time has elapsed (Fig. S11†), despite gas well-to-groundwater well distances as short as 200 m. Considering these restrictive circumstances for groundwater impairment due to subsurface transport, it is perhaps unsurprising that poor correlations between organic contaminant level and distance to nearest gas wells exist in this and other studies.
McMahon et al.15 previously observed low levels of organic contamination in PA groundwater, utilizing 3H and SF6-calibrated modeled groundwater-age distribution to illustrate that groundwaters were older than HDHF activities. Thus, absent any catastrophic impacts or surface releases,58 more time was needed for water-borne contaminant transport. Our results support this finding, where we provided a fundamentally different approach to indicate that organic chemical impacts via subsurface transport to groundwater from HDHF activities require long timescales, exceeding the development timeline of HDHF.
Here, we highlight a few important features of the model and note opportunities for further improvement. First, the model only accounts for contamination at the well pads, excluding accidents such as roadside spills during fluid and waste transportation. Nevertheless, spills at well pads are probable sources of HDHF-associated water contamination59 and the groundwater physics is illustrative nonetheless. Second, this groundwater model does not account for contaminant transport in surface waters (e.g., streams or rivers) that may have orders of magnitude more rapid transport velocities.60 Generally speaking, in the model domain, groundwater tends to move toward adjacent rivers, suggesting that groundwater wells that intersect a path between a gas well and river may be more vulnerable to contamination incidents from HDHF activities. Third, the model does not account for reactive-transport within the vadose zone, which would tend to reduce the concentrations of surface-derived contaminants that reach groundwater. Fourth, the model does not consider multi-contaminant interaction, enhanced organic sorption (locally high foc), natural degradation,61 or volatilization, which would all combine to reduce the contaminant load at a particular time and place away from the pulsed input (i.e., these results are maxima).
Considering the potential importance of such events, we note that there are imprecisely documented violation entries in the PA DEP disclosed violation reports. These should be corrected and appropriately documented (e.g., EHS violation are sometimes mistakenly recorded as administrative violations). The quantity and type of chemicals spilled should be recorded systematically if any meaningful interpretations of the rates, frequencies, or chemical fates are to be inferred. In addition, we emphasize that accidents that occur away from well pads during transportation were noticeably absent from violation databases. Future vulnerability models would be able to predict the impacts from these events only with accurate reporting measures. These findings highlight that the rapid response to groundwater contamination events would require systematic documentation of HDHF operations or incidents coupled with proactive evaluation of groundwater vulnerability with geospatially-specific hydrogeological modeling tools (e.g., as part of the permitting process).
Such modeling was illustrated here and underscores two important features of contaminant transport: (1) many chemicals used in HDHF required long timescales to reach groundwater wells via subsurface transport (as shown previously18), and (2) groundwater transport tended to dominate in advection-driven directions rather than in all directions equally, as might be implied by distance-to-nearest well assessment. Thus, distance-based metrics to gauge exposure in epidemiologic studies and others must be utilized with caution. In contrast to the commonly adopted distance correlation, we provide an approach that accounts for the physics of groundwater flow and solute transport.
Estimating transport direction and distance over long timescales is not possible without spatially resolved hydrogeological modeling. Although our transport model and calculation was limited to southeastern Bradford County, the approach can identify regions with higher groundwater vulnerability and be applied to assess the impact of HDHF on groundwater quality in other shale plays in the long term. Considering that stochastic accidents (rather than deterministic deep subsurface transport processes) have been shown to give rise to groundwater contamination repeatedly,20,21,62,63 the utility of hydrogeologic modeling to evaluate groundwater vulnerability following such accidental releases becomes clear. For example, in the event of an HDHF well breach or chemical spill on or off the well pad, groundwater vulnerability modeling could be used to predict which drinking water sources could be impacted and on what timescale. This would allow for (a) more informed response to the spill, saving remediation costs and reducing ecological impacts, (b) targeted home drinking water treatments to protect public health, and (c) more accurate estimates of the timeline of the crisis and anticipated impacts, informing both response strategies as well as potential regulatory and oversight frameworks (i.e., reducing the cost of local and federal response in a way that still protects the environment and public health). To make this vision a reality and enable response to crises, near-term future work should focus on developing transport-direction related variables that are simple to calculate and alleviate the need for computationally intensive 3D models. Until that time, the use of linear distance metrics may be a de facto necessity for any policy tools designed to protect public living near UOG operations.
| Footnote | 
| † Electronic supplementary information (ESI) available. See DOI: 10.1039/d1em00124h | 
| This journal is © The Royal Society of Chemistry 2022 |