Open Access Article
Jianfeng Su
*a,
Lara Williamsb,
Allison Fenskec,
Samuel W. Shaheen
bd,
Susan L. Brantleybe,
Brandon Forsytheb,
Nathaniel R. Warner
c,
Jennifer Bakaef and
Tao Wen
*a
aDepartment of Earth and Environmental Sciences, Syracuse University, Syracuse, NY, USA. E-mail: jsu124@syr.edu; twen08@syr.edu
bDepartment of Geosciences, Pennsylvania State University, University Park, PA, USA
cDepartment of Civil and Environmental Engineering, Pennsylvania State University, University Park, PA, USA
dDepartment of Earth & Environmental Sciences, University of Minnesota, Minneapolis, MN, USA
eEarth and Environmental Systems Institute, Pennsylvania State University, USA
fDepartment of Geography, Pennsylvania State University, University Park, PA, USA
First published on 2nd June 2026
Unconventional oil and gas development (UOGD) in the Marcellus Shale of southwestern Pennsylvania has raised persistent concerns regarding groundwater quality. Previous work identified potential regional associations between UOGD activity and elevated salinity in heavily drilled counties based on samples collected between 2008 and 2018. Here, we collected 97 water samples from private wells, springs, and streams in 2024 to evaluate whether such associations persist and to assess whether impacts, if present, are regional or highly localized. Sampling targeted previously identified chloride hotspots, decommissioned wastewater impoundments, and recent spills, with comparisons to control locations. Major ions, trace metals, dissolved gases, isotopes, and organic compounds were analyzed and compared with drinking water standards, historical datasets, and proximity to UOGD features while accounting for geologic, hydrogeologic, and topographic controls. Most samples met drinking-water standards, and statistical comparisons revealed no robust regional differences between treatment and control samples after Benjamini–Hochberg correction. Fixed-effects models showed no evidence of widespread groundwater degradation attributable to UOGD. Among spill sites sampled, only one groundwater sample exhibited elevated salinity and brine-associated constituents. Geochemical, isotopic, and topographic evidence suggests this anomaly reflects localized influence from produced water or natural Appalachian Basin brine migrating along deeper flow paths rather than surface transport. Our results indicate that contamination patterns are best explained by highly localized rather than region-wide problems, consistent with rare and localized detectable impacts within the targeted sampling framework used here and highlighting challenges in defining appropriate controls given hydrogeologic heterogeneity. This work emphasizes combining regional screening with targeted, hydrogeology-informed monitoring to assess UOGD-related groundwater contamination.
Environmental significanceSouthwestern Pennsylvania hosts a high density of unconventional oil and gas (UOG) wells in the United States, yet the extent of associated impacts under current regulatory conditions remains poorly constrained. This study presents a hydrogeochemical dataset of 97 samples collected in 2024 from Washington and Greene Counties, targeting previously identified chloride hotspots, decommissioned impoundments, and recent spill sites. We show that widespread brine-related contamination attributable to unconventional oil and gas development (UOGD) is not detectable in this targeted dataset, and that persistent impacts are rare, highly localized, and strongly governed by hydrogeologic context. These findings demonstrate that proximity to UOGD features alone is an insufficient predictor of contamination risk. We propose a tiered monitoring framework that combines regional screening with hydrogeology-informed field investigations, offering a transferable model for protecting drinking-water resources in active shale-development regions. |
Early studies in Pennsylvania documented that shallow aquifers in the Appalachian Basin can contain naturally-occurring methane due to both microbial methanogenesis and natural upwelling of thermogenic methane from deeper formations and, occasionally, thermogenic gas linked to well integrity failures.2–8 Although nontoxic, methane can pose explosion and asphyxiation hazards in confined spaces once concentration exceeds 10 mg L−1 in water.2,5 Beyond methane, which primarily reflects gas migration and well integrity issues, concerns related to unconventional oil and gas development also include the potential release of liquid wastes associated with hydraulic fracturing operations. In particular, flowback water—produced when injected fluid returns to the surface after hydraulic fracturing—often contains highly concentrated brine salts, reflecting a mixture of the original fracking fluid and naturally occurring formation brines.9 These wastewaters are usually enriched with naturally occurring radioactive materials and heavy metals, many of which are toxic and can cause adverse health effects even at very low levels.10 Some species uncommon in shallow groundwater (e.g., barium (Ba), strontium (Sr), bromide (Br)) can act as “fingerprints” for mixing of brines with fresher waters. These fingerprint species are sometimes naturally present in shallow groundwater, attributed to natural vertical migration of basin brines through inter-formation pathways7 or infrequent flushing of connate salt water by meteoric recharge.11 However, leakage from impoundments, where the flowback water is held for unconventional oil and gas (UOG) operation, and surface spills that may occur during temporary storage, transportation and waste handling processes, pose a significant contamination risk.12
Using large publicly available groundwater datasets from the Marcellus Shale region, Shaheen et al. (2022, 2024)13,14 demonstrated that UOGD is associated with small but statistically significant regional increases in salt-related groundwater constituents, including chloride (Cl), Ba, and Sr, particularly in southwestern Pennsylvania (SWPA) where UOG activity overlaps with dense legacy coal, oil, and gas extraction. These studies further showed that any regional associations between groundwater chemistry and UOGD are likely driven by subregions with documented spill-related violations and wastewater impoundments, and that such relationships persist after accounting for other natural and anthropogenic sources of salinity. Collectively, these findings were interpreted to suggest that wastewater management issues, rather than routine UOG operations, may contribute to subtle degradation of groundwater in which a small number of localized incidents can produce a regionally detectable effect in SWPA. Shaheen et al. (2022)13 used a geospatial tool to identify “hotspots,” i.e., areas in which concentrations of Cl significantly increased with UOG well density. However, most samples underlying these interpretations were collected early in the history of UOGD in SWPA (i.e., pre-2014), leaving it unclear whether previously observed contamination persists under the current regulatory and operational framework that has evolved since early UOGD.
Here, we evaluate the regional relationships reported by Shaheen et al. (2022, 2024)13,14 using an independent groundwater dataset collected in 2024. Although our sampling targeted fewer locations than the earlier regional studies, it was designed to test whether previously reported salt-related groundwater signals remain detectable under current regulatory and operational conditions. Because the earlier regional associations were hypothesized to arise from localized wastewater-management issues, particularly leakage from impoundments or produced-water spills, we targeted sites near previously identified chloride hotspots, decommissioned wastewater impoundments and documented recent spills. Given the lack of recent spills in the area that we were able to sample, we also targeted brownfield sites (hereafter referred to as “spills” for simplicity), where spills had occurred and been remediated. By applying multivariate statistical analyses and explicitly controlling for geologic, hydrogeologic, topographic, and anthropogenic factors, we assess whether previously reported UOGD-associated groundwater signals are persistent, localized, or no longer detectable.
Our study focuses on Washington and Greene Counties in SWPA, a region with the highest density of UOG wells in the Appalachian Basin.15 Washington and Greene Counties contain thousands of horizontal gas wells (UOG wells) and a century of legacy conventional drilling and coal mining, making it a key location for assessing potential effects on groundwater. We organized the study around four related hypotheses and corresponding analyses. First, we tested whether regional brine-related contamination signals identified during the early period of UOGD remain detectable in the 2024 dataset by comparing treatment and control samples, and by evaluating results relative to historical groundwater data. Second, we tested whether previously identified chloride hotspots persist as zones of elevated salinity by characterizing groundwater chemistry within hotspot areas and comparing these samples with controls. Third, we tested whether wastewater impoundments and spills produce localized and potentially persistent groundwater contamination by sampling near decommissioned impoundments, documented spills, and brownfield sites and by assessing spatial, geologic, hydrogeologic, and topographic controls on observed chemistry. Fourth, we tested whether any detected impacts correspond to potential human-health concerns by comparing measured concentrations with established or proposed drinking-water thresholds. Together, these analyses provide a framework for determining whether UOGD-related groundwater effects in SWPA are regionally persistent, locally restricted, or not detectable under the current conditions.
In total, 97 samples were collected across 7 field campaigns between May and November 2024, including both groundwater (79 wells and 14 springs) and stream water (n = 4). Two groundwater samples strongly influenced by acid mine drainage (AMD) and one rain-dominated sample were excluded from subsequent data analyses, yielding 90 groundwater samples for analysis (Fig. 1 and Table 1). Stream samples were collected to provide local surface water context but since we focus on groundwater in this study, they were also excluded from the analyses.
Water sampling sites were chosen based on locations of hotspots and impoundments as previously reported.13–16 For example, all UOG well locations within the hotspots as previously identified were used to define an area where the outer perimeter was 2 km from every well. This buffer radius (2 km) was selected for the 31 HS samples based on previous modeling and field-based studies showing putative groundwater impairment caused by UOGD within this radius.17,18 Like the hotspot target areas, the 24 I samples were collected within zones around impoundments that were also defined with a 2 km buffer radius.
To sample the effect of spills, we targeted spill or leakage events that occurred within the past three years. A 3-year window was selected because impacts from spills were observed in previous work in Pennsylvania as well as in other shale plays within this temporal range.14,19,20 Initially, we sought to collect groundwater samples within 2 km of these locations, consistent with the buffer used for hotspot and impoundment sampling. However, because domestic wells were sparse near several spill locations and homeowner participation was voluntary, we expanded the spill-sampling buffer from 2 km to 3 km to obtain sufficient samples for exploratory comparison near spill-associated sites. This expansion was also consistent with evidence that at least one documented UOGD-related contamination incident in PA involved migration more than 2 km along fractures.18 Even within a 3 km radius of the three spills we identified in Greene County that occurred between 2021 and 2024, we collected only six samples.
To increase the potential sampling possibilities near spills, we identified twelve additional sites in Washington County using data from the PADEP Land Recycling Program. Information from that Program allowed us to find brownfield sites associated with major UOG companies. Pennsylvania brownfield sites are former industrial or commercial properties where reuse is complicated by potential contamination from hazardous substances but that are eligible for cleanup and redevelopment. We were able to collect fifteen water samples from 6 of the 12 brownfields. Two of the six brownfields were associated with spills that occurred in 2015 and 2020, respectively. Given the very small number of spills and domestic wells located near spills, samples (Sample_082 and Sample_083) collected near these sites outside the 3-year window were retained in the analysis to evaluate whether longer-term contamination signatures persist beyond the 3-year window, providing a complementary perspective on attenuation timescales. Because these older sites differ in timing from the recent spill sites, they were interpreted as supplemental cases for evaluating possible longer-term persistence rather than as direct equivalents to recent releases. We found no information about exact spillage locations for the 3 spill sites and for the 12 brownfield sites. Therefore, centroids of the wellpads were used to define the areas targeted for S water sampling.
Control samples were selected 5–7 km from the targeted UOGD features and, where possible, from broadly comparable hydrogeologic, topographic, and land-use settings. This distance range was chosen to minimize the likelihood of direct influence from the targeted UOGD features while maintaining sufficient geographic proximity to treatment areas for comparative assessment.
Quality assurance and quality control included field blanks, trip blanks, duplicate samples, and calibration verification every 10 samples. Analytical precision for major and trace elements was within ±5% (refer to SI for test procedures and details of chemical analyses).
Water chemistry data were compared to established standards including the maximum contaminant level (MCL)10 and secondary contaminant level (SMCL).21 Where EPA has not established an MCL/SMCL, alternative threshold levels22,23 were used for comparison (Table S1).
Correlations between analyte concentrations and distance to the nearest UOGD feature were evaluated using both linear (ordinary least squares [OLS]) and rank-based (Spearman, Kendall's tau, and Akritas–Theil–Sen [ATS]) regression models for SC and dissolved concentrations of chloride (Cl), sodium (Na), sulfate (SO4), barium (Ba), strontium (Sr), bromide (Br), iron (Fe), and methane (CH4). SC, Cl, and Na are indicators of groundwater salinity and serve as proxies for assessing salt impacts, a major component of UOGD wastewaters. Fe and SO4 were chosen because they are redox-sensitive and have sometimes been used to predict contamination related to UOGD.8,24 SC, Cl, Ba, Sr, Br, and CH4 are typically enriched in UOG produced waters, and have been used to identify contamination from deep formation brine, methane leakage, and putative UOG-related incidents.13,24
To assess the vulnerability of each sample to contamination that is transported via the surface or shallow pathway, watersheds of collected samples were delineated from 10 m × 10 m DEM raster.25 In ArcGIS Pro, cells with values ≥100 (≥0.01 km2) were used to define stream networks, snap samples within 200 m, and delineate capture zones using the “Watershed” tool. We also examined how groundwater chemistry relates to topography: the topographic position index (TPI)26 was used to categorize locations into six terrain classes—valley, lower slope, flat slope, middle slope, upper slope, and ridge—which were grouped into “valley” (first three) and “ridge” (last three).
A simplified fixed-effects regression was used to control for potential confounding factors, including proximity to coal mines (<1 km) and conventional oil and gas wells (COGD),15,27 highways,28 topographic position (valley vs. ridge), and a sampling-season indicator (November vs. May–October). These factors were tested because of their reported effects on groundwater chemistry in Pennsylvania.29,30 In particular, coal mining can increase SO4 (among other analytes), COGD can affect many of the same contaminants associated with UOGD, and highways in PA are associated with elevated Na and Cl because of road salting. Spatial distances (horizontal planar distances) were calculated using ArcGIS Pro. Statistical significance was assessed at p = 0.05. The final equation for the fixed-effects model is:
log C = β1UOG/I/S + COG 1 km + coal mining 1 km + highway 1 km + TPI + season + ε
| (1) |
C is the log concentration of brine-related species, UOG/I/S represents the geodesic distance metrics for all water samples, β1 is the regression coefficient of the distance variables, and ε is the error term.
The ‘other’ samples were excluded from the primary categorical treatment-control tests because they were not designed to represent either treatment or control groups. They were retained in descriptive statistics and fixed-effects regression models when they met the relevant sample type criteria, because those models used continuous distance metrics rather than categorical treatment assignment.
| Spill identifier | Incident detail | Incident date | Remediation detail | 2024 SWPA samples | # of SN samples (1 km) | # of PADEP samples (1 km) | # of PADEP samples (1.3 km) |
|---|---|---|---|---|---|---|---|
| A | Produced fluid release due to washout failure and slow leak of produced water from a dump line (19 213–22 425 barrels) |
12/2020–12/2021 | Groundwater & soil were impacted; pump and treat were done on site | Sample_003 (HS/S), Sample_013-3 (HS/S), Sample_042 (HS/S) | 80 (Sample_003) | 31 | 292 |
| 20 (Sample_013-3) | |||||||
| 18 (Sample_042) | |||||||
| B | Produced fluid spill during transportation (50–100 gallons) | 10/29/2020 | Limestone aggregate near the well pad was removed | Sample_082 (S) | 17 | 0 | 0 |
| C | Produced fluid seeped out of containment | 2/12/2021 | Impacted soil and fluid were removed | Sample_090 (S), Sample_038 (S) | 158 (Sample_090) | 0 | 0 |
| 0 (Sample_038) | |||||||
| D | Resue water spill during transportation (11.8 barrels) | 1/7/2021 | Impacted soil was removed | Sample_081 (S), Sample_084 (S) | 0 | 0 | 0 |
| E | Produced fluid release while attempting to purge air (15 gallons) | 6/6/2015 | Impacted soil was removed | Sample_083 (S) | 12 | 0 | 0 |
Most groundwater samples met EPA drinking-water standards (Tables S1–S6). Among the 90 groundwater samples analyzed, exceedances of MCLs, SMCLs, or other screening thresholds were observed primarily for Cl, Fe, Mn, TDS, and Li. Five of 90 2024 SWPA samples (5.6%) exceeded the EPA SMCL of 250 mg L−1 for chloride: two categorized as HS (286 mg L−1 and 568 mg L−1; Sample_066 and Sample_077), one as I (307 mg L−1; Sample_026), one as S (697 mg L−1; Sample_082), and one as ‘other’ (584 mg L−1; Sample_043), i.e., not in any of these categories. The S sample – Sample_082 – also exhibited the highest SC (3485 µS cm−1), Na (622 mg L−1), and elevated levels of several other analytes. Its Ba concentration (2.05 mg L−1) was at or slightly above the EPA MCL (2 mg L−1), while F (3.52 mg L−1) and Br (5.81 mg L−1) concentrations exceeded their SMCLs (2 mg L−1).
Fe and Mn surpassed SMCLs at 4 of 90 samples (4.4%) and 15 of 90 samples (16.7%), respectively. These secondary limits are based largely on aesthetic considerations. Li concentrations at 44 samples were higher than the EPA provisional toxicity value of 10 µg L−1,23 and one sample (Sample_082) exceeded the U.S. Geological Survey (USGS) drinking water benchmark (60 µg L−133). All other trace metals (Be, Ni, Cu, As, Tl, Cd, Pb, and U) were below health-based thresholds. A few low-level detections of Li, Cu, and Pb are consistent with possible analytical blank interference.
All BTEX compounds were below EPA MCLs. Dissolved methane was low except for three samples that exceeded the Department of Interior action level (10 mg L−1; Sample_086, Sample_017, Sample_055), including one (57 mg L−1; Sample_086) above the immediate action level.
Methane (CH4 or C1) was detectable in 53% of samples, whereas ethane (C2H6 or C2) and propane (C3H8 or C3) were infrequently detected and occurred only where CH4 was present. δ13C–CH4 values ranged from −77‰ to −55‰ and δD–CH4 from −220‰ to −154‰, and are consistent with published ranges for methane of mixed biogenic and thermogenic origin (Fig. 2A). 87Sr/86Sr ratios (0.711–0.713) were similar across all treatment samples and matched UOG brines for Pennsylvania34 and the control group (Fig. 2B). In addition, these 87Sr/86Sr ratios were also significantly lower than those of COG brines from PA (≥0.720 (ref. 34)).
![]() | ||
| Fig. 2 (A) Concentrations of methane (C1) versus ethane (C2) + propane (C3) plotted versus δ13C–CH4 and (B) 87Sr/86Sr as a function of Sr/Cl mass ratio (in mg L−1) of groundwater samples. The symbol sizes are proportional to CH4 concentration in (A) and to Cl concentration in (B), values of production gases and produced waters from COG and UOG wells are compiled from the literature.34,43 (C) Cl/Br mass ratios for HS, I, S, and C samples. Gray areas for septic and animal waste based on Katz et al. (2011).44 Appalachian Basin brine reported for Pennsylvania based on Blondes et al. (2023).34 (D) Comparison of Cl/Br mass ratio across HS, I, S, and C samples, as well as from the produced waters in Pennsylvania from Blondes et al. (2023).34 | ||
Cl/Br mass ratios were also examined to help distinguish the potential sources of elevated chloride (Fig. 2C and D). In hydrocarbon-bearing basins such as the Appalachian, Cl/Br ratios can indicate provenance of salinity: road salt and organic waste (e.g., septic or agricultural runoff) typically contain little bromide, resulting in high Cl/Br ratios, whereas Appalachian Basin formation brines—whether naturally discharging or returning to the surface during UOGD—contain relatively high bromide and therefore much lower Cl/Br ratios.18 Calculated Cl/Br ratios for this dataset show that most samples with elevated chloride (>∼100 mg L−1) plot within the high-ratio domain characteristic of surface anthropogenic sources rather than deep formation brines. Comparisons further indicate that the majority of samples, regardless of category, exhibit Cl/Br ratios much higher than those typical of UOG wastewaters. These results support the interpretation that most elevated chloride concentrations in the study area primarily reflect surface or shallow anthropogenic inputs rather than persistent influence from UOG-derived brines. The only notable exception is Sample_082, which plots closer to the Appalachian Basin brine zone (Fig. 2C). This high Br is consistent with a non-negligible brine contribution. This anomalous signature is examined in greater detail in subsequent sections.
To reduce bias related to agricultural inputs, statistical tests were repeated for well samples only (Table S8). In this subset, five analytes—Cl, Mg, K, Pb, and Cu—were higher in treatment wells than in controls, while P and ethylbenzene were lower. Specifically, prior to FDR correction: (1) for HS samples, K, Cl, and Mg remained significantly higher; (2) for I samples, Pb was elevated; and (3) for S samples, K and Cu were higher.
Because multiple tests across 48 analytes increase the risk of false positives, we applied the Benjamini–Hochberg (BH) correction.35 We set false discovery rate (FDR) = 0.1, meaning that we accepted up to 10% false positives. After correction, no analytes were significantly different between treatment and control groups in either the full dataset or the well-only subset.
| Treatment group | Coefficient | SC | Na | Cl | Ba | SO4 | Sr | Br | Fe | CH4 |
|---|---|---|---|---|---|---|---|---|---|---|
| HS | OLS slope | 2.93 | 3.055* | 0.119 | 0.003 | −0.588 | −0.005 | 0.012 | 0.002 | 0.206* |
| Spearman | −0.099 | 0.026 | −0.051 | −0.041 | −0.206 | −0.109 | 0.079 | 0.139 | 0.310* | |
| Kendall's tau | −0.071 | 0.022 | −0.038 | −0.029 | −0.133 | −0.069 | 0.053 | 0.104 | 0.214* | |
| ATS slope | −3.793 | 0.121 | −0.192 | 0.000 | −0.686 | −0.006 | 0.000 | 0.000 | 0.000* | |
| I | OLS slope | 11.504 | 4.617* | 1.86 | 0.007 | 0.077 | −0.005 | 0.029* | −0.002 | 0.097 |
| Spearman | −0.063 | 0.05 | −0.085 | −0.129 | 0.028 | −0.225* | −0.035 | −0.156 | 0.144 | |
| Kendall's tau | −0.034 | 0.033 | −0.05 | −0.086 | 0.03 | −0.151* | −0.026 | −0.117 | 0.095 | |
| ATS slope | −2.310 | 0.170 | −0.236 | −0.001 | 0.160 | −0.014* | −0.000 | 0.000 | 0.000 | |
| S | OLS slope | −15.816 | −5.954* | −4.165 | −0.014* | 1.072 | −0.011 | −0.026 | −0.005 | −0.286 |
| Spearman | 0 | −0.307* | −0.119 | −0.084 | 0.208* | −0.048 | −0.113 | 0.034 | −0.259* | |
| Kendall's tau | −0.002 | −0.209* | −0.084 | −0.054 | 0.145* | −0.037 | −0.077 | 0.024 | −0.172* | |
| ATS slope | −0.143 | −1.199* | −0.521 | −0.001 | 1.106* | −0.005 | −0.000 | 0.000 | −0.000* |
We also observed that topography influenced groundwater chemistry. Of the 90 samples, 26 were located in valleys and 64 on ridges. Valley samples had significantly higher Ba and CH4 but lower SC, SO4, Ca, Li, and Ni (p < 0.05 in both WMW and BM tests) (Table S10). After BH correction, only SO4 remained lower in valley settings, consistent with enhanced reducing conditions.
After log-transforming analyte concentrations, the fixed-effects linear regression model (Table 4) estimated percentage change per kilometer of distance from each UOGD feature as:
| % increase = [exp(β) − 1] × 100% | (2) |
| Analyte | UOG well | Impoundment | Spill |
|---|---|---|---|
| SC | 0.003 (p = 0.687) | 0.007 (p = 0.435) | −0.01 (p = 0.487) |
| Na | 0.044 (p = 0.091) | 0.039 (p = 0.176) | −0.105* (p = 0.02) |
| Cl | −0.001 (p = 0.974) | −0.023 (p = 0.465) | −0.054 (p = 0.279) |
| Ba | 0.003 (p = 0.272) | 0.007* (p = 0.013) | −0.008 (p = 0.071) |
| SO4 | −0.041* (p = 0.01) | −0.028 (p = 0.115) | 0.088* (p = 0.001) |
| Sr | −0.006 (p = 0.234) | −0.009 (p = 0.116) | −0.001 (p = 0.923) |
| Br | 0.01* (p = 0.017) | 0.017* (p < 0.001) | −0.001 (p = 0.859) |
| Fe | 0.003 (p = 0.309) | 0.002 (p = 0.577) | −0.012* (p = 0.021) |
| CH4 | 0.048* (p < 0.001) | 0.037* (p = 0.013) | −0.068* (p = 0.004) |
Because distance was measured from each UOGD feature, negative coefficients indicate higher concentrations closer to the feature and are therefore most relevant for evaluating potential UOGD-related enrichment. After adjustment, distance to impoundments showed no significant negative relationship with any analyte concentration. In contrast, concentrations of SO4 increased closer to UOG wells within hotspots, while Na, Fe, and CH4 increased closer to spills. These proximity relationships correspond to estimated concentration increases of 4.2% (SO4), 11.1% (Na), 1.2% (Fe), and 7.0% (CH4) per kilometer approaching the feature.
![]() | ||
| Fig. 3 Scatter plots of brine-related species concentration vs. time (year) for 2024 SWPA water sample Sample_003 (collected near spill A), SN dataset samples, and PADEP samples (within 1.3 km). | ||
213–22
425 barrels of produced fluid.We collected three groundwater samples within the buffer area of spill A as part of the 2024 SWPA dataset but we focus on Sample_003, which was collected at the nearest point. We were able to compare Sample_003 to chemical analyses of 292 water samples collected as part of the PADEP study from 18 monitoring wells between May 2022 and December 2024 (see Text S5 for more details). Although the PADEP sampling overlapped with our sample timing, the PADEP water samples targeted more spill-adjacent locations, i.e., less than ∼700 m downgradient from spill A. In comparison, our Sample_003 was collected 1.2 km downgradient from the spill (our Sample_013-3 and _042 were collected more than 2 km downgradient from spill A and from the PADEP samples).
Brine contamination from spill A is inferred for some of the PADEP water samples even at the same time point (29 months) when we collected Sample_003. In contrast, our Sample_003, collected 1.3 km from the furthest PADEP sampling sites, showed no geochemical indication of spill impact (Table S11): the sample showed markedly lower SC, Br, Cl, Ba, Fe, and Sr than PADEP samples (Fig. 3). Furthermore, Sample_003 exhibited comparable concentrations to the SN sample collected in 2013 at a location only 8 meters away, differing only in its higher methane content (Table S11). Regarding Sample_013-3 and Sample_042, the former showed lower concentrations of most brine-related species, whereas the latter exhibited slightly higher SC, Na, Cl, and Ba relative to its nearest SN sample. The SC, Na, Cl, and Ba concentrations in both samples are comparable to the median values of all nearby SN samples (Table S12, S13 and Fig. S2, S3).
Spill B was caused while a water truck was loading produced fluid from the production water tank, resulting in an estimated spill of 50–100 gallons onto the well pad surface on 10/29/2020. The release was contained to a small area of the compacted limestone pad, with no reported off-pad migration. Within the buffer radius of spill B, the only 2024 SWPA sample (Sample_082, 2422 m from the spill location; Fig. S4) exceeded drinking-water standards for Cl, Ba and Br and exhibited the highest salinity observed in the 2024 SWPA dataset. Sample_082 exhibited higher SC, Na, Cl, and Ba, and lower SO4, Fe, and CH4 (Table S14) compared to (i) the chemistry reported for the same site prior to the spill incident in the SN dataset, and (ii) the median of all prior SN samples within 1 km.
Spill C was described in the PADEP report as an incident wherein produced fluid (unknown volume) was found to be seeping from the pad gravel below secondary containment along the northeastern corner of a well pad. Near this site, two samples were collected in our 2024 campaign: Sample_090 (2592 m away) and Sample_038 (1966 m away). Historical SN data were available only near Sample_090. Sample_090 had much lower concentrations of most of the brine-related species compared to the closest SN sample and the median of all nearby SN samples (Table S15 and Fig. S5).
Spill D was described in the PADEP report as occurring on January 7, 2021, when a water hauler veered off the driveway apron while entering the lease road from State Route 519 near the Guyton well pad entrance. The truck overturned on the adjacent fill slope, releasing approximately 11.8 barrels of reuse water (likely brine) onto the surrounding ground. Sample_081 and Sample_084, from sites located near spill D, could not be compared to nearby SN samples because of the lack of samples within a 1 km radius.
Spill E occurred on June 6, 2015, and consisted of fifteen gallons of brine water released from containment when the company attempted to purge air from the water transfer line running across the well pad. Sample_083, sampled within ∼2800 m of this spill location, showed higher SC, Na, Cl and SO4 concentrations compared to the median of nearby SN samples (Table S16 and Fig. S6).
Previously identified chloride hotspots in southwestern Pennsylvania were interpreted as potential indicators of wastewater-related impacts associated with UOGD.13 Our 2024 sampling provides no evidence that such hotspot-related signals persist at a regional scale under current conditions. Groundwater collected within hotspot areas is geochemically similar to control locations, and no systematic enrichment of brine-associated constituents or trace metals of health concern is observed. Li exceeded its provisional health threshold (10 µg L−1) in 44 samples across all categories, including controls, suggesting that elevated Li reflects regional groundwater quality characteristics rather than hotspot-specific contamination.38 Exceedances of Mn and Fe were aesthetic and reflect regional lithologic norms, as these metals are ubiquitous in Pennsylvania bedrock and groundwater particulates.11 Although modest differences appear in uncorrected statistical tests, these do not persist after BH correction, indicating that any earlier hotspot-related impacts are not regionally detectable in the present dataset under current conditions.
Shaheen et al. (2024)14 attributed small increases in brine salts near UOGD operations to occasional, localized surface fluid releases rather than subsurface hydraulic fracturing processes. Consistent with this interpretation, we observed significant correlations between Sr (in 3 of 4 statistical tests) and proximity to impoundments, and between Na (all 4 tests), Ba (1 of 4), and CH4 (3 of 4) and proximity to spills. After applying fixed-effects regression that accounted for topographic position, coal mining, and conventional oil and gas wells, however, most of these relationships were no longer statistically significant. In the full fixed-effects model, Na, Fe, and CH4 showed significant negative relationships with distance to spills, indicating higher concentrations closer to spill locations (Table 4). These results suggest that limited enrichment of Na and CH4 may reflect residual impacts from surface or shallow subsurface wastewater releases rather than well integrity failures or deep formation leakage. Sensitivity analyses excluding Sample_082 showed that negative relationships for Na, Ba, Fe, and CH4 remained significant, while SO4 showed a significant positive relationship with distance to spills (Table S17). Thus, the proximity relationships were not driven solely by Sample_082, although Sample_082 likely contributed disproportionately to the strongest brine-like signal.
The most anomalous of our samples is Sample_082, collected near spill B, which exhibits distinctly elevated salinity and brine-associated constituents relative to both nearby controls and co-located historical data. Its ionic composition, Cl/Br ratio, and Sr isotopic signature indicate a non-negligible contribution from Appalachian Basin brine or produced water (Fig. 2B and C). This geochemical fingerprint is unique within the 2024 SWPA dataset and is not observed at other spill sites, underscoring the highly localized nature of this anomaly.
Comparison with historical SN data sheds some light on the interpretation. The 2024 SWPA sample collected at this site exhibits markedly higher specific conductance, Na, Cl, Ba, and Br and lower SO4 relative to both the co-located SN sample and the median of nearby SN observations, indicating a site-specific increase in brine-associated constituents over time. The decrease in SO4 is also consistent with the low solubility of barite (BaSO4), which produces inverse behavior for Ba and SO4 when Ba-containing brines contaminate aquifers.39 While the chemistry is consistent with either a natural ABB or produced water source, the change over time is more consistent with an explanation involving a localized influence from a produced-water (brine) release.
However, the available data do not allow definitive attribution of the brine source impacting this sample without more detailed site-specific investigation. For example, natural Appalachian Basin brine cannot be fully excluded given that natural brines are known to have impacted waters in SWPA40 and temporal changes in water chemistry are relatively common. Furthermore, spill contamination may be unlikely given the small volume released (50–100 gallons), the reportedly prompt response, and the ∼2.4 km distance between Sample_082 and spill B. In addition, there are other potential explanations. For example, within 2 km of Sample_082, at least 10 active COGD wells and other associated infrastructure (e.g., compressor stations) are present.
Nonetheless, to explore possible sources for contamination, the surface drainage area for Sample_082 (and all other samples in the 2024 dataset) were delineated using the “Watershed” tool in ArcGIS Pro. The drainage area for Sample_082 was small (0.01 km2), as were the drainage areas for all the samples in the 2024 dataset (maximum area of 1.4 km2). For example, when each watershed is treated as a circle of equivalent area, the largest diameter is approximately 670 m. None of the drainage areas, including the area for Sample_082, contain an impoundment or spill location. This implies that, if produced waters (brines) explain the chemistry of Sample_082 (or any of the samples in the 2024 dataset), migration likely occurs along deeper groundwater flow paths rather than surficial pathways. Such deeper migration could be facilitated by valley-focused flow convergence and favorable hydrogeologic connectivity such as fractures. This interpretation is consistent with the geologic setting at spill B, which overlies the Waynesburg Formation—a clastic bedrock unit with extremely low primary porosity because the pore spaces are filled with calcareous or siliceous cement.41 Given these characteristics, it is possible that the local aquifer system at spill B exhibits the same fracture-dominant flow regime observed at spill A (Text S5), and local fractures serve as preferential pathways that facilitate both vertical migration and lateral transport of the brine-associated constituents.
The strong heterogeneity of geology, topography, land use, and legacy infrastructure in this region also complicates both statistical inference and the definition of appropriate control samples. In settings where lithology, redox conditions, agricultural inputs, road salting, coal mining, and conventional oil and gas development co-vary over short distances, background groundwater chemistry can vary as much as or more than any potential UOGD-related signal. As a result, no universal sample size guarantees detection of subtle effects, and control samples are inherently imperfect. This heterogeneity increases the sample size required to resolve small regional trends and reduces the power of broad spatial comparisons, particularly when contamination events are rare and spatially constrained.
In this study, only one of the five spills examined in detail (spill B) is associated with a groundwater sample exhibiting a clear brine-like geochemical anomaly (Sample_082). Our groundwater samples associated with the other four spills (A, C, D, and E) show no evidence of persistent salinity, brine-associated constituents, or systematic deviation from nearby historical or background conditions: this observation is especially interesting in that samples taken by the state regulator (PADEP) during the same time period as our sampling but 500 m closer to spill A did show evidence of brine contamination. This pattern indicates that persistent impacts were not detected at most of the spill/brownfield sites examined here, and that when impacts were detected, they appeared highly localized and strongly conditioned by site-specific hydrogeologic setting rather than spill occurrence alone.
In this sense, the present study extends prior field-based work by using previously identified chloride hotspots and documented spill or impoundment locations as a targeted sampling framework for evaluating whether regional signals identified in large datasets correspond to persistent local groundwater impacts. Our findings underscore the complementary roles of regional datasets and targeted, hydrogeology-informed field investigations in evaluating groundwater impacts associated with UOGD. Large datasets are essential for identifying subtle regional patterns and for flagging areas of potential concern, but they cannot resolve rare, spatially constrained impacts. Targeted, hydrogeology-informed sampling is required to directly test hypotheses generated from regional analyses and to detect localized anomalies when they occur. In heterogeneous settings such as southwestern Pennsylvania, a tiered approach—regional screening followed by focused field investigation—provides a useful framework for distinguishing widespread trends from isolated, site-specific impacts.
Despite the absence of a regional signal, only one of the five spill sites examined in detail showed a localized anomaly—a single groundwater sample with elevated salinity and brine-like signatures relative to nearby controls and co-located historical data, suggesting increased brine influence over time. Samples from the other four sites show no evidence of persistent salinity or enrichment in brine-associated constituents, indicating that persistent groundwater contamination from surface releases was uncommon and highly localized among the spill/brownfield sites examined here. The isolated anomaly observed at Sample_082 is consistent with localized influence from produced water or natural Appalachian Basin brine migration along deeper or intermediate flow paths; however, definitive attribution to a specific source or event is not possible based on the available data.
Together, these results directly address the central questions motivating this study. We find no evidence that regional brine-related contamination signals identified during the early period of UOGD persist under current regulatory and operational conditions, nor that previously identified chloride hotspots remain detectably enriched relative to controls after controlling FDR. With respect to localized impacts, only one of five examined spill sites exhibited a clear brine-like geochemical anomaly, indicating that persistent groundwater contamination from surface releases is rare and highly localized. Exceedances of enforceable primary MCLs were limited. Other exceedances or screening-threshold exceedances, including Li, occurred across multiple sample categories and were largely unrelated to proximity to UOGD features, suggesting that present-day human health concerns associated with groundwater contamination in this region are limited and site-specific rather than regional in scale.
These results also reinforce that, within our targeted sample set, detectable groundwater impacts associated with UOGD-related features are rare, highly localized, and strongly conditioned by site-specific hydrogeologic setting rather than proximity alone. Large regional datasets are essential for identifying subtle patterns and flagging areas of potential concern, but targeted, hydrogeology-informed field investigations are required to detect and characterize localized impacts when they occur. A tiered monitoring strategy that combines regional screening with focused sampling in hydrogeologically vulnerable settings provides an effective framework for distinguishing isolated, site-specific contamination from widespread trends.
| This journal is © The Royal Society of Chemistry 2026 |