Open Access Article
Andrew J.
Sumner
a and
Desiree L.
Plata
*ab
aDepartment of Chemical and Environmental Engineering, Yale University, New Haven, Connecticut 06520, USA
bDepartment of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA. E-mail: dplata@mit.edu
First published on 10th February 2020
Hydraulically fractured wells with horizontal drilling (HDHF) accounted for 69% of all oil and gas wells drilled and 670
000 of the 977
000 producing wells in 2016. However, only 238 flowback and produced water samples have been analyzed to date for specific organic chemicals. To aid the development of predictive tools, we constructed a database combining additive disclosure reports and physicochemical conditions at respective well sites with the goal of making synthesized analyses accessible. As proof-of-concept, we used this database to evaluate transformation pathways through two case studies: (1) a filter-based approach for flagging high-likelihood halogenation sites according to experimental criteria (e.g., for a model compound, cinnamaldehyde) and (2) a semi-quantitative, regionally comparative trihalomethane formation model that leverages an empirically derived equation. Study (1) highlighted 173 wells with high cinnamaldehyde halogenation likelihood based on combined criteria related to subsurface conditions and oxidant additive usage. Study (2) found that trihalomethane formation in certain wells within five specific basins may exceed regulatory limits for drinking water based on reaction-favorable subsurface conditions, albeit with wide uncertainty. While experimentation improves our understanding of subsurface reaction pathways, this database has immediate applications for informing environmental monitors and engineers about potential transformation products in residual fluids, guiding well operators' decisions to avoid unwanted transformations. In the future, we envision more robust components incorporating transformation, transport, toxicity, and other physicochemical parameters to predict subsurface interactions and flowback composition.
Environmental significanceOf the 670 000 active wells that leverage horizontal drilling with hydraulic fracturing in the United States, specific organic chemical measurements for only 238 flowback waters exist in the public domain. We built a geospatially resolved database that unifies well-specific geochemical parameters and chemical additive disclosures to prioritize future measurements, flag regions of concern or critical treatment needs, and avoid unwanted subsurface chemistries a priori. Leveraging existing transformation models that draw on these inputs, we demonstrate this framework's utility to highlight qualitative and quantitative chemical transformation estimates. While an important first step, the vision for this database is as a predictive tool that serves industry and regulatory sectors seeking to improve the safety and sustainability of the natural gas extraction industry.
|
This small number of analyses have included inorganic11–17 and organic2,18–25 components of flowback water, but the vast well-to-well diversity (spatially and temporally13,21) and comparably complex suites of disclosed additives26,27 pose massive hurdles for complete characterization at the field's current pace. In particular, a high degree of uncertainty remains regarding the nature of anthropogenic additives themselves and their capacities to persist or transform in the subsurface.28 While we acknowledge that organic chemical transformations represent only a fraction of the total risk presented by HDHF activities (e.g., induced seismicity,29 fugitive gas emissions,30,31 and increased traffic32,33 and socioeconomic impacts34,35), organic chemical transformations are among the most difficult to anticipate, monitor, and rectify. Industrial ingredient reports have been disclosed through FracFocus,36 a publicly available online database, and by the United States Congress.37 Generally, these chemicals serve a wide array of purposes to optimize hydraulic fracturing operations,26 from (1) controlling the chemical environment within wells to prevent degradation (e.g., biocides and scaling inhibitors) to (2) altering the fluid's physicochemical properties in situ.26,27 Equipped with an understanding of subsurface conditions,38 researchers have simulated the shale well parameter space in laboratory systems to interrogate and characterize subsurface reaction pathways, demonstrating conditions or additives that can either mitigate or encourage transformation. Analyses of cinnamaldehyde (a corrosion inhibitor),39 glutaraldehyde (a biocide),40 and polyacrylamide (a friction reducing polymer)41 have elucidated problematic halogenation, agglomeration, and degradation pathways, respectively. However, without predictive well models these results serve only to explain past detections in the field, ultimately a reactive response to an emerging issue.
To move the field towards predictive mitigation, modeling tools are needed to anticipate transformations ahead of additive usage or wastewater management. So far, a number of modeling tools exist to predict chemical interactions in environmental systems, including EPI Suite42 for structure-based physicochemical parameter estimation, enviPath43 for biotransformation pathway prediction, or the EPA's Chemical Transformation Simulator.44–46 However, these models do not cover all relevant functional groups or transformation types and cannot account for complex mixtures, let alone the parameter ranges of hydraulically fractured shale wells. As researchers continue to investigate the unique chemistry encouraged by the shale well parameter space, evaluation tools are important to relate laboratory simulations back to wells in the field.
In this study, we appended comprehensive well condition data to existing FracFocus disclosure reports, allowing for unified filtering and searching by ingredient disclosures and corresponding subsurface parameters on a well-specific basis. To illustrate its utility, we applied this modified database to two case studies across disclosed well reports: (1) assessing the impact and opportunity for halogenation39 using cinnamaldehyde as an example, and (2) predicting the formation of trihalomethanes47 from disinfection-like halogenation pathways. Both of these studies leveraged qualitative or quantitative criteria prescribed from laboratory results and equations and linked them to FracFocus chemical disclosures. While we note critically that these models cannot be validated en masse currently, their utility in a sample-limited space is clear. By demonstrating regions of high incidence and estimating product formations based on well conditions, we have provided a framework that can inform a sampling strategy for eventual model verification. In addition, the qualitative and quantitative evaluations are useful for environmental monitors and well operators to anticipate or mitigate HDHF additive transformations.
Prior to integration of any data sets, we performed quality control measures. From the historical FracFocus reporting, we removed all entries without a Chemical Abstracts Service (CAS) registry number, given that the primary purpose of this dataset was chemical usage and identification at reported wells (approximately 3% of entries). Furthermore, due to our case-study focus on some of the most common oxidative breakers (e.g. hypochlorite, chlorite, and persulfate salts) for analysis, we corrected “Ingredient Name” typographical errors linked to these compounds by (1) searching for “breaker” across all report “Purpose” entries (as entered by operators at the time of disclosure), (2) inspecting and manually correcting the resulting list of misspelled species, and then (3) applying those corrections to the entire database. Similarly, we filtered the NGDS database for wells containing valid “bottom hole temperature” entries, omitting those that lacked such entries. While the USGS Produced Waters Database also contained incomplete entries for variables of interest, we did not remove these, as multiple parameters could be utilized with regional spatial resolution.
000, 692, and 10 mg L−1 Cl−, Br−, and I−, respectively) while the median well's bottom hole temperature is low (37 °C). In contrast, median halide concentrations in the Permian Basin are relatively low (54
000, 230, and 9 mg L−1 Cl−, Br−, and I−, respectively) where median temperatures are high (91 °C). Thus, while conditions certainly vary from well to well, the values of nearby wells and/or the conditions of wells in the same region can provide valuable approximations.
Our approximation approach was parameter dependent due to variability in distribution and density of available data: from (i) the use of exact ingredient disclosure data, to (ii) estimation of certain subsurface conditions values using proximally-located wells, (iii) basin-wide averages when subsurface condition data was most sparse, or (iv) some hybrid of these approaches. For example, within the USGS dataset of 114
943 produced water samples, chloride values are reported in 108
646 of those (94.5%), whereas bromide and iodide values are reported in only 6548 (5.7%) and 3659 (3.2%) samples, respectively. Based on this availability, missing chloride values were added to FracFocus wells by finding the ten closest USGS wells according to calculated coordinate distances (via reported latitudes and longitudes in both datasets) and recording the average of chloride concentrations in those ten nearest wells. This method was also used to add temperature data from the NGDS database. However, this method failed in cases where the ten wells closest to a FracFocus location lacked the necessary data available (e.g., for bromide of iodide concentrations). In these cases, we averaged basin-wide bromide and iodide concentrations data instead. Notably, DOC measurements were rare (303 of USGS wells and primarily from only the Appalachian Basin). For pH measurements, data was relatively complete (86
630 wells, 75.4% of USGS data), so a hybrid approach was taken where basin-wide averaging was employed only when the ten nearest USGS wells lacked sufficient data. For all conditions, standard deviation values were also added to wells based on the respective condition source's data sample (i.e., the ten nearest wells or basin-wide data) to capture variability. Data sources and sample sizes by parameter are summarized in Table 1.
a Reflects valid Chemical Abstract Services (CAS) registry numbers, from original 4 190 137 disclosures.
b Water quality and composition measurements were extracted from the United States Geological Society (USGS) Produced Waters Database.48
c Well temperature measurements came from the National Geothermal Data System.49
|
||||||
|---|---|---|---|---|---|---|
| FracFocus | Ingredient report (with CASa) | Unique well uploads | ||||
| n, disclosures | 4 065 763a |
111 221 |
||||
| USGSb | Total | Cl− | Br− | I− | pH | DOC |
| n, samples | 114 943 |
108 646 |
6548 | 3659 | 86 630 |
307 |
| NGDSc | Total | Temperature | ||||
| n, wells | 964 521 |
133 183 |
||||
Here, we note that FracFocus includes 107
144 unique well uploads from the 670
000 HDHF wells across the United States. The model presented here includes only those wells that are indexed in FracFocus, but could be extended to include all wells where the user provides the chemical additive list for a specific well. Note that the entire available geospatial dataset was presented in Sumner & Plata 2018a.38
000, 500, and 25 mg L−1 chloride, bromide, and iodide, respectively), plus high values of persulfate breaker disclosed (i.e., masses in the top quartile of disclosures for this oxidant). For comparison purposes, we also highlighted wells where the reaction criteria were met except for high oxidant usage. In a sense, this last categorization demonstrates the opportunities for operator decision-making to mitigate transformation irrespective of well conditions.
![]() | (1) |
For this analysis, we selected wells where sodium hypochlorite or sodium chlorite were disclosed as breakers. As chlorine-based oxidants, these compounds are directly applicable to disinfecting agents60,61 but have also been shown to form THMs when applied as breakers to guar-crosslinked fracturing fluids.50 We focused this analysis around five different regions with the greatest disclosure rates of these breakers for comparison purposes: the Williston Basin, the Permian Basin, the Gulf Coast Basin, the Denver Basin, and the Anadarko Basin. Inputs included the appended pH, temperature, and bromide concentrations for each well location. When choosing a timescale to model, we opted to bridge the differing scales between expected reaction time from the model's source study (6–168 h)47 and the wide-ranging, potentially months-long, shut in times as flowback fluids return to the surface, as consistent with the temporal analyses by Rosenblum et al.13,21 and Stringfellow and Camarillo.62 This factor is further complicated by the existence of “shut-in” times, where the well is closed and water is not allowed to return to the surface for some period of time that can be widely variable. To deal with this uncertainty, we chose a time of 7 days (168 hours) for initial simulations and extended this value to 28 days (672 hours; extrapolated outside the bounds of the original model by Hong et al.) in subsequent sensitivity analyses. Thus, while the time presents the greatest uncertainty and heterogeneity across the operation space, the results are presented such that the relative effect can be gauged by model users (or specified by an operator who knows the actual time constraint). To calculate an effective chlorine dosage, we adjusted disclosed masses of sodium hypochlorite and sodium chlorite by converting according to their molecular weights and oxidation capacities to Cl2 (based on per mole half-reaction stoichiometry):
![]() | (2) |
To simulate error and incorporate the variability of subsurface conditions at each well in our model, we employed a stochastic approach. Well-by-well, we used the respective appended mean and standard deviation values to construct normal distributions (n = 10) reflecting each well's subsurface condition variability. Values less than zero for which negative numbers have no meaning were forced to zero. From those distributions, input selection was randomized to generate 50 simulated THM calculations for each well. From these 50 values, we calculated a “predicted THM concentration” mean and standard deviation for the well.
Finally, we performed a sensitivity analysis to investigate the influences of each input parameter to modeled THM formation. This analysis was conducted by re-modelling all wells (i.e., using the stochastic approach described above) while fixing all but the test variable model input. For each test input, we incrementally varied the individual parameter across relevant ranges (from their nationwide values) and recorded the subsequent change in predicted THM formation. Specifically, T was varied from 50 to 100 °C by 2 °C, pH was varied from 5.5 to 8.5 by 0.1, Br was varied from 40 to 1000 mg L−1 by 40 mg L−1, t was varied from 12 to 672 hours (four weeks) by 12 hours, and (Cl2/DOC) was varied from 0.02 to 1 by 0.02 (a unitless ratio of diatomic chlorine to carbon concentration).
190
037 chemical disclosures, 4
065
763 of which contained valid CAS registry numbers. Those disclosures included 2880 unique CAS registry numbers and 107
144 distinct well sites (of the 127
781 total on FracFocus). For context, of the 2880 unique CAS numbers, only 1200 appeared on the NIH CACTUS library64 of chemical structures, suggesting that many of the disclosed chemicals are either (1) rare and relatively unstudied, (2) proprietary, or (3) erroneous entries. We note that FracFocus does not cover all well operations, as the Energy Information Agency estimated that around 670
000 hydraulically fractured wells were producing in 2016 (ref. 1) (i.e., FracFocus reports data on approximately 16% of wells). The unified database enables searching across historical FracFocus disclosures and filtering by criteria such as ingredient usage and concentration, water quality parameters, and subsurface well conditions. In this study, we present two case studies to demonstrate the database's utility as a framework for evaluating the existing scope and future opportunity for chemical transformations in the shale well subsurface. Future work would enable one to fully develop and host this database to enable public usage and encourage other researchers in the field to bolster to its predictive capacities with more experimental results and modeling components.
175 out of 107
144 distinct wells (13.2%). To isolate wells in which reactive halogen species might form and add to cinnamaldehyde (a probe chemical for other α-unsaturated aldehydes), we screened for wells with subsurface conditions that met or exceeded the experimental “requirements” determined in Sumner and Plata:39 Cl− and Br− concentrations of at least 50
000 and 500 mg L−, respectively, and a temperature of at least 60 °C. Of the total cinnamaldehyde wells, 1726 met those reaction criteria, notably in the Williston (n = 494), Anadarko (220), Permian (513), and Gulf Coast (344) basins (Fig. 1), which tend to have elevated temperatures. These higher temperatures can activate persulfate (disclosed as a “breaker”) to sulfate radical, which is capable of oxidizing concentrated halides to reactive species that can go on to halogenate other additives.39,50,65–67 Thus, we classified co-disclosures of persulfate at the same site (n = 6352) as being of higher halogenation forming potentials. Finally, to isolate the wells with the highest likelihood of halogenation, we highlighted wells with disclosed persulfate usage in the top 25th percentile of reported concentrations and satisfied the other reaction conditions. We note that this persulfate threshold is not a precise requirement for transformation and that persulfate availability is also a function of competing species within the fluid;50,66,68 nevertheless, we viewed this final criterion as a conservative estimate for flagging wells with the greatest opportunities for halogenation.
Notably, 173 wells passed each of these filters: 75 in the Williston Basin, 12 in the Anadarko Basin, 74 in the Permian Basin, and 8 in the Gulf Coast Basin. Interestingly, in regions like the Appalachian Basin where cinnamaldehyde disclosure was high (n = 1600), zero wells were flagged for the halogenation pathway, as none of them met all of the subsurface conditions needed to promote that reaction due to relatively low well temperatures in that region. This highlights the danger of applying sweeping generalizations to flowback water chemistry. Furthermore, while a large subset of the cinnamaldehyde wells in the Gulf Coast basin met the subsurface condition criteria due to high well temperatures and elevated salinities (344 out of 1871; 18%), just 8 of those 344 (2.3%) wells had sufficient persulfate usage to merit designation as a likely transformation site. In contrast, 75 of the 494 (15%) criteria-matching wells in the Williston Basin and 74 of the 513 (14%) criteria-matching wells in the Permian Basin met the persulfate concentration threshold, indicative of more aggressive breaker usage by operators in the Bakken, Barnett-Woodford, and Avalon Bone-Spring Shales of North Dakota and west Texas.
Ultimately, the undesirable cinnamaldehyde halogenation pathway requires the coincidence of encouraging subsurface conditions along with certain high levels of breaker usage by the well operator. By highlighting wells with halogenation promoting conditions, this database and analysis can caution against aggressive oxidant usage at (or near) those sites of transformation product concern in order to mitigate hazardous product formation. Furthermore, environmental monitors and industrial operators screening for such contaminants can be made aware of the potential occurrence of halogenated species when managing wastewaters or inspecting nearby spill sites based on our conservative well “flagging”, saving time and cost associated with comprehensive analysis. By narrowing the potential transformation sites down from tens-of-thousands to less than two hundred, we have made the scale of such site analyses or sampling programs for verification markedly more feasible. Finally, we identify utility for such a well-specific predictive tool, especially in a highly complex and heterogeneous reaction parameter space, and this tool may help industry avoid a degree of trial-and-error experimentation when optimizing their own oil and gas production as a function of chemical additive choice.
For context, the “Maximum Contaminant Level” (MCL) of total THM as regulated by the Safe Drinking Water Act (SDWA) is 80 μg L−1. Across all 12
894 modeled wells, the median predicted THM concentration was 60.9 μg L−1 (Fig. 2). Within the highlighted basins, the predicted concentrations varied slightly: the Anadarko Basin had a median predicted concentration of 56.3 μg L−1 while the Denver Basin had a median predicted concentration of 69.0 μg L−1. However, even while these two basins held the largest gap between median concentrations, that difference is not significant (p > 0.05) due to their wide tailing distributions. More interestingly, 2243 wells were predicted to exceed the THMtot MCL of 80 μg L−1. Of note across the five highlighted basins: 166 wells in the Anadarko, 298 in the Denver, 666 in the Gulf Coast, 196 in the Permian, and 441 in the Williston Basin had mean THM values that exceeded the MCL. Again, we emphasize that these calculated concentrations are not precise predictors, considering the model was developed in the context of river-water disinfection rather than subsurface oxidation of fracturing fluids. Nonetheless, this model accounts for many of the important chemical parameters and governing principles identified in Sumner and Plata,39 and the application here can provide reasonable first estimates for comparing THM formation risk regionally.
To capture the influence of input value selection on THM prediction outputs and probe the effects of each input to our model, we performed a sensitivity analysis (Fig. 2b). That is, while our model already accounts for variability of input mean, we performed an exercise to determine how sensitive the basin-wide outputs were to changes in those respective input values. For each parameter, we modeled the resulting nation-wide median and standard deviations for the incremental parameter values across their relevant ranges, as determined by existing parameter reviews38 (Fig. 3).
Individual sensitivities generally reflected a parameter's relative exponentials in the model equation, yet the magnitude of effect on final predicted THMtot also depended on the total variability in the selected parameter range. Across the tested temperature range, the median predicted well concentration varied linearly by about 20 μg L−1: 49.5 vs. 70.5 μg L−1 at 50 and 100 °C, respectively. From pH 5.5 to 8.5, the median predicted well concentration spanned a larger linear range from 44.4 to 92.7 μg L−1, indicating a higher response to predicted THM formation due to higher pH, consistent with the model equation's relatively high exponent (i.e., pH1.695) and with the disinfection chemistry literature.61,69,70 The remaining model input parameters (time, bromide concentration, and the ratio of chlorine dosage to DOC) resulted in logarithmic predicted THMtot responses (Fig. 3); i.e. initial parameter increases corresponded to greater increases in THMtot but the responses diminished toward the latter end of the ranges. From t = 12 to t = 192 hours (8 days), the median predicted well concentration roughly doubled from 39.7 to 81.4 μg L−1. Similar trends resulted from the dosage/DOC and bromide sensitivity analyses. From Cl2/DOC = 0.02 to 0.2, the median predicted THMtot jumped from 42.2 to 66.1 μg L−1 but at Cl2/DOC = 1.0 saw about the same increase of about 24 μg L−1 to 90.2. From Br− = 40 to 360 mg L−1, the median predicted THMtot increased from 48.8 to 78.8 μg L−1 and then to 98 μg L−1 at the highest tested value of 1000 mg L−1. Overall, model outputs were most sensitive to pH and bromide levels (and clearly time, as the model is a kinetic one), whereas well temperature had a smaller effect. This information can be used to prioritize ancillary data collection at well sites; for example, Br− measurement should be encouraged as they are only collected at 5.7% of wells nationally, whereas pH measurements are more standard and widely available.
A major goal of this study was to meaningfully synthesize the wealth of publicly available data for assessing transformation models, seeking to bridge analyses from single-ingredient simulation studies to nation-wide trends suggesting transformation potential. While the presented predictions have not been verified, the model provides critical information needed to inform a rigorous evaluation. For example, the output of the model can now guide the collection of both positive and negative controls at strategic sites across the country, as well as a spread of quantitative measurements across the identified basins. This strategy is necessary given the logistical and analytical hurdles to comprehensive flowback sampling2,28 (e.g., current limitations of ad hoc strategies for flowback water compositional analysis only where sample access is granted or available). To date, we are limited to a handful of observations that confirm the presence of halogenated chemicals, including THMs, in basins where their formation was anticipated by the model.19,21,28,77
As of now, the two case studies investigated here offer just a limited vision for the scope of this evaluation tool to leverage known subsurface pathways and well parameters. While experimental capacities improve and the field characterizes more subsurface transformation pathways and physicochemical interactions, this framework provides a critical first step towards truly predictive modeling of flowback composition and environmental risk. As such, the approach presented here can and should be modified to enable a complete and multi-faceted suite of fate predictions. We envision this platform will stimulate researchers to populate the missing and necessary data. For example:
(i) Enhanced, geospatially resolved knowledge of mineralogy and organic-carbon content could enable ingredient-specific estimates for surface partitioning, governing whether additives or transformation products return to the surface with the flowback fluids.
(ii) Permeability and fracture measurements along the well horizon could inform subsurface transport phenomena and their effects on flowback interactions and composition (e.g., improved understanding of the physical mixing processes).
(ii) Knowledge of downhole-relevant microbial activity, proteins, and their transformation capabilities25,52,78 relative to subsurface conditions could improve existing biopathway models.43
(iv) Organic transformation models,44–46 modified to incorporate mixtures and adjust for extreme physical/chemical conditions could estimate product formation extents and kinetics.
(v) Toxicity measurements79–81 or estimates, exposure models,82 and population data could be appended, akin to the EPA's Risk Screening Environmental Indicators (RSEI) model,83 using FracFocus ingredient disclosures and input in lieu of the Toxic Release Inventory (TRI).
Ultimately, the scope and growth of hydraulic fracturing fluids as hazardous wastes for treatment, management, or remediation demands continued attention. The database framework presented here should serve as a foundation for more ambitious model components that can apply, in real time, emerging transformation studies to inform final flowback compositions. In turn, this information will inform better public health, economic, gas production, and treatment optimization for more sustainable development of this important energy industry.
Footnote |
| † Electronic supplementary information (ESI) available. See DOI: 10.1039/c9em00505f |
| This journal is © The Royal Society of Chemistry 2020 |