Efstathios
Reppas-Chrysovitsinos
,
Anna
Sobek
and
Matthew
MacLeod
*
Department of Environmental Science and Analytical Chemistry (ACES), Stockholm University, SE-10691 Stockholm, Sweden. E-mail: matthew.macleod@aces.su.se
First published on 26th April 2016
Polymeric materials flowing through the technosphere are repositories of organic chemicals throughout their life cycle. Equilibrium partition ratios of organic chemicals between these materials and air (KMA) or water (KMW) are required for models of fate and transport, high-throughput exposure assessment and passive sampling. KMA and KMW have been measured for a growing number of chemical/material combinations, but significant data gaps still exist. We assembled a database of 363 KMA and 910 KMW measurements for 446 individual compounds and nearly 40 individual polymers and biopolymers, collected from 29 studies. We used the EPI Suite and ABSOLV software packages to estimate physicochemical properties of the compounds and we employed an empirical correlation based on Trouton's rule to adjust the measured KMA and KMW values to a standard reference temperature of 298 K. Then, we used a thermodynamic triangle with Henry's law constant to calculate a complete set of 1273 KMA and KMW values. Using simple linear regression, we developed a suite of single parameter linear free energy relationship (spLFER) models to estimate KMA from the EPI Suite-estimated octanol–air partition ratio (KOA) and KMW from the EPI Suite-estimated octanol–water (KOW) partition ratio. Similarly, using multiple linear regression, we developed a set of polyparameter linear free energy relationship (ppLFER) models to estimate KMA and KMW from ABSOLV-estimated Abraham solvation parameters. We explored the two LFER approaches to investigate (1) their performance in estimating partition ratios, and (2) uncertainties associated with treating all different polymers as a single “bulk” polymeric material compartment. The models we have developed are suitable for screening assessments of the tendency for organic chemicals to be emitted from materials, and for use in multimedia models of the fate of organic chemicals in the indoor environment. In screening applications we recommend that KMA and KMW be modeled as 0.06 × KOA and 0.06 × KOW respectively, with an uncertainty range of a factor of 15.
Environmental impactAssessments of exposure and human health risk associated with organic chemicals in the indoor environment depend on accurate knowledge of equilibrium partition ratios of the chemicals between polymeric materials found in the indoor environment and air and/or water. We compiled a database of 1273 measurements of partition ratios of 446 different chemicals between nearly 40 different polymeric materials and air or water. We developed empirical models to estimate partition ratios for other chemicals from octanol/water and octanol/air partition ratios, and from Abraham solvation parameters estimated from structure. Our screening-level models can be applied in prospective exposure and risk assessments for chemicals, and to guide the collection of additional data. |
There is growing interest in developing models to describe and quantify the flow of organic chemicals out of consumer products and materials and into the indoor6 and outdoor7 environment. A notable example is the ExpoCast program, in which the United States Environmental Protection Agency is currently developing rapid, high-throughput exposure models to provide human exposure estimates for thousands of chemicals.8 Exposure modeling for organic chemicals depends on accurate estimation of partition ratios of the chemicals.
Partition ratios describe the distribution of organic chemicals between two phases at equilibrium, and are often used to indicate the direction of diffusion and, generally, to quantitatively describe mass transport phenomena. Partition ratios between polymeric materials and air (KMA) or water (KMW) are required for models of fate and transport of organic chemicals in the indoor environment, e.g. houses, workplaces and vehicles, that have been developed to describe exposure pathways to humans and explain variability of indoor concentrations.6 In recent years there has been increasing interest in measuring partition ratios of organic chemicals for polymers such as polyurethane and polydimethylsiloxane that are present in the indoor environment and also are used as passive samplers of air and water by environmental analytical chemists. Thus, KMA and KMW have been measured for a growing number of substances and materials, but significant data gaps still exist to reliably quantify these partition ratios for the myriad of organic chemicals and polymers that humans might encounter in a typical day.
Single parameter and polyparameter linear free energy relationships (spLFERs and ppLFERs) are semi-empirical modeling approaches that can be applied to estimate partition ratios where measurements are not available. Models based on spLFERs are correlations between the logarithms of two partition ratios that have one phase in common. Octanol has often been used as a surrogate for organic phases in spLFERs, for example in widely used models that estimate partition ratios between organic carbon and water (KOC) from the octanol–water partition ratio (KOW),9 and that estimate partition ratios between aerosols and air from the octanol–air partition ratio (KOA).10
The spLFER approach relies on the embedded assumption that the free energies of transfer of chemicals between phases in the two partition ratios in the correlation are linearly related.11 The ppLFER approach is based on the less stringent assumption that the free energy of transfer between different phases can be quantified as the linear sum of energetic contributions from different types of molecular interactions of the chemical with the phases.11 A common form of ppLFER is based on five empirical chemical descriptors that represent a chemical's ability to participate in molecular interactions through hydrogen bonding acidity (A) and basicity (B), van der Waals dispersive interactions described by the logarithm of the hexadecane–air partition ratio (L) and polarity/polarizability (S), and McGowan characteristic volume (V), which accounts for energy costs to form cavities in solvents to accommodate the chemical.12 Well-calibrated models based on ppLFERs are in general expected to perform better than equally well-calibrated spLFERs, as they account for a diversity of types of molecular interactions, and they have more degrees of freedom than single parameter models.13 There are some cases, however, where the performance of ppLFERs and spLFERs is similar and the two methods can be used interchangeably.14 In any application, the information about molecular interactions and the potential for higher accuracy offered by ppLFERs must be weighed against the additional requirements for data compared to spLFERs.
Our goal in this paper is to develop a suite of alternative semi-empirical LFER models for estimating KMA and KMW, and to evaluate the performance of these models and make recommendations about their utility in different applications such as indoor models of fate and transport, selection of materials for passive sampling, and high-throughput exposure estimation. We assembled a database of 363 KMA and 910 KMW measurements for 446 individual compounds, along with their estimated physicochemical properties. The database includes measurements for nearly 40 individual polymeric materials, collected from 29 studies. Our hypothesis was that spLFER models would be more suitable for screening-level applications where (i) the polymer is treated as a bulk substance or (ii) the chemical and/or polymer are poorly characterized. Accordingly, we hypothesized that ppLFER models developed for better characterized cases of substances and materials would perform better than spLFERs in estimating KMA and KMW.
After screening the initial set of 45 peer-reviewed journal articles according to the criteria above, data from 29 articles (ref. 15–43) were included in our database.
Finally, we achieved a uniform set of “dimensionless” partition ratios by employing material densities to convert the units of partition ratios that were expressed on a volume/mass basis (e.g. mL g−1) to a volume/volume basis. In cases where information on the densities of the polymers was not reported in the studies, we used an average material density value from manufacturers' websites. Densities used in these calculations are included in the spreadsheet that is part of the ESI.†
log![]() ![]() | (1a) |
log![]() ![]() | (1b) |
log![]() | (2a) |
log![]() | (2b) |
The phase descriptors aPQ, bPQ, lPQ, sPQ and vPQ describe the differences between any two phases (P, Q) with regard to the same intermolecular interactions, and ci is the constant of the multiple linear regression (eqn (2a) and (2b)). The molecular descriptors (A, B, L, S and V) of the chemicals are listed in ESI.†
We present models for partition ratios to six categories of polymeric materials that are defined based on the 2D chemical structures of their corresponding monomers: polyethylene, polyurethane, polydimethylsiloxane, carbohydrates, polyoxymethylene and nylons. 3D structural differences among the materials associated with properties such as degree of crystallinity, density, porosity and permeability were not taken into account in classifying the polymers into different models. However, the form of our models implicitly assumes that absorption, rather than adsorption, is the dominant partitioning process. Absorptive processes occur between a volume of polymer and a volume of air or water. Therefore we expect differences in the 3D structure among polymers to be reflected in the regression parameters determined when we fit our models. As shown in other studies,47–49 the constants in both LFER approaches are partially determined by the fraction of the matrix that is available/accessible for sorption.
We also developed KMA and KMW models for a generic “bulk material”, which is based on combining all the KMA or KMW measurements in the dataset, regardless of the type of material to which the chemicals partition. The generic bulk material models include more measurements than the sum of the measurements used to develop the six categories of polymeric materials since there are polymers in the database such as polystyrene, acrylate polymer, vinyl, polyethylene terephthalate and polyhydroxyethylmethacrylate32 that are not included in any of the six categories listed above, and for which there were only a few measurements, and materials such as kitchen towel23 for which the 2D chemical structure of the polymeric material was not known. In total, the generic bulk material models contain 93 measurements that are not used to develop models in any of the six material-specific cases.
We calculated the correlation coefficient (R2) and the root mean squared error (RMSE) for each model to quantify uncertainties, assess the relative performance and make recommendations about possible applications for estimating KMA and KMW. For model development and outlier detection we employed the Python Statsmodels package. The outlier detection method we used is based on the adjusted Bonferroni p-values for the studentized residuals.
When gi = 1, eqn (1a) and (1b) can be written as
KMA = 10h1[×/÷10RMSE] × KOA | (3a) |
KMW = 10h2[×/÷10RMSE] × KOW | (3b) |
We expect the spLFER models of KMA and KMW of the same material (i.e.eqn (1a) and (1b)) to have the same intercept when we force the slope to 1, i.e. we expect h1 = h2. This expectation is a result of our data curation process since the value of either of the two partition ratios, KMA or KMW, is dependent on the other, measured value (i.e. KMW or KMA) through a thermodynamic triangle. A deviation between the intercepts of the material–air and a material–water spLFER models for a given material would indicate problems with the application of Trouton's rule or with the calculation of consistent partition coefficients in the thermodynamic triangle made up of KMW, KMA and the dimensionless Henry's law constant, KAW.
The error introduced in the spLFER models by forcing the slope to 1 is accounted for by the RMSE value of the model, which reflects the total uncertainty of each model.
Fig. 1 depicts the 446 compounds of the database in a chemical partitioning space defined by logKOA and log
KAW. The database contains chemicals spanning about 20 orders of magnitude of KAW and KOA and about 10 orders of magnitude of KOW, including hydrophobic (log
KOW > 6), volatile (log
KAW > 2), and highly water soluble (log
KAW < −8) chemicals. The range of chemical space represented in the training sets of semi-empirical models largely determines the applicability domain of the models. A broader domain of applicability renders the prospective power of the models stronger by expanding their range of applicability. Based on the 1273 measurements of the database, we derived a total of 28 different models to estimate KMA or KMW, as described below.
![]() | ||
Fig. 2 Linear free energy relationship models to estimate material–air (KMA) and material–water (KMW) partition ratios developed using all measurements in the database regardless of material type. |
Fig. 2 shows the models developed for the bulk material case, Fig. 3 shows models for the material-specific cases of polyethylene, polyurethane, polydimethylsiloxane and carbohydrates, where the number of observation (n) is over 100, and Fig. 4 shows models for polyoxymethylene and nylon where n is less than 100. This selective presentation of plots is based on a rule of thumb50 according to which multiple linear regression works in the most meaningful way for 10–20 observations per model parameter, including the regression constant. In the case of ppLFER models there are six parameters: the five Abraham solvation parameters and the regression constant. Thus models with n ≥ 100 meet the minimum data requirements for ppLFERs more strictly than models with n < 100. Measurements identified as outliers in each model are indicated in red, and a list of the compounds identified as outliers in each of the 28 models can be found in ESI.†
Our secondary hypothesis was that the ppLFERs we developed would outperform spLFERs in the material-specific models where the polymers were well characterized. In many of the material-specific models illustrated in Fig. 3 and 4 the model performance metrics for the ppLFERs are indeed better than for the spLFERs. However, the differences in most cases are small. Moreover, there are a few cases (KMA for polydimethylsiloxane, polyoxymethylene and nylon) where spLFERs have lower RMSE and higher R2 values than the corresponding ppLFER models. Therefore, our second hypothesis is weakly supported, but ppLFERs are not markedly superior to spLFERs.
It is clear from theoretical considerations that ppLFER models should outperform spLFER models for specific materials and chemicals with accurate Abraham solvation parameters.13 In studies where partition ratios are measured using a consistent methodology and ppLFERs are developed based on accurately measured Abraham parameters, correlation coefficients can be 0.99 or higher.51–53 The lower correlation coefficients and comparable performance of ppLFERs and spLFERs that we observed in this study reflects limitations of our data quality and comparability that arise from our use of estimated property data and from combining studies primarily based on having a common monomer in the polymeric material.
It is notable that ppLFER models for KMW perform better than the corresponding spLFER models according to both R2 and RMSE in nearly all cases. A possible explanation for the better performance of ppLFERs compared to spLFERs for KMW is the specific H-donor and acceptor interactions between some chemicals and water. Another possible explanation is that calculating KMA through the application of the thermodynamic triangle introduced some random errors that mitigated the advantages of the ppLFER models over the spLFER models. There were initially more experimental KMW values than KMA values, so random errors introduced by the thermodynamic triangle would affect KMA more significantly.
Variability in the polymeric material itself among the various studies that form the basis of each material-specific dataset is another possible source of variability in our material-specific models that might not be well-described by the ppLFER models. A cross-sectional examination of the material-specific models implies a positive correlation between RMSE and the number of studies used to form each polymeric material group. In two material groups that consist of comparable number of observations (polyurethane, n = 273 and polydimethylsiloxane, n = 251), the ppLFER model developed for the partition ratio that was experimentally determined (KMA or KMW – without the application of the thermodynamic triangle) in the largest number of studies (polydimethylsiloxane, 11 studies, water) has higher RMSE values (0.77) than the partition ratio measured in fewer studies (polyurethane, 2 studies, air, RMSE = 0.59).
Another aspect of material variability among the different studies is the age of the material used in the experiments and the overall handling of the material prior to the experimental procedure. In aged polymers some of the covalent bonds in the polymer structure might be degraded,54 leading to a difference in the sorption capacities of an aged and a fresh polymer.
Our database was assembled using EPI Suite's estimated values of logKAW to create a thermodynamic triangle, Trouton's rule to estimate enthalpies of phase change between material and air, and the Van't Hoff equation for temperature adjustment to 298 K. Each of these three calculations contributes to the introduction of uncertainty in the models. We expect spLFER models to be more susceptible to the introduction of random errors than ppLFERs because they have fewer degrees of freedom. EPI Suite's estimations have been found to be generally consistent with estimates of other software,57 but errors introduced by the use of EPI Suite's estimated log
KAW would be difficult to detect since they are incorporated consistently through the data curation and model development. Trouton's rule is in theory limited to nonpolar chemicals and compounds that are capable of forming strong hydrogen bonds are typically exceptions to this rule. The Van't Hoff equation is based on the assumption that enthalpy change remains constant over the temperature range under consideration. Our database contains polar compounds that form hydrogen bonds. Also, the temperature range over which the partition ratios were measured in the 29 studies we used to assemble the database spans from 277 K to 373 K. However, uncertainty for most of the models developed in this study is typically a factor of 10, while the range of partition ratios spans several orders of magnitude. This result implies that both Trouton's rule and Van't Hoff equation assumptions are not sufficiently violated to introduce large errors. In all spLFER models the intercept of the KMA and KMW relationships was identical and the RMSEs were similar for models developed for the same group of materials. The internal consistency of the KMA and KMW spLFER models indicates that there were no errors introduced in the calculation of partition ratios using the thermodynamic triangle.
Our ppLFER models are based on A, B, L, S and V, which is consistent with recommended practice for environmental applications.58 The estimation of these Abraham solvation parameters by ABSOLV is a source of uncertainty for ppLFER models. In the estimation of KMA, there are cases of material-specific spLFERs such as the ones developed for polydimethylsiloxane, polyoxymethylene and nylon that have a lower RMSE than the corresponding ppLFER models. This result contradicts our initial theoretical considerations. In theory, ppLFERs are more accurate than spLFERs – especially when the chemicals and the phases to which they partition are well defined and characterized. A possible explanation for this unexpected result lies in the accuracy (or lack thereof) of ABSOLV's estimations.
Among our spLFER models, those developed for carbohydrates have the highest RMSE, and also much higher RMSE than the corresponding ppLFER models (see Fig. 3). The relatively poor performance of the spLFER models for carbohydrates led us to re-examine this group of materials and we identified a subset of partition ratios that consistently appeared in the outliers. These partition ratios were obtained from the same study and they also clearly defined a different sub-segment of polymeric materials that we initially considered as carbohydrates. This is the sub-segment of paper; it consists of imitation parchment paper, Kraft paper and cardboard. We suspect that paper has different partitioning properties than the rest of the materials that form the carbohydrates aggregation because it has typically undergone various treatments to reach its final state. It is possible that due to crystallinity,49 the cellulose in the paper is not as available for sorptive processes as in the rest of the materials in the carbohydrates group. The models for carbohydrates without the paper sub-segment, as well as the models for the paper sub-segment are included in the ESI.† When the paper sub-segment is removed from the carbohydrates group, the R2 for the spLFER for KMA rises from 0.55 to 0.77 and the RMSE falls from 1.7 to 0.96.
As discussed above, partition ratios measured in a single study of partitioning between paper and water that were included in our models for ‘carbohydrates’ were frequently identified as outliers, especially in the spLFER models. In general, we believe that studies focused on specific polymers that are not representative of the larger group in which they are collected for modeling purposes would be a source of outliers. The performance metrics for models of the carbohydrates group are markedly better when paper is excluded, which indicates paper has different characteristics than other polymeric materials in the ‘carbohydrates’ group.
Endosulfan I (alpha) is the only compound that is an outlier for two different ppLFER models for specific polymers (polydimethylsiloxane and polyethylene) measured in two different studies, and is also an outlier in both bulk material ppLFER models. In contrast, endosulfan I (alpha) is not an outlier in any spLFER model. Thus the Abraham solvation parameters estimated by ABSOLV for endosulfan I (alpha) may be suspect.
Notable outliers are bisphenol A and 4-deoxypyridoxine in the paper sub-segment, hexane in polyurethane and styrene in crystalline polystyrene. Four out of six of the compounds that contain basic groups were detected as outliers (codeine, L-methadone, naltrexone, trimetazidine), mainly in ppLFER models. This result indicates that the partition ratios of these compounds might have been measured in their disassociated form. Our inspection of the other 44 individual compounds that were identified as outliers in various models did not lead to the identification of other specific chemicals or classes of compounds that were consistently identified as outliers.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c5em00664c |
This journal is © The Royal Society of Chemistry 2016 |