Matthew J.
Eckelman
*a,
Matthew S.
Moroney
a,
Julie B.
Zimmerman
a,
Paul T.
Anastas
a,
Eva
Thompson
b,
Paul
Scott
b,
Maryann
McKeever-Alfieri
b,
Paul F.
Cavanaugh
b and
George
Daher
b
aSustainability A to Z, LLC, Guilford, CT, USA. E-mail: matt@sustainabilityatoz.com
bThe Estée Lauder Companies, New York, NY, USA
First published on 7th December 2021
Advances in green chemistry over the past 25 years have improved sustainability in the development of new cosmetic and personal care products. Product formulators benefit from an expanding palette of “greener” natural and synthetic ingredients but need clear guidance on how to choose among options to optimize formula sustainability while also evaluating for performance. As greener can have a variety of meanings, for the purpose of this article, we define greener as being aligned with green chemistry principles. Here, we report the development of a quantitative green chemistry scoring methodology incorporating human health (HH), ecosystem health (ECO), and environmental (ENV) endpoints to specifically characterize cosmetic and personal care products. Using a hazard-based approach, a “Green Score” for cosmetic ingredients was calculated incorporating the HH, ECO, and ENV categories. Ingredient and chemical component data were obtained from manufacturers, open-source databases, or computer model estimates. There are 8 individual metrics: 3 each for HH (acute, ocular, and dermal toxicity) and ECO (bioaccumulation, persistence, and aquatic toxicity) and 2 for ENV (feedstock sourcing and greenhouse gas emissions). All metrics and data quality measures can be examined by formulators for individual endpoints, averaged by category, or further averaged to an overall Green Score. This scoring framework has been applied across ingredients and product formulations at The Estée Lauder Companies to establish Green Score baseline values, identify priority raw materials for replacement, and guide future innovation. Actual scores and statistical results are presented here at the ingredient, formula, and product subcategory levels to demonstrate the functionality of the tool as a measure of green chemistry performance.
A challenge for any organization seeking to integrate sustainability into their product development-related decision-making is the need for a standard definition for product sustainability and a comprehensive framework of metrics with which to measure progress. The field of green chemistry fortunately offers a wealth of knowledge that can be incorporated into enabling and quantifying sustainability.3 The significant progress in global green chemistry initiatives provides an expanding framework for balancing the large-scale considerations of sustainability with tangible actions and metrics that integrate those considerations into product design and business decisions across the product life cycle.2,4,5 These incremental advances are important steps for reaching specific aspects of the United Nations Sustainable Development Goals.6,7 This framework is particularly valuable for formulators within the beauty care (i.e., cosmetic) industry because of the direct application of products onto the skin and hair. Tools based in green chemistry principles can assist in harmonizing sustainability goals with the formulation design process, providing a quantitative approach to make decisions and measure progress.
A variety of bespoke tools have emerged for green chemistry and life cycle assessments for pharmaceutical and personal care products (PPCPs). Tools for green chemistry assessments of PPCPs can be categorized along 2 continuums: hazard (inherent nature of ingredients) and risk (concentration in final product) as well as cradle to cradle (life cycle) and gate to gate (internal operations). Tools within this space include those that consider a number of green chemistry principles,8–11 and help inform the design of “greener” raw materials and/or products to yield more sustainable PPCPs. Each incorporates unique environmental and toxicologic metrics, weighting, and scoring algorithms.
As the data underlying these tools evolve to be more robust and transparent, there are opportunities to improve on green chemistry measurement frameworks to account for inherent hazards, consider life cycle implications, enable the design of greener raw materials and products, and most importantly, drive innovation to more sustainable means of providing the necessary function in PPCP formulations. To be trusted, effective tools should be based on the best available evidence and be transparent in their algorithms and data sources.
Here we present a new “Green Score” tool designed for rapid assessment of PPCP ingredients and formulations based on knowledge at the fundamental molecular level, coupled with life cycle sourcing and end-of-life considerations. The 12 principles of green chemistry provide a hazard-based perspective on ingredients. While The Estée Lauder Companies (ELC) uses a risk-based safety assessment framework, our Green Score tool incorporates several of the 12 principles of green chemistry as a complement:
• Principle 4: Designing safer chemicals
• Principle 6: Design for energy efficiency
• Principle 7: Use of renewable feedstocks
• Principle 11: Real-time analysis for pollution prevention
The Green Score tool evaluates green chemistry principles and chemical hazards in 3 distinct categories: human health (HH), ecosystem health (ECO), and environmental impact (ENV). The tool has been applied to chemical components of ingredients for formulations within ELC and provides useful insights into the effect of specific ingredient choices (singly or in combination) on a product formulation's overall Green Score.
Notably, the tool includes several important features: (1) a balance between assessing inherent chemical and supply chain hazards, (2) a disincentive to use raw materials with low scores or lack of data by weighting their impact to reduce the score further, and (3) a certainty score to provide insight on the level of confidence in the Green Score for a given ingredient or chemical component.
Beyond presenting the methodology used to develop the novel Green Score tool, this report also demonstrates how results can be interpreted and applied. At the product category level, statistical analyses can provide an overall baseline and Green Score comparisons across products within a category and by product form (e.g., solid or liquid). Descriptive statistical analyses within ingredient or product groupings can also be used to identify low Green Score outliers through best Green Score performers. This can be integrated and displayed at the formulator bench level, using data dashboards allowing for rapid identification of alternative ingredients. Before and after scoring for specific reformulations allows for quantitative analyses of PPCP improvements using green chemistry–based optimization. In concert with continuing advances in green chemistry and engineering, the Green Score tool is being leveraged to prioritize innovation for individual ingredients and raw material classes.
Ingredient compositions are then adjusted to remove water, with all ingredient proportions rescaled as shown in the following equation:
Ingredient sourcing details considered are feedstock source (plant, mineral, petroleum, etc.), country of origin, and existing sustainability certifications, such as certified organic and Roundtable on Sustainable Palm Oil (RSPO). In addition, a separate GHG emissions survey is sent out to all ELC suppliers to collect data on Scopes 1 and 2 emissions (according to the GHG protocol) per kilogram of manufactured ingredient delivered to ELC.
GHG emissions are also scored as a composite metric averaged from 2 distinct data sources. Emissions performance of suppliers is represented by their reported Scopes 1 and 2 emissions per kilogram of ingredient produced, collected through supplier surveys. These data represent raw material manufacturing operations but do not account for upstream Scope 3 emissions of chemical ingredients. Emissions performance of the upstream supply chain is represented by modelling each of the 2300+ chemical ingredients using embodied cradle-to-gate GHG emissions data gathered from the ecoinvent life cycle inventory database, maintained by the Swiss Centre for Life Cycle Inventories (http://www.ecoinvent.org). GHG emissions from transporting raw materials to ELC manufacturing sites are not currently included, as production locations shift depending on demand and capacity constraints. These ingredient-based GHG emissions are then mass averaged up to the raw material level. Both GHG emissions are first recorded in absolute units of kilogram of carbon dioxide equivalents (kg CO2 eq) per kilogram of ingredient, and then rescaled to the 1 to 5 scale used for other Green Score endpoints. Because GHG emissions for chemicals can span several orders of magnitude, a logarithmic scale is applied for rescaling. Ingredients with scores that fall above a threshold of 1000 kg CO2 eq per kilogram are assigned the worst value of 1, as are ingredients from the supplier survey that are reported as having zero GHG emissions, likely indicating that the survey was not completed correctly. A conservative score of 2 is assigned if no emissions data were provided by the supplier.
Table S3† outlines the full scoring assignment rubric for the 2 ENV endpoints.
As the criteria to trigger a penalty increases, there are an increasing number of raw materials that will be penalized, particularly for the ENV endpoint, where average raw material scores tend to be higher. The effect of the penalty scheme on the overall Green Score is nonlinear as the criteria and penalty values change since the 14 formulations are composed of different raw materials at different percentages, each with their own attributes that may trigger different penalties. The selected criteria value of 1 avoids dramatic jumps associated with certain categories (e.g., 500% increase in penalties for multiple formulations in the ENV category as the criteria value increases to 2).
Penalty calculations were evaluated using 14 different formulations (7 moisturizers and 7 foundations) to simulate spread using a variety of thresholds (e.g., score of 1 or 2) and penalties (e.g., deduction of 0.1 or 0.2). The standard deviation for each simulation (i.e., penalized Green Score vector) was calculated by applying a matrix of penalties (0.0–0.25 in 0.05 increments) and thresholds (0–2 in 0.25 increments). This process was repeated separately for moisturizers and foundations. The penalty scheme of criteria ≤1 and penalty value of 0.10 per exceedance results in an increased standard deviation of ∼100% for foundations (0.24–0.5) and ∼200% for moisturizers (0.24–0.73) due to a higher number of low-scoring components in the moisturizer ingredients. For the ENV category, the scores were similar between these product categories, with foundations having a much smaller interquartile range.
Then, HH, ECO, and ENV category scores are calculated through simple averaging of the ingredient-level metrics k in each category. Finally, the overall ingredient Green Score Ij for each ingredient j is obtained by simple averaging of the category scores. For ease of interpretation by product developers, the overall Green Score is rescaled from a scale of 0 to 5 to a scale of 0 to 100 (100 being best).
Product category Green Score benchmarks are calculated by grouping all like active formulas in each product category (e.g., haircare, skincare, make-up) and subcategory (e.g., serums, waterproof mascara, conditioners, solid perfumes). The initial benchmark is set as the mean formula Green Score for that category/subcategory.
A variety of exploratory data analyses were used prior to statistical analysis in R: A language and environment for statistical computing, including visual tests (e.g., boxplots, histograms) and statistical tests. Assumptions of normality were evaluated using the Anderson-Darling test in R.14,15 A variety of transformations were applied to the Green Score vector, then reevaluated for normality, including natural log, square root, logarithm base 10, inverse, sqrt[max(x + 1) − x], log10[max(x + 1) − x], and 1/[max(x + 1) − x]. A selected statistical significance value of α = 0.05 was used for evaluation. All tested transformations failed normality tests at extremely significant P values (e.g., >10 × 10−16).
The failure of normality assumptions indicates that parametric tests, such as the t test, z score, and analysis of variance (ANOVA), should not be used to analyse the Green Score vector between grouping factors. Instead, to evaluate multiple groups, the Kruskal–Wallis test, a nonparametric equivalent of ANOVA, was used. If a significant difference between groups was found, the Dunn test of multiple pairwise comparisons, a nonparametric equivalent of the Tukey honestly significant difference test, was used.16,17
When considering Green Scores by product category (Fig. S2b†), differences in performance are observed. For example, lip care products score relatively well across endpoints, whereas haircare products tend to score well for the ENV endpoint. It is interesting to note that formulations in the haircare category have a lower mean HH score than all the other product categories. This can largely be attributed to the presence of solvents and colorants (lower-scoring ingredients) in hair dye, which affect the overall mean.
When considering product form (Fig. S2c†), it is unequivocally clear that liquids and emulsions score lower across all 3 endpoints, while sticks, solids, powders, and anhydrous products tend to score higher. Again, this can be attributed to the nature of the ingredients required for the various product forms and the need to use lower-scoring functional classes (e.g., solvents, suspending agents) to formulate liquid and emulsion product forms.
Investigating combinations of attributes is also useful. A robust approach to greener formulation would be the simultaneous consideration of a product category, such as lip care, and product form (e.g., gel, liquid, or stick; Fig. S3† and Table 1). It is interesting to note that within the lip care product category, the mean Green Score is significantly higher for sticks than for liquids and gels, suggesting that a focus on greener ingredient innovations meeting the unique technical needs of certain physical forms could be beneficial for the development of greener product lines.
Category | Product form | No. of products considered | Minimum score | Average score | Maximum score |
---|---|---|---|---|---|
Lip product | Gel | 161 | 63.9 | 71.9 | 80.5 |
Lip product | Liquid | 63 | 65.7 | 72.8 | 79.3 |
Lip product | Stick | 1356 | 55.3 | 76.3 | 85.1 |
Examining the ingredient options in Fig. 3, solid natural beeswax (row 1) received the highest score, while the wax version of the same product (row 4) received a lower score. This difference stems entirely from the ENV category, where the solid version is certified organic from a supplier with relatively low reported facility GHG emissions (and thus a higher ENV GHG score). In contrast, the wax version is not certified organic and is from a supplier with higher reported GHG emissions. This heterogeneity in classifications and Green Score values for related chemical compounds underscores the importance of using substance-specific hazard data. Exploring further how specific hazard data can influence the overall green score, synthetic beeswax (row 6) is listed by the DSL as an aquatic toxicity concern and therefore has one of the lowest scores, so is penalized in the ECO category. The other petroleum waxes, such as the microcrystalline form (rows 16–18), are faced with the challenge of a persistent classification also resulting in a lower score. Rose floral wax (row 19) contains essential oils with GHS flags that further drive a lower score relative to most other waxes.
Evaluating the data set, 10 substances with the lowest Green Scores were identified (Table 2). As a proof point for the output of the tool, it is noteworthy that of these raw materials, many are silicone and silicone-like compounds that are currently restricted for use in certain products in the European Union and Canada, while 2 are colorants that have regulatory restrictions on use in Canada. In this way, the tool can be used to elevate and prioritize raw materials based on the scoring of endpoints related to ECO, HH, and/or ENV impact.
Ingredient | Common function | Green Score | Element of Green Score | |||
---|---|---|---|---|---|---|
HH | ECO | ENV | ||||
ECO, ecosystem health; ENV, environment; HH, human health. | ||||||
1 | Cyclopentasiloxane | Emollient | 38.8 | 53.3 | 18.0 | 45.1 |
2 | Cyclopentasiloxane | Emollient | 39.3 | 56.4 | 20.8 | 40.6 |
3 | Alcohol denatured | Solvent | 39.4 | 25.3 | 45.3 | 47.5 |
4 | Cyclopentasiloxane | Emollient | 40.2 | 55.7 | 20.0 | 45.0 |
5 | Red 17 (CI 26100) | Colorant | 40.7 | 52.7 | 18.0 | 51.4 |
6 | Cyclopentasiloxane | Emollient | 41.0 | 60.3 | 20.7 | 42.0 |
7 | Phenyl trimethicone | Skin conditioning | 41.1 | 46.7 | 38.7 | 38.1 |
8 | Cyclopentasiloxane | Emollient | 42.2 | 57.0 | 21.2 | 48.3 |
9 | Zinc oxide | Skin protectant | 42.7 | 57.1 | 26.4 | 43.2 |
10 | Red 28 Lake (CI 45410) | Colorant | 43.2 | 66.4 | 16.8 | 46.4 |
In order to demonstrate the effectiveness of the Green Score tool in guiding future formulations, a case study for 3 products containing decamethylcyclopentasiloxane (D5) is presented in Table 3. This organosilicon compound has recently garnered attention because of its potential to persist and bioaccumulate in the environment.18–21 Given these concerns, many of the proposed and enacted restrictions on D5 are related to their use in wash-off product formulations. In being proactive, ELC began to reformulate wash-off products to eliminate the use of D5 and replace the functionality it provided with greener alternatives. Three products reformulated to eliminate D5, including a make-up remover, moisturizer, and liquid foundation, all yielded higher Green Scores, with score improvements ranging from 1.6 to 9.2 in absolute terms representing between 2.3% and 15.9% (Table 3). These results highlight how the Green Score tool can identify emerging chemicals of concern and guide substitution with greener alternatives.
Product type | Overview of key changes | Green Score (before) | Green Score (after) | Absolute change | Percent change |
---|---|---|---|---|---|
D5, decamethylcyclopentasiloxane. | |||||
Make-up remover | Various silicones (including D5 at 17%) replaced with combination of petroleum and plant-based emollients | 68.7 | 70.4 | 1.6 | +2.3% |
Moisturizer | Several ingredient changes made. D5 (at 5%) replaced with dimethicone (5% total) | 70.1 | 71.9 | 1.8 | +2.6% |
Liquid foundation | D5 (at 38%) replaced with alternative silicones | 57.8 | 67.0 | 9.2 | +15.9% |
Comparison | Dunn test z score | Adjusted P value | Significant |
---|---|---|---|
Mixed vs. non-petroleum | −17.16 | 2.66×10−66 | * |
Non-petroleum vs. Petroleum | 16.16 | 4.98×10−59 | * |
Mixed vs. petroleum | 1.412 | 0.079 |
Comparing across hazard categories (boxplots shown in Fig. 4), ENV endpoints demonstrated the lowest median certainty scores and among the smallest range of uncertainty compared with the ECO and HH endpoints. This is to be expected based on the scoring methodology for ECO and HH, which depends on empirically studied endpoints of inherent hazard (providing higher-quality data) from multiple data sets (providing a larger data quality range), versus the scoring methodology for ENV, which depends on self-reported raw data from suppliers and modelled results. This provides a clear indication that improved confidence in the overall Green Score could be readily achieved through a more robust process for supplier-provided data, including training for small- and medium-sized enterprises that may lack the staff to perform the necessary calculations. The presence of outliers in the boxplots for HH and ECO endpoints signals that the data quality is not universally high for these categories and should ideally be improved for some chemicals as more robust data become available.
The credibility behind the continuous improvement of formulas – as driven by the Green Score tool – can be assured, as existing and new raw materials are rigorously assessed on an ongoing basis. In addition, the approach presented here clearly indicates that certain product forms score higher than others across a variety of product categories, enabling formulators to readily focus on key innovation opportunities. With every improvement made by the Green Score, the tool itself will also be updated to further incentivize substitution by modifying default scores as well as the criteria set for penalties. By taking such a dynamic approach in evolving the tool, we can ensure that feedback loops are in place to improve scores across the entire product portfolio while staying ahead of and pre-empting reactive reformulation triggered by regulatory action.
The current framework strives for data transparency and verifiability, and so does not include all possible HH, ECO, or ENV endpoints of concern where only limited data are currently available. For the same reasons, not all of the 12 principles of green chemistry are currently accounted for in the tool. However, with improvements in testing and modelling methods, data availability, and broader regulatory review, additional HH or ECO endpoints such as endocrine disruption could be added. For supplier data, future development and standardization of supply chain reporting and frameworks may allow for inclusion of additional ENV data, such as manufacturing waste generation and use of hazardous process chemicals.
The current approach advances the organization's sustainability goals in a way that can be transparently measured, tracked, and validated. With this data-driven approach comes the opportunity to proactively guide the supply chain and strengthen green-chemistry-inspired formulation above and beyond regulations. While the Green Score will be continuously improved to incorporate new data from regulators and suppliers, the current version is a transparent and robust tool to inform formulator decision-making, communicate expectations with suppliers, and prioritize raw materials, product types, and product forms for reformulation. ELC will use the Green Score across its operations to guide future innovation for greener alternatives.
Paul Anastas, Matthew Eckelman, Matthew Moroney, and Julie Zimmerman have served as consultants to The Estée Lauder Companies.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/d1gc03081g |
This journal is © The Royal Society of Chemistry 2022 |