Open Access Article
Giulia
Lo Dico
abc,
Álvaro Peña
Nuñez
c,
Verónica
Carcelén
c and
Maciej
Haranczyk
*a
aIMDEA Materials Institute, C/Eric Kandel 2, 28906 Getafe, Madrid, Spain. E-mail: maciej.haranczyk@imdea.org
bDepartment of Materials Science and Engineering and Chemical Engineering, Universidad Carlos III de Madrid, Getafe, Spain
cTolsa Group, Carretera de Madrid a Rivas Jarama, 35, Madrid, Spain
First published on 2nd June 2021
Natural porous materials such as nanoporous clays are used as green and low-cost adsorbents and catalysts. The key factors determining their performance in these applications are the pore morphology and surface activity, which are typically represented by properties such as specific surface area, pore volume, micropore content and pH. The latter may be modified and tuned to specific applications through material processing and/or chemical treatment. Characterization of the material, raw or processed, is typically performed experimentally, which can become costly especially in the context of tuning of the properties towards specific application requirements and needing numerous experiments. In this work, we present an application of tree-based machine learning methods trained on experimental datasets to accelerate the characterization of natural porous materials. The resulting models allow reliable prediction of the outcomes of experimental characterization of processed materials (R2 from 0.78 to 0.99) as well as identification of key factors contributing to those properties through feature importance analysis. Furthermore, the high throughput of the models enables exploration of processing parameter–property correlations and multiobjective optimization of prototype materials towards specific applications. We have applied these methodologies to pinpoint and rationalize optimal processing conditions for clays exploitable in acid catalysis. One of such identified materials was synthesized and tested revealing appreciable acid character improvement with respect to the pristine material. Specifically, it achieved 79% removal of chlorophyll-a in acid catalyzed degradation.
Nanoporous clays are a quite diverse family of materials whose morphology and chemical structure vary depending on the natural environments and deposits.16 The different structural growth patterns give rise to diverse particle aggregates among which, sepiolites and palygorskites exhibit a fibrous shape, and smectites are known to be layered nanosilicates.10Fig. 1A provides a photo of a powdered clay together with two structural models of fibrous sepiolite and lamellar smectite. The repeated inverted structure of fibrous nanoclays leads to higher surface area (125–195 m2 g−1 for palygorskites and 230–320 m2 g−1 for sepiolites) compared to smectites, which are 30–80 m2 g−1 for the Al-rich and 150–300 m2 g−1 for the Mg-rich ones.10 Moreover, clay morphology is critically affected by the pervasive substitutions in the lattice structure and by exchangeable cations affecting the layer charge and swelling properties.10,17
The structural dissimilarities between various clays are reflected in the surface reactivity, which can be approximated by Brønsted and Lewis acid–base characters.18–20 Generally, the active sites reside predominantly in the particle edges, in the hydroxylic termination of octahedral sheets and on the basal oxygen of tetrahedral sheets.21,22 Fibrous nanoclays possess higher hydroxyl density compared to smectites whose reactivity is increased by their swelling properties and cation exchange capacity (CEC).23,24 Furthermore, defects in the chemical structure may cause charge vacancies compensated by diverse ions influencing the catalytic behavior or the affinity with certain molecules.17 Such a modification of natural nanoclays may drastically change the morphology and surface activity, and in some cases it may also lead to loss of crystallinity.14,25,26Fig. 1B further illustrates this tunability by highlighting ranges of property values achieved for various raw clay minerals with such treatments. The properties of processed materials shift from the corresponding ones for pristine materials as a function of modification parameters such as the nature of an additive, the additive-to-clay ratio, additive concentration and the contact time, requiring control and optimization for achieving the desired outcomes.27 The latter is challenging due to the complexity of the parameter space as well as the cost of experimental work involved. Computer-aided approaches can provide solutions to potentially overcome these challenges.
A number of computational studies have investigated the effect of modification, lattice defects and impurity in specific material structures in the context of adsorption and catalysis.17,28 However, the wide variability and imprecise nature of (modified) nanoporous clays makes such modeling approaches, which rely on precise atomistic models, ineffective for material screening.17,29 Similarly, machine learning-supported property prediction approaches based on crystal structures such as the ones used in the discovery of zeolites and metal–organic frameworks cannot be easily adopted.1,30–32 Instead, advanced design of experiments strategies have been exploited to address process optimization by performing only a small number of experiments.33 For example, central composite design has been involved in the optimization of HCl treatment on montmorillonite,34 which explored experimental factors such as temperature, contact time, acid–clay ratio and acid concentration. Similarly, response surface optimization sheds light on the relationships among various experimental process parameters (time, acid concentration, and microwave heating power) and the specific surface area of montmorillonite.27 The results indicated the optimum parameters for the modification of a specific clay mineral, however, more sophisticated methods are needed to account for the high variability of properties in pristine materials coming from various deposits.
In this contribution, we incorporate machine learning algorithms into surrogate models of experimental characterization outcomes, and then employ the latter to accelerate the exploration of the morphological parameters and surface activity of clay-based materials achievable through processing of raw materials. We demonstrate how the throughput of the models can be harnessed in multiobjective optimization of materials. In particular, we focus on a design function relevant to acid catalysis applications and verify its utility by synthesis of the identified material. The promising acid nano-catalyzer was characterized and tested in terms of performance in the catalytic degradation of chlorophyll-a. The latter has scientific and industrial significance, e.g., in pharmaceutical applications and in the production of biofuels.35,36
The feature vector space for material representation was defined by rational selection and/or implementation of 41 descriptors. To facilitate the analysis, the descriptors were aggregated into three main families, namely, the properties of raw clays, the characteristics of the additives and the parameters describing the modification process (summarized in Table 1). The raw clay features were experimentally measured (details in Section S1, Fig. S1 and Table S1†). The descriptors for the additives were obtained from PubChem, and correspond to the acid dissociation constants (pKa), molecular weight (MW), molecular formula (C, H, O, S counts), number of hydrogen donor (H-donor) and acceptor (H-acceptor) groups, number of double and rotatable bonds, octanol–water partition coefficients (X
log
P) and topological polar surface area.37 In case a material datapoint corresponded to a raw material, a neutral value of 7 was assigned to the pKa, while 0 was given to the remaining additive descriptors. The parameters describing the modification process were defined by assigning unique ascending integer values for the type of mixing between additives and raw clays (activation), i.e. not activated, by spraying, or by mixing them into a dilute solution or a solid mixture.
| Raw clay | Additive | Modification process |
|---|---|---|
| a pH measured at 0 and after 24 h (pH0, pH24) and free acidity. b SiO2, Al2O3, MgO, CaO, Fe2O3, Na2O, K2O, TiO2 and Mn2O3 and loss by calcination. c Relative content of fibrous, planar phyllosilicates, dolomite, calcite, and quartz. | ||
| Cation exchange capacity (CEC) | pKa1 | Activation |
| pKa2 | Milling time | |
| Surface area (BET) | pKa3 | Additive/clay%g/g |
| Acid–base character (3 features)a | C, H, O, S counts | Additive (M) |
| # Double bonds | RH%g/g | |
| Chemical composition (10 binary features)b | Molecular weight | Final RH%g/g |
X log P |
Particle size | |
| Phyllosilicate composition (5 features)c | H-donor | |
| H-acceptor | ||
| Rotatable bond | ||
| Polar surface | ||
The milling time is classified with integer values from 0 to 2 for, respectively, after, before modification and double milling (before and after modification).
Then, the additive concentration in water and the amount with respect to dry clay (additive (M) and additive/clay (%g/g) respectively), as well as the starting (RH%g/g) and final moisture (final RH%g/g), and the particle size (%g/g of <45 μm particles) were introduced completing the vector space. The two datasets were normalized giving the same weighted contribution at all the information used. The dataset splitting was fixed as 85% of training data and 15% of testing data. Furthermore, we ensured that the datasets are a good representation of the material space captured by the above feature vectors, see the ESI, Section S2, Fig. S2 and S3.†
The targets are represented by the main morphological properties calculated by N2 physisorption isotherms at 77 K (details in the ESI, Section S1, Table S1 and Fig. S1†) and the surface activity which is herein assessed by measuring the pH.38,39Table 2 summarizes the morphological targets, with their corresponding abbreviations, which were experimentally assessed for every natural or processed clay-based material contained in the two datasets.
| Target class | Target | Abbreviations of the predicted targets |
|---|---|---|
| Morphology | Surface area (SA) | pSA |
| External surface area (ESA) | pESA | |
| Micropore content (Micro) | pMicro | |
| Main pore size (MS) | pMS | |
| Total pore volume (Vol) | pVol | |
| Surface activity | Free acidity (pH) | ppH |
| y_i = f(xraw_clay, xadditive, xmodification_process) | (1) |
| Hyperparameter | pSA | pESA | pVol | pMicro | pMS | ppH |
|---|---|---|---|---|---|---|
| n_estimators | 2000 | 2000 | 2000 | 2000 | 2000 | 2000 |
| min_samples_split | 2 | 4 | 3 | 3 | 4 | 3 |
| min_sample_leaf | 1 | 1 | 1 | 1 | 1 | 1 |
| max_features | 36 | 9 | 8 | 27 | 18 | 38 |
| max_depth | 100 | 900 | 900 | 600 | 300 | 900 |
| DF = pSA × pMicro−1 × ppH−1 | (2) |
The design function involves predicted properties of BET surface area (pSA), micropore content (pMicro), and pH (ppH). Sulfuric acid was chosen as a promising additive for acid modification. The search for the optimum value of the design function was performed into the entire feature vector space corresponding to the modification process features (7 descriptors). We then simplified the space to two-most important descriptors, i.e. H2SO4/clay% and RH%, while fixing the other five modification process descriptors to the average of single optimum value for the five material classes. In particular, the models were run on a set of proposed materials considering the 5 clay classes under different starting moisture contents (6–30%) and additive/clay ratios for H2SO4. The other parameters were fixed to: 16% of the final moisture, additive–clay activation being achieved using a spray diffuser and double milling (before and after the activation), and the amount of particles smaller then 45 μm was fixed to 75%. The DF improvements with respect to the starting values were calculated as (DFproc − DFraw)/DFraw × 100, where DFproc and DFraw are the design functions of the processed and raw nanomaterials, respectively. The promising acid nano-catalyzer extracted using DF was prepared and tested in the catalytic degradation of chlorophyll-a (experimental details in Section S5†).
| Assessment | pSA | pESA | pMS | pVol | pMicro | ppH |
|---|---|---|---|---|---|---|
| R 2 | 0.943 | 0.93 | 0.77 | 0.986 | 0.954 | 0.959 |
| MAE | 11 | 6.09 | 4.09 | 0.006 | 0.96 | 0.33 |
| MSE | 276 | 89.6 | 38.7 | 0.00012 | 1.6 | 0.19 |
![]() | ||
| Fig. 3 Summed up importance scores of the features which were grouped according to Table 1. | ||
Furthermore, Fig. S8† indicates that X
log
P (as an indirect measure of the affinity with the polar clay surfaces), the molecular weight with its chemical formula as well as the acid strength are key features of the additive, while the additive/clay ratio, and the starting moisture content and activation are the most important features of the modification process. Surprisingly, the final particle size makes low contribution. Normally, the surface area of natural porous solids increases significantly with the decrease of grain size; nevertheless, when a chemical treatment is involved, this parameter loses importance.45 The low contribution of the final moisture content is attributed to the sample degassing pre-treatment before recording the N2 physisorption isotherms.38 Phyllosilicate composition in terms of fibrous and planar clay content is the key descriptor in the raw material group displaying the diverse intrinsic behaviour of the two morphological classes.10,43 The other contributions mark the correlation between the target properties and the chemical structures in terms of substitutions in the theoretical layer composition as well as reversible exchangeable cations. Those substitutions may naturally occur in both octahedral and tetrahedral sheets, even if more energetically costly, complicating the assignment of their roles in the resulting properties.10,17,19
log
P, polar surface, double bond count, and hydrogen bound donor and acceptor counts) have the biggest impact on ppH. Besides that, the molecular weight of the additive participates with a high importance score due to the fact that it is not completely removed after treatments. These findings establish a competitive effect between the additive's ability to modify the surface and the tendency to cover it. As expected, the raw clay group of features is dominant in the model by virtue of the intrinsic acid–base character (pH0, pH24 and free acidity). Despite the low importance of the cation exchange capacity, the type of ions (K+, Na+) has a significant impact, indicating that these cations may interact with the modifier tuning its effectiveness. Finally, additive/clay and activation are the most correlated with the process treatment group.
![]() | ||
| Fig. 4 Effect of the starting moisture content and H2SO4/clays% on the predicted target pSA, pESA, pVol, pMicro, pMS and ppH. | ||
Five natural materials, corresponding to high-purity palygorskite, sepiolite, montmorillonite, saponite and stevensite were explored. Fig. 5 highlights the optimal improvement areas with respect to the DF of the raw material. An improvement of 125% of DF at low RH% can be noted for saponite. Although RH% plays a minor role, palygorskite and sepiolite are successfully modified at lower RH values of up to 15%. The optimal limit of additive/clay is identified as 5% for sepiolite but can be extended to 8% for palygorskite. In contrast, montmorillonites find their best conditions at high values of additive/clay. The DF of stevensite is improved until up to 6% of additive/clay, independent of the RH.
![]() | ||
| Fig. 5 Trend of design function improvements (%) with different additives/clays (%g/g) and starting moisture contents (RH%g/g). | ||
The impact of clay impurity on the design function is highlighted in Fig. S9.† Herein, the starting materials were stevensites provided by different deposits and exhibiting a diverse grade of purity, i.e. the phyllosilicate content varying from 80% (stevensite 1) to 62% (stevensite 5). The figure indicates that the DF can be generalized to the materials with impurities as the optimal regions for low-purity stevensites are located in the same regions as the high-purity ones. However, differences in the improvement scores can be appreciated following a non-linear trend with the stevensite content. In fact, in many cases natural clays present traces of other phyllosilicates that contribute to facilitating or preventing the modification. Our machine learning-supported multiobjective optimization represents an accelerated and low cost strategy for finding the optimal solutions for such impure materials. Furthermore, DF provides a rapid guide for experimentation and initial material selection by quickly identifying the optimal conditions for each considered clay. Fig. 6 displays the optimized DF values of nano-catalyzers based on five different clays. The plot identifies palygorskite as a promising starting system for the preparation of high-performing acid catalyzer.
Table S5 and Fig. S10† summarize the predicted and experimentally measured properties of P1 and P2. Overall, there is a very good agreement between the predicted and measured properties providing further validation of our models.
Similarly, the DF values calculated using the experimentally measured properties of P1 and P2 closely match the corresponding model prediction, Table S5.† Furthermore, the DF value being lower for P2 than P1 confirms that sepiolite does not get the benefit from the modification optimized for palygorskite. In fact, although the two fibrous nanoclays have similar morphologies, sepiolite exhibits a strong reduction of the available surface area together with non-acid character after the treatment.
The catalytic activities of P1 and P2 were tested in the degradation of chlorophyll-a in lipidic media following the protocol outlined in Section S5.† A generally accepted reaction mechanism involves replacing Mg2+ of the chlorophyll structure with H+ at the clay surface.49 The fragmented molecules are eliminated by the high temperature and vacuum conditions or remain entrapped by physisorption in the porous clay and removed by filtration.49,50 Generally, a desirable nano-catalyzer removes around 60–75% of the pigments depending on diverse variables (temperature, vacuum and dose).51 The performances of P1 and P2 as well as the corresponding raw materials are collected in Table 5. P1 was able to remove 78.6% of chlorophyll-a, 40% of improvement with respect to the pristine material. The remaining pigments are then removed under extreme conditions of high temperature and vacuum forcing the elimination of volatile compounds and allowing the stripping of 88% of chlorophyll-a. Additionally, the catalytical activity of P2 indicates no improvement with respect to the raw sepiolite and that the conditions in which P1 and P2 were prepared are likely non-transferable between different clay types. This observation is in favor of our approach, which allows high-throughput search for optimum conditions for various clays and cutting times and experimental costs. Furthermore, the quality of the underlying property models opens the possibility of smart design of other DFs tailored to specific applications.
| Material | Remaining chlorophyll-a (ppm) | Chlorophyll-a removal (%) |
|---|---|---|
| P1 | 0.651 | 79 |
| Raw palygorskite | 1.84 | 39 |
| P2 | 1.959 | 35.5 |
| Raw sepiolite | 1.91 | 37.1 |
Furthermore, a design function was proposed to investigate ideal material processing scenarios to achieve high performing hierarchical acidic nano-catalysts. The results suggest that the investigated raw nanoclays are not equally qualified precursors, i.e. palygorskite is preferred as a starting natural porous material. By screening through the space of important synthesis and processing parameters, we identified and then synthesized a nano-catalyzer for effective removal of chlorophyll-a from lipidic media. This palygorskite-based material achieves 79% degradation of chlorophyll-a in an acid catalyzed degradation reaction, outperforming natural minerals with 40% of improvement.
Footnote |
| † Electronic supplementary information (ESI) available: Experimental methods, hyperparameter optimization and validation. See DOI: 10.1039/d1sc00816a |
| This journal is © The Royal Society of Chemistry 2021 |