Rationalization of liquid assisted grinding intercalation yields of organic molecules into layered double hydroxides by multivariate analysis

E. Conterosito, M. Milanesio, L. Palin and V. Gianotti*
Università del Piemonte Orientale “A. Avogadro” Dipartimento di Scienze e Innovazione Tecnologica, Viale T. Michel 11, I-15121 Alessandria, Italy. E-mail: valentina.gianotti@uniupo.it

Received 12th July 2016 , Accepted 7th November 2016

First published on 8th November 2016


Abstract

The Liquid Assisted Grinding (LAG) method for the fast and facile preparation of organic-intercalated Layered Double Hydroxide (LDH) nanocomposites allowing the production of low cost, stable and efficient functional materials, is here employed to rationalize the features of the organic compounds that most likely undergo easy intercalation. LAG method was exploited to determine in a short time which molecules can be successfully intercalated into LDH. A straightforward rationalization of the intercalation yield results was not possible since no individual feature (such as bulkiness or pKa) could alone describe the intercalation behaviour of the whole set of molecules. Therefore, Principal Component Analysis (PCA) together with the use of molecular descriptors to classify molecules, were mutuated from the chemometric approach, widely used in analytical chemistry and applied successfully, for the first time, to a novel area of materials science. A set of molecular descriptors were chosen to cover different features of the molecule (physicochemical, topological, geometrical etc.) and then screened by statistical methods to understand which descriptors affected the intercalation yield. Then PCA allowed us to highlight the presence of various mechanisms, involved in the LAG intercalation and to separate the samples along PC3 as a function of yield. Finally, the classification tree method allowed us to understand the various mechanisms of intercalation and to classify molecules in groups, related to their yield. These groups can be used to estimate the expected yield as a function of the molecular descriptors. The molecules more apt to LAG have medium–low molecular weight, high flexibility and low refractivity. Conversely large and hydrophobic molecules and, surprisingly, small but rigid molecules have a small success rate concerning LAG intercalation. The behaviour of this last class of molecules, that should be in principle easily intercalated by LAG but which was identified by the present study as a difficult case, was thus tested using two molecules and the prediction of the chemometric study was confirmed.


1 Introduction

Hydrotalcites (anionic clays) are inorganic natural or synthetic layered hydroxides containing exchangeable anions. These compounds are layered solids1 whose structure is based on metal hydroxide layers constituted by bivalent and trivalent metals, generally in a ratio M(II)/M(III) equal to 2, octahedrally coordinated to six hydroxyl groups. The trivalent cation introduces a positive charge compensated by interlayer anions. These anions are exchangeable by other inorganic or organic anions via anionic exchange or, for instance, by hydrothermal methods.1 The increasing interest for these materials is due to their numerous applications in many fields such as catalysis,2 preparation of pigments,3 removal of waste agents from water,4–8 pharmaceutical and cosmetic formulations,9,10 stabilizers and rheology modifiers for polymers,11–13 symptomatic treatment of peptic ulcer14,15 and therapy of digestive disorders,3,16 drug and biomolecule hosts.17–19 Recently, to meet the need of wide production and testing of new materials based on Layered Double Hydroxides (LDHs), a fast and easy way to obtain the intercalation of organic anions into hydrotalcite20 based on Liquid Assisted Grinding (LAG) was developed in our laboratories. In fact, LAG synthesis allows the high throughput testing of the intercalability of molecules, as the preparation of the hybrid sample can be carried out in about half an hour, almost nullifying the reaction times of days that are necessary to perform it via the commonly and widely used hydrothermal synthesis and/or ionic exchange.13,21 The applicability of the LAG method was tested in several works and the set of molecules intercalated by LAG has been enlarged including organic anions with different structural characteristics and physicochemical properties. The tested molecules were chosen from different classes of compounds that gain interesting technological properties when intercalated.22,23 In details, the intercalation of UV-absorbers and dyes is an efficient way to obtain photo-stabilization and enhancement in light management, energy transfer efficiency and tunability of the photoactive assembled LDHs systems.24–27 On the other hand, non-steroidal anti-inflammatory drugs (NSAID) gain controlled release properties and in some cases improved photostabilization, coupled to the intrinsic gastro-protective effect of hydrotalcite itself. Noteworthy, SDS and NSA (sulfonic acids/sulfates in Fig. 1) can be used to co-intercalate other larger and also neutral molecules.28–30
image file: c6ra17769g-f1.tif
Fig. 1 Chemical structure formula and short name of the studied compounds.

A total of 17 molecules were tested and seven were successfully intercalated, while a partial intercalation was observed for six of them and in four cases the intercalation did not succeed. The reasons why some molecules can be easily intercalated while for other molecules LAG is unfeasible needed to be analysed in more detail, as no one of the features (such as bulkiness or pKa) could alone explain the intercalation behaviours of the whole set of molecules. In fact, the nature and characteristics of the chemical interactions between organic species and mineral surfaces and interlayers at the molecular scale is quite complex and depends upon the characteristics of the guest molecules, layer composition, synthesis conditions and hydration.31 Moreover, the intercalation reaction itself is affected by a number of variables.32 Given the complexity of the information obtained from the experiments on intercalation, trends are very difficult to be found so we looked for a method to seek for patterns in the obtained data. Principal Component Analysis (PCA) is a pattern recognition method that is used to obtain an effective representation of the system under investigation with a lower number of variables (called Principal Components – PCs) than in the original case.33,34 The loadings (weights of the original variables along the PCs) and the scores (projection of the samples on the space given by the PCs) allow the identification of relationships between the variables and the existence of groups of samples respectively. Despite chemometric methods are widely used in analytical,35,36 organic and pharmaceutical chemistry37 and “materials informatics” has been proposed recently and its use encouraged,38–40 there are still few examples of application of multivariate methods to materials science in the literature. These are mainly devoted to the handling of large amount of data to seek structure/properties relationships,41–45 for material optimization,46 for the prediction of crystal structures47 and for crystallographic data analysis.48,49 In fact, in the field of kinetic analysis of in situ XRPD data collected during a solid-state reaction,50 PCA resulted a valid and efficient alternative of Fourier analysis methods known as Phase Sensitive Detection (PSD).51

We here present the results obtained by applying multivariate analysis on the yields obtained by LAG intercalation, in order to rationalize the LAG efficiency versus the features of the organic molecules. The aim of the present work is to derive an “index of intercalability” that can be calculated a priori for each molecule to estimate the possibility to perform the intercalation successfully. Moreover it gives an hint on how to tailor the solution used for the intercalation to achieve the best yield or to make the intercalation possible before applying other strategies, including multivariate approach.52

2 Experimental

2.1 Materials and methods

The experiments were performed using the Mg/Al_LDH_NO3 or the Zn/Al_LDH_NO3 (the more investigated at academic level and already employed in commercial formulations in polymer and drugs, as detailed in the Introduction). Since no relevant difference in the intercalation yield was observed when performing the intercalation of the same compound in both kinds of LDH, this issue was not further explored, thus focusing the investigations on the guest molecules.

Guests A,53 B, C, VG1-C2, VG1-C8,54 VG1-C10 (ref. 55) were synthesized in laboratory as described in the literature (Fig. 1). The other guests (also in Fig. 1) are commercially available and were purchased from Sigma Aldrich (Milano, Italy).

All the intercalations were obtained by adding 5 ml of NaOH 0.5 M and EtOH solutions (mixed in a 2[thin space (1/6-em)]:[thin space (1/6-em)]1 ratio) for each gram of LDH and grinding the obtained blend ex situ in a mortar or mixing in situ in a capillary as described in ref. 23 The samples intercalated in situ in a capillary were given the suffix “_c”, as detailed in Fig. 3. From our preliminary tests, the duration of the grinding does not affect the result. The grinding has mostly the role of homogenizing the mixture and favouring contact between the reactants. The grinding time was although standardized to avoid crystallinity degradation. The amount of liquid employed was optimized in previous experiments20,23,52 since we found that an excessive amount of liquid has a negative impact on the yield, probably as it induces an excessive swelling of the layers. The amount and kind of impurities generated with the LAG method are similar to that obtained performing ionic exchange in solution. The final LDH product contains impurities of sodium nitrate, derived from the removal of nitrate from the layers, and possible excess of non-intercalated organic compound. These impurities can be easily removed, washing the LDH with water to remove the soluble salts and eventually with another solvent more apt to remove eventual residues of non-intercalated organic guest. Static X-ray measurements were performed on a ThermoARL powder diffractometer XTRA, equipped with a solid state Peltier cooled detector. All powder diffraction spectra were measured in continuous mode using the following conditions: 2θ angular range 2–50°; tube power 45 kV and 40 mA, step size 0.02° 2θ.


image file: c6ra17769g-f2.tif
Fig. 2 Results of the data mining procedure to determine the significant (black) and discarded (red) variables.

image file: c6ra17769g-f3.tif
Fig. 3 Values of the molecular descriptors for the intercalated guest molecules used in the PCA analysis, highlighted by a colour scale (green = high values, red = low values). The colour of the sample names (cases) depends on their yield of intercalation (green = high yield, yellow = medium yield, red = low yield), the variable names colour depends on the category (red = topological descriptors, blue = geometrical descriptors, black = polarizability/charge and yield of intercalation).

TOPAS56 academic 4.1 was used to calculate the yield from the XRPD patterns of the samples. The peak area was calculated using a single peak fitting approach. All the XRPD patterns were already presented in previous works by some of us20,23,52,53,57,58 except those of LDH_COUM,59 LDH_FLUR ex situ, LDH_TIAP ex situ, LDH_PABA and LDH_PTA. The XRPD pattern and fit of the samples not already present in the literature are reported in the ESI file (Fig. SI2–SI4, SI9 and SI10). The ratio between the area of the basal peak of the starting nitrate LDH and that of the hybrid compound allowed to calculate the reaction yield (reported in Fig. 3) as described by Toson et al.52

Marvin60 calculator plugins, running on an Intel core i7 vPro processor, were employed to calculate the values of the molecular descriptors that, unlike for example charge and molecular weight, are not directly evaluable from the chemical structure of the compounds. All molecules were drawn in Marvin Sketch60 version 15.6.29.0 and the calculations carried out on the lowest energy conformer, calculated employing molecular mechanics force field method MMFF94 as implemented in Chemaxon plugin cxcalc.60 For the calculation of the lowest energy conformer, a strict limit was imposed together with pre-hydrogenize and hyperfine options.

All statistical treatments, Principal Component Analysis (PCA), classification tree and graphical representations were carried out by Statistica61 version 8 and Microsoft Excel (Microsoft Corporation, USA).

3 Results and discussion

3.1 Fast screening of photoactive molecules for LDH intercalation and their characterization

The set of examined guest molecules, reported in Fig. 1, comprises 13 carboxylic or di-carboxylic acids that are NSAID drugs, dyes or photoactive molecules, 3 sulfonic acids and a sulphate. Compounds with different bulkiness, molecular weight ranging from 190 g mol−1 (COUM) to 737 g mol−1 (VG1-C10), flexibility and different physicochemical properties such as solubility and acidity were chosen to test how these features can influence LAG success rate, measured by the reaction yield.

3.2 Molecular descriptors calculation

PCA best performs when working on large datasets of numerical data describing the studied system in the most detailed way as possible. For the application of PCA the molecular features (also those that cannot be directly expressed as numbers, such as steric hindrance, size, shape, flexibility, reactivity etc.) must be described by numerical values. This can be achieved using the so called “molecular descriptors” already largely employed in QSAR analysis.62 The descriptors were chosen, following criteria diffused in drug design, to cover different features of the molecule (physicochemical, topological, geometrical etc.) to describe the molecules from the view point of the geometry, bulkiness, flexibility and chemical reactivity because these are, in our opinion, the features that could affect the intercalation behaviour. We explored the variables related to lipophilicity and partition in a range of pH values to evaluate its influence on the intercalation, being the intercalation a sort of acid/base reaction between the organic molecule in acid form and the hydroxide layer. The values of molecular descriptors (Fig. SI1) for each molecule were calculated using Chemaxon calculator plugin.60 In general, the variables affected by pH were calculated in a series of pH values and at pH 13.3 that is the pH of the solution used to perform the intercalation.
List of the descriptors. The following physic-chemical and structural descriptors, grouped in categories, were considered:
Topological descriptors. Aromatic rings number (ring num.) and number of rotatable bonds (rot. bonds) were chosen to give an indication of the molecule flexibility. Refractivity index (refractivity) calculation is performed by Marvin plugin based on the atomic method proposed by Viswanadhan et al.63 and is strongly related to the volume of the molecule and to London dispersive forces.
Geometrical descriptors. van der Waals volume (volume), minimal and maximal projection area [Å2] (min. proj. area, max. proj. area), minimal and maximal projection size [Å] (min. proj. size, max. proj. size) were chosen to describe the steric hindrance of the molecule.
Chemical descriptors. Molecular weight (mol. weight) describes the bulkiness of the molecule. Solvent accessible surface area (ASA) was calculated using the radius of 1.4 Å for the solvent (water) as a probe. It was calculated on the mayor micro-species (as defined by the Marvin software) at pH 13.3 (corresponding to the reaction pH) and 7.4 (the defaults pH in Marvin) as reference.
Polarizability and charge descriptors. Only the surface of polar atoms in a molecule contributes to the polar surface area (PSA). This descriptor is calculated by Marvin plugins as described in the article by Ertl et al.64 The calculation of the PSA was performed at pH 13.3 (corresponding to the reaction pH) and sulphur atoms were included. Average polarizability (avg. mol. pol.) is the average polarizability component of the major microspecies taken at pH 7.4. Anion charge (anion ch.) is the number of negative charges on the molecule at pH 13.3. log[thin space (1/6-em)]P is the octanol/water partition coefficient, which is used in QSAR analysis and rational drug design as a measure of molecular hydrophobicity. The calculation method is based on the publication of Viswanadhan et al.63 log[thin space (1/6-em)]P is calculated by default at the pH in which molecules are neutral or at their isoelectric point if they are zwitterions. We performed also a calculation forcing the pH at 13.3 (log[thin space (1/6-em)]P pH 13.3) (corresponding to the reaction pH). Both calculations were run in presence of electrolytes (Cl 0.25 mol L−1 and Na+ K+ 0.25 mol L−1) to better approximate the reaction conditions. log[thin space (1/6-em)]D is the octanol/water distribution coefficient and was calculated at different pH values from 5 to 13.3 to consider acid, neutral and basic environments.

3.3 Rationalization of the success of the intercalation as a function of chemical descriptors by PCA multivariate analysis

The full dataset collecting the tested guest molecules, the obtained intercalation yields and 22 molecular descriptors is reported in Fig. SI1. A preliminary data mining procedure was performed on the full dataset by features selection and variable screening STATISTICA's module, to understand which variables, in Fig. SI1 are correlated to the yield (i.e. the parameter we want to control and predict). Yield, was therefore used as dependent variable to determine the significant variables, influencing more strongly the yield, to be used for the subsequent analysis. The result is reported in Fig. 2. The variables were considered significant when an F-value higher than 2.5 was obtained. Given this cut-off, standard in chemometric analysis, the first 14 molecular descriptors, in black Fig. 2, plus the yield were used for the PCA analysis. It is worth noting that pKa, a parameter in principle important in an acid (organic molecule) base (LDH) reaction, is rejected by this preliminary analysis. It can be inferred that the basic conditions of the reaction are able to deprotonate the organic molecules, irrespective of their pKa. PCA analysis was conducted on this subset, reported in detail in Fig. 3, after the auto-scaling procedure to eliminate the scale inhomogeneity.

The variance explained by the obtained PCs is reported in the scree plot displaying the eigenvalues associated with a component (or factor) in descending order versus the number of the component (Fig. 4). The first Principal Component (PC1) explains approximately the 74%, PC2 the 15%, PC3 the 4% and PC4 the 3% of the total variance contained in the original data set. Since the total variance explained by the first 4 PCs was about 97%, the successive PCs were considered not statistically significant. Fig. 5a reports the score plot of PC1 versus PC2. It is useful to obtain the graphical representation of how the tested molecules are located in the new system of coordinates (PCs) and eventually how they are grouped by the new variables. The samples are reported in different colours corresponding to the intercalation yield: in red, the compounds intercalated with low yield, in yellow those with medium yield and in green those with high yield. It is possible to highlight that there are some partial groupings of the samples and by observing the corresponding loading plot of Fig. 5b it is possible to individuate which variables are responsible for the samples grouping. In fact, the loading plot reports the projection of each variable on the new reference axes (PC1 and PC2 in this case). By observing the loading of the variables (see Fig. 5b), it can be seen that the yield is a significant variable on both PC1 and PC2. In more details, on PC1 the yield has a quite high weight and proves to be anti-correlated to all the other variables being on opposite sides with respect to the PC1 axis. Regarding PC2, yield shows large positive weights together with log[thin space (1/6-em)]P at pH 13.3 and log[thin space (1/6-em)]D at pH 5 and 7. Anion charge and ring number are anti-correlated to the above reported variables, since they show a negative high weight. It can be noted that on PC2 the geometrical descriptors, describing sterical hindrance and molecular dimensions (labelled in blue in Fig. 5b) are grouped near to the zero and so they have low weights. The two topological descriptors, ring number and rotatable bonds (labelled in red), have medium weights and are anti-correlated.


image file: c6ra17769g-f4.tif
Fig. 4 Scree plot reporting the variance (y axis) explained by each PC (x axis).

image file: c6ra17769g-f5.tif
Fig. 5 (a) Score plot reporting the samples on the PC1 vs. PC2 axes. (b) Loading plot reporting variables projection on PC1 vs. PC2. Sample color codes were used to highlight the fully intercalated ones (green) the partially intercalated (yellow) and the poorly or not intercalated (red).

In fact, rotatable bonds give flexibility to the molecule while aromatic rings are rigid. Finally, physicochemical descriptors such as log[thin space (1/6-em)]P, log[thin space (1/6-em)]D and anion charge have the highest weights on PC2. Unfortunately, the new variables PC1 and PC2 alone are not able to group the samples from the viewpoint of the yield, but it is already possible to individuate some partial separation. The most well separated group (Fig. 5a) collects the NSAID drugs (IBU, FLUR, TIAP, KET) except INDO (which has, in fact, a rather different and bulky structure) and is placed in the top right part of the graph, i.e. at positive value on PC1 and PC2. Its position on PC1 reveals, by comparison with Fig. 5b, that it is characterized by a high value of the yield and low value of all the other variables that are placed on the opposite part with respect to the PC1 axis.

On PC2 the NSAIDs are again grouped on the same side of the yield but this time also other variables (log[thin space (1/6-em)]P pH 13.3, log[thin space (1/6-em)]D pH 7 and log[thin space (1/6-em)]D pH 5) contribute to their location. Moreover on the opposite side (at high negative value on PC2) are located two variables that indicate that NSAIDs are characterized by a low value of ring number and anion charge. All the samples intercalated with lower yields are scattered instead. The VG series, grouped on the left side of Fig. 5a, is of particular interest. In fact, these compounds are all large molecules with many rings, two deprotonated groups and rather bulky. Two compounds of this series, VG1-C8 and VG1-C10, are not intercalated while the similar compound VG1-C2 is intercalated with high yield but is isolated from the other well-intercalated compounds. A similar behaviour is observed for compound A. On the contrary, according to the molecule volume, number of charges and rings PFBS, 2-NSA and COUM should be easy to intercalate while SDS should be more difficult to intercalate, but we found from the experiments that this is not true. Therefore, from the analysis of the first two PC's it is evident that different mechanisms are involved in the intercalation process. In addition, some variables could be involved with non-linear synergistic or antagonistic effects. The yield information is not the predominant one since it does not emerge in the first two PCs that, as expected, account mostly for the structural information since it is the predominant information of the data set. It is important to examine the successive PC's that collect the residual information without the overlapping of the huge, but not informative, structural data. All the score and loading plots deriving from the PCA analysis are reported in ESI (Fig. SI5–SI8) for completeness purpose. Here we describe in detail the results obtained for PC3. In fact, as evidenced in Fig. 6a that reports the score plot of PC3 versus PC2, the separation among the samples according to the intercalation yield was obtained. Three groups corresponding to the different amount of intercalation yield are well defined. The samples with high yield are all placed at negative value of PC3 and the not intercalated compounds are all placed on the opposite side on PC3 (high positive values). Fig. 6b reports the projection of the variables on the two new axes PC2 and PC3. The loading plot indicates that log[thin space (1/6-em)]P pH 13.3, log[thin space (1/6-em)]D pH 5 and log[thin space (1/6-em)]D pH 7 are variables that drive the good intercalation of the molecules. In details as the yield is at the opposite side on PC3 a high yield can be obtained when the molecules have low values of log[thin space (1/6-em)]P pH 13.3, log[thin space (1/6-em)]D pH 5 and log[thin space (1/6-em)]D pH 7. Moreover, also ASA pH 13.3 and 7.4, volume and avg. mol. pol. affect the intercalation yield. In particular, since they are on the same side of yield on PC3, an increase of the yield is expected when an increase of such variables is observed. By looking at the loading plot it can be seen that the descriptors regarding charge and polarizability (log[thin space (1/6-em)]P, log[thin space (1/6-em)]D, avg. mol. pol. and anion ch.) are the most influent in separating the scores as they are the most distant from the origin of the axes and thus have high weights on PC2 and/or PC3. Looking at the scores in Fig. 6a only one sample is out of the group, that of molecule C, which it is the only compound with anion ch. = 3 and negative log[thin space (1/6-em)]P pH 13.3. The difference in the yield of intercalation between VG1-C2 and the other two squaraines, VG1-C8 and VG1-C10, appears to be due, beside to the number of rotatable bonds, to the different log[thin space (1/6-em)]P pH 13.3, which is strongly negative for VG1-C2 and positive for VG1-C8 and VG1-C10. Affinity towards water seems to be in this case the driving force that allowed the intercalation of VG1-C2, which has shorter aliphatic chains (thus lower log[thin space (1/6-em)]D), so a better affinity with water and lower molecular weight with respect to VG1-C8 and VG1-C10.


image file: c6ra17769g-f6.tif
Fig. 6 (a) Score plot reporting the samples on the PC2 vs. PC3 axes. (b) Loading plot reporting variables projection on PC2 vs. PC3. Sample color codes were used to highlight the fully intercalated ones (green) the partially intercalated (yellow) and the poorly or not intercalated (red).

Therefore, the features that seems to drive the intercalation yields are principally the partition coefficient log[thin space (1/6-em)]P pH 13.3 and the number of charges. The moieties with two negative charges must have a negative log[thin space (1/6-em)]P pH 13.3 (hydrophilic molecules) in order to enter the layers. In particular, the molecules with two COOH groups that can be deprotonated are generally more difficult to intercalate, probably because the molecule has to “fit” the distances imposed by the charge distribution of the layers, which is quite dense. Carbonate, for example, has a double charge and it is very stable when intercalated because is a small anion and can interact with both layers but if the double charge anion is a large molecule it is much difficult to accommodate the two charges to fit the layer charges. In the case of VG1-C2 the accommodation in the layers is probably allowed by the fact that the molecule is large but quite flat and the charges are disposed symmetrically on the two ends of the molecule so it can interact with the top and bottom layers. Smaller molecules less flat and/or less symmetrical with multiple charges appear to be more difficult to be intercalated.

With the information gathered by the PCA analysis, we are able to point out that different mechanisms are involved and to obtain a first grouping of the samples based on the mechanism involved in the intercalation. Anyway, it is still difficult to discriminate how the variables involved in the different mechanisms affect the yield, to foresee it. To obtain a model that could predict the yield we performed a multivariate regression but no model has an agreement factor better than 60%. Therefore, this approach was abandoned.

Finally, we performed a data mining procedure using general classification tree models in order to try to better understand how many mechanisms are present and the variables affecting each mechanism. The result of the tree classification is shown in Fig. 7. At each level of the tree is indicated the classification variable and the threshold value. In the boxes at the end of the branches, the expected yield is reported and we added, for sake of clarity, the names of the samples falling in that branch were added. We employed the same colour scheme to highlight the yield values. The tree first separates the molecules according to the molecular weight, and then the heavy molecules are separated according to the number of rotatable bonds.


image file: c6ra17769g-f7.tif
Fig. 7 Classification tree chart. Mu is the expected yield.

The heavy flexible molecules are then separated according to the refractivity index and then by log[thin space (1/6-em)]D pH 7. In the same way, the other branches can be read in Fig. 7, leading to the six groups with their ID number indicated in the boxes in Fig. 7.

Starting from the left of the tree group 4 contains light (with mol. weight ≤ 282) and rigid (rot. bonds ≤ 2) molecules that gave low to medium yields. When we looked at the molecules contained in this group, we found they were the small molecules that surprisingly did not give good results. The next two groups (8 and 9) instead contain the molecules that achieved the highest yields. These molecules are all light, flexible and with low refractivity. One last distinction is made between those having log[thin space (1/6-em)]D pH 7 below or above 1.56 giving a predicted yield of 0.92 and 0.99 respectively. If the refractivity is high instead, as in the case of KET, the yield is lower. On the other side of the tree, the molecules with a mol. weight higher than 282 g mol−1 are separated into two groups according to their log[thin space (1/6-em)]P pH 13.3. In group 11 the molecules having a log[thin space (1/6-em)]P pH 13.3 higher than −0.5 gave very low yield. In group no. 10 instead, the molecules with a lower log[thin space (1/6-em)]P pH 13.3 gave mixed results. In this group, three molecules with more than one negative charge are contained. One with three charges and yield 0% (C), and two with two charges and medium/high yield (A and VG1-C2).

We concluded that multi-charged molecules might have a significant yield only if log[thin space (1/6-em)]P pH 13.3 is lower than −0.5. However, the case of C suggests that this class of molecules must be analysed case by case because of their high density of negative charges, when packed in a layer, which difficultly fits the fixed disposition of positive charges of LDH layers.

As a proof of principle and to confirm the unexpected results, among the various possible cases in Fig. 7 the method was used to predict feasibility of intercalation by LAG of two molecules that given their features should fall in group 4. This group, in fact, collects small molecules, in principle more easy to intercalate than bulky molecules. In fact, they request only a limited swelling of the LDH layers and are easy to pack. Conversely, following the classification tree according to their features, summarized in Table 1, a low yield is expected. p-Aminobenzoic acid (PABA) and p-toluic acid (PTA) were chosen since PABA is a small molecule easy to intercalate in LDH by standard methods,65 while PTA, to our knowledge, was never intercalated into LDH (Fig. 8).

Table 1 Values of the molecular descriptors calculated for PABA and PTA
  Refract. Mol. weight Volume Ring num. Max. proj. area Min. proj. size Rot. bonds
PABA 38.01 138.15 120.03 1 49.17 3.39 1
PTA 38.36 136.15 125.39 1 49.55 3.53 1

  Anion char. Avg. mol. pol. ASA pH 7.4 ASA pH 13.3 log[thin space (1/6-em)]P pH 13.3 log[thin space (1/6-em)]D pH 5 log[thin space (1/6-em)]D pH 7
PABA 1 13.81 286.15 286.15 −2.34 0.35 −1.8
PTA 1 14.42 318.85 318.85 −0.99 1.22 −0.84



image file: c6ra17769g-f8.tif
Fig. 8 Chemical structure formula and name of the two test molecules (PABA and PTA).

As foreseen by the classification tree, the intercalation of PABA and PTA with the standard recipe gave a low yield of 6% and 4% respectively (see XRPD pattern and fit in Fig. SI9 and SI10). Moreover, in the XRPD pattern (Fig. SI9 and SI10) the peaks of the crystalline PABA and PTA are still visible indicating scarce solubility or recrystallization outside the layers.

4 Conclusions

Different chemometric tools (PCA, Data Mining and multivariate calibration) widely employed in analytical chemistry, were applied for the first time to rationalize a preparation method in materials science, to predict the possibility of using LAG and to optimize the recipe to obtain high intercalation yields. PCA allowed shedding light on one hand on the particular features that made some molecules more prone to intercalation into LDH and on the other hand, on the reaction mechanisms involved. PCA, together with the preparation strategy proposed in Fig. 11 of ref. 52 allows rational design and fast screening of new LDH-based materials.

PCA analysis does not group all together the molecules that can be intercalated by LAG into LDHs with good yield results, suggesting that there is not a single mechanism involved in the intercalation. In fact, NSAID drugs are grouped in the cases plot suggesting a common reaction mechanism, while the others are spread suggesting different mechanisms related to their different chemical features. In fact, surprisingly pKa resulted not important in defining the yield of the intercalation, probably because in the basic conditions all molecules are deprotonated. Conversely, log[thin space (1/6-em)]P pH 13.3, log[thin space (1/6-em)]D pH 5 and log[thin space (1/6-em)]D pH 7 variables, related to a good solubility in water and easy diffusion, drive the good intercalation of the molecules. According to the classification tree (Fig. 7) the molecules where grouped into six groups showing different chemical features and different intercalation yield. These groups can be used to estimate the expected yield as a function of the molecular descriptors. The molecules most apt to be intercalated by LAG have medium-low molecular weight, high flexibility and low refractivity. Conversely, large and hydrophobic molecules and, surprisingly, small but rigid molecules have a small success rate in LAG intercalation.

The predictive ability of the method was successfully tested.

The chemometry-assisted LAG approach, applied in this article to LDH intercalation, can be easily extended to other solid-state preparations.

Moreover, by a dedicated selection of the molecular descriptors, the proposed procedure can become a useful tool also for syntheses design and optimization in Materials Science.

Notes and references

  1. Q. He, S. Yin and T. Sato, J. Phys. Chem. Solids, 2004, 65, 395–402 CrossRef CAS.
  2. F. Figueras, Top. Catal., 2004, 29, 189–196 CrossRef CAS.
  3. U. Costantino, N. Coletti, M. Nocchetti, G. G. Aloisi and F. Elisei, Langmuir, 1999, 15, 4454–4460 CrossRef CAS.
  4. D. Mohan and C. U. Pittman, J. Hazard. Mater., 2007, 142, 1–53 CrossRef CAS PubMed.
  5. V. Gianotti, M. Benzi, G. Croce, P. Frascarolo, F. Gosetti, E. Mazzucco, M. Bottaro and M. C. Gennaro, Chemosphere, 2008, 73, 1731–1736 CrossRef CAS PubMed.
  6. S. Angioi, S. Polati, M. Roz, C. Rinaudo, V. Gianotti and M. C. Gennaro, Environ. Pollut., 2005, 134, 35–43 CrossRef CAS PubMed.
  7. S. Polati, S. Angioi, V. Gianotti, F. Gosetti and M. C. Gennaro, J. Environ. Sci. Health, Part B, 2006, 41, 333–344 CrossRef CAS PubMed.
  8. S. Polati, F. Gosetti, V. Gianotti and M. C. Gennaro, J. Environ. Sci. Health, Part B, 2006, 41, 765–779 CrossRef CAS PubMed.
  9. C. Del Hoyo, Appl. Clay Sci., 2007, 36, 103–121 CrossRef CAS.
  10. M. I. Carretero and M. Pozo, Appl. Clay Sci., 2010, 47, 171–181 CrossRef CAS.
  11. X. Yang and Q. Zhang, Polym. Int., 2004, 53, 698–707 CrossRef.
  12. Y. Li, W. G. Hou and S. Shen, Colloids Surf., A, 2009, 350, 109–113 CrossRef CAS.
  13. L. A. Utracki, M. Sepehr and E. Boccaleri, Polym. Adv. Technol., 2007, 18, 1–37 CrossRef CAS.
  14. D. Mendelson and L. Mendelson, S. Afr. Med. J., 1975, 49, 1011–1014 Search PubMed.
  15. U. Costantino and M. Nocchetti, in Layered Double Hydroxides: Present and Future, ed. V. Rives, Nova Science Publishers, Inc., New York, 2001, pp. 403–404 Search PubMed.
  16. C. Aguzzi, P. Cerezo, C. Viseras and C. Caramella, Appl. Clay Sci., 2007, 36, 22–36 CrossRef CAS.
  17. H. Tamura, J. Ciba, M. Ito, T. Takeda and S. Kikkawa, Solid State Ionics, 2004, 172, 607–609 CrossRef CAS.
  18. Z. P. (Gordon) Xu and G. Q. M. G. Q. (Max) Lu, Pure Appl. Chem., 2006, 78, 1771–1779 Search PubMed.
  19. A. Burzlaff, S. Brethauer, C. Kasper, B. O. Jackisch and T. Scheper, Cytometry, Part A, 2004, 62, 65–69 CrossRef PubMed.
  20. M. Milanesio, E. Conterosito, D. Viterbo, L. Perioli and G. Croce, Cryst. Growth Des., 2010, 10, 4710–4712 CAS.
  21. U. Costantino, V. Ambrogi, M. Nocchetti and L. Perioli, Microporous Mesoporous Mater., 2008, 107, 149–160 CrossRef CAS.
  22. D. Yan, R. Gao, M. Wei, S. Li, J. Lu, D. G. Evans and X. Duan, J. Mater. Chem. C, 2013, 1, 997–1004 RSC.
  23. E. Conterosito, W. Van Beek, L. Palin, G. Croce, L. Perioli, D. Viterbo, G. Gatti, M. Milanesio and W. Van Beek, Cryst. Growth Des., 2013, 13, 1162–1169 CAS.
  24. G. Wang, S. Xu, C. Xia, D. Yan, Y. Lin and M. Wei, RSC Adv., 2015, 5, 23708–23714 RSC.
  25. D. Yan, Y. Zhao, M. Wei, R. Liang, J. Lu, D. G. Evans and X. Duan, RSC Adv., 2013, 3, 4303–4310 RSC.
  26. Y. Zhao, H. Lin, M. Chen and D. Yan, Ind. Eng. Chem. Res., 2014, 53, 3140–3147 CrossRef CAS.
  27. R. Gao and D. Yan, Chem. Sci., 2016 10.1039/c6sc03515a.
  28. E. L. Crepaldi, P. C. Pavan and J. B. Valim, Chem. Commun., 1999, 155–156 RSC.
  29. P. C. Pavan, G. D. A. Gomes and J. B. Valim, Microporous Mesoporous Mater., 1998, 21, 659–665 CrossRef CAS.
  30. P. C. Pavan, E. L. Crepaldi, G. de A. Gomes and J. B. Valim, Colloids Surf., A, 1999, 154, 399–410 CrossRef CAS.
  31. Q. Li and R. J. Kirkpatrick, Am. Mineral., 2007, 92, 397–402 CrossRef CAS.
  32. P. M. Murray, S. N. G. Tyler and J. D. Moseley, Org. Process Res. Dev., 2013, 17, 40–46 CrossRef CAS.
  33. D. L. Massart, B. G. M. Vandeginste, L. M. C. Buydens, S. De Jong, P. J. Lewi and J. Smeyers-Verbeke, Handbook of Chemometrics and Qualimetrics, Volume Part A (Data Handling in Science and Technology), Elsevier, 1998 Search PubMed.
  34. D. L. Massart, B. G. M. Vandeginste, S. M. N. Deming, Y. Michotte and L. Kaufman, Chemometrics: A Textbook (Data Handling in Science and Technology, 2), Elsevier, 1988, vol. 2 Search PubMed.
  35. C. Cassino, V. Gianotti, F. Bonello, C. Tsolakis, M. C. Cravero and D. Osella, J. Chem., 2016, 2016, 1–8 CrossRef.
  36. T. Schilirò, S. Bonetta, L. Alessandria, V. Gianotti, E. Carraro and G. Gilli, Environ. Toxicol. Pharmacol., 2015, 39, 833–844 CrossRef PubMed.
  37. S. Z. Kovacevic, L. R. Jevric, S. O. Podunavac Kuzmanovic, N. D. Kalajdzija and E. S. Loncar, Acta Chim. Slov., 2013, 60, 420–428 CAS.
  38. K. Rajan, Mater. Today, 2005, 8, 38–45 CrossRef CAS.
  39. K. Rajan, A. Rajagopalan and C. Suh, Molten Salts, 2002, 74, 2 Search PubMed.
  40. K. Rajan, in TMS/ASM Materials Informatics Workshop, Cincinnati, 2006 Search PubMed.
  41. N. Bonnet, J. Microsc., 1998, 190, 2–18 CrossRef CAS.
  42. W. J. Brouwer, J. D. Kubicki, J. O. Sofo and C. L. Giles, Condens. Matter, 2014, 1–21 Search PubMed.
  43. M. Savonnet, E. Kockrick, A. Camarata, D. Bazer-Bachi, N. Bats, V. Lecocq, C. Pinel and D. Farrusseng, New J. Chem., 2011, 35, 1892 RSC.
  44. G. Øye, J. Sjöblom and M. Stöcker, Microporous Mesoporous Mater., 2000, 34, 291–299 CrossRef.
  45. C. Ortiz, O. Eriksson and M. Klintenberg, Comput. Mater. Sci., 2009, 44, 1042–1049 CrossRef CAS.
  46. I. A. Starovoitova, V. G. Khozin, L. A. Abdrakhmanova, O. Y. Rodionova and A. L. Pomerantsev, Chemom. Intell. Lab. Syst., 2009, 97, 46–51 CrossRef CAS.
  47. C. C. Fischer, K. J. Tibbetts, D. Morgan and G. Ceder, Nat. Mater., 2006, 5, 641–646 CrossRef CAS PubMed.
  48. L. Palin, R. Caliandro, D. Viterbo and M. Milanesio, Phys. Chem. Chem. Phys., 2015, 17, 17480–17493 RSC.
  49. J. C. Burley, D. O'Hare and G. R. Williams, Anal. Methods, 2011, 3, 814–821 RSC.
  50. L. Palin, E. Conterosito, R. Caliandro, E. Boccaleri, G. Croce, S. Kumar, W. van Beek and M. Milanesio, CrystEngComm, 2016, 18, 5930–5939 RSC.
  51. D. Chernyshov, W. van Beek, H. Emerich, M. Milanesio, A. Urakawa, D. Viterbo, L. Palin and R. Caliandro, Acta Crystallogr., Sect. A: Found. Crystallogr., 2011, 67, 327–335 CrossRef CAS PubMed.
  52. V. Toson, E. Conterosito, L. Palin, E. Boccaleri, M. Milanesio and V. Gianotti, Cryst. Growth Des., 2015, 15, 5368–5374 CAS.
  53. E. Conterosito, I. Benesperi, V. Toson, D. Saccone, N. Barbero, L. Palin, C. Barolo, V. Gianotti and M. Milanesio, ChemSusChem, 2016, 9, 1279–1289 CrossRef CAS PubMed.
  54. J. Park, C. Barolo, F. Sauvage, N. Barbero, C. Benzi, P. Quagliotto, S. Coluccia, D. Di Censo, M. Grätzel, M. K. M. K. Nazeeruddin and G. Viscardi, Chem. Commun., 2012, 48, 2782–2784 RSC.
  55. L. Etgar, J. Park, C. Barolo, V. Lesnyak, S. K. Panda, P. Quagliotto, S. G. Hickey, M. K. M. K. Nazeeruddin, A. Eychmüller, G. Viscardi and M. Grätzel, RSC Adv., 2012, 2, 2748–2752 RSC.
  56. A. A. Coelho, Topas-Academic. General Profile and Structure Analysis Software for Powder Diffraction Data, version, 5, 2012 Search PubMed.
  57. E. Conterosito, G. Croce, L. Palin, C. Pagano, L. Perioli, D. Viterbo, E. Boccaleri, G. Paul and M. Milanesio, Phys. Chem. Chem. Phys., 2013, 15, 13418–13433 RSC.
  58. E. Conterosito, L. Palin, D. Antonioli, D. Viterbo, E. Mugnaioli, U. Kolb, L. Perioli, M. Milanesio and V. Gianotti, Chem.–Eur. J., 2015, 21, 14975–14986 CrossRef CAS PubMed.
  59. S. Kumar, M. Milanesio, L. Marchese and E. Boccaleri, Phys. Status Solidi A, 2009, 206, 2171–2176 CrossRef CAS.
  60. Marvin 15.6.29.0, Chemaxon, 2015, http://www.chemaxon.com Search PubMed.
  61. C. H. Weiß, AStA Advances in Statistical Analysis, 2007, 91, 339–341 CrossRef.
  62. V. Consonni and R. Todeschini, in Statistical Modelling of Molecular Descriptors in QSAR/QSPR Volume 2, ed. M. Dehmer, K. Varmuza and D. Bonchev, Wiley, 2012 Search PubMed.
  63. V. N. Viswanadhan, A. K. Ghose, G. R. Revankar and R. K. Robins, J. Chem. Inf. Comput. Sci., 1989, 29, 163–172 CrossRef CAS.
  64. P. Ertl, B. Rohde and P. Selzer, J. Med. Chem., 2000, 43, 3714–3717 CrossRef CAS PubMed.
  65. L. Perioli, V. Ambrogi, B. Bertini, M. Ricci, M. Nocchetti, L. Latterini and C. Rossi, Eur. J. Pharm. Biopharm., 2006, 62, 185–193 CrossRef CAS PubMed.

Footnote

Electronic supplementary information (ESI) available: Complete dataset, XRPD patterns and fit, complete PCA analysis. See DOI: 10.1039/c6ra17769g

This journal is © The Royal Society of Chemistry 2016
Click here to see how this site uses Cookies. View our privacy policy here.