J. S. Torrecillaa, J. Palomar*b, J. Lemusb and F. Rodrígueza
aDepartamento de Ingeniería Química, Universidad Complutense de Madrid, 28040, Madrid, Spain
bSección de Ingeniería Química, Universidad Autónoma de Madrid, Cantoblanco, 28049, Madrid, Spain. E-mail: pepe.palomar@uam.es; Fax: +34 91 497 35 6
First published on 28th October 2009
A COSMO-RS descriptor (Sσ-profile) has been used in quantitative structure–activity relationship studies (QSARs) based on a neural network for the prediction of the toxicological effect of ionic liquids (ILs) on a leukemia rat cell line (LogEC50 IPC-81) for a wide variety of compounds including imidazolium, pyridinium, ammonium, phosphonium, pyrrolidinium and quinolinium ILs. Sσ-profile is a two-dimensional quantum-chemical parameter capable of characterising the electronic structure and molecular size of cations and anions. By using a COSMO-RS descriptor for a training set of 105 compounds (96 ILs and 9 closely related salts) with known biological activities (experimental Log
EC50 IPC-81 values), a reliable neural network was designed for the systematic analysis of the influence of structural IL elements (cation side chain, head group, anion type and the presence of functional groups) on the cytotoxicity of ∼450 IL compounds. The Quantitative Structure–Activity Map (QSAM), a new concept developed here, was proposed as a valuable tool for (i) the molecular understanding of IL toxicity, by relating Log EC50 IPC-81 parameters to the electronic structure of compounds given by quantum-chemical calculations; and (ii) the sustainable design of IL products with low toxicity, by linking the chemical structure of counterions to the predictions of IL cytotoxicity in handy contour plots. As a principal contribution, quantum-chemical-based QSAM guides allow the analysis/quantification of the non-linear mixture effects of the toxicophores constituting the IL structures. Based on these favorable results, the QSAR model was applied to estimate IL cytotoxicities in order to screen commercially available compounds with comparatively low toxicities.
A few quantitative structure–activity relationships (QSARs) to predict IL toxicity have been reported in literature. Ranke et al. found a linear correlation between the toxicity of 74 ILs (based on imidazolium, pyrrolidinium, pyridinium, quinolinium, quaternary phosphonium and quaternary ammonium cations) and a empirical HPLC-derived lipophilicity parameter (correlation coefficient of estimated vs. real values, R2 = 0.78).21 A different approach by Luis et al.25 designed an algorithm based on group contribution methods to estimate the aquatic toxicity of 43 imidazolium, pyridinium and pyrrolidinium ILs (R2 = 0.92). Garcia-Lorenzo et al.26 built a QSAR model, according to the Topological Sub-Structural Molecular Design (TOP-MODE) approach, which uses graph-based molecular descriptors to predict the cytotoxicity of 15 imidazolium-derived ILs in Caco-2 cells (R2 = 0.98). Recently, Torrecilla et al.27 used the empirical formulas (elemental composition) and molecular weights of 153 ILs (ammonium, imidazolium, morpholinium, phosphonium, piperidinium, pyridinium, pyrrolidinium and quinolinium salts) to estimate their toxicity (Log EC50) in a leukemia rat cell line (IPC-81) and acetylcholinesterase (AChE) by multiple linear regression (R2 = 0.87 and 0.81, respectively) and neural network (NN) models (R2 = 0.98 and 0.97, respectively).
Molecular simulation based on physically-based models, such as quantum mechanics, molecular dynamics or COSMO-RS, have been much applied as valuable tools to predict physical and thermodynamic properties of ILs, with the main advantage of providing fundamental understanding at the molecular level.28 However, to our knowledge, these reliable methods have hardly been used for the analysis of toxicological effects of ILs. Only one early attempt by Couling et al.29 developed a QSAR model for Vibrio fischeri toxicity of 25 ILs using molecular descriptors, calculated at low semi-empirical computational level, and genetic function approximations (R2 = 0.78). In this work, we will use a recently proposed COSMO-RS molecular descriptor (Sσ-profile)30 in a non-linear neural network analysis for the prediction of IL cytotoxicities. The Sσ-profile descriptor is an a priori quantum-chemical parameter, which quantifies the distribution of the polar electronic charge of a molecular structure on the polarity σ scale, obtained from the histogram function σ-Profile given by COSMO-RS methodology.31 We have recently shown its capability to describe qualitatively and quantitatively the electronic structure and molecular size of a wide variety of ionic liquids (IL).32 Specifically, the Sσ-profile molecular descriptor has been successfully related by NN to three solvent properties of neat ILs and their mixtures (density, IL solubility in hydrocarbons and partition coefficient of aromatic compounds between aliphatic and IL solvents),33 properties which depend on the molecular size and the intermolecular interactions of the mixture. In addition, the Sσ-profile descriptor has been used in quantitative structure–property relationships (QSPR) based on NN for the prediction of polarity/polarizability scales of pure solvents and their binary and ternary mixtures.34 This study represented the first a priori computational approach for a reliable prediction of the non-ideal behavior of solvent effects in mixtures. A remarkable advantage of the Sσ-profile parameter is its additive character, since Sσ-profile values result from the contribution of the different atomic groups on a σ-range.32,33 Thus, the Sσ-profile descriptor of a pure IL or a mixture of ILs can be successfully defined as the sum of those Sσ-profile values of their independent ions.
In the current study, we will develop a QSAR model to establish the toxicological effects of ILs based on the integration of the powerful Sσ-profile descriptor of the cations and anions on NN. In a preliminary section, Sσ-profile descriptor was validated by multilinear regression (MLR) relationships to the cytotoxicity of ILs in Leukemia Rat Cell Line (Log EC50 IPC-81).35 For this analysis, the available experimental Log EC50 IPC-81 data for 86 ILs and 9 closely related salts were used,7 including 35 different cations (ammonium, imidazolium, phosphonium, pyridinium, quinolinium, and sodium) and 18 different anions. Subsequently, a neural network was designed and optimized to estimate the LogEC50 IPC-81 values of ILs as a function of a restricted number of ten Sσ-profile input values (4 for cations and 6 for anions), revealed as statistically significant descriptors in previous MLR analysis. For the development of NN model the Log
EC50 IPC-81 sample data were extended to 105 compounds (96 ILs and 9 salts), including additional cations and anions with very high and non-additive cytotoxicological effects. Then, the developed QSAR approach was used for the reliable Log
EC50 IPC-81 predictions of ∼450 ILs, obtained using 53 cations – with 6 different head groups, different side chains (from 1 to 16 carbon atoms), and functional groups (ether, hydroxyl, benzyl and amino, among others) – and 20 different anions. As a result of this study, the new concept of a Quantitative Structure–Activity Map (QSAM) was established with the following main objectives: (i) the Quantitative Electronic Structure–Activity Map (QSAM-E) approach relates the estimated Log
EC50 IPC-81 values to the molecular electronic information of IL compounds given by the COSMO-RS method. As a consequence, QSAM-E allows a deeper insight into the chemical features of toxicophore substructures and the toxicological impacts of ILs; and (ii) the Quantitative Chemical Structure–Activity Map (QSAM-C) approach relates the estimated Log
EC50 IPC-81 values to the chemical structure of both cation and anion constituting the IL. Considering the non-linear toxicophore mixture effects,22 which prevent the use of additive predictive models for IL toxicity, QSAM-C is proposed as a useful guide for the prospective design of safer ILs.
Recently, we reported an a priori computational design tool to obtain ILs with suitable processing features for specific applications.33 QSAM approaches were presented as a complementary tool to also take into account the toxicological hazards in the product design of ILs. In this sense, the developed QSAR model was finally applied in this work to screen commercially available ILs in order to select those with comparatively low cytotoxicities.
In order to guarantee the reliability of the estimations calculated by these models, the applicability domain has been evaluated selecting the compounds with cross-validated standardized residuals greater than three standard deviations.40–42 As the highest cross-validated standardized residual is 2.62 standard deviations (1-hexadecyl-3-methylimidazolium chloride), no set was considered as an outlier.40,41
The MLP model used consists of three layers (input, hidden and output), a topology widely used to deal with several problems.27,32–34 The input layer has as many nodes as Sσ-Profile used (10 input nodes) and one output neuron to estimate the LogEC50 IPC-81. The hidden neuron number (HNN) should be fixed by optimization techniques (vide infra). Given that the range of most Sσ-Profiles and the sigmoid function are between 0 and 1, the sigmoid function has been used as the MLP transfer function.38,39
The BP algorithm is based on the Bayesian Regularization (trainBR) training function. It was selected because its generalization power is higher than other training functions and avoids overfitting and overtraining when a small learning sample is used.39 Their specific parameters are learning coefficient (Lc), learning coefficient decrease, (Lcd), learning coefficient increase (Lci). The Lc parameter is similar to “h” in Newton's method (often called the Newton–Raphson method). Lcd and Lci control the value of Lc depending on the MLP model performance. To avoid overfitting of the NN model, learning and verification samples were used (vide supra) and the learning process was stopped when the error of prediction (mean square error, MSE) for the verification sample, defined by eqn (1), began to increase. A detailed description of the calculation process is described in the literature.43–46
![]() | (1) |
In eqn 1, N, yk and rk are the number of datasets of the data-base, the response of the output neuron and the corresponding real output response, respectively.
The HNN and NN parameters are optimized by an experimental design based on the Box-Wilson Central Composite Design 24 + Star Points. The experimental factors analyzed were Lc, Lcd (between 1 and 0.001) and Lci (between 2 and 100).47 Taking into account the learning sample size, the HNN range was selected between 3 and 10.48 The responses of the experimental design were the Mean Prediction Error (MPE), eqn 2, and correlation coefficient (predicted vs. real values, R2). Both indexes are easily computed and provide a good description of the predictive performance of the NN model.49 As the main objective is to have an NN which predicts results with the highest possible accuracy, the considerations taking into account to analyze the experimental design were the need to achieve the least MPE with the highest values of R2.
![]() | (2) |
![]() | ||
Fig. 1 Surface polarization charge density and σ-Profile of some representative cations (A) and anions (B) of ionic liquids. |
The polarisable surface and the σ-Profiles of the imidazolium, ammonium and quinolinium cations with a common butyl side chain are shown in Fig. 1A. As can be seen, the surface of the cations is mainly non-polar, i.e., green, with a tendency to blue-green on the more polarized fragments. The σ-Profiles were dominated by a main peak with the charge distribution around zero (−0.0082 < σ < 0.0082 e Å−2), corresponding to the aliphatic groups of the cationic alkyl chains. In addition, some unresolved peaks along with low peaks were observed at lower values than the cutoff −0.0082 e Å−2. These peaks are related to the hydrogen atoms of the aromatic ring for the case of imidazolium and quinolinium cations and those of the ammonium cation closest to its nitrogen atom. According to COSMO-RS theory,31 these cationic fragments may contribute to hydrogen bonds as donors. These examples manifest how the σ-Profile qualitatively describes the different electronic nature of cations and anions. In fact, we have demonstrated that the charge distribution area (Sσ-profile) below σ-Profile can be used as a suitable molecular descriptor of solvent properties, with the remarkable advantages of being a quantum-chemical-derived parameter defined in a restricted scale of polarity (±0.025 e Å−2 for most compounds) and presenting an additive character (the σ-Profile of a IL can be reasonably defined as sum of cation and anion contributions).33
As the first step to develop a QSAR model for establishing IL toxicity based on COSMO-RS information, multilinear regression (MLR) models were constructed for prediction of LogEC50 IPC-81 values of ILs. We initially considered the 61 levels of charge distribution [px(σ)] defining the σ-Profile of IL ionic species in the range ±0.03 e Å−2, i.e., 61 Sσ-profile values. In order to design the best MLR model, the regression model selection (RMS) analysis was used (SPSS software version 15.0.1). The RMS analysis ranked the best subsets of the introduced explanatory variables, using the criteria of best adjusted correlation coefficient and MPE for all possible linear regression models (estimated vs. real Log
EC50 IPC-81 values). Since Ranke et al.17,18 found a good correlation between the alkyl chain lengths in imidazolium-based ILs and their Log
EC50 IPC-81 cytotoxicities, we initiated the RMS analysis introducing Sσ-profile descriptors of imidazolium cations for a sample data of 60 ILs (MLR Model 1 in Table 1). The RMS analysis revealed that only three Sσ-profile of cation were needed to obtain a good MLR model [eqn 3 in Table 1], located at −0.002, −0.008 and −0.011 e Å−2. The good quality of these simple relationships was indicated by a correlation coefficient R2 = 0.90 and a standard deviation of 0.31 (these being parameters based on a logarithmic scale). The results of Log
EC50 IPC-81 predictions by eqn 3 are shown in Fig. 2A. As can be seen in Fig. 2C, the Sσ-profile descriptors used in eqn 3 were located at the peak maximum for alkyl fragment (−0.002 e Å−2) and hydrogen groups linked to head groups (−0.008 and −0.011 e Å−2). Interestingly, eqn 3 presents a negative coefficient for the two cationic Sσ-profile descriptors in the non polar range (−0.0082 < σ < 0.0082 e Å−2), i.e., the corresponding molecular fragments increase IL toxicity (lower Log
EC50 IPC-81 values). In contrast, the coefficient of the Sσ-profile descriptor corresponding to the most polarized hydrogen group (−0.011 e Å−2) presents a positive signal in eqn 3, i.e. the more acidic groups of cation contribute to the reduction of the IL toxicity. As second step on COSMO-RS descriptor validation, we performed a RMS analysis for the Log
EC50 ICP-81 values of 15 ILs with the common cation 1-butyl-3-methylimidazolium, using in this case the 61 Sσ-profile values of anions. A good MLR model (eqn 4 in Table 1) to describe anion effects was constructed using 6 statistically relevant Sσ-profile descriptors of anions, located at −0.002, 0.009, 0.011, 0.015, 0.017 and 0.019 e Å−2 on their σ-Profile (Fig. 2C). The analysis of eqn 4 indicated that higher polarized anionic fragments result in less toxic ILs. However, the presence of non-polar groups in the anion (see NTf2− in Fig. 2C) confers a toxicological effect on ILs, based on the negative coefficient in eqn 4.
The next step was to achieve a general QSAR approach to analyze both cation and anion effects on IL cytotoxicity. For this purpose, we proposed a different MLR, Model 2 (Table 1), which estimates separately the contributions of cation and anion to LogEC50 IPC-81 value. The sample data included 65 imidazolium-based ILs, which incorporated anions with considerable toxic effects such as (C2F5)3PF3)−, N(CF3)2− and Co(CO)4−.19 Firstly, the main cationic effects were described for the 65 ILs by MLR Model 1 (eqn 5 in Table 1) using 3 Sσ-profile descriptors of cations (located at −0.002, −0.008 and −0.011 e Å−2). Then, the Log
EC50 IPC-81 contribution from cation (eqn 5) was included as input in the MLR Model 2 (eqn 6 in Table 1), together with the 6 Sσ-profile descriptors of the above-mentioned anion. The predicted Log
EC50 IPC-81 values by MLR Model 2 (eqn 6) were compared with the experimental values in Fig. 2B. Finally, the scope of the proposed MLR Model 2 was extended to 95 ILs with different head groups (imidazolium, pyridinium, phosphonium, ammonium, pyrrolidinium and sodium, as reference). The RMS analysis, introducing the 61 Sσ-profile descriptor of the cation, provided an optimized MLR Model 1 (eqn 7 in Table 1) with four Sσ-profile inputs, corresponding to those used in eqn 5, plus an additional input at −0.006 e Å−2, which presented a significant statistical influence in some head groups such as quinolinium. Then, MLR Model 2 (eqn 8 in Table 1) was constructed to predict both cation and anion effects by including the Log
EC50 IPC-81 contribution from cation (eqn 7) and the 6 Sσ-profile descriptors of anions used in eqn 6.
MLR - Model 1 | |||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Aim/Sample | Eq. | C0 | Ccation−0.011 | Ccation−0.008 | Ccation−0.006 | Ccation−0.002 | Ccation−0.002 | Ccation0.009 | Ccation0.011 | Ccation−0.015 | Ccation−0.015 | Ccation−0.015 | R2 | σ | N |
Cation effect /Imidazolium ILs | 3 | 4.8±0.8 | 0.2±0.1 | −0.15±0.03 | — | −0.085±0.004 | — | — | — | — | — | — | 0.90 | 0.31 | 60 |
Anion effect /Imidazolium ILs | 4 | 2.3±0.1 | — | — | — | — | −0.012±0.006 | 0.015±0.006 | 0.025±0.008 | 0.047±0.009 | 0.038±0.007 | 0.047±0.009 | 0.91 | 0.19 | 15 |
Cation effect /Imidazolium ILs + toxic anions | 5 | 3.8±0.9 | 0.3±0.2 | −0.13±0.04 | — | −0.081±0.005 | — | — | — | — | — | — | 0.81 | 0.42 | 65 |
Anion effect /5 head group types of ILs | 7 | 4.9±0.4 | −0.03±0.01 | −0.12±0.03 | 0.06±0.02 | −0.077±0.004 | — | — | — | — | — | — | 0.81 | 0.43 | 95 |
MLR - Model 2 | |||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
a R2: Square correlation coefficient; σ = Standard deviation; N = number of IL in sample data. | |||||||||||||||
Aim/Sample | Eq. | Ccation | IPC-81cationMLR![]() ![]() | Canion−0.002 | Canion0.009 | Canion0.011 | Canion−0.015 | Canion−0.015 | Canion−0.015 | R2 | σ | N | |||
Cation + Anioneffects /Imidazolium ILs | 6 | 0.96±0.04 | Eq. 5 | −0.025±0.006 | 0.005±0.003 | 0.002±0.005 | 0.03±0.01 | 0.007±0.006 | 0.013±0.005 | 0.91 | 0.31 | 65 | |||
Cation + Anion effects / 5 head group types of ILs | 8 | 0.94±0.03 | Eq. 7 | −0.022±0.005 | 0.006±0.003 | 0.004±0.004 | 0.034±0.009 | 0.008±0.004 | 0.012±0.004 | 0.90 | 0.34 | 95 |
![]() | ||
Fig. 2 Comparison of experimental Log![]() |
As can be seen in Fig. 3A, the proposed QSAR approach provided a reasonable prediction of LogEC50 IPC-81 values for a wide variety of ILs including 6 different head groups and 19 types of anions. In addition, MLR Model 2 (eqn 8) can be used for a simplistic estimation of cation and anion contribution to IL toxicity. Thus, Fig. 3B showed that the dominant contribution of cation to IL toxicity increases with the side chain length, in agreement with experimental evidence.17,18 On the other hand, current results indicate that most of anions (as Cl− or HSO4−) contribute to the reduction of the IL toxicity, whereas other anions (such as (C2F5)3PF3−, N(CF3)2− or Co(CO)4−) were described as strong toxicophores by eqn 8. Similarly, Fig. 3C presented the effect of head group (with common butyl side) on toxicity for some representative IL examples. Quinolinium and pyrrolidinium head groups exhibited, respectively, the highest and lowest hazardous effects, whereas imidazolium and pyridinium cations presented intermediate toxicities, in line with experimental trends with small anions [Experimental Log
EC50 IPC-81 values: BquinBr (2.32) < BmimBr (3.43) < BpyrrBr (3.77)].7 However, the QSAR approach proposed by MLR Model 2 presents significant limitations to predict accurately IL cytotoxicity. For example, it failed to describe the non-linear behaviour of Log
EC50 IPC-81 values with the alkyl chain length for highly hydrophobic ILs.21 In addition, the observed non-additive toxicophore effects for different cation and anion mixtures19,23 cannot be described by the QSAR model based on MLR approximation. Therefore, a main aim of this work was the generation of a non-linear model based on NN for the prediction of Log
EC50 IPC-81 values of ILs using 4 Sσ-profile descriptors of cation and 6 Sσ-profile descriptors of anions. The sample data contained 105 different compounds, including ILs with very large cations (TetraDmim+ or P66614+) or anions (BBDB−). Following the aforementioned optimization process, the NN model was designed using the learning sample. The optimum values of HNN, Lc, Lcd and Lci were 9, 0.001, 1 and 2 respectively (see Table 2). Then, using these parameter values, the model was verified. In this process, the model was tested against a verification sample that had not been included in the neural network learning sample. Fig. 4A compared the Log
EC50 IPC-81 estimations for learning and verification samples to the experimental values. The correlation coefficient R2 was obtained higher than 0.996 and MPE and σ were less than 0.47% and 0.12, respectively (these being statistical parameters based on a logarithmic scale of Log
EC50 IPC-81). Finally, in order to carry out an external validation process of the optimized model,40,50,51 a new validation sample based on experimental data available in literature was employed. Taking into account that the external validation sample range must be within the learning sample range and belong to the same application domain (vide supra), the datasets used in the external validation of the MLP model were selected. The mathematical procedure followed was similar to the verification process described above. As can be seen in Fig. 4B, the MLP model presents an acceptable goodness of fit, with R2 > 0.96 and MPE < 5.7%. In terms of external explained variance (Q2ext), given that this parameter is close to 1 (Q2ext >0.94), the NN model is stable and predictive.41,50Fig. 4C reported the residuals from the NN model. Given that the R2 between residuals and Log
EC50 IPC-81 values is less than 0.07, no mathematical dependence between them can be found. Therefore, in the light of the statistical results, the non-linear NN model (MLP) was adequate to estimate Log
EC50 IPC-81 values of ILs, by only using 10 Sσ-profile descriptor values of counterion components, which are derived solely from quantum-chemical COSMO-RS calculations. The diversity of cations (>40) and anions (>20) used to design the NN model, together with the high variability of Log
EC50 IPC-81 values (between −0.19 and 4.30), guaranteed the generality of the developed non-linear QSAR model based on NN, validating the a priori Sσ-profile parameter for the description of IL cytotoxicities.
Parameters | |
---|---|
Transfer function | Sigmoid |
Input nodes number | 10 |
Hidden neurons number | 9 |
Output neuron number | 1 |
Learning coefficient, Lc | 0.001 |
Learning coefficient decrease, Lcd | 1 |
Learning coefficient increase, Lci | 2 |
![]() | ||
Fig. 3 (A) Comparison of experimental Log![]() ![]() |
![]() | ||
Fig. 4 Comparison of experimental Log![]() |
![]() | ||
Fig. 5 (A) 3D Graph of σ-Profile for the 1-alkyl-3-methylimidazolium cation series; (B) Quantitative Electronic Structure–Activity Map (QSAM-E) of the alkyl chain effect for the 1-alkyl-3-methylimidazolium tetrafluoroborate series; and (C) QSAM-E of the head-group effect for IL series with a common BF4− anion and butyl side chain. Experimental and estimated Log![]() |
![]() | ||
Fig. 6 QSAM-E of the anion effect for the 1-butyl-3-methylimidazolium series. Experimental and estimated Log![]() |
These results were basically consistent with a previous QSAR model to establish IL toxicity mainly driven by the lipophilicity of the compound.19,22 However, the authors reported some cases with significant deviations from the general linear trend between cytotoxicity and lipophilicity: (i) ILs containing anions with clear intrinsic toxicities (as NTf2− or BBDB−); (ii) ILs based on quinolinium or 4-(dimethylamino)pyridinium cations; and (iii) ILs containing very lipophilic cations with very long alkyl chains. In this work, the lipophilicity parameter logk0, derived from reverse-phase gradient HPLC retention times,21 was found to be directly related to the Sσ-profile descriptor of the cation located at the non-polar region σ = −0.002 e Å−2, as it is shown in Fig. 7A for 32 ILs including 7 different head groups. The amount of charge estimated by Sσ-profile (−0.002 e Å−2) was mainly provided by aliphatic fragments of the structure. This result was relevant because it indicated that the lipophilicity estimations by the log
k0 parameter of ILs would be assigned to those aliphatic fragments of the cation. As a consequence, the deviations from the lipophilicity–cytotoxicity relationship19–23 must be attributed to the interactions of other toxicophore fragments with biological systems, which seem to be underestimated by lipophilicity parameter log
k0 in HPLC. For example, QSAM-E in Fig. 7B demonstrated that the different behaviour evidenced for the quinolinium head group21 must be assigned to the toxicological effects of the aromatic fragment, which presents a strongly delocalized charge in the negative region of low polarity (−0.0082 < σ < −0.002 e Å−2). In addition, the greater toxicity of 4-(dimethylamino)pyridinium with respect to the imidazolium head group22,23 can be ascribed not only to the higher amount of non-polar groups at −0.002 e Å−2 region but also to the presence of polarized charge density at the σ region between −0.0082 and −0.002 e Å−2. For its part, the intrinsic cytotoxic effects of NTf2−, compared to the reference chloride anion,19,22 must be ascribed to the presence of non-polar groups in its structure (Fig. 7C and 7D). On the other hand, the observed significant reduction of imidazolium IL toxicity by the introduction of an oxygenated side chain23 was explained by QSAM-E on the basis of the charge shift of cation structure towards more polarized regions (Fig. 7E), resulting in less toxicophore features.
![]() | ||
Fig. 7 (A) Comparison of the values for Sσ-profile descriptor of cations at −0.002 e Å−2 against lipophilicity parameter log![]() ![]() |
As noted in the previous section, the current QSAR study confirmed non-additive mixture effects of toxicophores constituting the IL structures. This is shown in Fig. 8, where the general agreement between the experimental and NN estimated values of LogEC50 IPC-81 of ILs contrasted with the results of MLR Model 2. Thus, the increasing side chain lengths generally imply a decrease of Log
EC50 IPC-81 values of ILs (higher toxicity), being the non-linear effects due to the well-known biological cut-off effect correctly described by NN model. It is noteworthy, however, that the scope of the alkyl chain effect is strongly modified by the anion nature (Fig. 8A). For example, (C2F5)3PF3− clearly behaves as stronger toxicophore than Cl− (the latter being used as a non-toxic and bio-compatible reference for anions52) for IL with short cationic alkyl chains;however, it presents the opposite behaviour with very long side chains. Mixture effects were also found to be drastically different for different head groups (Fig. 8B). Thus, the contribution to toxicity of different anions is significantly modified by the cationic head group (see the case of CF3SO3− in Fig. 8B). In conclusion, current results show the difficulty of assigning an intrinsic toxicity to an anion, since its toxicophore behaviour is heavily dependent on cationic structure. In other words, a reliable additive treatment based on the intrinsic quantification of cation and anion toxicities is no longer suitable. As a suitable alternative, we proposed the QSAM-C approach to consider the IL cytotoxicity in terms of mixture toxicity. QSAM-C relates the chemical structures of both cation and anion of ILs to the Log
EC50 IPC-81 estimations in a handy contour plot, as shown Fig. 9 for the families of ILs based on imidazolium (A), pyridinium (B), ammonium (C) and phosphonium (D). The QSAM-C graphics allow a visual quantitative evaluation of IL cytotoxicity in function of the non-linear cation and anion interactions. Firstly, it was clear that the three main structural elements of an IL (cation side chain, cation head group and the kind of anion) have a significant impact on its toxicity, even when the influence of cationic moiety was found to dominate. Thus, the range of Log
EC50 IPC-81 for a commonly considered low toxic type of IL, such as 1-alkyl-3-methylimidazolium methylsulfate,22 spans from 3.9 to 1.2 by increasing the alkyl side in 7 carbon atoms (Fig. 9A). The non-linear mixture effects were also evident: ILs with intermediate polar anions, such as CF3SO3− or DCN−, presented the highest (0.6) and lowest (3.9) toxicity for, respectively, the longest and shortest alkyl imidazolium chains (Fig. 9A). On the other hand, a small anion such as Cl− seemed to increase its toxicological effects with the length of alkyl chains, whereas a large anion such as BBDB− presented stronger toxicophore effects in low toxic imidazolium or pyridinium cations, i.e., cations with short side chains (Fig. 9A and 9B). As a general contribution, current QSAM-C revealed that the ammonium-based ILs present lower cytotoxicity than those based on imidazolium, pyridinium or phosphonium head groups, particularly those synthesized with short alkyl chains and alkylsulfate anions (estimated Log
EC50 IPC-81 values ∼4.2).
![]() | ||
Fig. 8 Non-additive toxicophore effects described by NN estimated (black symbols), MLR estimated (grey symbols) and experimentally available (white symbols) Log![]() |
![]() | ||
Fig. 9 Quantitative Chemical Structure–Activity Map (QSAM-C) of IL cytotoxicity for (A) 1-alkyl-3-methylimidazolium; (B) 1-alkylpyridinium; (C) tetraalkylammonium and (D) tetraalkylphosphonium series. Estimated Log![]() |
Finally, in order to carry out a useful application, the developed QSAR model was easily used in this work to filter environmentally benign ILs by screening the commercially available compounds, using the criteria of low IL cytotoxicity given in Fig. 6 (LogEC50 IPC-81 > 3.4). Table 3 reports 30 ILs with comparatively low hazardous effects, together with their CAS reference number. In fact, 10 commercial ILs in Table 3 were estimated to have lower toxicity than 1-ethyl-3-methylimidazolium ethylsulfate, a compound used in the literature as a reference for low IL toxicity. It should be noted, however, that the examined commercial ILs still have cytotoxicities higher than those of common solvents [Log
EC50 IPC-81 of methyl tert-butyl ether (5.6), methanol (6.2) or acetone (6.8)].17 As a result, further QSAM analysis, based on quantum-chemical data, is currently being performed to select the toxicologically favourable structural elements for designing more sustainable ILs.
CAS | Cation | Anion | Log![]() | |
---|---|---|---|---|
Estimated | Experimental | |||
a Ionic liquid in the Io-Li-Tec catalogue, presented as new without CAS number | ||||
479500-35-1 | Bpyrr+ | Cl− | 4.30 | — |
35895-69-3 | E4N+ | CF3SO3− | 4.19 | — |
145022-44-2 | Emim+ | CF3SO3− | 4.08 | 4.09 |
145022-45-3 | Emim+ | CH3SO3− | 4.03 | — |
331717-63-6 | Emim+ | SCN− | 4.00 | — |
1906-79-2 | Epyr+ | Br− | 3.98 | — |
370865-89-7 | Emim+ | DCN− | 3.94 | — |
103173-73-5 | Epyr+ | PF6− | 3.94 | — |
342573-75-5 | Emim+ | MeSO4− | 3.91 | 3.93 |
412009-61-1 | Emim+ | HSO4− | 3.92 | 3.99 |
65039-08-9 | Emim+ | Br− | 3.89 | — |
155371-19-0 | Emim+ | PF6− | 3.86 | 3.92 |
93457-69-3 | Bpyrr+ | Br− | 3.63 | 3.77 |
874-80-6 | Bpyr+ | Br− | 3.73 | — |
85100-76-1 | Prmim+ | Br− | 3.68 | — |
712354-97-7 | Epyr+ | NTf2− | 3.68 | — |
186088-50-6 | Bpyr+ | PF6− | 3.60 | — |
3878-80-6 | Epyr+ | CF3SO3− | 3.57 | — |
342789-81-5 | Bmim+ | CH3SO3− | 3.51 | 3.56 |
65039-09-0 | Emim+ | Cl− | 3.56 | — |
174899-82-2 | Emim+ | NTf2− | 3.54 | — |
79917-90-1 | Bmim+ | Cl− | 3.53 | 3.55 |
258273-75-5 | B1M3N+ | NTf2− | 3.52 | 3.61 |
85100-78-3 | Bmim+ | Br− | 3.52 | 3.43 |
216300-12-8 | Prmim+ | PF6− | 3.51 | — |
—a | Prpyr+ | NTf2− | 3.50 | — |
143314-16-3 | Emim+ | BF4− | 3.49 | 3.44 |
344790-87-0 | Bmim+ | SCN− | 3.46 | 3.42 |
350-48-1 | Epyr+ | BF4− | 3.42 | — |
216299-72-8 | Prmim+ | NTf2− | 3.40 | — |
Footnote |
† Electronic supplementary information (ESI) available: Nomenclature of ionic liquid units and Sσ-profile descriptors for cations and anions. See DOI: 10.1039/b919806g |
This journal is © The Royal Society of Chemistry 2010 |