Open Access Article
Jinzheng Zhao,
Guohui Zhou*,
Timing Fang
,
Shengzhe Ying
and
Xiaomin Liu
*
School of Chemistry and Chemical Engineering, Qingdao University, 308 Ningxia Road Shinan District, Qingdao, Shandong 266071, P. R. China. E-mail: liuxiaomin@qdu.edu.cn; zhouguohui@126.com
First published on 6th June 2022
The utilization of biomass resources has attracted more and more attention due to the consumption of non-renewable resources. Compared with cellulose and lignin, hemicellulose has been less studied. Some ionic liquids (ILs) have been proved to be excellent solvents for lignocellulosic pretreatment. However, screening of more efficient ILs is difficult due to numerous possible ILs. Computational chemistry has been proved effective in solvent screening, but a precise model is indispensable. In this work, we focused on building several appropriate models and selected the most suitable one. According to the structure of hemicellulose, six hemicellulose models were constructed and the mid-dimer of the xylan chain hemicellulose (MDXC) model was proved to be the best compared with the reported experimental results. Based on the MDXC model, 1368 ILs were screened to evaluate their ability to dissolve hemicellulose by Conductor-like Screening Model for Real Solvents (COSMO-RS). The activity coefficient (γ), excess enthalpy (HE), and σ-profile indicated that the hydrogen-bond (H-bond) played a vital role in the dissolution of hemicellulose. Anions played a more critical role than cations, where small anions with H-bond acceptor groups could enhance the molecular interactions with hemicellulose. This work provided a thermodynamic understanding of hemicellulose and IL solvent systems. It highlights the importance of building appropriate solute models, which may be necessary to predict of the other thermodynamic properties in the future.
Ionic liquids (ILs) have several remarkable properties such as having low vapor pressure, excellent chemical and thermal stability, and being nonflammable.11–15 As a new type of green solvents, ILs can dissolve carbohydrates, swell the cell walls, and decrease the crystallinity of cellulose.9 It have been reported16 that imidazole-based ILs with anions of Ac−, Cl−, Br−, I− could dissolve hemicellulose efficiently at temperatures ranging from 80 to 150 °C. Peng17 found that two hydroxyl groups of the xylan chain in hemicellulose can form electron donor-electron acceptant (EDA) with [Bmim]Cl, which could accelerate the hemicellulose dissolution process. Morais18 studied the interaction between ILs and hemicellulose through NMR. They concluded that the anion alkalinity was the key for hemicellulose dissolution. Cheng19 showed the advantages of conducting a suitable model compound to explain the dissolution mechanism of the experimental results. Up to now, hundreds of ILs with excellent solubility to cellulose or lignocellulose have been found,20–23 but hemicelluloses were relatively few investigated. Due to the complexity of hemicellulose, its dissolution in ILs and its interaction with ILs were still unclear.2
As the numerous potential ILs with different cations and anions bring difficulty to screen, it is important to use a high-efficiency method to predict hemicellulose solubility in ILs.24 Conductor-like Screening Model for Real Solvents (COSMO-RS) is a model based on the continuum solvation theory for a specific problem in the molecular structures.25 H-bonds, van der Waals, and misfits interactions in the solvent systems are combined to predict the logarithmic activity coefficient (ln
γ) at infinite dilution.15 The ln
γ can reflect the solubility of solute in ILs. Combining statistical thermodynamics and quantum chemistry, COSMO-RS is considered25 to be one of the most high-efficiency methods for prediction and screening of ILs.
In recent years, several studies26–29 have demonstrated that COSMO-RS could predict ILs dissolving capabilities. It has been proved that low activity coefficient and low numerical HE of lignocellulose in ILs were related to high solubility.26 Kahlen et al.30 screened more than 2000 ILs for dissolving cellulose and showed that the anion was the major determinant in the dissolving process based on COSMO-RS calculation. Liu15 emphasized the importance of suitable models in investigating the dissolving process, the mid-monomer part of cellotriose was proved to be the most consistent with the experimental results. Moreover, different conformations of the molecules in the dissolved system led to different termodynamic properties predicted by COSMO-RS.31 He et al. 32 developed an automated computational framework MoDoop based on COMSO-RS to predict the lignin solubility in ILs. In Mohans' studies,33,34 the researchers used cellobiose as a model for cellulose, whereas hemicellulose is represented with an equimolar combination of both glucose and xylose. The ln
γ of cellulose and hemicellulose in thousands of ILs have been predicted by COSMO at room temperature, and the interaction between solute and solvent has been analyzed using a variety of computational methods combined with experiments, which is a relatively complete and systematic work. However, the solute model has great influence on the calculation results, a suitable solute model determines the accuracy of the prediction, and is essential for future hemicellulose studies. The fact that different solute model can drastically change the thermodynamic estimations using COSMO-RS is of significant concern.
This study focuses on the construction and selection of suitable hemicellulose models, aiming to obtain the most appropriate hemicellulose model and screen a large amount of ILs. Understand the thermodynamic properties of hemicellulose dissolution in ILs. ILs with different cations, anions (bounded with several types of functional groups) were considered due to their accessibility and eco-friendly properties. We developed six different models, and the reported experimental data16,35 were used to verify the predictions of several hemicellulose models' ln
γ at infinite dilution. The mid-dimer of xylan chain (MDXC, Model 3) was proved to be the best model. This study also summarizes the dissolution rules of ILs for hemicellulose from the perspectives of cation type and alkyl-chain length, and complementarity of the σ-profile between anion-solute molecules was quantified and the rationality of this screening method was confirmed.
To calculate excess enthalpies in the COSMO-RS program, the molar fraction of hemicellulose, IL cation, and IL anion was set to 0.5, 0.25, 0.25, respectively. The method to calculate ln
γ at infinite dilution and HE was described in previous studies.31,36
![]() | (1) |
![]() | ||
| Fig. 1 Schematic representation of hemicellulose.37 Inter- and intramolecular hydrogen bonds are also shown. | ||
The ln
γ at infinite dilution of solute is often considered as a qualitative measurement of solute solubility. According to Formula 1, the reciprocal of the activity coefficient represents the solubility of hemicellulose in their respective ILs.30 In this formula, xLi represents the mole fraction of dissolved i versus undissolved i in a saturated solution. Δhm,i, Tmi represents the enthalpy of melting and the melting temperature of component i, respectively. In this work, the ln
γ at infinite dilution of each hemicellulose model in ILs were calculated using COSMO-RS. These values served as quantitative measurements of solubility in ILs. Description of the activity coefficient is the key of this work. Linear regression was performed on the experimental solubility of hemicellulose and the values of ln
γ. The performance of each hemicellulose model was measured by R-squared (R2, it measures the goodness fit of the data regression equation) and the residual sum of squares (RSS), which reflected the prediction accuracy. The structures and charge surface files(σ-surface) of the six models are shown in Fig. 2, generated with the procedures in Section 2.1.
In COSMO-RS, σ-potential25 measures the system's affinity to a surface of polarity σ. Three regions defined based on σ-potential: H-bond donor region (σ <−0.0082 e Å−2), H-bond acceptor region (σ >+0.0082 e Å−2), and non-polar region (−0.0082< σ <+0.0082 e Å−2). The σ-potential lines of Models 3 to 6 almost overlap in the whole region (Fig. 3), which means that their representation of hemicellulose is almost identical in the solution system based on COSMO-RS calculation. The σ-potential of Model 1 is more negative than that of other models in the H-bond acceptor region, which means that Model 1 has more affinity to H-bond acceptor surfaces than other models. Model 3 has the strongest hydrogen-bonding acceptor affinity due to a more negative σ-potential in the H-bond donor region and a more positive σ-potential in the H-bond acceptor region. The performance of Model 2 is somehow in between.
The ln
γ of the six hemicellulose models by COSMOtherm along with the experimental bamboo solubilitie16,35 is in the ten ILs were listed in Table 1. Linear regression was performed between ln
γ and experimental solubilities. R2, RSS, and optimization time, which characterize the performance of the models were listed in Fig. 4. R2 of the longer xylan chain (R2 = 0.82) models were higher among the six hemicellulose models. R2 of the intermediate trimer, tetramer, and pentamer models were the same as those of the mid-dimer models. However, the RSS values of the mid-dimer of xylan chain model (MDXC, Model 3) in the ten ILs was smaller. With the increase of chain length, the molecular structure's optimizing time of more complex models increased sharply. It showed that Model 3 was a good representation of the long-chain structure of hemicellulose. Combined with the calculation time of different models, Model 3 was the best among the six models.
| ILs | Solubility (g·100 g−1 IL) | ln γ prediction results by COSMO-RS |
|||||
|---|---|---|---|---|---|---|---|
| Model 1 | Model 2 | Model 3 | Model 4 | Model 5 | Model 6 | ||
| a Data from Hu et al., 2020;35b Data from Xia et al., 2020.16 | |||||||
| EmimAc | 13.51a | −6.03 | −0.86 | −1.63 | −2.77 | −3.96 | −5.18 |
| BmimAc | 12.46b | −5.58 | −0.73 | −1.29 | −2.21 | −3.19 | −4.19 |
| BmimCl | 11.03b | −2.83 | −0.67 | −1.57 | −2.67 | −3.82 | −5.00 |
| BmimAcr | 4.02b | −4.48 | −0.60 | −1.01 | −1.75 | −2.55 | −3.38 |
| BmimBr | 2.61b | −1.12 | −0.34 | −0.95 | −1.71 | −2.51 | −3.35 |
| HoemimAc | 0.78a | −3.98 | −0.39 | −0.63 | −1.24 | −1.90 | −2.60 |
| BmimBut | 0.21b | −5.13 | −0.65 | −0.96 | −1.66 | −2.42 | −3.20 |
| BmimGly | 0.30b | −3.54 | −0.45 | −0.85 | −1.51 | −2.20 | −2.94 |
| BmimBen | 2.10b | −4.07 | −0.51 | −0.64 | −1.15 | −1.70 | −2.29 |
| BmimEtHOCOO | 0.36b | −3.38 | −0.44 | −0.73 | −1.29 | −1.90 | −2.55 |
![]() | ||
| Fig. 4 Optimization time and R2 of six hemicellulose models. The area of the blue circle indicates the value of the RSS, a smaller RSS represents a better goodness of fit. The data in this figure are shown in ESI.† | ||
γ (methods described in Section 2.2) of Model 3 in 1368 ILs were depicted in Fig. 5. The cations and anions were listed in the order of the calculated logarithmic activity coefficient values. The ILs with high solubility were shown in the upper-left corner of the image, and the ILs with low solubility were shown in the down-right corner. The red and blue dotted lines roughly divided 1368 ILs into three regions according to the ln
γ. The ln
γ varies notably between different kinds of anions (Fig. 5). However, little difference was found between different cations, indicating that the dissolution of hemicellulose in ILs was mainly attributed to the anion. It could be explained that ILs containing the anions including Cl−, Ac−, HCOO−, Gly−, Br− had a strong ability to dissolve hemicellulose. It is consistent with the experimental results of Santo et al.,38 where they indicated the relative interaction strength between smaller anions and cations were stronger,38,39 and could promote the dissolution of xylan chain.
It has been widely recognized that anions played a significant role in the dissolution of lignocellulose. But the cations cannot be ignored in the dissolution of hemicellulose. In Mohan's calculation34 of the ln
γ at room temperature, the cation species had little influence on the ln
γ. In this work, it can be seen that cation type and cation branched-chain type has a more significant effect on hemicellulose in ILs at 130 °C (Fig. 5), which may be caused by the temperature effect. High-effective ILs can be obtained by designing ILs cations with different functional groups. The hemicellulose has a relative high solubility in pyrrolidinium-based ILs than other ILs (Fig. 6a). The second highest are the ILs with ethyl-morpholine, and followed by methyl-morpholine, imidazolium, and pyridinium ILs. Fig. 6b showed the average ln
γ of 30 cations with five different R-group, where the order of solubility is methyl > allyl > methoxyethyl > hydroxyethyl > ethoxy-2-oxoethyl. The presence of alkaline oxygen atoms in the side chain could interfere with hydrogen-bond formation between the anions and the hydroxyl groups in the xylan chain, for example, cations with alkoxy or alkoxyalkyl groups.40
In this work, we compared the methylmorpholinium-based and ethyl-morpholinium-based ILs. We found that cation with longer alkyl chain had a negative effect on the dissolution of hemicellulose compared with shorter alkyl-chain. To investigate the effect of the cation alkyl chain length on hemicellulose dissolution, methylimidazolyl-based and methylpyrrolidinium-based ILs with alkyl chain of different lengths were selected from the COSMOtherm ILs database. After combining with the 38 anions, the ln
γ of those ILs were calculated. The results were shown in Fig. 7, which verifies the inhibition long alkyl chain on hemicellulose dissolution, which is consistent with the experimental phenomenon.41 Pyrrolidinium ILs were more affected by the length of alkyl chain than imidazole ILs (Fig. 7c).
![]() | ||
| Fig. 8 Excess enthalpies of Model 3 and 16 ILs from different regions in Fig. 5. | ||
γ and HE showed that the dissolution of hemicellulose is strongly dependent on the anion. In addition, the σ-profile curve of COSMO-RS theory can reflect the relative number of environmental polar fragments on the molecular surface. The complementarity of the curve peak in the H-bond acceptor region can be used to describe the interaction ability between ions and solute molecules.31 The ILs anions were classified into categories I, II and III according to their ability to dissolve hemicellulose (Fig. 9a). The σ-profile curves of the category I anions had better complementarity with the curve of hemicellulose model in the polar region (+0.015–+0.022). Then, we tried to find the most appropriate way to quantify this feature. After analyzing the anions' σ-profile curves in the polar region, we found that the integral area under the σ-profile curves of the anions can reflect the complementarity between anions and the hemicellulose model better (Fig. 9b). We calculated the integral areas under the σ-profile curve of 38 anions in the polar region (+0.015–+0.023) and correlated them with the average ln
γ of each anion's ILs. In order to determine the most appropriate complementary region, we took multiple regions for calculation and comparison. We found that data points of the 38 anions had the best correlation (R2 = 0.779) in the range of (+0.016–+0.021) (Fig. 9c, other interval data detailed in ESI, Table S3†). The method can quantify the complementarity of the σ-profiles curves between compounds in the vital interval of the polarity region. The anions with the largest integral area within the region were highly consistent with the anions with better solubility. In addition, the curve of σ-profiles in the H-bond acceptor region also showed that the interaction between the anions and the hydroxyl group of the hemicellulose plays an essential role in the dissolution process. The solubility followed the hydrogen bonding acceptor affinity or alkalinity42 of the anion. The σ-profile method for ILs was a new and more intuitive method for the high throughput screening of ILs.
![]() | ||
Fig. 9 A new screen method of ILs. (a) σ-profile curves of hemicellulose and 38 anions. The red curve is the σ-profile of the hemicellulose model. The anions are divided into three groups according to the order of activity coefficients in Fig. 5. (Due to the limitation of the size of the figure, the detail annotated figure of the 38 anions is presented in ESI†). (b) The integral area of three anions in the polar receptor region. The red curve is the σ-profile of the hemicellulose model. (c) The integral area in the polar receptor region (+0.016–+0.021) and corresponding ln γ of 38 anions. | ||
To further verify the reliability of the σ-profile, we used molecular dynamics (MD) simulation for three ILs with different abilities in dissolving hemicellulose. The cation is [Bmim]+, and the three anions are [Cl]−, [BEN]− and [TF2N]−, respectively. The simulations were performed with GROMACS software package.43,44 Each system contains 1000 ILs and 15 molecules (Model 3), and the content of the molecule is 1.48 mol%. The initial configuration was generated by randomly placing Model 3 in the ILs by Packmol package.45 Then energy was minimized, followed by NVT and NPT simulation. The equilibration simulation was extended to 50 ns, and production phase lasted for another 50 ns at 130 °C. The cutoff distance between LJ and Coulomb interaction is 1.2 nm. Temperature control using Nosé–Hoover thermostat46 with a coupling time of 0.2 ps, and pressure (1.0 bar) controlled by Parrinello–Rahman constant47 with a coupling time of 2.0 ps.
Fig. 10a shows the radial distribution functions (RDFs) between the hydroxyl H on the two middle rings of hemicellulose and the anion. By comparison, it is found that the interaction between H atom and oxygen atom in [TF2N]− is much smaller than the interaction for [Cl]− or carbonyl O in [BEN]−. This is consistent with the poor solubility of [Bmim][TF2N]. The spatial distribution function (SDF) reflects the density distribution of components from a three-dimensional perspective intuitively, which represents the interaction strength of all components. We calculated the SDF of anions around hemicellulose, as shown in Fig. 10b–d. The anions are mainly distributed around the hydroxyl of the hemicellulose molecules, because their interaction is mainly the H-bond between the hemicellulose and the negatively charged atoms on the anion. The difference between the three anions is obvious, the H-bond interactions from strong to weak are [Cl]−, [BEN]− to [TF2N]−, which is consistent with the result of the above σ-profile. To sum up, the σ-profile prediction method is effective and reliable.
γ is a reliable character reflecting the solubility in ILs with different cations and anions. Especially, ILs with anions of Ac−, Cl−, HCOO−, Gly− and DMSO− were predicted to have a promising advantage for the hemicellulose dissolution. Compared with the reported work, cations have a greater influence on the dissolution of hemicellulose by increasing the temperature. In terms of cations, pyrrolidinium-based and quaternary ammonium-based ILs have strong H-bond formation ability with hemicellulose, which accelerate the dissolution process. The order of R-groups contribute to solubility is methyl > allyl > methoxyethyl > hydroxyethyl > ethoxy-2-oxoethyl. The solubility of ILs for hemicellulose is decreased with the extend of alkyl chain length in cation. The H-bond interaction is the primary contributor in the dissolution of hemicellulose, with misfits' force and van der Waals force being subsequent ones. Further, the σ-profile method is effective for large-scale ILs screening. In all, this work highlighted the importance of selecting an appropriate solute model, and it provided an understand of the thermodynamic behavior of hemicellulose and ILs solvent systems, which is necessary for predicting the other thermodynamic properties.
Footnote |
| † Electronic supplementary information (ESI) available. See https://doi.org/10.1039/d2ra02001g |
| This journal is © The Royal Society of Chemistry 2022 |