Open Access Article
Sharib Khan
a,
Tormi Lillerand
a,
Veerapandian Ponnuchamyb,
A. G. M. Zaman
a,
Daniel Rauber
cd,
Udayakumar Veerabagu
a,
Jüri Olt
a,
Markus Gallei
*cd,
Sabarathinam Shanmugam*a and
Timo Kikas
*a
aChair of Biosystems Engineering, Institute of Forestry and Engineering, Estonian University of Life Sciences, Tartu, Estonia. E-mail: sabarathinam.shanmugam@emu.ee; timo.kikas@emu.ee
bInnoRenew CoE, Slovenia and University of Primorska, Andrej Marušič Institute; Koper, Livade 6, 6310 Izola, Slovenia
cPolymer Chemistry, Saarland University, Campus C4.2, 66123 Saarbrücken, Germany. E-mail: markus.gallei@uni-saarland.de
dSaarene, Saarland Center for Energy Materials and Sustainability, Saarland University, 66123 Saarbrücken, Germany
First published on 5th June 2026
The transition to a circular bioeconomy depends on the sustainable use of renewable resources, with forest-based biomass playing a critical role. Lignin, in particular, is a promising feedstock for replacing fossil-derived aromatics due to its unique chemical structure. However, lignin obtained from conventional pulp mills often contains impurities and structural modifications that limit its potential for producing high-value chemicals and materials. In contrast, emerging biorefinery pretreatment strategies enable a “lignin-focused” approach, where every major biomass component, cellulose, hemicelluloses, and lignin, is valorized for intended applications, allowing the tailored production of bio-based products with improved structural quality. Traditionally, the development of biorefining processes requires extensive experimentation guided largely by empirical advances. In this study, we demonstrate how Bayesian optimization can accelerate the development of a sustainable biomass fractionation process based on the protic ionic liquid (PIL) triethylammonium hydrogen sulfate ([TEA][HSO4]). Open-source Python-based tools were employed to optimize lignin extraction from softwood and evaluate the influence of key processing parameters, including temperature and process severity, on lignin yield. Molecular dynamics simulations and in-depth literature analyses were adopted to identify the optimal trade-offs between lignin recovery and structural properties. Notably, the optimized PIL fractionation process achieved 82% delignification with lignin yields of up to 73%, while consistently producing lignin with low molecular weights because of in situ depolymerization during pretreatment. Furthermore, advanced qualitative characterization was conducted to assess the lignin structure and evaluate the relationship between lignin yield, structural quality, and potential downstream applications.
Green foundation1. We advanced a protic ionic liquid (PIL)-based biomass fractionation process with a strong focus on lignin recovery by integrating Bayesian optimization and molecular dynamics simulations. This integrated strategy enables computationally driven efficiency, reduces the need for extensive experimentation, and clarifies the role of carbohydrate impurities in lignin extraction.2. The key achievements were the production of low-molecular-weight lignin and the identification of an optimal balance between lignin yield and its quality. 3. To further enhance the sustainability of this approach, future research should focus on full life cycle assessments, PIL recycling, and the development of fully bio-derived PILs, thereby enabling a closed-loop, zero-waste biorefining process. |
Ionic liquid (IL)-based biomass fractionation, commonly known as the IonoSolv process, has recently gained significant attention as a promising strategy for LCB valorization.14 The appeal of the IonoSolv process lies in the ability of ILs, particularly protic ionic liquids (PILs), to selectively dissolve and recover the primary components of LCB through several key features: intrinsic hydrogen-bonding capability essential for biomass processing; facile, cost-effective, and atom-efficient synthesis via simple acid–base neutralization, which is well-suited for large-scale applications; catalytic properties at elevated temperatures, including reverse reactions that reform the acid; and ease of handling and processing due to the absence of pressure build-up at higher temperatures.15 On top of this, certain PILs can effectively extract and enrich cellulose, thereby enhancing its subsequent saccharification.16,17 In addition, PILs can be designed to exhibit high selectivity toward lignin, improving extraction efficiency and facilitating potential in situ structural modification during pretreatment.17,18 Compared to conventional methods, including mechanical,19 chemical,20 thermochemical,21 and kraft pulping processes,22 IL-based fractionation offers several advantages. Depending on the feedstock, the IonoSolv process can also provide improved component separation in a potentially more cost-effective and sustainable manner.23 Furthermore, the tuning properties of PILs towards different biomass components, particularly cellulose and lignin, make them relevant for several high-value applications, such as emulsions, antioxidants, and thermoplastic formulations.16,24 Nevertheless, tailoring the properties of LCB components or renewable materials traditionally requires extensive experimental screening, substantial resources, and detailed characterization. These challenges can be overcome by emerging machine learning (ML) methods, which can significantly speed up the process and make the IonoSolv approach more sustainable and economically relevant.
Recent studies have increasingly explored the application of ML to advance biomass fractionation, particularly for optimizing pretreatment conditions,25 modeling representative compounds,26 and tailoring the LCB side-stream properties for valorization. For instance, Löfgren et al.27 demonstrated the potential of ML to accelerate the development of sustainable chemical processing strategies for targeted lignin extraction and its properties. Similarly, Diment et al.28 and Chrząstowska et al.29 applied ML approaches to improve the scalable production of lignin-carbohydrate complexes (LCCs) and pulp extractive analysis. Rummukainen et al.30 introduced an ML-based framework for pilot-scale comparison of wood delignification processes. In addition, Gisperg et al.31 highlighted the applicability of Bayesian optimization (BO) in bioprocess engineering, whereas Bertelsen et al.32 developed an open-source Python package to facilitate real-world optimization using BO. Despite these advances, current research lacks a consolidated focus on lignin yield and its corresponding molecular weight (Mw), which is a key parameter influencing downstream valorization. To the best of our knowledge, no study has investigated the integration of IL-based fractionation, particularly PILs, within a BO-driven ML framework to optimize lignin extraction. The absence of a general-purpose optimization strategy capable of controlling the Mw of lignin limits the development of standardized lignin valorization pathways. This challenge is further compounded by the fact that different fractionation methods produce lignin with varying Mw,33,34 complicating process modeling, comparison, and refinement.35–37 Additionally, process optimization is often hindered by variability in biomass feedstocks and solvent systems. Conventional optimization approaches, such as linear programming, dynamic programming, and search algorithms, often struggle with the highly nonlinear and discontinuous nature of LCB fractionation, resulting in high computational costs.38,39
Molecular dynamics (MD) simulations provide a powerful computational approach for understanding the dynamic behavior of materials at the molecular scale. These methods have been widely applied to investigate the interactions between lignin and various solvents to optimize the dissolution process and nanoparticle synthesis. In particular, MD simulations provide valuable insights into the movement of particles and molecules, which can further elucidate the complex interactions between lignin and ILs during fractionation.40–42 Previous studies have also indicated that basic anions, such as acetate and glycinate, function as strong hydrogen-bond acceptors capable of disrupting both inter- and intra-molecular interactions within the lignin matrix.40,42 MD simulations further demonstrated that selective lignin dissolution relative to cellulose can be optimized by tuning the anion basicity. By quantifying the binding energies and diffusion coefficients, MD simulations provide a thermodynamic framework for the rational design of task-specific ILs.
Keeping these considerations and challenges in mind, our study developed a BO-guided PIL-based lignin extraction methodology that integrates MD simulations with lignin yield predictions to evaluate the feasibility of targeted extraction conditions at the early stages of the process design, thereby reducing reliance on trial-and-error experimentation. In addition, the study proposes integrating BO with extensive literature datasets to generate surrogate models that correlate key PIL properties with lignin characteristics while minimizing processing constraints and satisfying multiple experimental objectives. This approach enables the simultaneous maximization of lignin yield and optimization of structural properties, facilitating the prediction of suitable downstream applications of lignin. By standardizing the lignin yield and structural properties within an ML framework that incorporates PIL selectivity and optimized extraction conditions, this study aims to advance predictive process design and significantly enhance lignin valorization strategies.
| Summary | Ref. |
|---|---|
| Rapid [TEA][HSO4] 15 min pretreatment at 180 °C enabled highly efficient enzymatic saccharification. A saccharification yield exceeding 75% of the theoretical maximum was reported, demonstrating the effectiveness and rapidity of the IonoSolv pretreatment | 44 |
| [TEA][HSO4] to remove heavy metals from contaminated biomass (HMCBs) before subjecting it to thermochemical or biological conversion processes | 45 |
| [TEA][HSO4] was evaluated using a combined methodology of detailed process simulation and life cycle assessment (LCA) to determine its economic viability and environmental footprint | 46 |
| A hybrid pretreatment process was developed by combining IonoSolv and organosolv fractionation. Compared to the standard IonoSolv process, this hybrid method demonstrated superior performance. It produced a cellulose-rich pulp with higher enzymatic accessibility and achieved more extensive lignin removal | 47 |
| A complete process was developed to transform oil palm empty fruit bunches (OPEFBs) using [TEA][HSO4] as a pretreatment agent. Systematic optimization of key pretreatment parameters, including the PIL composition and temperature. Finally, a method for the recovery and recycling of PIL was established, forming a comprehensive and sustainable processing loop for biomass | 48 |
| The biofuel potential of the perennial grass Pennisetum polystachion was evaluated using a pretreatment process with [TEA][HSO4]. An optimal pretreatment condition was established using an 80% concentration of PIL at 140 °C for 45 min with a 10% solid load. This resulted in a high delignification rate of 65.8%, effectively breaking down the biomass structure for subsequent conversion. Furthermore, the process demonstrated strong sustainability through PIL recycling, with recovery rates reaching 90% while maintaining significant delignification efficiency across multiple cycles | 49 |
| A microwave-assisted fractionation process using [TEA][HSO4] was developed for corn stover. This strategy effectively deconstructed the biomass to facilitate the production of monomeric sugars and support downstream acetone–butanol–ethanol (ABE) fermentation. A mass balance analysis of the integrated process demonstrated that from 100 g of raw corn stover, this approach could generate 8.1 g of ABE solvents and 16.61 g of technical lignin | 50 |
| This study designed and commissioned a versatile semi-batch reactor with inherent flexibility for reconfiguration into continuous-flow operation. The reactor was successfully implemented for production, demonstrating the capacity to generate ([TEA][HSO4]) at a rate of 0.8 kg per hour and a concentration of 80% (w/w) | 51 |
| This study demonstrated the molecular simulations of ionic liquids for lignin solvation and in situ depolymerization | 52 |
The synthesis was carried out as follows: Triethylamine (75.9 g, 750 mmol) was cooled in an ice bath in a round-bottom flask. Sulfuric acid (5 M solution) was added dropwise with stirring. Water was removed under reduced pressure using a rotary evaporator, and the product was dried overnight at 40 °C in a Schlenk line. The resulting PIL, [TEA][HSO4], was obtained as a white hygroscopic solid.
A simulation box containing one lignin molecule dissolved in 2792 molecules of [TEA][HSO4] at approximately 1.5 wt% lignin was constructed as shown in Fig. 2. After energy minimization by the steepest descent method, the system was equilibrated for 5 ns at 298.15 K. Production simulations were then carried out for 100 ns in the NPT ensemble at 1 bar. The temperature was controlled using a velocity-rescaling thermostat, and the pressure was controlled using a Berendsen barostat. The leapfrog algorithm was used with a 2 fs time step, and hydrogen-containing bonds were constrained using the LINCS method.63 Lennard-Jones interactions were switched off between 1.0 and 1.2 nm with a dispersion correction applied, and long-range electrostatics were treated using the particle mesh Ewald method. Structural and thermodynamic analyses were performed over the final 70 ns of the trajectory and saved every 2 ps. GROMACS built-in compiled codes were used for all analyses performed in this study.
:
3 was added, vortexed, and incubated in a preheated oven. The samples were treated in triplicate at different temperatures (120–210 °C) in batches iteratively (0, 1, 2 and 3) for 1.5 h as the incubation time. Following pretreatment, ethanol was added to the pressure tube to separate the cellulosic pulp from the PIL–lignin mixture via centrifugation (4000 rpm for 10 min). Later, PIL lignin dissolved in ethanol was fractionated using a pressure filtration system equipped with Whatman™ nylon membrane filters (pore size 0.8 μm) to extract lignin rich fractions. Finally, ethanol was recycled using a rotary evaporator (Buchi Rotavapor R-200, Buchi, Switzerland) and after evaporation, Milli-Q water was added to the concentrated ionic liquid solubilized lignin as an antisolvent to precipitate lignin fractions from the PIL; the precipitated lignin was subsequently washed with Milli-Q water (3×) to ensure complete removal of the PIL, which was then recovered through centrifugation, as shown in Fig. 3. Furthermore, this study does not evaluate the enzymatic digestibility or fermentability of the cellulosic residue, nor does it provide PIL recycling data. These aspects remain the subjects of ongoing studies and are necessary for the development of a fully integrated biorefining process.
![]() | (1) |
For elemental analysis, approximately 100 mg of lignin was encapsulated in tin foil. The carbon, hydrogen, nitrogen, and sulfur contents were determined using an Elementar Vario Macro Cube based on EVS-EN ISO 16948:2015. The oxygen content was calculated as the difference (O = 100 − (C + H + N + S)).
000 Å), and a pre-column (8 mm × 50 mm) (Polymer Standards Service (PSS), GmbH, Mainz, Germany) with a UV detector (280 nm). Lignin samples were dissolved in 0.1 M NaOH (5 mg mL−1), and an isocratic flow was maintained with 0.1 M NaOH solution at a flow rate of 0.5 mL min−1 with an injection volume of 20 µL. The relative molecular weight of lignin was determined using polystyrene sulfonate sodium salt standards (PSS, GmbH, Mainz, Germany) ranging in size from 1100 to 100
000 Da.The sugar content was quantified using 10 mg of lignin samples via acid methanolysis. Following acid methanolysis and silylation, carbohydrate analysis was performed by gas chromatography using an HP-1 GC column (Shimadzu GC-2010 AF) equipped with a flame ionization detector (Shimadzu, Kyoto, Japan), as previously described.67 After neutralization with pyridine, the sample was derivatized overnight with HMDS/THMS, followed by the addition of 1.0 mL of internal standard (0.1 mg mL−1 resorcinol in methanol). The silylated analytes were separated using hydrogen as the carrier gas at a flow rate of 1 mL min−1.
Notably, the P-factor was originally developed to model hemicellulose removal, and its application to direct PIL-based lignin extraction remains largely unexplored. To the best of our knowledge, this study represents the first attempt to correlate the P-factor with lignin yield in a PIL solvent system. Given the limited availability of prior work in this area, an iterative batch-based validation framework was adopted (detailed analysis and datasets are provided in the SI). The Bayesian optimization progression is shown in Fig. 4A and B for the two lignin-yield datasets. Batch 0 represents the initial GP model trained using the first four observations, which are shown as white circles to distinguish them from BO-selected experiments. In subsequent batches, all observations available before the current acquisition step are shown as black circles, including the initial observations after batch 0, while the newly selected BO acquisitions are shown as lime-green circles. For each batch, the top row presents the GP-predicted lignin yield across the temperature P-factor space, with warmer colors indicating higher predicted yield and cooler colors indicating lower predicted yield. The bottom row presents the corresponding predictive uncertainty, expressed as σ in lignin-yield units. Regions with higher σ indicate areas where the model has less information, whereas lower σ regions occur near sampled experimental conditions. As additional BO-selected observations are incorporated, the GP model is updated, uncertainty decreases around sampled regions, and the predicted yield landscape becomes progressively refined. The final column represents the final model after all observations have been incorporated, rather than an additional BO acquisition batch. This sequential process demonstrates how BO can guide the selection of informative process conditions while reducing the number of experiments required to identify promising lignin-yield regions.
The theoretical data presented in Fig. 4A were compiled from recent literature and illustrate the effects of the P-factor and temperature on lignin recovery. According to the results, the optimal pretreatment temperature for maximizing lignin yield during subsequent organosolv extraction using solvents such as ethanol, acetone, or aqueous solvent mixtures was 200 °C, with a P-factor of 2000.
However, these results are not directly comparable to the results obtained with a PIL, as shown in Fig. 4B. In the PIL process, the ionic liquid serves as a catalyst in the pretreatment; thus the optimal pretreatment temperature range for lignin extraction is between 170 and 180 °C. Moreover, when higher temperatures are employed with the PIL, the reported lignin yield exceeds 100%, which is attributable to the co-extraction and condensation of hemicellulose and cellulose with lignin, as previously documented.17
A major challenge associated with the literature data is heterogeneity, as illustrated in Fig. 4A. The collected P-factor and temperature values, along with their corresponding lignin yields, vary considerably due to differences in solvent systems, biomass types, and extraction conditions across studies. This variability complicates the establishment of direct correlations. Nonetheless, the literature data clearly indicate the importance of temperature and its role in lignin extraction.
To address this variability and gain a better understanding of the temperature dependence, the model simplifies the analysis by focusing on the fundamental relationship between lignin yield and the two primary controllable variables, P-factor and temperature, within the PIL extraction framework.
An in-depth iterative experimental series was subsequently conducted to focus on quantitative lignin yield across ten experimental observations (see SI Table S2), in which time was held constant while temperature was increased iteratively up to 210 °C. The input variables examined were temperature (120–210 °C) and P-factor (12–15
055), with lignin yield (11–120%) serving as the response variable. The relationship between these process parameters and lignin yield, along with the corresponding model predictions derived from the literature dataset, is presented in Fig. 4B. The results identified a critical processing region near a P-factor of 2000. A notable lignin yield (approaching 73%) with minimal carbohydrate impurities (discussed later) was achieved at a P-factor of approximately 1902.
Although higher lignin yields (up to 78%) were also observed at elevated P-factors, these conditions were not employed in this work for two primary reasons. First, experimental results indicated that increased recovery levels led to higher residual PIL and saccharide contamination in the extracted lignin, resulting in greater filter residue and reduced lignin purity, as shown in Fig. 5. Specifically, at a lignin yield of 73%, a notable increase in the filter residue was observed during ethanol-based lignin recovery, a trend that became more pronounced at higher yields. This issue is further addressed later in the text. Second, consistent with previous studies, elevated temperatures are known to promote lignin condensation reactions, which can alter the native lignin structure and limit its potential for downstream valorization.17,19,35,73 In particular, elevated temperatures during biomass fractionation can induce lignin condensation through the formation of LCCs, where lignin forms covalent linkages with hemicellulose, especially xylan, generating recalcitrant structures.74–76 Within the PIL fractionation process, which solubilizes both lignin and hemicelluloses during lignin recovery, this presents a specific challenge. Higher temperatures enhance delignification but also promote LCC formation and extensive cleavage of β-O-4 linkages in native lignin.67 Consequently, increased delignification results in the co-extraction of lignin and LCCs. As this work focuses on extracting lignin with targeted properties, a balance must be struck between the degree of delignification and the purity and recoverability of lignin in subsequent stages. Henceforth, a delignification level of 82% with a lignin recovery of 73% was selected for detailed characterization of pretreatment-related impurities and lignin properties relevant to its efficient applications.
In summary, the influence of the P-factor and temperature on the structural properties of lignin can be explained, both quantitatively and qualitatively, by well-established principles of lignin chemistry and relevant literature data. However, the experimentally optimized results reported in this study require advanced characterization to enable detailed qualitative analysis, particularly regarding the role of the PIL in lignin solubilization during fractionation. Similarly, as a next step, optimization of time and the PIL/biomass ratio as independent variables will be required to establish correlations between the lignin yield, cellulose retention and sidestream valorization. Nevertheless, challenges remain in defining an optimal lignin yield, as the preferred outcome “whether lower molecular weight lignin with minimal carbohydrate impurities or higher molecular weight lignin with higher yield” depends on the specific application (discussed later in the text). In this context, our surrogate model landscapes provide the quantitative predictions necessary for large-scale lignin valorization, particularly for producing lower molecular weight lignin with reduced carbohydrate content.
Accordingly, this study represents a significant step in applying BO to elucidate the influence of temperature on the PIL-based fractionation of pine wood biomass. BO enabled systematic exploration of the trade-off between maximizing lignin yield and preserving lignin quality. The results further confirm the applicability of the P-factor concept to PIL-based lignin extraction.
However, conventional compositional analysis techniques do not distinguish between native lignin and pseudo-lignin. Therefore, pseudo-lignin formation is strongly suspected under conditions where the amount of precipitated lignin exceeds the measured delignification. Under these conditions, the apparent yield exceeded 100% (see the SI), particularly for PIL-based pretreatments conducted above 180 °C.17,44,78 This reprecipitation of pseudo-lignin onto the cellulose surface is known to negatively affect saccharification yields and kinetics by reducing the accessibility of the cellulose substrate.19,73 Altogether, to better understand the effects of temperature and pretreatment associated impurities, several analytical techniques including GPC, ATR-FTIR, HSQC, 13C and 31P NMR, and TGA were employed to characterize lignin properties and its tailored applications.17,44
GPC analysis of the optimized PIL-extracted lignin revealed lower average molecular weight (Mw) and polydispersity index (PDI) values compared to commercial kraft lignin, as summarized in Table 2 and Fig. 7A. These differences are likely attributable to the PIL fractionation and lignin recovery process, particularly the ethanol washing step, which may bias the recovered lignin fraction toward lower molecular weight components rather than directly altering the lignin structure at higher pretreatment temperatures.79
| Sample | Mn (g mol−1) | Mw (g mol−1) | Đ |
|---|---|---|---|
| Mn – number average molecular weight, Mw – weight average molecular weight, Đ – polydispersity index (Mw/Mn). | |||
| Kraft lignin | 601 | 4585 | 7.63 |
| PIL extracted lignin | 850 | 2760 | 3.25 |
The GPC results exhibited trends like those described in the ML optimization and were consistent with the delignification selection criteria discussed previously. This interpretation is further supported by studies demonstrating that sulfate-based PILs including the PIL used in the present study (Table 1) can dissolve up to 70 wt% of lignin prior to ethanol extraction. Moreover, ethanol has considerably lower lignin solubility compared to acetone and solvent–water mixtures, which may also contribute to the lower molecular weight observed in lignin from the PIL process.79 Similarly, pseudo-lignin formation may be associated with the washing step, which promotes the precipitation of larger, ethanol-insoluble fragments, leading to pseudo-lignin deposition on the cellulose surface solution. This observation aligns with the apparent trend of increased ether cleavage at higher pretreatment temperatures. Overall, the decrease in Đ and Mw is attributed to the increase in hydrogen bonding between ethanol and lignin, combined with optimum PIL pretreatment conditions.80 Therefore, lower molecular weight lignin fractions are more soluble in ethanol while higher molecular weight lignin fractions remain in the filtrate, as observed in this study (Fig. 5), resulting in a narrower molecular weight distribution in the isolated lignin. Moreover, the sugar content in the optimized lignin was found to be below 2%, whereas the filter residue contained over 7% sugars, with xylose as the predominant component, accounting for almost 40% of the total sugars. Additionally, approximately 3% of the impurities in the lignin were found to originate directly from the PIL insertion process due to the higher pretreatment temperature, specifically from triethylamine.67 Taken together, these findings suggest that while delignification efficiency and the intrinsic structure of the extracted lignin are largely affected at higher temperatures, both lignin yield and the molecular weight of the isolated lignin are also influenced by the recovery process.
ATR-FTIR spectroscopy was employed to identify the functional groups present in optimized lignin extracted using a PIL and to compare them with those in commercial kraft lignin, as shown in Fig. 7C. Absorption bands in the range of 3550–3250 cm−1 were attributed to the O–H stretching vibrations of aromatic and aliphatic groups in lignin.81 Shoulder peaks were observed at approximately 2920 and 1453 cm−1, corresponding to the symmetric and asymmetric C–H vibrations of the methylene and methyl groups, respectively.48 The intense peak at 1604 cm−1 in the PIL-extracted lignin is attributed to carbonyl (C
O) vibrations conjugated with the aromatic skeletal structure.23 The broad peak at 1250 cm−1 was assigned to the C–O stretching of the guaiacyl unit, whereas in kraft lignin, the presence of syringyl units was confirmed at 1220 cm−1.23 The peaks at 1080 cm−1 and 916 cm−1 are attributed to C–O–C stretching with aromatic C–H in-plane deformation and aromatic C–H out-of-plane bending vibrations, respectively.82 These observations indicated that the extracted lignin primarily contained guaiacyl units, whereas the absence of condensed guaiacyl at 1362 cm−1 suggested that the extracted lignin was free from condensation.
TGA revealed an initial weight loss (stage I) between 100 and 150 °C (Fig. 7B), corresponding to approximately 5% of PIL lignin, which is attributed to the evaporation of physically bound water molecules.83 In the second pyrolysis stage, PIL lignin exhibited more significant mass loss and reduced thermal stability. Similarly, PIL lignin underwent gradual pyrolysis between 200 and 435 °C (stages II and III), reaching a maximum weight loss of 76%. At temperatures above 450 °C (stage IV), no significant loss was found. The greater degradation observed at stages II and III for PIL lignin resulted in a small solid residue, indicating fewer unreacted by-products,84,85 which is advantageous for reducing waste and improving process efficiency. Similarly, the lower stability of PIL lignin can be attributed to its lower molecular weight. TGA results therefore suggest that the PIL fractionation produces lignin with lower molecular weights and higher volatility, as reflected by its lower decomposition temperature of 288 °C and greater overall mass loss (∼76%). Consistent with GPC, ATR-FTIR, and TGA observations, elemental analysis showed a carbon content exceeding 70% in PIL lignin compared with kraft lignin, as presented in Fig. 7D.
HSQC NMR analysis revealed minor changes in the chemical functionality of recovered lignin compared with the conventional lignin (detailed in Tables S3, S4 and Fig. S1 in the SI). In the aromatic region of HSQC spectra (Fig. 8B), the inter-unit linkages at δ(13C), δ(1H) 110.9/6.91, 114.9/6.86, and 115/6.8 ppm correspond to the C2–H2, C5–H5, and C6–H6 of guaiacyl (G) units, respectively. Correlations at δ(13C), δ(1H) 110.5/7.2 and 124/7.1 ppm were assigned to the C2–H2 and C6–H6 of oxidized guaiacyl (G′) units. The β-5 (SB5) linked trans-stilbene exhibited an α signal at δ(13C), δ(1H) 128.2/7.20 ppm and a β signal at δ(13C), δ(1H) 120.1/7.22 ppm; in contrast, the trans-stilbene with a β-1 linkage (SB1) exhibits only its α signal at δ(13C), δ(1H) 125.6/6.97 ppm. Similarly, in the aliphatic region, the methoxy groups of guaiacyl units were observed at δ(13C), δ(1H) 56/3.8 ppm, whereas signals at δ(13C), δ(1H) 8–52.5/0.5–4.5 correspond to the alkyl side chains in the extracted lignin.86 A decrease in ether linkage content (Table 3) was observed in the isolated lignin, accompanied by an increase in carbohydrate (C) peaks assigned to C1 (δ(13C), δ(1H) 102.3/4.30), C2 (δ(13C), δ(1H) 73.2/3.08), C3 (δ(13C), δ(1H) 74.6/3.29), C4 (δ(13C), δ(1H) 75.9/3.54), and C5 (δ(13C), δ(1H) 63.8/3.92 and 3.23).28,87 Comparison of the assigned peaks showed that C2, C3, and C5 were clearly visible in the HSQC spectra, suggesting that pseudo-lignin formation began at this yield, a finding consistent with experimental quantification. The appearance of carbohydrate peaks may result from an increased concentration of dissolved hemicellulose fragments in the PIL–lignin solution.
| Properties | Lignin backbone composition | |||||
|---|---|---|---|---|---|---|
| α/β/γ represent assignments using 13C and 1H.a Quantification using 31P-NMR.b Calculated using a quantitative combination of HSQC and 13C-NMR. | ||||||
| Hydroxyl contenta | Aliphatic OH (mmol g−1) | Phenolic OH (mmol g−1) | Total OH (mmol g−1) | |||
| 0.46 | C5substituted | Guaiacyl | Total phenolics | 3.03 | ||
| 0.62 | 1.95 | 2.37 | ||||
| Interunit linkagesb (abundance/100 aromatic units) | Phenylcoumaran (β-5′) | Pinoresinolα(β–β′) | Pinoresinolα′ (β–β′) | |||
| 0.03 | 0.01 | 0.01 | ||||
| Aryl-vinyl moietiesb (abundance/100 aromatic units) | Stilbeneα(β-5′) | Stilbeneβ(β-5′) | Stilbeneα(β-1′) | cis-Enol etherα | cis-Enol etherβ | |
| 0.02 | 6.41 | 1.81 | 2.16 | 0.05 | ||
| Side chain structure in end groupsb (abundance/100 aromatic units) | Dihydrocinnamyl alcoholα | Dihydrocinnamyl alcoholγ | Aryl-glycerolγ | |||
| 0.02 | 0.08 | 0.02 | ||||
| Lignin polysaccharide complex linkageb (abundance/100 aromatic units) | Benzyl etherα | |||||
| 0.01 | ||||||
To further understand the reactions occurring during extraction, the isolated lignin was analyzed for subunit composition and interlinkage distribution, where 31P NMR analysis identified aliphatic and phenolic OH regions (Table 3). Signals in the range δ(31P) 145.8–149 ppm correspond to aliphatic hydroxyl (R–OH) groups, whereas phenolic OH groups appeared within δ(31P) 137.5–145 ppm, indicating β-O-4 bond cleavage and limited condensation through Cα–Caryl linkages. These reactions lead to the formation of free phenolic structures, particularly guaiacyl OH at δ(31P) 137.5–142 ppm, which showed high signal intensity in the PIL-extracted lignin. In addition, 31P NMR (Fig. 8A) revealed carboxylic acid groups (–COOH) within δ(31P) 134–136.6 ppm, likely formed through oxidation reactions of alkyl side chains.35,37
Altogether, the 31P and 13C NMR results show that a substantial fraction of β-O-4 ether linkages, the most abundant linkages in native pine wood, was cleaved after pretreatment. In contrast, ether bonds associated with the less abundant β–β′ and β-5′ linkages described in Table 3 degraded more slowly, consistent with previous observations. Higher pretreatment temperatures increased the in situ depolymerization of lignin, as evidenced by reduced G6 and G2 signal intensities. Furthermore, when comparing the 31P and 13C NMR results with the HSQC spectra, it becomes clear that increasing lignin yield beyond this point leads to extensive lignin modifications, which in turn complicates lignin characterization and the understanding of its properties.
The combination of HSQC and 31P NMR analyses, therefore, provides detailed structural insight into the lignin units and inter-unit linkages present in lignin extracted from pine biomass. This critical analysis advances lignin–carbohydrate characterization and provides valid information on how PIL-soluble lignin depolymerizes in situ (via β-O-4 bond cleavage) during pretreatment. Henceforth, to further advance the understanding of PIL–lignin interactions and support these findings, molecular dynamics simulations were performed to relate the results to lignin applications.
| Model | Softwood lignin |
|---|---|
| H/G/S composition | 0–5/95–100/0 |
| Molecular weight | ∼8.5 kDa |
| Linkage composition | |
| β-O-4 | 21 (∼46%) |
| β-5 | 7 |
| 5–5′ | 6 |
| β-1 | 5 |
| α-O-4 | 4 |
| 4-O-5 | 2 |
| β–β | 1 |
Effective anions typically exhibit negative σ-potentials in regions of positive charge density, facilitating strong interactions with the lignin surface. A key principle derived from these simulations is that the interaction energy between the anion and the cation of the IL must be weaker than the interaction energy between the anion and lignin, thereby enabling the ions to solvate the polymer.52
To elucidate lignin behavior in a PIL environment, we analyzed time-dependent structural stability, conformational sampling, hydrogen-bonding patterns, and energy contributions. These descriptors link molecular interactions with polymer expansion and solvation behavior. The RMSD profile (Fig. 10A) gradually increases and stabilizes at approximately 0.55–0.65 nm, indicating substantial conformational rearrangement of lignin before reaching a dynamically stable configuration in the PIL. This behavior reflects adaptation of the polymer to the ionic environment rather than its structural instability. Concurrently, the radius of gyration (Rg) increases steadily from 1.45 to approximately 1.60–1.65 nm throughout the trajectory, indicating that lignin adopts a more expanded conformation in [TEA][HSO4]. These values are consistent with those observed in effective solvents, where lignin typically exhibits Rg values between 15.75 Å and 18.85 Å, whereas in poorer solvents such as water, Rg values are generally below 13 Å.80,91
The 2D projection obtained from principal component analysis (PCA) revealed a broad U-shaped conformational landscape (Fig. 10B). This indicates that the lignin polymer explores multiple conformational states during the simulation rather than remaining trapped in a single basin. Such extensive sampling suggests that lignin remains flexible in the PIL and is not kinetically trapped in a rigid configuration. Continuous migration across the phase space indicates stabilization through solvent interactions rather than by intrachain hydrogen bonding, consistent with the increasing RMSD and Rg values shown in Fig. 10A.
Hydrogen-bond analysis revealed a clear predominance of intermolecular hydrogen bonds between lignin and PIL ions, fluctuating between approximately 55 and 70 throughout the simulation trajectory. In contrast, the intramolecular lignin–lignin hydrogen bonds remained significantly weaker (∼12–20). This imbalance demonstrates that lignin preferentially forms hydrogen bonds with the ionic liquid rather than with itself. Consequently, the dissolution process is characterized by the disruption of intramolecular (intrachain) hydrogen bonds and their replacement with lignin–PIL interactions. These transitions explain the polymer expansion observed in the RMSD and Rg profiles, as internal hydrogen bonds are disrupted, the polymer structure opens, and it becomes more accessible to the solvent.
Energy decomposition analysis shows that coulombic interactions dominate the lignin–PIL system, particularly interactions between lignin and HSO4− (HS) anions. The total interaction energy for lignin–HS is substantially more negative than that for lignin–TEA+, indicating that the anion plays a primary role in stabilizing lignin in solution. Although van der Waals interactions contribute to the overall stabilization, their effect is secondary compared with electrostatic and hydrogen-bonding interactions. The strongly negative total interaction energy for the combined lignin–TEA–HS system confirms that solvation is energetically favorable, consistent with the hydrogen-bonding trends observed in Fig. 10C.
In the Radial Distribution Function (RDF) profiles (Fig. 10E and F) for lignin–TEA hydrogen bonding, a small but distinct first peak appears near ∼2 Å, corresponding to interactions between lignin oxygen atoms and the ammonium hydrogen of TEA. However, this peak is less pronounced than that observed for anionic interactions; the aliphatic oxygen (O) of lignin shows an intensity of approximately 0.28, while interactions between aromatic O in lignin with H in TEA are negligible. A second maximum at approximately 5.5 Å, indicates that solvation is distributed along the lignin backbone rather than localized at specific sites. In contrast, RDFs for lignin–HSO4− interactions display sharper and stronger maxima at shorter distances (∼1.9–2.0 Å), indicating the presence of a well-defined coordination shell around lignin oxygen atoms. These results are consistent with previous observations that PIL anions form a dense hydrogen-bond network around lignin functional groups. The presence of this short-range, highly ordered solvation shell explains the dominance of the coulombic interactions observed in Fig. 10D and the high contact probability between lignin and PIL ions. In an effective PIL system, this contact probability can be up to one order of magnitude greater than that observed in weaker solvents, thereby driving lignin solubility.
In summary, MD simulations primarily represent the early-stage PIL–lignin interaction, and limitations of using a static native model for the full extraction process would be needed to precisely acknowledge the lignin modifications in dynamic environments. Altogether, the findings propose that the anion plays a key role in lignin solubilization, making its selection a cornerstone of the process. Similarly, drawing conclusive evidence from these results is challenging because under dynamic and optimized pretreatment conditions, lignin undergoes multiple simultaneous reactions. These include reactions with ethanol during the recovery process and reactions with impurities derived from hemicellulose and extractives. Furthermore, the severity of the process and the complex biomass components that drive condensation reactions obscure the specific interactions between lignin and the PIL. Therefore, future studies should eliminate impurities such as those from pre-hydrolysis (which removes hemicellulose) and residual extractives since simulating the full complexity of biomass with all impurities is theoretically prohibitive. Our results further demonstrate that lignin isolated through PIL-based biomass fractionation closely resembles lignin obtained via conventional organosolv processes. A substantial body of literature supports the applications of this PIL-extracted, lower-molecular-weight lignin, as discussed later in the text.
PIL-separated lignin (IonoSolv lignin), such as that obtained in this study, often undergoes in situ depolymerization during pretreatment, thereby reducing the need for additional chemical modification.43 This structural transformation during extraction positions PIL-derived lignin as a promising candidate for integrated downstream applications, a concept recently highlighted by Yang et al.92 Similarly, Zhang et al.93 proposed a multiple-freezing strategy to incorporate lignin/ionic liquid into a dual-cross-linked network of polyvinyl alcohol and gelatin, enabling the preparation of a multifunctional green composite hydrogel. Furthermore, technical lignins can serve as feedstock for three principal categories of chemical modification: fragmentation (depolymerization), synthesis of new chemically active sites, and functionalization of existing hydroxyl groups for targeted applications.94 However, the economic feasibility of modified lignins95 and their applications is a concerning fact which needs market acceptability.96,97
Therefore, we compile and compare different lignin applications in Table 5. Interestingly, most technical lignin-based applications require modification strategies. For example, fragmentation aims to depolymerize lignin into low-molecular-weight aromatics such as vanillin, which is currently the only lignin-derived chemical produced on a significant industrial scale.98 Another strategy involves the introduction of new reactive sites that impart functionality not present in native lignin.99 One such example is hydroxyalkylation, enhancing lignin reactivity and enabling partial substitution for phenol in phenol–formaldehyde resins.100 Lastly, the functionalization of existing hydroxyl groups represents one of the most versatile routes for lignin valorization.94,101
| Lignins | Modifications | Applications | Ref. |
|---|---|---|---|
| Kraft lignin | Lignin nanoparticles (LNPs) | Digital light processing (DLP) 3D printing of lignin–polymer nanocomposites | 102 |
| Epoxidized lignin, colloidal lignin particles | Fire and water-resistant composite adhesives | 103 | |
| Oxypropylation | Biocomposites for packaging applications | 104 | |
| Esterification and nanoparticles | Targeted chemical delivery in plant protection | 105 | |
| Hydroxymethylation and nanoparticles | Organic battery electrodes | 106 | |
| Hydroxyethyl modification, esterification and fractionation | Polyethylene terephthalate | 107 | |
| Aminopropyl/methyl silsesquioxane (WAPMSS) | Wood coating | 108 | |
| Soda, kraft, organosolv and aldehyde-protected lignin | Acid phenolation | Antiviral coating material | 109 |
| Alkaline lignin | Preoxidation | Lignin-derived hard carbon anode | 110 |
| Polymerized lignin formulation | Barrier coating in packaging | 111 | |
| Organosolv and milled wood lignin | Formulation | Natural sunscreens | 112 |
| Depolymerized lignin | Lignin hydrogenolysis oil chlorohydrin, glycidyl ether, cyclic carbonate | Nonisocyanate polyurethane/epoxy thermoset materials | 113 |
| RCF lignin oil | Molybdenum carbide to deoxygenate lignin | Jet-range aromatic hydrocarbons | 114 |
| Hydrolysis lignin | Depolymerization and glycidylation of lignin | Epoxy resin for metal coating applications | 115 |
| IonoSolv lignin | Spinning dope | Carbon fibres | 92 |
| Sodium lignosulfonate and kraft lignin | Emulsions | Hair conditioning | 116 |
| Aldehyde-assisted fractionated lignin | Formulation | Thermal paper | 117 |
| Aminated lignin | Amino-nanoscale or micron-scale solid particles | CO2 capture | 118 |
Based on the applications described in Table 5, for PIL-separated lignin, integrating lower-molecular-weight lignin and esterification with a modifier as described by Nelis et al.117 and Babaeipour et al.,119 where the resulting formulations are directly applicable as coatings on fiber-based substrates for packaging applications, is promising. Overall, the proposed applications require further research not only on minimum modifications but also on the effects of such modifications on biodegradability. Similarly, based on a comparative analysis of the literature and the results reported in this study, future research would focus on minimizing modification steps, as shown in Fig. 11 and on applying green chemistry strategies such as esterification for lignin-based industrial coatings, thereby advancing sustainable IonoSolv lignin valorization.
![]() | ||
| Fig. 11 Traditional kraft lignin extraction and valorization vs. the lignin-focused protic ionic liquid (PIL)-biorefining approach. | ||
| This journal is © The Royal Society of Chemistry 2026 |