A machine-learning framework for interpretable prediction of cellulose degree of polymerization retention in green solvents

Abstract

Cellulose is one of the most abundant natural renewable polymers. Its wide availability, biocompatibility, and biodegradability enable broad applications in papermaking, textiles, biomedical materials, and biofuels. However, the strong inherent network of intermolecular hydrogen bonds in cellulose impedes its dissolution in most organic solvents and water, thereby limiting its high-value utilization. In recent years, green solvents such as ionic liquids (ILs), deep eutectic solvents (DESs), and molten salt hydrates (MSHs) have emerged as effective dissolution media. However, they invariably induce cellulose depolymerization to some extent. Moreover, experimental determination of the degree of polymerization (DP) is laborious, and conventional prediction methods lack accuracy. Here, we propose an interpretable machine learning (ML) framework that uniquely integrates raw material properties, process parameters, and microscale physiochemical descriptors of solvent components from computational chemistry. Solvents were disassociated into cations, anions, and auxiliary components, with microscale physicochemical descriptors calculated via computational chemistry. Seven feature combinations were designed to train six ML models, and model interpretability was enhanced using SHapley Additive exPlanations (SHAP) and partial dependence plots (PDP). The optimal model (group 5-RF) achieved a test set R2 of 0.9555, demonstrating excellent generalization and robustness, as confirmed by leave-one-solvent-family-out cross validation. The key influential factors (temperature, time, Cation_ESPmax, Cation_BalabanJ, and Other_NBO(H+)) and their nonlinear synergistic/antagonistic effects were revealed. A user-friendly graphical user interface (GUI) was developed to facilitate rational solvent design and process optimization. This study establishes a new approach to intelligent prediction of cross-solvent material processing and provides a theoretical and technical basis for targeted preparation of high-performance cellulose materials.

Graphical abstract: A machine-learning framework for interpretable prediction of cellulose degree of polymerization retention in green solvents

Supplementary files

Article information

Article type
Paper
Submitted
09 Dec 2025
Accepted
23 Mar 2026
First published
27 Mar 2026

Green Chem., 2026, Advance Article

A machine-learning framework for interpretable prediction of cellulose degree of polymerization retention in green solvents

C. Wei, Y. Liu, H. Cui, Y. Xie, F. Song, Y. Zhang and H. Tan, Green Chem., 2026, Advance Article , DOI: 10.1039/D5GC06642E

To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page.

If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.

If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page.

Read more about how to correctly acknowledge RSC content.

Social activity

Spotlight

Advertisements