Open Access Article
Yunhong Che
*ab,
Joachim Schaeffer
ac,
Jinwook Rhyu
a,
Liang Wua,
Patrick A. Asingera,
Minsu Kima,
Jacob Sassa,
Rolf Findeisenc,
Martin Z. Bazant
ad,
William C. Chueh*ef and
Richard D. Braatz
*a
aDepartment of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA. E-mail: yunhche@mit.edu; braatz@mit.edu
bDepartment of Energy, Aalborg University, Aalborg 9220, Denmark
cControl and Cyber-Physical Systems Laboratory, Technical University of Darmstadt, Darmstadt 64289, Germany
dDepartment of Mathematics, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
eDepartment of Energy Science and Engineering, Stanford University, Stanford, CA 94305, USA. E-mail: wchueh@stanford.edu
fApplied Energy Division, SLAC National Accelerator Laboratory, Menlo Park, CA 94025, USA
First published on 11th February 2026
Reliable battery health diagnosis and cycle life prediction remain a central challenge for energy storage systems. This work first provides a systematic analysis of key factors for battery health diagnosis, highlighting previously overlooked yet critical elements that affect health assessments. Building on these insights, a rate-adaptive transformation model converts high C-rate features into low C-rate equivalents, enabling rapid diagnostics of battery aging modes without time-consuming testing using a low C-rate. To address fitting inaccuracies caused by aging, blended materials, and kinetic effects, an interpretable residual learning model corrects voltage mismatches, which also enables low C-rate fitting by using high C-rate data. Leveraging mechanistic-informed features, early cycle life prediction achieves mean errors of less than 70 cycles using data from fewer than 30 equivalent full cycles across complex and unseen aging conditions. This interpretable and generalizable framework bridges electrochemical understanding with practical diagnosis and offers a fast and reliable path toward mechanism-informed battery prognostics.
Broader contextReliable and rapid battery health assessment is essential for ensuring safety, extending lifetime, and supporting large-scale electrification in transportation and renewable energy storage. Yet, accurate diagnostics often rely on slow laboratory tests and complex electrochemical modeling that are difficult to apply under real operating conditions. This work bridges the gap between mechanistic understanding and data-driven modeling to accelerate and improve battery health diagnosis. By systematically identifying key factors that influence diagnostic accuracy, the study introduces a rate-adaptive transformation to translate high-rate measurements into equivalent low-rate data and an interpretable residual learning model to correct degradation-induced voltage errors. Together, these methods allow fast, accurate, and explainable cycle life prediction using only limited early-cycle data. This framework not only enhances the reliability and interpretability of battery diagnostics but also provides a practical pathway toward efficient health management for large-scale battery systems, contributing to a more sustainable and circular energy future. |
In practical applications, battery aging is typically characterized by cell-level indicators such as capacity and power fade. However, the underlying degradation stems from intricate electrochemical reactions that induce varying overpotentials due to transport limitations and reaction kinetics. These effects are further exacerbated under dynamic operating conditions, complicating both performance analysis and aging interpretation.8–13 As a result, aging mode and electrode health diagnosis become increasingly important for in-depth aging evaluation, which can be achieved via physical modeling or data-driven approaches.14–21
Differential voltage fitting (DVF) has emerged as a widely adopted diagnostic tool for such purposes. By fitting the full-cell voltage profile to a reference half-cell model, DVF effectively scales and shifts the open-circuit potentials (OCPs) of the anode and cathode, enabling the quantification of key degradation modes such as loss of lithium inventory (LLI) and loss of active material (LAM).13,22–27 To minimize kinetic distortion and accurately capture equilibrium behavior, DVF is ideally performed at extremely low current rates (e.g., C/40). However, such conditions are relatively hard for routine diagnostics due to their excessive time requirements, downtime requirements during operations, and may perturb aging trajectories. Higher C-rate measurements offer improved efficiency but introduce kinetic artifacts that degrade diagnostic accuracy. Recent advances have sought to overcome this challenge using machine learning models to infer low C-rate behavior from high C-rate data.28–30 Nevertheless, these data-driven approaches often lack physical interpretability and struggle to generalize across different battery chemistries and aging scenarios, thereby limiting their robustness and scalability. Physics-constrained models can improve interpretability, but it is still hard to identify key intrinsic relations between different C-rate measurements.31,32
The accuracy of DVF can deteriorate over time due to reference OCP shifts, voltage hysteresis, and the evolving nature of aging mechanisms. These factors introduce systematic fitting errors that compromise the reliability of electrode-level health indicators. To address this, previous studies have introduced additional fitting parameters, such as the silicon ratio in blended anodes, to adjust the anode's reference OCP and improve fitting accuracy.33–37 However, battery aging is a coupled, multi-physics process, especially in hybrid electrodes with multiple active materials, and changes in the cathode OCP can also play a significant role. These observations highlight that DVF fitting errors during aging are not isolated artifacts, but manifestations of complex, interacting degradation pathways. Therefore, compensating for these effects requires a more holistic and adaptive modeling framework that can evolve with the battery's aging.
Beyond current health diagnostics, predicting battery cycle life remains a critical yet challenging task, particularly under complex aging conditions. Cycle life prediction has become a critical enabler in battery research, accelerating material discovery, guiding cell design, facilitating predictive maintenance, and informing second-life deployment strategies.21,38–40 Early prediction is even more difficult, as batteries typically exhibit minimal degradation during the initial cycles. Nevertheless, early cycle life prediction offers significant benefits, including accelerated battery R&D at lower cost and improved maintenance strategies, making it a topic of global research interest. While recent advances in data-driven approaches have demonstrated strong predictive performance, many models depend on features that are highly sensitive to specific usage conditions, which limits their ability to generalize across diverse aging scenarios.41–44 In parallel, deep learning techniques offer enhanced accuracy but often do so at the cost of interpretability, hindering trust and obscuring physical insights.45–47 To address these challenges, models grounded in physically meaningful features and interpretable structures provide a compelling pathway toward accurate, robust, and generalizable cycle life prediction.24,38,48,49
In this work, we first conduct a systematic evaluation of DVF performance using OCV measurements at different C-rates, revealing critical limitations and offering actionable insights for more reliable and robust health diagnostics. Guided by these findings, we develop a simple yet interpretable transformation model that accurately predicts low C-rate electrode behavior from high C-rate measurements, enabling faster and more reliable mechanism interpretation. To further address residual fitting errors, particularly those exacerbated by aging, we introduce an interpretable residual learning model that corrects degradation-induced deviations. This hybrid framework combines mechanistic understanding with data-driven learning, enabling robust extraction of electrode health under diverse aging conditions. Notably, the residual model also allows reconstruction of low C-rate OCV curves with high C-rate measurements, substantially reducing diagnostic time while ensuring reliable health mechanism diagnosis in practical settings. Finally, leveraging the extracted mechanistic features, we demonstrate accurate early cycle life prediction using only the first 30 equivalent full cycles (EFCs), achieving a mean absolute error of fewer than 69 EFCs across a wide range of previously unseen protocols. Overall, this study advances hybrid DVF modeling with C-rate adaptability and establishes a unified, interpretable framework for accurate and explainable battery health diagnosis and cycle life prediction.
Firstly, we use Dataset 1 for DVF fitting with discussions of key remarks and issues influencing fitting performance, and demonstrate the correlation between the electrode features extracted from C/5 and C/40, showing the possibility and reasonability of transforming low C-rate RPTs to high C-rate RPTs in battery health diagnosis. Then, we discuss the challenge of DVF to track the aging mechanisms accurately in different aging conditions and how the effectiveness of DVF deteriorates during aging for both C/5 and C/40 OCV fitting, quantifying the reliability of DVF during aging and the dependency on C-rates. Building on these results, we use all three datasets for the evaluation of the proposed learning-based residual compensation. Finally, Dataset 2, which involves various aging patterns and loading protocols, is used for the interpretable and early cycle life predictions with respect to unseen aging conditions.
Different degrees of freedom (DOF) can be defined in DVF-based battery health diagnosis depending on assumptions regarding half-cell OCP variations. Here, we compare fitting results under 2-, 3-, and 4-DOF settings and discuss the implications of each approach (see Methods for implementation details).22,24,50 Under the 2-DOF configuration, only the capacity ratio between the cathode and anode and their relative offset are considered. While this yields good fits to full-cell OCV curves and enables fast computation, it lacks the ability to decouple electrode-specific degradation. As a result, the method implicitly assumes that aging occurs in only one electrode, limiting its diagnostic interpretability. The 3-DOF model introduces an additional degree of freedom by allowing one electrode (e.g., the anode) to scale independently. This method enables the identification of degradation in a specific electrode while maintaining computational efficiency. Compared to the full 4-DOF model, which accounts for independent shifts in both electrodes, the 3-DOF setting achieves similar fitting performance of accuracy for the tested cells (Fig. 1C) with high robustness, as the cathode exhibits minimal shift across cycling. Given the full-rank observability of mechanistic parameters confirmed via Jacobian analysis (Fig. S4), and the lower computational cost in Fig. 1B, we adopt the 3-DOF model throughout this work. However, this choice may need to be revisited for other cell chemistries where more significant shifts in both electrodes occur. In addition, the loss variations with respect to pairwise parameter perturbations are in Fig. 1D–F, while the loss evolution over iterations is in Fig. S4. These results collectively reflect the identifiability of the estimated parameters. The results reveal strong interdependence among the three parameters, with the offset parameter being particularly challenging to identify. The iterative process also reveals the presence of multiple local minima during parameter estimation, further highlighting the importance of conducting multiple rounds of optimization to ensure robust mechanistic parameter estimations.
The third critical factor in DVF performance lies in the definition of the objective function. The most direct approach evaluates the difference between the fitted and measured OCV curves, typically using the mean squared error (mse) of the voltage values, as the measured capacity is fixed and used as input. However, DVF models also aim to preserve the fidelity of the differential voltage curve, which captures key degradation signatures. To this end, we consider another loss function based on the difference in differential voltage (dvf), which promotes accurate aging mode identification and increases the sensitivity/weight to voltage changes due to phase transformations. Beyond direct voltage- or derivative-based losses, we introduce a Euclidean distance loss (eucl) that constrains the overall curve shape, providing a global regularization to maintain physical consistency. As shown in Fig. 1G, different objective functions yield distinct fitting results for the same cell. The Euclidean loss, in particular, preserves curve morphology more faithfully and shows higher stability across conditions (see also Fig. S1–S3). The detailed formulations are described in Methods.
Besides the three key factors discussed above, the selection of reference half-cell OCP curves for scaling and shifting is also crucial, as the shape of the half-cell OCP directly determines the underlying form of the full-cell OCV curve. Ideally, the C-rate used for testing the half-cell OCP should match that of the full-cell OCV. This alignment ensures that the scaled and shifted half-cell OCP curves accurately represent the full-cell OCV shape under the same C-rate condition. To illustrate the impact of mismatched C-rates, we present fitting results in Fig. 1H using inconsistent half-cell OCPs for full-cell OCV reconstruction—specifically, applying C/5 OCP curves to fit C/40 OCV data, and vice versa. The results show clear deviations between the fitted and measured curves in both scenarios. When low C-rate (C/40) half-cell OCPs are used to fit high C-rate (C/5) full-cell OCVs, the resulting curves display additional features not observed in the measurements. Conversely, using high C-rate OCPs to fit low C-rate OCVs fails to capture some mechanistic features present in the measurements. The implications of these mechanistic distortions will be further explored in the next subsection.
There are some other key issues that limit the fitting effectiveness in DVF. One major assumption is that the full-cell OCV can be reconstructed solely by scaling and shifting the pristine electrode OCP curves. However, this neglects electrode aging, which can significantly alter electrode kinetics, capacity ratios, and active material utilization. Moreover, half cells have to be physically harvested and constructed from full cells, introducing potential experimental artifacts. Such changes distort the shapes of the reference half-cell OCP curves, particularly for hybrid electrodes with multiple, multiphase active materials, such as silicon–graphite anodes,51 and are also observed in various cathode chemistries. Materials such as lithium iron phosphate (LFP) exhibit strong voltage hysteresis between charge and discharge, related to reaction-controlled phase separation,52–55 complicating consistent curve fitting across cycles in diagnostic tests. Additionally, temperature variations, kinetic overpotentials, and C-rate-dependent effects related to these nonlinear phenomena introduce further discrepancies between the fitted and actual differential voltage curves. Measurement noise and the non-uniqueness of the solution space also make it challenging to identify the underlying aging mechanisms reliably. In practice, filtering techniques are often applied to the raw differential voltage curves to reduce noise and extract key features such as peak and valley positions and magnitudes. However, excessive smoothing may compromise the fidelity of aging-relevant features, requiring a careful balance between denoising and feature preservation. Moreover, cell-to-cell variability and mechanical degradation, such as electrode cracking or separator deformation, can also introduce subtle but impactful differences in the DVF behavior, especially over long-term cycling. These effects are difficult to capture with purely physics-based models. To address these limitations, we propose a machine learning-based compensation approach for the unknown or unmodeled mechanisms in DVF, which is discussed in detail later in this article.
To enable fast and interpretable electrode state estimation and understanding of the degradation mechanism, we employ a sparse multi-task model that maps high C-rate OCV-derived features to their low C-rate counterparts. Unlike traditional approaches that rely on multiple independent regressors (e.g., Lasso and Elastic Net), we adopt a unified multi-task Lasso to simultaneously predict multiple mechanistic states, including full-cell capacity (Cq) and electrode-specific descriptors (Cp, Cn, Qli). The model is trained and validated on 94 cells using 50 randomized train-validation-test splits (60%–20%–20%), with hyperparameters optimized via cross-validation. Representative results for moderate prediction accuracy are in Fig. 2A–F, with best and worst cases detailed in Fig. S6 and S7. The model accurately recovers low C-rate states from C/5 features, achieving mean absolute errors (MAE) and root mean squared errors (RMSE) below 2.85% and 2.52%, respectively (Fig. 2C and Table S1). In addition to its predictive performance, the model is inherently interpretable: the learned feature weights reveal the relative importance of each input, offering insight into dominant degradation pathways. Capacity from the high C-rate has the overall highest impact; electrode capacity and lithium inventory show significant impact, indicating the importance of considering both LAM on cathode and anode, as well as LLI for the health diagnosis. This sparse structure facilitates efficient aging mode identification under high C-rate protocols, enabling the fast diagnosis of both LAM and LLI for interpretation of degradation patterns.
Despite the mismatched OCP curves, as discussed in the above section, the mechanistic states obtained from DVF remain highly consistent with those derived from matched fittings. As shown in Fig. 2G–I, the positive electrode capacity (Cp) extracted from the C/40 full-cell OCV curve using the C/5 half-cell OCP profiles exhibits a strong linear correlation (R > 0.9) with the value obtained using the matched C/40 OCP curves. This high degree of linearity is preserved across all three aging modes, demonstrating that mismatched OCP inputs can still yield reliable mechanistic interpretations. These results suggest that a simple linear transformation suffices to bridge the discrepancy introduced by OCP mismatches. Therefore, DVF can be flexibly implemented in practical scenarios, including under higher C-rate conditions or when only unmatched OCP data are available.
Using this model, we can reconstruct more realistic degradation patterns that would otherwise require low C-rate (C/40) diagnostic tests to reveal. Fig. 3B presents the reconstructed aging trajectories for all cells in Dataset 2, tracking the capacity loss of the cathode and anode (reflecting LAM for each electrode), the loss of active lithium ions (LLI), and the resulting full-cell capacity fade under diverse and complex aging patterns. Different cycling conditions result in different aging patterns containing different contributions from LAM and LLI, which are interpretable through our model. Additional representative cells with varying cycle lives and degradation patterns, along with their original C/5-based fitting results, are in Fig. S8. Generally, the C/5 fitting results give an overestimation of the aging conditions, i.e., LLI and LAM, while the cathode degradation (representing LAMp) may be underestimated in early aging stages. This discrepancy is further illustrated in Fig. 3C and D, where the reconstructed OCV and differential voltage (dV/dQ) curves under C/40 conditions are compared to C/5-based results for a representative cell exhibiting super-linear aging. The observed differences highlight how kinetic limitations and internal resistances at higher C-rates can obscure the true thermodynamic degradation, leading to a misjudgment of battery health. Through the variations of the key features, like the peaks and valleys of the reconstructed differential voltage curves, battery aging mechanisms can be interpreted. Notably, the widening gaps between the RPT cycle in the OCV and differential voltage curves accurately reflect the accelerated aging after the knee onsite. Nevertheless, the predictor still shows effective boundaries on the mechanism reconstruction, especially for mechanisms influenced by the blended electrodes and aging, which distorted the shapes of the OCV curves. In addition, the predictor still shows some mismatches in some cases. In such cases, even small fitting mismatches may lead to disproportionately large deviations in the inferred aging modes (e.g., LLI/LAM decomposition) and peak-related mechanistic interpretation. These limitations motivate the residual compensation introduced in the following section to correct systematic deviations and improve the fidelity of mechanism reconstruction.
We introduce a machine learning-based residual compensation strategy that enhances the fitting accuracy of mechanistic models by correcting their systematic errors. This machine learning residual makes the compensation originate from different sources during aging and is informed by mechanistic states as model input. Although the half-cell DVF model alone exhibits limited fitting accuracy, the mechanistic features that it extracts remain stable across different aging and C-rate conditions. These physically meaningful features representing the electrode health condition and the amount of active lithium, derived from either low or high C-rate data, provide a robust foundation for the learning model, thereby improving both its interpretability and extrapolation capability. As illustrated in Fig. 4C, the extracted mechanistic features Φ, which are obtained from the classical DVF framework, are fed into a residual learner, which predicts the pointwise residuals of the fitted OCV curve. These predicted residuals are then added to the baseline DVF model to yield an enhanced overall fit. Details of the model structure are provided in Methods. When applying the model across different C-rates, the rate-adaptive predictor introduced earlier is used to estimate the mechanistic features at the target C-rate. These predicted features are subsequently passed to the residual learner for residual compensation. Both the predictor and the residual learner adopt sparse and interpretable formulations, based on Lasso and multi-task Lasso. Using Dataset 1 with random data split with ratios of 6
:
2
:
2 for the training, validation 1, and validation 2 (testing) for both C/40 and C/5 OCV curves, the model achieves high accuracy and generalizability. As shown in Fig. 4D, E and Table 1, the MAE is reduced from over 17 mV to below 1.5 mV. Notably, the maximum absolute error decreases dramatically from 361.6 mV to under 10.1 mV substantially mitigating previously observed high-error regions (Fig. 4B).
| Objective | Method | RMSE | MAE | MaxAE | R2 |
|---|---|---|---|---|---|
| Dataset 1 C/5 | Residual compensation | 1.21 mV | 0.89 mV | 4.97 mV | 1.000 |
| Original fitting | 22.94 mV | 17.10 mV | 93.61 mV | 0.997 | |
| Dataset 1 C/40 | Residual compensation | 2.06 mV | 1.48 mV | 10.01 mV | 1.000 |
| Original fitting | 18.72 mV | 8.40 mV | 361.65 mV | 0.997 | |
| Dataset 2 C/5 | Residual compensation | 3.64 mV | 2.45 mV | 30.61 mV | 1.000 |
| Original fitting | 26.76 mV | 18.64 mV | 131.86 mV | 0.996 | |
| Compensation (retrain) | 1.37 mV | 0.95 mV | 21.70 mV | 1.000 | |
| Dataset 3 C/40 | Residual compensation | 3.01 mV | 2.01 mV | 131.35 mV | 1.000 |
| Original fitting | 30.58 mV | 18.27 mV | 240.99 mV | 0.992 | |
| Compensation | 1.87 mV | 0.96 mV | 99.94 mV | 1.000 | |
| Dataset 1C/5 to C/40 | Voltage compensation | 4.56 mV | 3.32 mV | 17.53 mV | 0.991 |
| Capacity compensation | 21.55 mAh | 16.49 mAh | 67.23 mAh | 0.877 |
We further applied the residual compensation model to enhance health diagnostics during battery aging, enabling aging mechanisms interpretation during applications. Trained on cells with varying SOHs from Dataset 1 and tested on Dataset 2, the model substantially improves the fit of differential voltage curves, as shown in Fig. S9. To validate the generalization capability, we retrained the model on a subset of cells and evaluated it on cells exhibiting unseen aging conditions.31 The resulting performance, illustrated in Fig. 5A, shows prediction errors consistently below 21.7 mV, with most errors tightly clustered around zero. Beyond improved fitting accuracy, the proposed hybrid model preserves and reconstructs key physical features that are critical for aging assessment, including indicators associated with LLI and LAM. As shown in Fig. 5B for a representative cell tracked across its full aging trajectory, conventional DVF increasingly fails to reproduce aging-induced changes in the voltage profile, particularly in deep depth-of-discharge (DOD) regions. This limitation arises from the underlying assumption of DVF that degradation effects can be fully represented by scaling and shifting of reference OCP curves, which becomes invalid as aging introduces heterogeneous kinetics, transport limitations, and interfacial resistance growth.
By contrast, the proposed hybrid framework explicitly compensates DVF residuals while retaining its physically interpretable backbone, enabling the recovery of both the global evolution of the differential voltage response and localized aging signatures. In particular, the model successfully reconstructs the gradual disappearance of low-SOC shoulder and inflection features with cycling. These features are commonly associated with phase-transition-related processes and stoichiometric plateaus, whose voltage signatures become progressively obscured as degradation advances. From a mechanistic perspective, such suppression is consistent with degradation pathways widely reported in blended anodes, where repeated lithiation-induced volume expansion of silicon phases leads to particle cracking, electrical isolation, and the formation of electrochemically inactive domains. This process couples LAM with LLI through the trapping of lithium in isolated regions and interfacial layers, thereby reducing electrochemical accessibility prior to the voltage cutoff. Concurrently, accumulated interfacial degradation amplifies polarization effects, causing the voltage response in deep DOD regions to become increasingly dominated by kinetic limitations. Importantly, the hybrid model captures the systematic shift of peak positions and the monotonic attenuation of peak magnitudes with cycling, as summarized in Fig. 5C. These coordinated trends reflect the coupled evolution of lithium redistribution and active material degradation, providing physically interpretable markers of aging progression rather than fitting artifacts. Together, these results demonstrate that the hybrid framework enables mechanistic insight into battery aging pathways under realistic operating conditions, beyond what is accessible with conventional DVF analysis alone. As summarized in Table 1, the improved model achieves the MAE of less than 2.45 mV across all testing cells in Dataset 2.
We then applied our model to Dataset 3, which includes a different commercial battery type and provides OCV curves during dynamic aging measured at C/40. The original fitted electrode states are plotted in Fig. S10. The error distributions before and after using the residual compensation are plotted in Fig. 5D, and the numerical results are listed in Table 1, where the errors are significantly reduced by introducing the proposed residual model. The model's ability to reconstruct aging patterns is demonstrated in Fig. 5E–G for three representative cells subjected to highway, urban, and real city driving profiles with differential voltage curves reconstruction in Fig. S10. In each case, the magnitude of the first voltage peak and the corresponding capacity are highlighted as the two key features characterizing the differential voltage curve. While the fitted curves capture the general trend of the peak shift during aging, they still deviate notably from the ground truth, typically overestimating the peak magnitude. The residual model effectively corrects these discrepancies, thereby enhancing the accuracy (with MAE of less than 0.97 mV) of battery aging diagnosis. Note that the filtering is needed in Dataset 3 since the original curves are noisy for feature extraction, as demonstrated in Fig. S10.
The reconstruction of aging degradation from high C-rate measurements to low C-rate expectations can be further improved by incorporating the residual learning model. Fig. S12 illustrates the variation across the original C/5 curves, the reconstructed C/40 curves from the rate-adaptive predictor, and the final enhanced curves with residual correction for a representative cell. While the rate-adaptive model captures the dominant rate-dependent trends, the residual correction further refines the curve morphology, particularly in low-SOC regions where voltage responses are strongly influenced by aging-induced polarization and resistance growth. Overall, by integrating the DVF framework with rate-adaptive prediction and machine learning-based residual compensation, we achieve more accurate and efficient battery health diagnostics. This approach enables deeper insights into battery degradation under complex aging scenarios, while significantly reducing testing time and resource consumption.
The above results demonstrate that the proposed hybrid and interpretable framework enables enhanced battery health diagnosis under complex cycling conditions, with improved modeling of residuals arising from aging-related kinetics, polarization, and degradation-induced parameter evolution. Other battery chemistries may exhibit distinct electrochemical characteristics that introduce additional challenges. For example, LFP cells show flat open-circuit voltage profiles and pronounced hysteresis behavior, which can affect observability and residual structures and are not explicitly addressed here. Nevertheless, the proposed framework is not restricted to any specific battery chemistry, as it systematically combines explicit mechanistic models with data-driven compensation and can be extended to alternative chemistries by incorporating appropriate mechanistic representations and targeted data.
| SOH threshold | Training/validation | RMSE (EFC) | MAE (EFC) | R2 |
|---|---|---|---|---|
| 95% SOH | Training | 6.95 | 4.18 | 0.78 |
| Validation 1 | 8.89 | 5.05 | 0.75 | |
| Validation 2 | 5.00 | 3.44 | 0.88 | |
| 90% SOH | Training | 26.15 | 17.57 | 0.86 |
| Validation 1 | 30.48 | 19.60 | 0.88 | |
| Validation 2 | 27.29 | 20.22 | 0.83 | |
| 85% SOH | Training | 58.45 | 40.40 | 0.84 |
| Validation 1 | 68.97 | 48.62 | 0.86 | |
| Validation 2 | 56.17 | 40.76 | 0.84 | |
| 82.5% SOH | Training | 75.13 | 50.42 | 0.83 |
| Validation 1 | 78.79 | 56.67 | 0.87 | |
| Validation 2 | 75.93 | 54.55 | 0.82 | |
| 80% SOH | Training | 94.45 | 64.80 | 0.81 |
| Validation 1 | 92.62 | 64.29 | 0.87 | |
| Validation 2 | 101.70 | 68.50 | 0.80 |
The MAE of early cycle life prediction until 80% SOH remains below 65 and 69 cycles for validation sets 1 and 2, respectively, with corresponding R2 values exceeding 0.8. The model accurately predicts cells with diverse aging trajectories, with both training and validation data closely aligning with the ideal parity line. However, predictive performance diminishes for cells with longer cycle lives, primarily due to data imbalance, as the model is more influenced by the abundant short-life data. Feature importance analysis of the interpretable model reveals that variations in positive electrode capacity and features derived from differential voltage curves play dominant roles. Notably, features from the negative electrode become increasingly significant when predicting cycle life at higher SOH levels. The peak positions and overall shape of differential voltage curves consistently contribute across different health stages. More prediction results are provided in Fig. S13. Note that the interpretation of the model reflects the feature contribution for the prediction, indicating the numerical data pattern learned from the machine learning model that is not aware of physics. The intrinsic relationship between the feature contribution and the actual dominating aging mechanisms needs further investigation. This is exemplified by predictions from later stages (around 85% SOH), where features linked to the negative electrode become more influential (Fig. S14).
In addition, the model is intrinsically interpretable, eliminating the need for post hoc explanation techniques such as SHAP (SHapley Additive exPlanations). To validate the robustness of feature importance, we conducted an additional analysis using XGBoost regression with SHAP-based interpretation. Comparable key features emerge, supporting the consistency of feature contributions across different model architectures (Fig. S15). By employing inherently interpretable models for rate-adaptive prediction, residual compensation, and cycle life forecasting, the underlying predictive mechanisms become more transparent. Overall, features related to electrode health and differential voltage curves play a significant role in early-stage cycle life prediction, and the improved diagnostic insight ensures both the reliability and interpretability of the proposed approach.
To address the challenges of DVF degradation caused by kinetic limitations, material blending, and especially battery aging, we introduce an additional interpretable residual model. This model successfully compensates for fitting errors and reconstructs more realistic degradation trajectories while preserving the critical aging mechanisms. The combination of the C-rate adaptive prediction model and residual compensation model allows a more accurate reconstruction of degradation curves under low C-rate conditions, which are often impractical to measure directly. The combined model achieves a mean absolute voltage error of less than 3.4 mV. Notably, the residual model remains interpretable and robust across batteries subjected to various aging protocols.
Lastly, early-stage cycle life prediction is accomplished using another interpretable model that leverages mechanistic features extracted from both half-cell and differential voltage curves. The model provides accurate and explainable predictions of battery lifespan under different SOH thresholds. When using data from only the first 30 cycles, the model achieves a mean absolute EFC error of less than 70 even under complex and previously unseen aging scenarios. This article concludes with the key insights for battery assessment and proposes interpretable models for systematically enhanced diagnosis and prognosis. Future efforts could focus on further aligning machine learning-based insights with underlying physical aging mechanisms, together with the mechanistic modeling of the DVF residuals.
| OCV(z) = OCPp(y) − OCPn(x), | (1) |
![]() | (2) |
| C = Cp(y0−y100) = Cn(x100−x0), | (3) |
![]() | (4) |
![]() | (5) |
![]() | (6) |
![]() | (7) |
![]() | (8) |
For the machine learning models, we employ (multi-task) Lasso (Least Absolute Shrinkage and Selection Operator) regression to capture the underlying mappings between features and targets.63–65 The objective function is formulated as
![]() | (9) |
, represents the empirical loss aggregated across tasks via the Frobenius norm. The second term,
, is a structured regularization penalty that encourages group sparsity across tasks. Specifically, the term applies the ℓ2 norm across each feature's coefficients (i.e., across tasks), and then sums over all features, which is equivalent to the ℓ2,1 norm of B. As a result, if a particular feature is not informative for any task, its entire row in B will be driven to zero. The regularization parameter λ > 0 controls the trade-off between minimizing prediction error and enforcing sparsity.
Model interpretability arises naturally through the learned coefficient matrix. For each feature j, its overall contribution across tasks is measured by the ℓ2 norm of the corresponding row, i.e., ||Bj,||2. A larger norm indicates that the feature has a stronger influence on at least one task. This allows for intuitive ranking of feature contributions and identification of shared predictors across multiple outputs, making the model inherently interpretable in multi-task settings.
| This journal is © The Royal Society of Chemistry 2026 |