Shengyu Taoab,
Ruohan Guoc,
Jaewoong Lee
b,
Scott Mourab,
Lluc Canals Casalsd,
Shida Jiangb,
Junzhe Shib,
Stephen Harris
e,
Tongda Zhangf,
Chi Yung Chungc,
Guangmin Zhou
*a,
Jinpeng Tian
*c and
Xuan Zhang
*a
aTsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen, China. E-mail: guangminzhou@sz.tsinghua.edu.cn; xuanzhang@sz.tsinghua.edu.cn
bEnergy, Controls, and Applications Lab (eCAL), Department of Civil and Environmental Engineering, University of California, California, USA
cDepartment of Electrical and Electronic Engineering and Research Centre for Grid Modernisation, The Hong Kong Polytechnic University, Hong Kong, China. E-mail: jinpeng.tian@polyu.edu.hk
dEnvironmental Engineering Research Group (ENMA), Department of Project and Construction Engineering, Universitat Politècnica de Catalunya·BarcelonaTech (UPC), Barcelona, Spain
eEnergy Storage and Distributed Resources Division, Lawrence Berkeley National Laboratory, California, USA
fSystems Optimization Laboratory, Stanford University, California, USA
First published on 20th June 2025
The reuse of second-life lithium-ion batteries (LIBs) retired from electric vehicles is critical for energy storage in underdeveloped regions, where power infrastructures are weak or absent. However, estimating the relative remaining capacity (RRC) of second-life batteries using field-accessible data stream remains challenging due to its scarcity and heterogeneity, despite efforts in battery passports and other initiatives to secure data integrity. This study proposes a deep generative transfer learning framework to address these two-fold challenges by generating voltage dynamics across state-of-charge (SOC) and using deep correlation alignment (CORAL) to align heterogeneities resulting from different aging patterns (domains) of second-life LIBs. We generate voltage response dynamics data across various SOC conditions from 20160 samples under 10 SOC values, demonstrating high statistical similarities and confidence. The model estimates the RRC with minimal field data availability, specifically 2% of the full sample size, achieving a mean absolute percentage error of 7.2% and 3.6% for second-life batteries with different degradation behaviors, respectively. The model preserves established knowledge in the available domain while reducing RRC estimation risks in new domains where data availability is limited. The maximum RRC estimation risk is reduced by 49% at a 95% confidence level. This unified data generation and transfer learning paradigm outperforms the state-of-the-art machine learning and equivalent circuit model-method across all data availability conditions. The “generate and transfer” paradigm enlightens many potential applications in other predictive management tasks by preferentially generalizing in-distribution data and then adapting to out-of-distribution conditions under guidance of limited field data.
Broader contextAs the global transition to electric mobility accelerates, large volumes of lithium-ion batteries are approaching retirement. Repurposing these batteries for second-life applications offers a critical opportunity to extend their value, reduce environmental waste, and enable cost-effective energy storage, particularly in under-resourced regions lacking stable grid infrastructure. However, assessing the health and usability of retired batteries remains a major barrier due to the absence of usage history and significant variability in design and degradation. This study introduces a deep generative transfer learning framework that enables immediate estimation of remaining capacity using only minimally available field data. By generating synthetic voltage responses across charge conditions and aligning heterogeneous battery types via domain adaptation, the method circumvents the need for extensive historical records or laboratory calibration. Scientifically, it advances the integration of generative modeling and transfer learning in battery diagnostics, offering a principled way to handle out-of-distribution uncertainty. The approach enables a scalable solution to battery reuse by reducing testing time, minimizing deployment risks, and enhancing decision-making under data constraints. More broadly, it demonstrates how data-efficient, adaptive learning models can support sustainable energy systems by unlocking the latent value of discarded resources, providing both technological and environmental benefits in the circular energy economy. |
EV battery diagnosis and prognosis play critical roles in informing manufacturers and users of current and future degradation expectations, which have been well-studied even before the release of the CCR.4–8 These efforts focus on prior-retired EV batteries, assuming considerable homogeneity, i.e., fixed material types, physical formats, capacity designs, observable state of charge (SOC), and state of health (SOH), even though driving profiles might vary significantly. These different battery states, referred to as domains, can be transferred from the trained model and updated for cross-operation-condition tasks.9–11 However, this typically involves an infeasible assumption that historical information is available. For post-retired EV batteries, historical data are not guaranteed since access to the battery management system (BMS) can be denied due to privacy restrictions due to ownership changes,12 and the information might even be destroyed by poorly managed battery transportation, leading to significant data scarcity.13–15 As a key interest of second-life usage, immediate knowledge of remaining capacity is a bottleneck due to the lack of historical information. Pulse tests have been documented as effective, rapid, and non-destructive SOH estimation methods with a core idea of exciting battery dynamics for voltage signal responses from pulse current that was injected into batteries, where voltage response signals could be mapped to SOH.16–18 However, these “voltage-SOH” pairs are sensitive to second-life heterogeneities, i.e., varied SOC, material types, physical formats, and capacity designs of batteries.17–22
Instead of spanning physical experiments at scale, deliberate data generation is promising to address the twofold challenge of data scarcity and heterogeneity. Broadly, this has been brought to the attention of the researchers in many energy-related tasks, such as thermal runaway prediction,23 energy consumption estimation,24 state of energy estimation,25 SOC estimation,26 SOH estimation,27 remaining useful life prediction,28 and degradation path prediction.29 However, it remains questionable whether generative models can adequately deal with the data heterogeneity, especially for out-of-distribution (OOD) observations in a second-life context. Indeed, it is fundamentally challenging to the battery modeling community that the generative model can hardly estimate OOD batteries unless prior knowledge is secured to guide model generalization and increase transferability. Such priors are never easy to obtain from second-life batteries, which restricts the fidelity of data generation. Thus, a careful evaluation of the accessibility of the priors from the field test site is critical to a successful, more importantly affordable, domain knowledge transfer from established models.30–33 Rasheed et al. proposed a single pulse test set method that enables generalized remaining useful life and battery classification ignoring prior usage, validated through broad datasets.34 While the idea of using immediately available pulse test for another critical task, such as remaining capacity estimation, holds promises, the pulse injection at consecutive SOC is challenging in real-world settings due to the randomly distributed SOC of second-life batteries at the collection site. To date, there has been no unified framework for estimating the immediate remaining capacity of heterogeneous second-life LIBs that can address data scarcity and heterogeneity simultaneously.
This study demonstrates a deep generative transfer learning approach to address the two-fold data scarcity and heterogeneity challenges in estimating the remaining capacity of second-life batteries, which is sensitive to SOC. To do so, a variational autoencoder (VAE net) is introduced to generate “voltage-SOH” pairs across SOCs to mitigate data scarcity by learning the voltage response dynamics that are subject to the same current inputs, reducing the number of experiments under different SOC levels. The VAE net takes the already-tested voltage dynamics and the measurement condition (SOC) as input for training the encoder. The decoder reconstructs voltage response dynamics from noise signals representing untested SOCs, which are free of test time and cost. The purpose of the variational encoder is to learn a continuous latent representation of the input data and generate new data samples that resemble the training data. Using generated data, the immediate remaining capacity estimation of second-life lithium-ion batteries can be achieved by prioritizing the prediction of SOC states, which are utilized as conditional information for the remaining capacity prediction. Otherwise, the SOC information requires a long-term charge and discharge test to obtain. To align with practical usage and address data heterogeneities, the generated data were split by physical formats and capacity designs into distinct domains. Deep correlation alignment (CORAL), a transfer learning method, was employed to transfer knowledge from any known domain to new domains with minimal field data and reduced uncertainties.
Fig. 1 provides a detailed workflow. The proposed approach is promising in terms of saving intensive data curation time and showing flexibility in updating trained models to new domains without retraining from scratch. The proposed unified data generation and knowledge transfer paradigm is promising for many other energy predictive tasks by preferentially generalizing non-OOD data and then adapting to OOD conditions under the guidance of available field data.
![]() | (1) |
Qreal is determined with a standard constant current constant voltage (CCCV) charging and discharging procedure. First, the second-life batteries are discharged to a lower cut-off voltage (2.7 V) using a 1C constant current. Second, the second-life batteries are charged to the upper cut-off voltage (4.2 V) using a 1C constant current, then charged using constant voltage until the current drops to 0.05C. Third, the second-life batteries are then discharged to the lower cut-off voltage (2.7 V) using a 1C constant current. C stands for charge (discharge) rate when a 1 hour of charge (discharge) is performed, which is a dimensionless scalar number.
Domain | Q (A h) | Physical format | Battery quantity (simulated quantity) | Data entry | RRC range | RRC mean (deviation) |
---|---|---|---|---|---|---|
#1 | 2.1 | Cylinder | 12 (67) | 670 | 0.61–0.92 | 0.81 (0.07) |
#2 | 3.1 | Pouch | 2 (14) | 140 | 0.68–1.01 | 0.87 (0.08) |
#3 | 5.2 | Pouch | 3 (15) | 150 | 0.50–1.02 | 0.83 (0.16) |
In each accelerated aging cycle, all batteries were charged with a 2C constant current followed by a constant voltage at 4.2V with C/200 or 30-minute cut-off condition followed by a 5-minute rest. The same constant discharge current was applied with a cut-off voltage of 3 V followed by a 5-minute rest. Note that C stands for rate of charge (discharge) relative to that for a 1 hour of charge (discharge). After every 100 accelerated aging cycles, the batteries were rested for 30 minutes to reach a steady state, preparing for pulse injection. The pulse injection is performed at 10 SOC levels, i.e., from 5% to 50%, with an increment of 0.05, as the second-life batteries are typically stored with relatively low SOC before redeployment. The current used to adjust SOC was 1C. Thus, the time for moving across SOC levels is 3 minutes. We used identical rest time, i.e., 25s, between all injected pulses. The pulse width was 5s and the amplitude was ±0.5C, ±1C, and +2C. The +2C pulse is the final pulse of each experiment, then the SOC adjustment is continued until the 50% SOC level is hit. The ambient temperature was set at 25 °C. Note that positive and negative pulses alternate to cancel the electric charge injection, so that no SOC calibration is required between different pulse injections. Since there are 10 SOC levels tested, the data entry in each domain is ten times that of the simulated battery quantity. The tested SOC are taken as the true SOC label in this paper. Specifically, data entry numbers for domains 1–3 are 670, 140, and 150, respectively, thus a total of 960 data entries were generated. The detailed workflow is explained in Fig. S1 (ESI†). The alternating setting between the accelerated aging test and the pulse test demonstrates that the safety of the pulse test within the wide observed RRC ranges, i.e., from 0.5 to fresh cells, even under 2C harsh cycling conditions.
![]() | (2) |
The voltage dynamics have a clear physical meaning: second-life batteries exhibit a higher polarization resistance, resulting in a higher voltage response, when subjected to an identical current input. The pulse injections were performed at ten different SOC levels instead of 5% SOC, resulting in a total pulse injection dataset with 960 observations. Thus, the dataset used in this study encompasses 20160 voltage dynamics feature points, including different capacity designs, physical formats, and SOC distributions. Fig. 2b illustrates that when physical formats and capacities differ, the batteries can respond significantly differently. Specifically, the same physical formats do not necessarily guarantee a uniform voltage response behavior, as shown in voltage dynamics for Pouch31 and Pouch52, where divergences can even be more pronounced than those of Cylind21 and Pouch52. Noticeably, domain divergences can be more than 0.2 V, which accounts for the polarization increase from fresh cells to around 80% relative remaining capacity within a single domain. Therefore, knowledge transfer between domains is necessary to ensure real-world deployment, where domain divergence exists but cannot be known as prior information at the test site.
In Fig. 4a, the Cylind21, Pouch31, and Pouch52 second-life battery groups exhibit linear, non-linear, and sub-linear aging, respectively, with distinct degradation behaviors noticeable. This implies a phenomenon where even if the data scarcity issue can be resolved within one domain, it is insufficient to generalize the model for OOD RRC estimations. Thus, it is necessary to train separate generative models for different domains to facilitate learning distinct degradation mechanisms and then design transferring mechanisms if the data scarcity challenge can be resolved.
Using the trained VAE net, a uniform data observation enrichment for continuous SOC levels is observed, as depicted in Fig. 4a. Multiple test points at the same SOC refer to the fact that batteries with different RRC are tested. In another way, one physical battery with one specific RRC is tested under multiple SOC, where an arrow was used to indicate the aging direction. Essentially, the VAE net learns polarization behaviors across SOC in one specific domain characterized by distinct degradation behaviors, i.e., linear, sublinear and accelerated aging. In Fig. 4b, the generative capability of the VAE net is quantified, finding that the tested and generated data distributions are similar to each other with low KL-divergence (KLD) values of 0.68, 0.63, and 0.25 for Pouch52, Pouch31, and Cylind21, respectively. Note that KLD is a statistical measure used to quantify the difference between two probability distributions. There is no universal threshold for a “good” KLD value because it depends on the scale and nature of the distributions. Instead of aiming for an absolute KLD, it is preferable to compare KLD values across models. Despite different degradation mechanisms, physical formats, and capacity designs, the data generation performance is consistent across all 21 voltage dynamics dimensions, with minor KLD values, as demonstrated in Fig. 4c. Thus, the data scarcity issue is addressed by generating extensive data observations across different SOCs, which are not easily covered by separate physical measurements.
In Fig. 5a, the predictive performance of the SOC net is presented. Data collected from Cylind21 is the source domain, assuming the data are available due to cumulative data collection over time. The SOC net can immediately infer SOC values from voltage dynamics without any capacity test. Again, this SOC information is otherwise obtained by long-term charge and discharge tests. A 9.1% MAPE for SOC prediction of the Pouch52 second-life batteries is observed when using the full size of Cylind21 data and only 2% of the Pouch52 data, equivalent to 42 second-life battery samples collected at the test field. The Pearson correlation of 0.96 indicates a strong linear correlation between the predicted and actual SOC values, indicating that the prediction is highly reliable. More importantly, the SOC net does not compromise the source domain's predictive capability, as evidenced by a 3.6% MAPE and a 0.96 Pearson correlation. In Fig. 5b, for another task with larger domain divergences, as shown in the voltage dynamics differences between Cylind21 and Pouch31 in Fig. 2b, the SOC net still performs robustly. With only 2% of the Pouch31 data, the SOC net can immediately infer SOC levels with a 6.4% MAPE and a 0.97 Pearson correlation. We observe that the SOC prediction in the higher SOC region of the Clind21 → Pouch31 is slightly divergent. This observation can be rationalized by increased domain divergence resulting from accelerated aging patterns in Pouch31 (see the aging direction of Pouch31 in Fig. 4b), which makes transfer between Clind21 and Pouch31 more challenging.
Having access to the SOC information predicted by the SOC net, the voltage dynamics are helpful to predict the RRC of the second-life batteries; otherwise, the representability of voltage dynamics to battery degradation is complicated by the mixed SOC distributions, as shown in Fig. 3b. The predicted SOC condition is appended to the voltage dynamics values immediately obtained from the test field for RRC prediction. Fig. 5c shows an RRC prediction with a 7.2% MAPE and a 0.84 Pearson correlation, which requires only 2% of the data availability. The predicted RRC points exhibit a noticeable concentration along the perfect prediction line (the red diagonal line), regardless of the mixed SOC levels, indicating that CORAL successfully transforms heterogeneities resulting from physical formats and capacity designs into a shared statistical space. In Fig. 5d, we observe a robust result for Pouch52 second-life batteries, with a 3.6% MAPE and a 0.72 Pearson correlation. Since the batteries with the same RRC are tested multiple times across different SOC levels, we note that some prediction points are located far from the diagonal line. However, the number of these outlier samples is relatively small, which does not significantly impact the overall RRC prediction error.
We perform absolute error frequency analysis to examine prediction uncertainties and risks since second-life batteries are more safety-conscious. In Fig. 5e, a concentrated absolute error distribution is observed, where the error frequency is calculated as the ratio of the number of instances within a certain error range to the total number. When considering a 95% accumulated error frequency, the target domain prediction is robust, with a 4.6% MAPE. In Fig. 5f, when transferring to a challenging condition with a larger domain difference, the prediction result remains stable with a 3.6% MAPE.
We summarize error values reported in Fig. 5a–f in Table 2 for a clear presentation and discussion. In Table 2, both the backward transfer and forward transfer are evaluated. The − and + symbol refer to backward transfer and forward transfer, respectively. The backward transfer refers to the scenario where the transferred model is used to make predictions in the domain from which it was initially transferred, i.e., the source domain. While the forward transfer means that the transferred model was used to make predictions in target domain that guided the transfer. The backward transfer and forward transfer evaluate the model's preservation of established knowledge and its transferability to new knowledge, respectively. It can be seen from Table 2 that both SOC predictions and RRC predictions demonstrate good knowledge preservability in the original domain, even if the difficulties vary upon different transfer scenarios. Thus, we conclude that the proposed CORAL transformation finds a domain-invariant representation of both source (Cylind21) and target (Pouch52, Pouch31) domain tasks. It is noted that the maximum prediction risk also decreases when transferring knowledge from the source domain to the target domain, which is preferable to second-life applications since they can be relevant to improved estimation reliability, and robust second-life battery warranty strategy designs. The observed maximum RRC estimation risk given a 95% confidence level is reduced by 49% averagely (45.23% and 53.25% reduction for Cylind21 → Pouch52, and Cylind21 → Pouch31 transfer scenarios, respectively).
Transfer scenario | SOC prediction | RRC prediction | |||
---|---|---|---|---|---|
ρ | MAPE (%) | ρ | MAPE (%) | Max risk (%) | |
Cylind21 → Pouch52 (−) | 0.98 | 4.7 | 0.84 | 3.6 | 8.4 |
Cylind21 → Pouch52 (+) | 0.96 | 9.1 | 0.84 | 7.2 | 4.6 |
Cylind21 → Pouch31 (−) | 0.98 | 5.7 | 0.83 | 3.8 | 7.7 |
Cylind21 → Pouch31 (+) | 0.97 | 6.4 | 0.72 | 3.6 | 3.6 |
For the practical usage of the proposed deep generative transfer learning approach, one may be concerned about the impact of the field data availability that is critical to guide the model to generalize, which has significant relevance to time and cost feasibility. In Fig. 5g and h, we carefully examine the impact of field data availability on target and source domain tasks. It perfectly aligns with the intuition that predictive performance increases with the availability of field data for all tasks. However, in turn, data scarcity challenges the model performance, especially when faced with changing data distributions resulting from different physical formats and capacity designs. For 33.3% of the field data availability, the model shows 1.7% and 3.4% MAPEs for Cylind21 → Pouch31 and Cylind21 → Pouch52 tasks, respectively. Interestingly, when field data availability decreases to 10%, the model performance is better than that of its counterparts with even more data availability, with 1.6% and 3.0% MAPEs for Cylind21 → Pouch31 and Cylind21 → Pouch52 tasks, respectively. This phenomenon inspires us to carefully examine the trade-off between the scale of the spanning dataset and the improvement in predictive performance. More importantly, seeing that MAPE for source domain tasks fluctuates with 1% even though the field data availability decreases drastically from 33.3% to 2%, as shown in Fig. 5g, the model hardly forgets about the previous domain tasks, which is challenging for transfer learning and continual learning tasks. For the source domain of Pouch52, this stability is as low as 0.2% MAPE, given a considerable change in field data availability from 33.3% to 2%. In Fig. 5h, similar stability is observed for the source domains of Pouch52 and Pouch31 tasks, respectively. However, it has been brought to our attention that the performance in terms of MAPE is higher for Pouch52 than that of Pouch31; counterintuitively, the Pearson correlation is higher for the Pouch52 task. This abnormality can be rationalized by the lower bound of RRC distribution reaching 0.50 while that of Pouch31 is 0.68, as shown in Table 1, where lower RRC values can lead to higher MAPEs given the same absolute prediction error. Among the two domain adaptation scenarios studied, the transfer from Cylind21 to Pouch31 is more challenging than to Pouch52 as illustrated by the domain divergence. This can be attributed to both distributional and degradation behavior differences. As observed in Fig. 2b, Pouch31 exhibits voltage dynamics that deviate more substantially from the source domain (Cylind21), and Fig. 4a further reveals a nonlinear, accelerated degradation trajectory unique to Pouch31. In contrast, Pouch52 shows a relatively consistent aging pattern, resulting in a smaller domain gap. These differences make CORAL alignment more difficult in the Cylind21 → Pouch31 case, as the feature distribution shift includes both covariate and conditional shifts, which is evidenced by the lower Pearson correlation coefficient as compared to the Cylind21 → Pouch52 case.
Note that the minimal field data availability is set at 2%, as no knowledge of the target domain can result in large errors due to domain divergence, as shown in Fig. 2b. In Fig. 6a and b, the deep generative transfer learning approach consistently shows lower MAPE across various field data availability and knowledge transfer tasks. In the linear regression, the prediction presents two orders of higher errors than our methods due to insufficiency in handling data scarcity and heterogeneity. When the field data availability increases from 2% to 5%, the deep neural network method approaches the deep generative transfer learning approach in an accelerated manner.
![]() | ||
Fig. 6 Benchmarking performance of (a) and (b) mean absolute percentage error (MAPE) in log space and (c) and (d) Pearson correlation coefficient under limited field data availability scenarios. The benchmarking settings are practical scenarios in which field data are limited but can be collected over time. All models only use the field data except for the proposed generative transfer learning method using data from other domains to learn a knowledge transfer mapping. See the Methods section for detailed experimental settings. The experimental data can be found in Tables S1–S4 (ESI†). |
Fig. 6c shows that the deep neural network lags behind the deep generative transfer learning approach by 0.2 and 0.15 Pearson correlation values for Pouch31 and Pouch52, respectively. This phenomenon indicates that data size is crucial for making accurate predictions, although it is still hindered by data heterogeneity. Fig. 6d illustrates a better linearity of prediction when transferring from Cylind21 to Pouch52, which is understandable because the transferability between Cylind21 and Pouch52 is superior to that between Cylind21 and Pouch31, as seen in Fig. 2b. This demonstrates a delicate trade-off between the necessity of knowledge transfer to correct for domain divergences and the extent to which these divergences can be transferred. Unfortunately, it is a significant challenge that we often face due to insufficient data access for new domains, forcing us to generate high-fidelity data from existing datasets and transfer established model knowledge from old domains into new ones. In short, this paper establishes a pulse test-based method that is fast, non-intrusive, generalizable, and transferable from previously measured voltage features within one domain to target domains. The data volume requirement for such target domains is demonstrated to be as low as 2% compared to that in the established domain. Compared to other state-of-the-art machine learning techniques, our approach stands out as a stable method with low prediction error and minimized field data requirements.
Since the main claim of the proposed method is that one can immediately estimate the residual capacity of the retired batteries with minimal field data, it is particularly relevant to compare the performance with non-learning-based methodologies, i.e., parameter identification from an equivalent circuit model.35,36 These model-based methods require no historical data to train, see Note S1 (ESI†) for detailed problem formulation. To maximize the generalizability of such comparison, typical begin-, middle-, and end-of-life batteries from the Cylind21 dataset are selected, with their RRC values being 0.91, 0.88, and 0.66, respectively. The MAPE values of these estimations are 4.9%, 11.9%, and 51.9% compared to their ground truth, as shown in Table S5 (ESI†). The result can be rationalized by selecting an open-circuit-voltage (OCV)–SOC relationship at the beginning-of-life, which is a fundamental assumption in the equivalent circuit model-based parameter identification paradigm. However, this OCV–SOC relationship cannot be immediately obtained for second-life batteries, which leads to an increased estimation error caused by divergences between begin-, and end-of-life batteries. The proposed deep generative transfer learning method fundamentally reduces the need for considerable data and facilitates immediate residual value estimation of retired batteries.
The limitation of this work lies in its handling of OOD data distribution, as it focuses on studying battery formats and capacity designs. However, heterogeneities can also arise from cathode chemistry types and historical usage patterns. Demonstrating its applicability to cells with different cathode chemistries (e.g., lithium iron phosphate, lithium manganese oxide, lithium nickel cobalt aluminum oxide, lithium cobalt oxide-based cells) would significantly reinforce the perceived utility and robustness of the framework. Although the pulse test has been demonstrated to be a safe technique in the observational data presented in this work, more in-depth degradation patterns should be further investigated to ensure a comprehensive understanding of the safety margin for applying pulse tests. We have focused on the SOC dependence of voltage dynamics, which reflects degradation. However, other degradation patterns, such as loss of active material, loss of lithium inventory, and their mixtures, should be explicitly included. Future studies could integrate pulse current injection with various pulse durations, magnitudes, and combinations to validate its effectiveness. Second-life battery packs with cell-to-cell variabilities, rather than disassembled individual cells, should be studied to engage practical interest.37 Another limitation of this work is that second-life batteries are simulated based on accelerated aging tests, where calendar aging is not included. However, these batteries are typically stored in the warehouse for a few months before they are given a “second life”. Despite the calendar aging itself being challenging to decouple from cycling aging, future studies should consider the joint impact of cycling and calendar aging. Thus, sophisticated internal state monitoring of the second-life battery states is recommended to enable the inclusion of calendar aging effects by establishing the empirical models.38
The observed backward transfer phenomenon is likely attributed to fundamental differences in degradation mechanisms between domains, such as varied polarization behavior or nonlinear degradation dynamics. Interestingly, such backward transfer, i.e., performance decrease with increased target-domain data, can offer practical insights by implicitly revealing domain-specific degradation pathways, thereby providing a valuable signal for sorting and classification strategies in second-life applications. From a deployment perspective, when backward transfer occurs, rapid correction through minimal target domain sampling emerges as a promising direction for future research. This also highlights a key operational insight: more data does not always yield better predictive performance. Instead, it is recommended to develop a data-efficiency-aware approach that seeks a balance between the sampling cost and model utility. On a theoretical basis, future work should aim to define transferability metrics to help practitioners determine whether a task in a new domain can benefit from existing models or requires retraining with extra domain-specific knowledge.
In conclusion, the proposed deep generative transfer learning approach addresses the challenges of data scarcity and heterogeneity in joint SOC and RRC estimation problems for second-life batteries, which often have minimal field data availability. The results demonstrate that the predictions are stable and have low statistical risk. The proposed method suggests that one could preferentially generalize non-OOD data (already measured) and then transfer to OOD conditions (test field site) under the guidance of available field data, as data curation time and cost are significant challenges for many other energy-related control and management tasks. Broadly, the proposed approach is promising in reducing environmental and financial costs associated with large-scale data curation, offering a transformative impact on second-life battery applications, especially in underdeveloped regions, where reliable energy storage is vital to addressing the challenges posed by weak or non-existent power grid infrastructure.
Two separate parameters, i.e., the mean and log-variance
are produced from the encoder. The sampling layer is defined as:
![]() | (3) |
![]() | (4) |
KL divergence loss LossKL, i.e., the KL divergence between originally tested and generated data, is given by:
![]() | (5) |
LossVAE_net = ωxent·lxentMSE + ωKL·lKL | (6) |
The training epoch is 1000, the batch size is 32, the latent dimension (size of sampling layer) is 2, and the sampling intensity is 10. Sampling intensity refers to the ratio of the number of generated and the originally tested observations, which is implemented by regulating the number of points taken from . Sampling intensity is set as 10 in this work.
![]() | (7) |
The loss function of the CORAL net is designed as:
![]() | (8) |
![]() | (9) |
![]() | (10) |
![]() | (11) |
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d5ee02217g |
This journal is © The Royal Society of Chemistry 2025 |