Guijun
Ma
a,
Songpei
Xu
b,
Benben
Jiang
c,
Cheng
Cheng
b,
Xin
Yang
d,
Yue
Shen
e,
Tao
Yang
f,
Yunhui
Huang
e,
Han
Ding
a and
Ye
Yuan
*b
aSchool of Mechanical Science and Engineering, Huazhong University of Science and Technology, Wuhan 430074, China
bSchool of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan 430074, China. E-mail: yye@hust.edu.cn
cDepartment of Automation, Tsinghua University, Beijing 100084, China
dCollege of Electrical and Information Engineering, Hunan University, Changsha 410082, China
eSchool of Materials Science and Engineering, Huazhong University of Science and Technology, Wuhan 430074, China
fState Key Laboratory of Synthetical Automation for Process Industries, Northeastern University, Shenyang 110819, China
First published on 30th July 2022
Real-time and personalized lithium-ion battery health management is conducive to safety improvement for end-users. However, personalized prognostic of the battery health status is still challenging due to diverse usage interests, dynamic operational patterns and limited historical data. We generate a comprehensive dataset consisting of 77 commercial cells (77 discharge protocols) with over 140000 charge–discharge cycles—the largest dataset to our knowledge of its kind, and develop a transfer learning framework to realize real-time personalized health status prediction for unseen battery discharge protocols, at any charge–discharge cycle. Our method can achieve mean testing errors of 0.176% and 8.72% for capacity estimation and remaining useful life (RUL) prediction, respectively. Additionally, the proposed framework can leverage the knowledge from two other well-known battery datasets, with a variety of charge configurations and a different battery chemistry respectively, to reliably estimate the capacity (0.328%/0.193%) and predict the RUL (9.80%/9.90%) of our cells. This study allows end users to tailor battery consumption plans and motivates manufacturers to improve battery designs.
Broader contextWith the rapid development of lithium-ion batteries, battery health status management has attracted more and more attention in recent years. For safety and sustainability considerations, each end-user would like to know the current battery health status (e.g., the state of health and remaining useful life) of electronic devices (e.g., electrified vehicles). However, due to personalized battery usage preferences, building a unified and reliable health status prediction model is hindered by cell-to-cell and cycle-to-cycle discharge variabilities. To solve this realistic issue, we develop a transfer learning framework to customize personalized battery health status prediction for each end-user. Using this framework, each end-user could accurately obtain the current battery health status by inputting the cycling data in the recent 30 cycles and then tailor battery consumption plans accordingly. Our approach has been demonstrated by LFP/graphite cells and NMC/graphite cells, and experimental results showcase the effectiveness and generalizability of the proposed approach. |
Two significant challenges stand in the way of developing a reliable real-time personalized health status prediction strategy: variability of discharge protocols and real-time requirements.23–25 First, due to the high usage variability, as well as different battery chemistries and ambient temperatures, the same discharge protocols and degradation patterns rarely exist in the historical database. Thus, if the health status of a new discharge protocol is predicted by other historical protocols directly, there is a possibility of bias.26 Previous studies focused on either performing general prediction by extracting common high-performance features from early-cycle battery data,6,27 or proposing case-by-case prediction using at least 25% degradation data of each battery.28–33 On the one hand, general prediction approaches using common features produced conceivable prediction errors due to the neglect of the personalized degradation pattern of each battery. On the other hand, the case-by-case prediction approaches had a contradiction between the amount of data and the prediction accuracy. Second, health status prediction is required to be real-time, so that end-users are continuously informed about the current battery health condition and may schedule maintenance or replacements accordingly.34 Previous research focused on predicting the battery health status at pre-defined cycle(s),6,27–33 which does not match real-time requirements.
Recent research interest in transfer learning aims to improve the performance of specific learners by transferring the knowledge from other different but related learners.35,36 As an important branch of transfer learning, deep transfer learning requires no manual feature engineering.37 Deep transfer learning methods have been successfully applied to various applications, such as medical diagnosis,38,39 chemistry,40,41 natural language processing,42,43 and smart manufacturing.44,45 An increasing number of deep transfer learning technologies have been employed for health status prediction of lithium-ion batteries. However, these studies require at least 50% prior degradation data of the target battery to perform a curve matching from the historical database, without considering the necessity of real-time prediction along with battery degradation.46–48 There still exists a considerable opportunity to fill the gap between deep transfer learning and real-time personalized health status prediction for a wide range of usage patterns.
In this work, we design a transferable deep network to achieve personalized and real-time health status prediction of lithium-ion batteries, using partial cycling data of the recent 30 cycles at any interested cycle (Fig. 1). To this end, we construct an experimental platform containing 77 lithium-iron-phosphate (LFP)/graphite cells with cycle lives ranging from 1100 to 2700 cycles. The cells undergo different multi-stage discharge protocols to approximate usage variability, thus acquiring 146122 charge–discharge cycles in total. Our method achieves mean testing errors of 0.176% and 8.72% for capacity estimation and RUL prediction respectively, at all charge–discharge cycles. In addition, we also perform two further tasks to predict the health status of the testing cells in our dataset by transferring the degradation knowledge from two other datasets, which have different charge–discharge conditions and a different battery chemistry from our dataset, respectively. Mean testing errors of 0.328% (0.193%) and 9.80% (9.9%) for capacity estimation and RUL prediction are achieved in two further tasks. These results illustrate the effectiveness and generalizability of the deep transfer learning framework on health status prediction according to the personalized usage patterns.
![]() | ||
Fig. 2 (a) Discharge capacity of 77 LFP/graphite cells with completely different discharge protocols. The colour of each curve is scaled by the battery cycle life. (b) The joint distribution of cycle life and initial capacity, where the cycle life and initial capacity both have large standard deviations. Two handcrafted features have poor predictive performance: (c) cycle life as a function of a common handcrafted feature6 (i.e., mean of square of ΔV3−2(Q); ΔV3−2(Q) is the difference of the charge voltage curves between the 3rd cycle and the 2nd cycle), (d) RUL as a function of mean of square of ΔV30−2(Q) (ΔV30−2(Q) represents the difference of the charge voltage curves between the 30th cycle and the 2nd cycle in every 30 cycles). Four important feature curves are collected from 80% SOC to the first 3.6 V for a representative cell: (e) charge voltage curves, (f) charge capacity curves, (g) difference of charge voltage curves between each cycle and the 10th cycle and (h) difference of charge capacity curves between each cycle and the 10th cycle. The feature curves are interpolated to a fixed length of 100 points and the colour of each curve is scaled by the cycle number in the representative cell. |
We also employ two other well-known public datasets which contain different charge–discharge configurations, ambient temperatures and battery chemistries from our dataset. Among them, the first public dataset comprises more than 100 LFP/graphite cells (see Fig. S3a, ESI† for degradation curves) from Severson et al.27 and includes 72 different charge protocols, but an identical discharge protocol, which focuses on charge variability other than discharge reflected in our dataset. The second public dataset developed by Preger et al.49 contains 22 lithium–manganese–cobalt-oxide (NMC)/graphite cells (see Fig. S3b, ESI† for degradation curves), where the NMC/graphite cells were cycled at three ambient temperatures (15 °C, 25 °C and 35 °C) with an identical charge protocol, but four different one-stage discharge protocols (0.5C, 1C, 2C and 3C). It could be found that the second public dataset and our dataset have different battery chemistries, cycling ambient temperature and discharge configurations. More details of the two public datasets are described in the Methods section. In this work, cross-scenario and cross-chemistry battery health status predictions are investigated when training data are limited. To be specific, the two public datasets are used for training two deep models, which will then be applied to our testing cells for real-time personalized health status prediction, respectively.
To alleviate the dependence on the battery charge ranges, partial charge data with a commonly-used charge range from 80% SOC to the first 3.6 V is used as input for the deep learning model that could automatically extract valuable features from charge data without complicated feature engineering. Four partial feature curves in the charging process are employed, i.e., charge voltage curve (V, Fig. 2e), charge capacity curve (Q, Fig. 2f), the difference of the charge voltage curve between each cycle and the 10th cycle (ΔVi−10 = Vi − V10, Fig. 2g) and the difference of the charge capacity curve between each cycle and the 10th cycle (ΔQi−10 = Qi − Q10, Fig. 2h). The four feature curves are fitted as a function of time and linearly interpolated to a fixed length of 100 for the convenience of model manipulations. Among the four feature curves, ΔQi−10 is inspired by Severson et al.,27 who proved that a high-performance feature (log variance of ΔQ100−10(V), i.e., the difference of the discharge capacity curves between the 100th cycle and the 10th cycle) had a strong correlation with the battery cycle lives (the absolute value of correlation coefficient is 0.92). ΔVi−10 is another important feature curve inspired by Jiang et al.,6 who proved that ΔVb−a(Q) (difference of the charge voltage curves between the bth cycle and the ath cycle) also contained high-performance features (e.g., mean of square of ΔV3−2(Q)) related to battery cycle life. However, the features proposed by previous studies are not suitable for our work due to the following three reasons: (1) the correlation is weak between the two-cycle features (e.g., mean of square of ΔV3−2(Q)) and cycle life (ρ = 0.1120, Fig. 2c; see Fig. S4, ESI† for the performance of different statistical features); (2) the correlation is weak between the two-cycle features (e.g., mean of square of ΔV30−2(Q), the 30th cycle and the 2nd cycle in every 30 cycles) and RUL (ρ = −0.0308, Fig. 2d; see Fig. S5, ESI† for the performance of different statistical features); and (3) the discharge current frequently changes during the battery discharge process (Fig. S2b, ESI†), thus the discharge feature curves in shared ranges are hard to be collected. Thus, we enrich the model input information using four charge feature curves at the current cycle combined with the previous 29 cycles (a total of 30 cycles). In particular, 10 cycles are taken from 30 cycles (sampling 1 cycle every 3 cycles in 30 cycles) for the ease of calculation. Thus, each input sample has a shape of 1 × 4 × 100 × 10 (i.e., 1 sample, 4 feature curves, 100 sampling points and 10 cycles) and its label consists of 10 battery discharge capacities at the corresponding 10 cycles and 1 RUL value at the current cycle.
As shown in Fig. 3, we design a deep learning model which consists of one convolutional module, one recurrent module and two fully connected modules. Among them, the convolutional module is used as a feature extractor to automatically extract valuable intra-cycle features from four feature curves and the recurrent module is used as an encoder to learn cycle-to-cycle knowledge. Additionally, two fully connected modules are designed for capacity estimation and RUL prediction, respectively. Two stages, the model pre-training stage and the model transfer stage, are designed in the transfer learning model. At the model pre-training stage, the deep learning model is pre-trained using input feature curves and labeled health status of training cells where each cell has thousands of charge–discharge cycles. Then, the model transfer stage aims to realize a real-time personalized health status prediction at any cycle of any new discharge protocol, through inputting feature curves in the recent 10 charge–discharge cycles (after resampling). In particular, input feature curves and labeled discharge capacities in the first 9 cycles are employed to fine-tune the recurrent module online by freezing other modules (see Note S1, ESI†). Then, the capacity and RUL at the current charge–discharge cycle are estimated and predicted online by the fine-tuned model, respectively. More computational details are described in the Methods section. With the proposed deep transfer learning model, any end-user with a personalized discharge pattern could obtain the battery health status at any charge–discharge cycle and make new usage plans accordingly.
![]() | ||
Fig. 3 The proposed model comprises an offline model pre-training stage and an online model transfer stage. The offline model pre-training stage aims to mine the implicit relationship between four partial charge curves and health status using a deep learning model that consists of a convolutional module, a recurrent module and two fully connected modules. The convolutional module is used to extract intra-cycle features from each cycle using a stack of three convolutional layers and max-pooling layers, and the recurrent module is used to learn the cycle-to-cycle knowledge using two long short-term memory (LSTM) layers. The output of the recurrent module is then mapped to 10 capacities using the first fully connected module. The mapped capacities are finally used to predict the RUL at the current cycle using another fully connected module. The model parameters configuration and the output sizes of each layer are detailed in Table S2 (ESI†). The online model transfer stage aims to real-time predict the capacity and RUL at any cycle of any discharge protocol, by fine-tuning the recurrent module while freezing other modules using four partial charge curves and capacities at the previous 9 charge–discharge cycles. |
Based on the proposed deep transfer learning approach and three datasets, we investigate three different tasks for real-time and personalized prediction of 22 unseen discharge protocols via transferring degradation knowledge from (1) 55 cells with different discharge protocols in our training dataset (denoted by Task A); (2) cells with different charge–discharge configurations in the first public dataset (denoted by Task B); and (3) cells with a different battery chemistry (NMC/graphite) in the second public dataset (denoted by Task C). In all cases, the 22 unseen discharge protocols are taken from our dataset. Three evaluation metrics, i.e., root mean squared error (RMSE, see eqn (4)), coefficient of determination (R2, see eqn (5)) and mean absolute percentage error (MAPE, see eqn (6)), are employed to evaluate the prediction performance of battery health status.
Capacity | RMSE (mA h) | R 2 | MAPE (%) |
---|---|---|---|
a Results for 22 testing protocols by transferring degradation knowledge from cells in our dataset. b Results for 22 testing protocols by transferring the degradation knowledge from cells in a well-known dataset27 with different charge–discharge configurations. c Results for 22 testing protocols by transferring the degradation knowledge from cells in another well-known dataset49 with a different battery chemistry. | |||
Task Aa | 2.57 | 0.999 | 0.176 |
Task Bb | 4.65 | 0.997 | 0.328 |
Task Cc | 3.08 | 0.999 | 0.193 |
RUL | RMSE (cycles) | R 2 | MAPE (%) |
---|---|---|---|
Task A | 186 | 0.804 | 8.72 |
Task B | 240 | 0.770 | 9.80 |
Task C | 216 | 0.794 | 9.90 |
It should be noted that the cycling data in the recent 30 cycles and a sampling rate of 3 cycles are used to perform health status prediction by giving consideration to a smaller number of input cycles, smaller computation costs and smaller prediction errors. To investigate the possibility and robustness of the proposed model using different numbers of input cycles, we perform additional analysis to compare the testing performance on different numbers of input cycles after a sparse sampling. Fig. 5 shows the prediction errors using different numbers of sparse input cycles for capacity estimation and RUL prediction and the results reveal that our choice of the number of input cycles in this work is general, but not the best, and the proposed model with a smaller number of input cycles is also feasible (for instance, i = 20 and j = 10 achieve RMSEs of 2.19 mA h and 160 cycles for capacity estimation and RUL prediction, respectively); the proposed model is insensitive to the number of input cycles as well as the sampling rates (average 1.65 mA h with a standard deviation of 0.50 mA h for capacity estimation; average 171 cycles with a standard deviation of 11 cycles for RUL prediction), thus demonstrating the robustness of the proposed deep transfer learning model.
In summary, the above results illustrate the predictive ability of our approach for unseen discharge protocols, showcasing the high flexibility and effectiveness to customize personalized battery health management for each end-user.
It is worth mentioning that the model transfer stage fine-tunes the recurrent module to adapt to the personalized battery usage conditions. A seemingly natural question is whether we can fine-tune the convolutional module at the model transfer stage. To further evaluate our proposed transfer strategy, we investigate the performance of personalized battery health status prediction by fine-tuning the convolutional module rather than the recurrent module at the model transfer stage (see Table S9, ESI†). The results for the three tasks consistently show that fine-tuning the convolutional module yields performance degradation in terms of both capacity estimation and RUL prediction, demonstrating that the cycle-to-cycle knowledge is strongly related to the battery health status and that the recurrent module could effectively learn new cycle-to-cycle knowledge after fine-tuning the model.
The above results present the capability of the proposed approach to LFP/graphite cells. Beyond LFP/graphite cells, our approach can be applied to NMC/graphite cells.49 To this end, we perform two additional experiments for personalized health status prediction of NMC/graphite cells by transferring degradation knowledge from NMC/graphite cells and LFP/graphite cells, respectively. On the one hand, we use a cross-validation strategy to predict the health status for four discharge protocols (constant-current discharge rates of 0.5C, 1C, 2C and 3C) of NMC/graphite cells, where one protocol is used for validation while the other three protocols are used to pre-train the deep model each time. We achieve overall RMSEs of 12.7 mA h and 98 cycles for capacity estimation and RUL prediction, respectively (Fig. S15, ESI†). On the other hand, we also predict the health status for the same four discharge protocols of NMC/graphite cells by transferring degradation knowledge from the 55 training LFP/graphite cells in our dataset. Overall RMSEs of 19.38 mA h and 103 cycles are achieved for capacity estimation and RUL prediction, respectively (Fig. S16, ESI†). The results further showcase the effectiveness of our approach for personalized battery health status prediction to the NMC/graphite chemistry other than LFP/graphite chemistry.
While the approach has achieved satisfactory performance in this work, there are still some limitations. First, the approach works when cycling data in a fixed charge range are available, but the stochastic charge process (random charge ranges) which approximates real application is not investigated. Then, the approach requires the recent 30 cycles to predict the cell health status, with the result that the health status in the first several cycles cannot be predicted. Also, our approach relies on the quality of cycling data in recent cycles, where some singular measurement data will certainly influence the prediction results. In addition, our work simplifies the operating conditions in real applications, where more practical operating scenarios such as different temperatures, depth of discharges and discharge powers are not included in our dataset, thus further investigations are still required. Nevertheless, this approach obtains considerable generalizability in a variety of usage scenarios and has the potential to be extended to real applications.
All cells in the first dataset were tested in an 80-channel CT-4008 Neware battery tester at a constant temperature of 30 °C (Fig. S1, ESI†). The cells were cycled with an identical fast-charging protocol, but completely different multi-stage discharge protocols (Fig. S2 and Table S1, ESI†), resulting in different cycle lives ranging from 1100 to 2700 cycles. The charge process consisted of a constant-current (CC) charge of 5C from 0% to 80% state of charge (SOC), a CC charge of 1C from 80% SOC to 3.6 V and a constant–voltage (CV) charge until 100% SOC with a current cutoff of C/20. The discharge stage comprised four CC discharge steps, where the first three steps were intended to discharge from 100% to 60% SOC, from 60% to 40% SOC and from 40% to 20% SOC, respectively; the last step was intended to discharge from 20% to 0% SOC using a CC of 1C with a voltage cutoff of 2 V. For example, discharge protocol with a 5C–4C–3C–1C protocol consisted of a 5C discharge from 100% to 60% SOC, a 4C discharge from 60% to 40% SOC, a 3C discharge from 40% to 20% SOC and a 1C discharge from 20% to 0% SOC. It should be noted that 1C is 1.1 A in this dataset, or the current is required to fully (dis)charge the nominal capacity (1.1 A h) in 1 h. The rest time between two arbitrary steps was 30 s. Voltage, current and capacity were collected along with battery cycling until the maximum capacity first reached 80% of nominal capacity (i.e., failure threshold). A total of 80 cells were cycled with 80 different discharge protocols and 3 of 80 cells were excluded from the dataset due to sudden faults before degradation failure. In this work, 55 of 77 cells are used to pre-train a benchmark deep learning model and the other 22 cells are used to validate the effectiveness of the proposed method.
The second dataset is a famous public dataset, where cells in the dataset were cycled with 72 different fast-charging protocols, but with an identical discharge protocol at a constant temperature of 30 °C. The cells were charged with one-step or two-step CC protocols from 0% to 80% SOC, a uniform CC of 1C charge step to 3.6 V and a uniform CV charge with a current cutoff of C/50. Then, the cells were discharged with a uniform CC discharge of 4C to 2 V. The cells have cycle lives ranging from 150 to 2,300 cycles, and more details of the dataset could be found in the study proposed by Severson et al.27 In this work, all cells in the dataset are used to pre-train a deep learning model, then the knowledge would be transferred to predict the health status of 22 testing cells in the first dataset.
The third dataset developed by Preger et al.49 is also a public dataset. A total of 22 NMC/graphite cells were cycled with an identical charge protocol, but with four different discharge protocols (0.5C, 1C, 2C and 3C) at three different ambient temperatures (15 °C, 25 °C and 35 °C). In each charge–discharge cycle, the cells were charged with the same CC charge rate of 0.5C from 0% SOC to 4.2 V, followed by a CV charge. Then, the cells were CC discharged with four different discharge protocols from 100% SOC to a voltage cutoff of 2 V. The cells have cycle lives ranging from 380 cycles to 1321 cycles, and more details of the dataset could be found in the study proposed by Preger et al.49 In the personalized health status prediction experiment for LFP/graphite cells by transferring degradation knowledge from NMC/graphite cells, we pre-train a deep learning model using all the 22 NMC/graphite cells and then transfer it to our 22 testing LFP/graphite cells for personalized health status prediction. In the personalized health status prediction experiment for NMC/graphite cells by virtue of other NMC/graphite cells, we use a cross-validation strategy to predict the health status by four discharge protocols (four NMC/graphite cells), where one protocol is used for validation, while the other three protocols are used to pre-train the deep model each time. Additionally, in the personalized health status prediction experiment for NMC/graphite cells by virtue of LFP/graphite cells, the 55 training LFP/graphite cells in our dataset are used to pre-train the deep model, which would then be deployed to the same four NMC/graphite cells.
Each battery could last for thousands of cycles before failure, which would waste much data storage space if voltage/current sensors continuously collect cycling data. Compared with all-time cycling data, partial cycling data is more practical and economical in real applications, i.e., voltage/current sensors only collect partial data corresponding to a specific range of SOC. To reduce the dependence on the data sampling time, charge voltage and charge capacity collected from a commonly-used range (80% SOC to the first 3.6 V (4.2 V for NMC/graphite cells)) are utilized as the model input in this work. To unify the length of sampling data, all charge curves from 80% SOC to the first 3.6 V are fitted as a function of sampling time and evaluated at 100 linearly spaced points. These uniformly sized samples enable direct model input and data manipulations like voltage/capacity subtraction.
![]() | (1) |
![]() | (2) |
At the model training stage, M training samples (each sample has p actual discharge capacities and one RUL yiRUL) are used to compute the mean squared error (MSE) loss between model outputs and actual measurements
![]() | (3) |
At the fine-tuning stage, the capacity and RUL at the current cycle are required to be estimated and predicted using the pre-trained model. To implement personalized prediction for the testing cells, random d actual discharge capacities in the previous p−1 cycles are used to fine-tune the model weights in RE, and the other p−1−d actual capacities are used to validate the performance after fine-tuning. Only weights in RE are updated because RULs are determined by multi-cycle knowledge. We also find that the model weights in FC2 are general for all discharge protocols, so the pre-trained weights in FC2 are also frozen while fine-tuning the model.
Once the estimated capacities and predicted RULs are obtained, three standard metrics, i.e., RMSE, R2 and MAPE, are chosen to evaluate the model performance. RMSE for capacity estimation or RUL prediction is defined as
![]() | (4) |
![]() | (5) |
![]() | (6) |
The operating system for algorithm implementation is Ubuntu 18.04; NVIDIA GeForce RTX 3090 GPU with 24 GB memory is used to accelerate the model training; CUDA 11.1 is deployed for the GPU-accelerated applications; Python 3.8.8 is used to process data and analyse data, where the pandas, Numpy and scikit-learn packages are employed to load, manipulate and process data; the PyTorch framework is used to construct the deep learning model.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d2ee01676a |
This journal is © The Royal Society of Chemistry 2022 |