 Open Access Article
 Open Access Article
      
        
          
            Haiping 
            Su‡
          
        
        
       a, 
      
        
          
            Sen 
            Lin‡
          
        
      b, 
      
        
          
            Shengwei 
            Deng
a, 
      
        
          
            Sen 
            Lin‡
          
        
      b, 
      
        
          
            Shengwei 
            Deng
          
        
       c, 
      
        
          
            Cheng 
            Lian
c, 
      
        
          
            Cheng 
            Lian
          
        
       *a, 
      
        
          
            Yazhuo 
            Shang
*a, 
      
        
          
            Yazhuo 
            Shang
          
        
       a and 
      
        
          
            Honglai 
            Liu
a and 
      
        
          
            Honglai 
            Liu
          
        
       *a
*a
      
aState Key Laboratory of Chemical Engineering, Shanghai Engineering Research Center of Hierarchical Nanomaterials, School of Chemistry and Molecular Engineering, East China University of Science and Technology, Shanghai 200237, PR China. E-mail: hlliu@ecust.edu.cn; liancheng@ecust.edu.cn
      
bNational Engineering Research Center for Integrated Utilization of Salt Lake Resources, East China University of Science and Technology, Shanghai 200237, China
      
cCollege of Chemical Engineering, Zhejiang University of Technology, Hangzhou 310014, China
    
First published on 25th April 2019
Machine learning (ML) methods were applied to predict the capacitance of carbon-based supercapacitors. Hundreds of published experimental datasets are collected for training ML models to identify the relative importance of seven electrode features. This present method could be used to predict and screen better carbon electrode materials.
Carbon-based materials, including activated carbon, carbon nanotubes, templated carbons, carbon aerogels, and graphene, are widely used as electrodes because of their desirable physical and chemical properties.12 These properties include easy processability, low density, non-toxicity, high chemical stability and conductivity, large specific surface area, and relatively low cost. During the past few decades, many experimental efforts have been devoted to the practical applications of EDLC devices by utilizing a wide variety of carbon-based electrodes with diverse pore size distributions,13–17 morphologies,18 architectures19,20 and surface chemistries.21–23 Meanwhile, test conditions have been optimized to improve the performance of supercapacitors. However, the effects of the pore structures, surface properties, and testing conditions on the EDLC performance remain poorly understood. Traditional EDL theory and models only give a mathematical equation for EDLs on a flat surface. It is hard to capture the pore network effects on the capacitive performances. Molecular simulation tools are good for studying and screening porous electrodes, but there is a big challenge to obtain the relations between the microscopic structures and the macroscopic performances because molecular simulation is very time-consuming. There are no general rules to guide the synthesis of porous carbon electrodes, and it is urgent but necessary to find the quantitative structure–activity relationship for predicting the capacitance of carbon-based supercapacitors.
In this work, machine learning (ML) is introduced to study and predict the capacitance of carbon-based supercapacitors. ML is a data-based method, which has been treated as an alternative tool to solve many practical problems. Recently, several chemical engineering issues have been studied by ML methods efficiently,24,25 which inspired us to investigate the influences of carbon variables on the EDL capacitance thorough a ML method. As a kind of ML method, an artificial neural network (ANN) has been applied to predict the capacitance of carbon-based supercapacitors. Zhu et al. studied the impact of five variables (specific surface area, calculated pore size, ID/IG ratio, N-doping level and voltage window) by the ANN method.26 The authors compared the ANN method with two different machine learning models (linear regression and Lasso), demonstrating that the ANN method is more accurate with a higher correlation coefficient (R2 = 0.91). However, the ANN method can't explain the impact of each feature separately. Deep learning networks rely on layers of artificial neural networks, which are often treated as a black box, and the outputs are unexplainable. On the other hand, machine learning algorithms are built to learn to do things by understanding labeled data, which could generate easy rules, making the outputs more explainable.27
To predict the EDL capacitance, a sequential procedure is used, as shown in Fig. 1. The first step is to collect the experimental data. Then, the experimental data are applied to assess the performance of different ML models to predict the EDL capacitance. Herein, 121 sets of carbon-based supercapacitors with seven variables reported (potential window, specific surface area, pore volume, pore size, ratio of ID/IG, N-doping percentage, and O-doping percentage) were obtained for ML models from different papers (part of the data source was from ref. 26). All of these data are available in the ESI.†
|  | ||
| Fig. 1 Illustration of the sequential approach used for predicting EDL capacitance by using machine learning models. | ||
Fig. 2 shows the EDL capacitance in relation to different variables. Six features including the physical and chemical properties of porous carbon electrode materials and the test potential windows are summarized in Table 1. The minimum, maximum, mean and standard deviation (SD) number of these data are also presented in the table. For better comparison, the highest capacitances reported in the papers were selected, which were tested at a low current density (≤1 A g−1). The capacitance from different papers are shown in Fig. 2(a), and the capacities range from tens to several hundreds of F g−1. It is notable from the conventional energy density equation E = CV2/2 that the testing potential windows have significant influence on the performance of supercapacitors, while the potential windows are mainly determined by the electrolyte types. Therefore, we mainly focused on the EDL capacitances tested in aqueous electrolytes, and the PW was around 1 V. Fig. 2(b) shows the relationship between the capacitance and potential window. According to the Helmholtz model (the capacitance C = Aεrε0/d, where A is the specific surface area, ε0 is the vacuum dielectric constant and εr is the dielectric constant of the electrolyte), the capacitance of EDLCs should be linear to the specific surface area. However, no linear relationship is found in Fig. 2(c). Fig. 2(d) and (e) show the relationship between the capacitance, pore volume and average pore size. The porous structures (pore volume and pore size) could be optimized to enhance the capacitive performance, while the relationships in Fig. 2(d) and (e) are not enough to determine the influence of pore volume and pore size on the capacitance. The intensity ratio of the D-band and G-band (ID/IG) from Raman spectroscopy can present the defect condition and the crystallization degree of carbon materials. Fig. 2(f) displays how the crystallinity of carbon materials affects the capacitance. Apart from optimizing the microstructures, heteroatom doping is a useful strategy to boost the capacitance. The influences of nitrogen and oxygen doping on the capacitance are shown in Fig. 2(g) and (h).
| Minimum | Maximum | Mean | SD | |
|---|---|---|---|---|
| a 121 sets of capacitances for different carbon-based electrodes. PW: potential window (V); SSA: specific surface area (m2 mg−1); PV: pore volume (cm3 g−1); PS: pore size (nm); ID/IG: the ratio of ID/IG; N%: N-doping percentage (at%); and O%: O-doping percentage (at%). | ||||
| PW | 0.8 | 1.2 | 0.971 | 0.064 | 
| SSA | 0.001 | 4.07 | 1.272 | 0.858 | 
| PV | 0.002 | 5.3 | 1.110 | 0.763 | 
| PS | 0.48 | 35.34 | 3.169 | 4.578 | 
| I D/IG | 0.5 | 4 | 1.13 | 0.508 | 
| N% | 0 | 20.55 | 3.756 | 4.33 | 
| O% | 0 | 25.07 | 8.589 | 5.216 | 
With selected data and variables, four different ML methods were used (linear regression, support vector regression, multilayer perceptron and regression tree models). Linear regression (LR) has predictors that are linear in the model parameters, which is easy to interpret, and fast in making predictions. However, the highly constrained form of LR models means that they often have low predictive accuracy. Support vector regression (SVR) is a regression technique with excellent performances in regression and time series prediction application, allowing categorising the input data using separating lines or planes. Multilayer perceptron (MLP) is a widely used artificial neural network model, which generally includes an input layer, hidden layers and an output layer, and each layer consists of nodes connected with a certain weight to all nodes in the next layer. Except for the input nodes, each node is a processing element with a nonlinear activation function such as a sigmoid function that enables the network to compute complex nonlinear problems. Regression trees (RT) are a way to split the input space into areas (sub-spaces) and build in each of them a local specialized linear regression model, and a RT is easy to interpret, fast for fitting and prediction, and low on memory usage. More details for these four machine learning models could be found in our previous work.25
The algorithms of the four ML models were conducted in the machine learning open source package WEKA. Each method has its own parameters that need to be specified, and parameters for the SVR and MLP were obtained by using the CVParameterSelection module in WEKA. The learning rate and the number of nodes in the hidden layer for MLP are 0.2 and 20, and the complexity parameter in SVR is optimized to be 200. For RT and LR, WEKA could generate parameters itself. A 10-fold cross-validation methodology was used to generate a statistical result which is independent of the data set. Specifically, the datasets were evenly split into 10 folds. The instances from 9 folds were used for training while the remaining fold was used for testing. The calculation process was repeated 10 times using a different fold for testing in each cycle. The performance of each model was given by the average of the accuracies of all the folds. The correlation coefficient (R), mean absolute error (MAE), and root mean square error (RMSE) were used to evaluate the accuracies of different ML models in predicting EDL capacitance.
|  | (1) | 
|  | (2) | 
|  | (3) | 
The predicted EDL capacitance by the ML models is compared with input experimental capacitances, as shown in Fig. 3. The estimated R, MAE and RMSE were also listed in the figures, supporting an accurate prediction of this model. It is easy to find that both MLP and RT have better performance than SVR and LR, with larger R and smaller RMSE. Based on the RMSE, the overall performance of the algorithms for predicting carbon electrode effects on EDL capacitance can be ranked as follows: RT > MLP > SVR > LR.
From the above discussion, the two better models were MLP and RT with a RMSE of 67.62 and 68.45, respectively. The RT model does not need to find the optimal parameters by trial and error, and it generates easier rules which we could understand easily:
[Rule 1]: If SSA > 0.405, then the capacitance = 241.8616 × PW + 43.5775 × SSA − 34.8074 × ID/IG + 6.0818 × N% + 2.8217 × O% − 34.3229.
[Rule 2]: Else the capacitance = 99.7593 × PW + 22.6152 × SSA + 74.6316 × ID/IG + 20.7572 × N% + 13.5279 × O% − 152.8567.
Based on the rules generated by RT, we find that 96 of the 121 sets of data meet Rule 1 and 25 sets of data meet Rule 2. The variables of the specific surface area, operating potential window, carbon crystallization degree and heteroatom doping are all shown in both of the rules, indicating that the EDL capacitance could easily be changed/enhanced by changing these parameters. Although the specific surface area seems to be the most important parameters to the EDL performance, the pore volume and pore size were not covered in the rules. Since the specific surface area, pore volume and pore size are always related to each other, a more specific study may be needed to verify the contributions. Anyway, this simple RT model still could be used for a quick material screening.
The MLP model only shows a little bit lower accuracy than the RT mode, but the relative contribution of different variables to the EDL capacitance could be obtained. As shown in Fig. 4, the weight of the seven variables is clearly described in a pie graph. The specific surface area (SSA) and the pore volume (PV) are the two most important variables, with a weight of 30.1% and 24.1%, respectively. The pore size (PS), potential window (PW) and ratio of ID/IG contribute with lower weights (11.4%, 11.2% and 9.9%, respectively). The N-doping percentage (N%) and O-doping percentage (O%) with a weight of 7.6% and 5.7% show the smallest contribution to the capacitive performances.
| Footnotes | 
| † Electronic supplementary information (ESI) available. See DOI: 10.1039/c9na00105k | 
| ‡ These authors made equal contribution to this work. | 
| This journal is © The Royal Society of Chemistry 2019 |