Li Mengshan
*ab,
Wu Weia,
Chen Bingshenga,
Wu Yana and
Huang Xingyuanb
aCollege of Physics and Electronic Information, Gannan Normal University, Ganzhou, Jiangxi, China 341000. E-mail: jcimsli@163.com
bCollege of Mechanical and Electric Engineering, Nanchang University, Nanchang, Jiangxi, China 330031
First published on 13th July 2017
As an important physical chemistry property, solubility is still a popular research topic. Its theoretical calculation method has developed rapidly. In particular, the artificial neural network (ANN) has attracted the attention of researchers because of its unique nonlinear processing ability. This review provides a brief explanation of the ANN approaches that are most commonly applied to predict gas solubility in polymers, and states the implementation principle, progress, and performance analysis of hybrid ANNs based on the intelligence algorithm. The prospect of solubility prediction based on current research trends is then proposed. This review attempts to analyze the solubility calculation method and provides an insight into and reference for the application of the artificial intelligence method in chemistry and material fields, and can expand in the future because of the increasing number of solubility prediction approaches being introduced.
Many solubility factors, such as system temperature, pressure, polarity, and system density, show strong non-linear mapping relationships.5 Mutually restricted nonlinear characteristics exist among the influencing factors, especially in the supercritical high temperature and pressure conditions; the traditional thermodynamics research method is relatively difficult, and the simulation precision and efficiency cannot satisfy the demand.6–8 Artificial neural network (ANN) has many characteristics, such as self-organization and self-learning ability, suitable fault tolerance, and strong non-linear processing power; hence, it exhibits superior performance when calculating solubility problems.9 The most commonly employed methods to predict solubility are back propagation ANN (BP ANN) and radial basis function ANN (RBF ANN). Scientists are working to explore computing methods that are more efficient, accurate, and adaptive to obtain a reliable solubility model.10,11
Therefore, the solubility of gases in different polymers based on many varieties of ANN is taken as an example in this review. The basic principles, research status, advantages, and disadvantages of different types of ANNs in solubility calculation are reviewed. Combined with our previous studies in recent years, this review mainly states the progress of the solubility prediction of ANNs based on the swarm intelligence algorithm. The solubility calculation is then summarized and prospected based on current research trends. This review attempts to analyze the solubility calculation method and provide an insight into and reference for the application of the artificial intelligence method in chemistry and material fields.
Yi = f(Ui) |
The ANN was trained according to the experimental data and optimized to obtain the parameter matrix in the network model. Thus, a mathematical model that reflects the inherent complex rules between the input and output of the experiment was developed. The basic structure of the model is shown in Fig. 1. The influence of system temperature, pressure, density, and other factors on solubility is generally analyzed according to the solubility of the supercritical carbon dioxide in polymer. The node number in the input layer is determined by the influence factors, such as temperature, pressure, and density. The output layer nodes are generally soluble, and the number of hidden layers and nodes is commonly determined by the empirical formula method or the heuristic method.
Many scholars have successfully applied ANN to correlate and predict solubility under supercritical conditions. Gharagheizi F. et al.13 utilized ANN to predict the solubility of solid complexes in supercritical carbon dioxide. Eslamimanesh A. et al.14 utilized ANN to predict solubility of supercritical carbon dioxide in ionic liquids, and showed the capability of the presented model. Modarress H. et al.15 also proposed a solubility prediction model with ANN, and showed that the presented model can predict the gas solubility satisfactorily. Khajeh A. et al.16 proposed the use of ANN to predict the solubility of carbon dioxide in polymers, and indicated that the presented model is an effective method. Bakhbakhi Y.17 and Lashkarbolooki M. et al.18–22 compared the solubility prediction of ANN with several state equations Mehdizadeh B. et al.23,24 compared the solubility predictions of ANNs and semi-empirical equations. Hussain M. A. et al.25 utilized the Kent–Eisenberg model in conjunction with ANN to predict dissolution. Torrecilla J. S. et al.26 proposed the solubility model based on multilayer ANN and mathematical regression methods. Their results show that the performance of ANN is suitable for correlating and predicting solubility, and showed that the ANN model is a superior technique with high accuracy.
The reliability and accuracy of ANN prediction are better than the traditional thermodynamics method. Researchers also reported that the performance of ANN relies heavily on its training algorithm, and the commonly utilized BP algorithm can easily fall into the local search and other deficiencies. Meanwhile, the center of the base function of the RBF ANN model and expansion constant and the network weights have a more significant impact on the model performance. Khajeh A. et al.27 proposed the use of RBF ANN and the adaptive fuzzy neural system method to predict gas solubility in polymers and determined that the adaptive fuzzy neural system method has superior performance. Khayamian T. et al.28 utilized wavelet ANN to establish the solubility model, which largely improved the performance. The training algorithm of the ANN model is currently attracting significant attention. Thus, many researchers have attempted to train ANN using the intelligent optimization method and have applied different chemicals and materials in different fields.
Abbreviation | Hybrid method | Reference |
---|---|---|
SA-ANN | ANN trained by simulated annealing (SA) | 29 and 30 |
GA-ANN | ANN with genetic algorithms (GA) | 31–34 |
AC-A-RBF ANN | RBF ANN with an adaptive Ant Colony Algorithm (AC-A) | 35 and 36 |
AC-AR-RBF ANN | Adaptive regulation (AR) ant colony algorithm and RBF ANN | 37 |
ANFIS | Adaptive neuro-fuzzy inference system | 38 |
SVM-ANN | Support vector machine algorithm and ANN | 39–42 |
BA-MPL-ANN | ANN with Bees algorithm and multi-layer perceptron (MLP) method | 43 |
MLR, MQR, MLP, RBF ANN | RBF ANN based on Multiple Linear Regression (MLR), Multiple Quadratic Regression (MQR), and Multi Layer Perceptron (MLP) | 26 |
SVR-ANN CI-ANN | ANN based on Support Vector Regression (SVR), and generalized regression ANN based Computational Intelligent (CI) | 44 |
RS–BP ANN | Rough sets and BP ANN | 45 |
KE-ANN | The Kent–Eisenberg (KE) model in combination with ANN | 25 |
BP-NL ANN | Back-propagation (BP) multi-layer (ML) ANN | 46 |
WANN | Wavelet artificial neural network (WNN) | 47 and 48 |
FANN | Fuzzy artificial neural network (FANN) | 49–51 |
HSF-RBF ANN | Hybrid self-organizing fuzzy (HSF) and RBF ANN | 52 |
ICO-VSA RBF ANN | RBF ANN with improved chaos optimization and variable-scale analysis | 53 |
Abbreviation | Hybrid method | Reference |
---|---|---|
UPSO-FFANN | Feed-forward artificial neural network (FFANN) optimized by unified particle swarm optimization (UPSO) | 58 |
ADPSO-RBF ANN | Linearly decreased inertia weight particle swarm optimization (ADPSO) model for RBF ANN | 60 |
PSO ANN | Artificial neural network trained by particle swarm optimization | 61–63 |
BBPSO-AD RBF ANN | RBF ANN based on bare-bones particle swarm optimization (BBPSO) with an adaptive disturbance factor (AD) | 64 |
GC-PSO ANN | Group Contribution (GC) plus ANN plus PSO | 65 |
ALPSO-RBF ANN | RBF ANN with linearly decreased inertia weight (ALPSO) | 60 |
PSO-FNN | Fuzzy artificial neural networks (FANN) with PSO algorithm | 66 and 67 |
IOFC-PSO RBF ANN | RBF ANN using input–output fuzzy clustering (IOFC) and PSO | 68 |
HPSO-GSA FFANN | FFANN using hybrid particle swarm optimization (HPSO) and gravitational search algorithm (GSA) | 69 |
The PSO algorithm has several disadvantages, such as slow local search and premature convergence.70 Our research group has discussed and developed some hybrid methods based on the chaos theory, the adaptive PSO algorithm, and the clustering method in recent years. We have also presented several solubility models based on hybrid ANNs. Table 3 shows these solubility models we proposed in the past few years.
Abbreviation | Method description | Instance | Reference |
---|---|---|---|
CSAPSO-BP ANN | ANN trained by chaotic self-adaptive particle swarm optimization (CSAPSO) algorithm and back propagation (BP) algorithm | Solubility prediction of carbon dioxide (CO2) in polystyrene (PS), polypropylene (PP) and nitrogen (N2) in PS | 71 and 72 |
CSAPSO-KCM RBF NN | RBF ANN trained by CSAPSO and k-means clustering method (KCM) | Solubility prediction of N2 in polystyrene (PS) and CO2 in PS, PP, poly(butylene succinate) (PBS), and poly(butylene succinate-co-adipate) (PBSA) | 73 |
KCM-PSO FNN | Four-layer fuzzy neural network (FNN) model combining PSO algorithm and KCM | Solubility prediction of CO2 in PS, PP, and N2 in PS | 74 |
CSAPSO-FCM RBF ANN | RBF ANN model based on CSAPSO and fuzzy c-means clustering method (FCM) | Solubility prediction of CO2 in PS, PP, PBS and PBSA | 75 |
CSAPSO-KHM RBF ANN | RBF ANN model combined with CSAPSO algorithm and K-harmonic means clustering method (KHM) | Solubility prediction of supercritical carbon dioxide in 10 different polymers | 76 |
CEAPSO KHM RBF ANN | RBF ANN trained by accelerated particle swarm optimization (APSO) algorithm with chaotic enhanced disturbance factor (CE) and KHM algorithm | Solubility prediction of CO2 in polymers including PP, PS, poly(vinyl acetate) (PVA), carboxylated polyesters (CPEs) and PBSA | 77 |
There are mainly two types of solubility prediction models based on PSO and ANN. First, BP ANN solubility prediction models based on PSO and its variant; second, RBF ANN models based on several clustering method, PSO and its variant. The improved PSO algorithm called CSAPSO was developed.
The improved PSO algorithm based on chaos theory and self-adaptive weight strategy was developed, called CSAPSO. And three clustering methods such as k-means clustering method (KCM), fuzzy c-means clustering method (FCM) and K-harmonic means clustering method (KHM) are used to model training. The first type of model called CSAPSO-BP ANN was proposed by training the BP ANN with CSAPSO algorithm. The second type of model, such as CSAPSO-KCM RBF ANN, CSAPSO-FCM RBF ANN and CSAPSO-KHM RBF ANN, were established by optimizing the RBF ANN parameters by combining the improved CSAPSO algorithm with clustering methods. The experimental results show the superiority of each model in predicting dissolution. The overall scheme of the said model is shown in Fig. 2.
The detailed model of the process and the detailed performance analysis and discussion of these models can serve as reference in the relevant literature.
From the figures, we can summarize the following characteristics.
(1) The performances of the three models in the CSAPSO-C RBF ANN are the same.
The CSAPSO-KCM RBF ANN, CSAPSO-FCM RBF ANN, and CSAPSO-KHM RBF ANN models based on the CSAPSO and clustering methods are consistent in terms of prediction error and correlation. The accuracy and efficiency of the three models are basically the same. CSAPSO-KHM RBF ANN is slightly superior in terms of stability, and its prediction performance is suitable in most polymer systems. These can be attributed to the global search ability of the CSAPSO training algorithm and optimized model weight. The integration of each clustering method also makes the center of the basis function and expansion constant more reasonable, which eventually leads to the superior prediction performance of the model.
(2) The performance of the CSAPSO-C RBF ANN model is better than that of the training set.
Model training guarantees the performance, as shown in the training data. The model has suitable development and mining abilities. In particular, the model has a suitable prediction effect in the context of the experimental data and can be applied to predict new experimental data under the same experimental conditions.
(3) The CSAPSO-BP ANN model exhibits better stability in untested training experiments.
First, the CSAPSO and BP algorithms have a suitable ability to explore. The two algorithms also promote and complement each other in the exploration of the model and provide a solid theoretical basis. In particular, the exploration and development of this model are better. Here, pioneering performance refers to the absence of any target material under the premise of experimental data through other existing material prediction approaches of target substances.
(4) The predictive model has better performance, higher precision, and better correlation in the training set than in the non-trained set.
The essence of model prediction is the data fitting in mathematics, specifically by mining the inherent data law through known data and then predicting the fitting process of the new data. The data without training essentially have a similar principle (i.e., predicting new items through other similar data). The training set is clearly more theoretical than the unsupervised training set, and its comprehensive performance is better.
(1) ANN avoids the problem of non-steady-state correction in the traditional thermodynamic solubility model.
The supercritical gas in the traditional thermodynamic solubility prediction model is regarded as a compressed gas or expanded liquid. The solubility problem is determined by calculating the mass and volume. Under low temperatures and pressures, the gas can be regarded as an ideal gas, and the results are more accurate. By contrast, high temperatures and pressures in supercritical conditions transform the system into a non-steady state, thus the gas cannot be regarded as an ideal gas.
(2) ANN has suitable prediction reliability under high temperatures and pressures.
ANN predicts solubility by modeling the nonlinear relationship between the model input and output. For the model parameters, the numerical value of the condition data is unrecognized, and the prediction reliability is also suitable even at high temperatures and pressures.
(3) The ANN model has suitable fault-tolerance and strong anti-interference ability.
ANN establishes a non-linear relationship according to the training data, and the accuracy of the training data directly affects the comprehensive effect. Training data are generally real experimental data. However, experimental operation errors and other reasons can make some of the training data false, which are then called invalid data. An error in the experimental record and writing a higher or lower number of data bits can cause individual data to become extremely abnormal, which are then called disturbing data. ANN can train the model with other data when it encounters invalid or disturbing data. This model can then determine this type of abnormal data during training. The ANN model can slowly address such abnormal data by training normal data to minimize their impact on the model and deal with them effectively. Hence, this model has better fault-tolerance and stronger anti-interference ability.
The main deficiencies of the ANN solubility model are as follows:
(1) Lacks a rational explanation of the dissolving machine.
The essence of ANN prediction is data fitting. The model is a function of the black box, which can obtain the output as long as the input is received. However, what is the mechanism by which the output is obtained? What is the law between the input and output? What is the relationship between the output and the impact of factors on the model that cannot be reflected? Although the predictive effect of the solubility model has an impeccable function, its solubility mechanism still needs to be comprehensively explained.
(2) The prediction performance depends on the experimental data.
The realization principle of the ANN model is to study and train the experimental data, as well as record the law between the input and output data by network weight. If no experimental data are available, then the model cannot work. Moreover, the model performance and experimental data quantity and accuracy are also highly relevant. The more experimental data available, the better the training model. Hence, the ANN is extremely dependent on the experimental data.
(3) Its prediction performance depends on its training algorithm.
Model training adjusts the model parameters. A suitable training algorithm has a decisive influence on the model performance. However, the training algorithm does not have a unified standard, and the algorithm for suitable performance is not necessarily applicable to all types of ANN training. The selection of a training algorithm, which is often through comparison or the test method, is relatively difficult. Thus, this process is more difficult for non-computer professionals to grasp.
The combination of hybrid ANN technology and chemical engineering can be further developed and has a broad application prospects. Given the solubility prediction of supercritical carbon dioxide in polymers, the following aspects can be discussed intensively.
(1) Study on the solubility mechanism.
The actual process parameters of the solubility prediction model can be studied by combining with the actual industrial production processes and conditions, such as adsorption, diffusion, and interface renewal theory. The model can determine the law and mechanism of solubility in industrial production and provide theoretical guidance for the selection of process parameters.
(2) Explore the solubility of new ideas.
The most considered factors in current solubility studies are system temperature and pressure, and the solubility model of many influencing factors can reflect the real implementation rules better, which can improve model adaptability. Solubility prediction based on hybrid ANN technology, such as combining with diffusion theory and synergy theory, must also be studied further. Hybrid ANN methods can be adopted to solve different nonlinear problems in chemistry, material science, biology, and medicine.
(3) Study the solubility method of multi-scale calculation.
Multi-scale studies from different times and spatial scales can be referenced to study the physical and chemical properties of materials. Multi-scale solubility studies can also be utilized to analyze the solubility problems of different substances at the microscopic, mesoscopic, and macroscopic levels. For example, the ANN model can provide scientific and theoretical guidance for the selection of process parameters at the multi-scale level in the field of materials processing. It also has suitable application prospects in polymer self-assembly, phase rheological properties, and kinetic analysis. The multi-scale computation method has important interdisciplinary, cross-level, and time-span research values. The multi-scale calculation method will be discussed in future studies and applied to chemical, material, and other related computing fields.
ANN | Artificial neural network |
BP | Back propagation |
RBF | Radial basis function |
PSO | Particle swarm optimization |
CSAPSO | Chaotic self-adaptive particle swarm optimization |
SA | Simulated annealing |
GA | Genetic algorithms |
ACO | Ant colony optimization |
BA | Bees algorithm |
SVM | Support vector machine |
KCM | k-Means clustering method |
FCM | Fuzzy c-means clustering method |
KHM | K-Harmonic means clustering method |
PBS | Poly(butylene succinate) |
PBSA | Poly(butylene succinate-co-adipate) |
PP | Polypropylene |
PS | Polystyrene |
PLLA | Poly(L-lactide) |
PLGA | Poly(D,L-lactide-co-glycolide) |
HDPE | High-density polyethylene |
PVAc | Poly(vinyl acetate) |
PPO | Poly(2,6-dimethyl-1,4-phenylene ether) |
CPEs | Carboxylated polyesters |
ANFIS | Adaptive neuro-fuzzy inference system |
This journal is © The Royal Society of Chemistry 2017 |