Open Access Article
Saba Kalantarya,
Ali Jahani
*b,
Reza Pourbabakia and
Zahra Beigzadehc
aDepartment of Occupational Health Engineering, School of Public Health, Tehran University of Medical Sciences, Tehran 1416753955, Iran
bDepartment of Natural Environment and Biodiversity, Faculty of Environment, College of Environment, Karaj 31746118, Iran. E-mail: ajahani@alumni.ut.ac.ir
cEnvironmental Health Engineering Research Center, Kerman University of Medical Sciences, Kerman 7616913555, Iran
First published on 12th August 2019
Prediction of the diameter of a nanofiber is very difficult, owing to complexity of the interactions of the parameters which have an impact on the diameter and the fact that there is no comprehensive method to predict the diameter of a nanofiber. Therefore, the aim of this study was to compare the multi-layer perceptron (MLP), radial basis function (RBF), and support vector machine (SVM) models to develop mathematical models for the diameter prediction of poly(ε-caprolactone) (PCL)/gelatin (Gt) nanofibers. Four parameters, namely, the weight ratio, applied voltage, injection rate, and distance, were considered as input data. Then, a prediction of the diameter for the nanofiber model (PDNFM) was developed using data mining techniques such as MLP, RBFNN, and SVM. The PDNFMMLP is introduced as the most accurate model to predict the diameter of PCL/Gt nanofibers on the basis of costs and time-saving. According to the results of the sensitivity analysis, the value of the PCL/Gt weight ratio is the most significant input which influences PDNFMMLP in PCL/Gt electrospinning. Therefore, the PDNFM model, using a decision support system (DSS) tool can easily predict the diameter of PCL/Gt nanofibers prior to electrospinning.
Although a wide variety of complicated electrospinning techniques such as coaxial,8 modified coaxial,9 tri-axial,10 side-by-side11 and other complicated techniques12 have been successively reported, only a few limited publications have reported manipulation of the quality of the nanofibers, which is often evaluated by their diameter. Fiber size distribution and morphology play a significant role in the porosity, the surface to volume ratio, functionality, and performance.13 The shape, diameter of the electrospun nanofibers, and spatial distribution have a close relationship with their functional performance and can be divided into four categories: polymer properties (molecular weight and solubility), properties of the working fluids (the concentration, solution viscosity, dielectric properties, surface tension, and conductivity), the operational conditions (applied voltage, the fluid flow rate, nozzle–collector distance, and nozzle diameter of the spinneret), and manipulation of the environmental conditions (temperature, atmospheric pressure, relative humidity, the possible vacuum and even hot air blowing). These parameters have a positive impact on downsizing of the nanofibers.14–18 Despite important experimental investigations to determine fiber diameter, using scanning electron microscopy, transmission electron microscopy, and atomic force microscopy for example, it is still time-consuming and expensive.18,19 Furthermore, complexities in the electrospinning method and many factors simultaneously affecting the preparation techniques cause the findings from statistical tools, such as response surface methodology and regression analysis, to be very noisy.16,20 Regression analysis is one of the traditional techniques that has been used for model generation but the accuracy decreases when the independent parameters increase. In complex phenomena modeling, methods such as an artificial neural network (ANN) are employed.21 An ANN is an attractive and flexible choice for solving linear and nonlinear multivariate regression and different problems because it is based on the natural neural network of the brain.21–23 An ANN consists of interconnected processing elements, such as an input layer, various hidden layers and an output layer which is capable of learning from samples, using transfer functions between neurons and a specific learning algorithm in the structure of a program without being affected by data noise.24–26 Nowadays, different models and learning algorithms can be applied to modeling and controlling the electrospinning processes.27,28 In this paper, we have compared the multi-layer perceptron (MLP), radial basis function (RBF), and support vector machine (SVM) models for predicting the diameter of PCL/gelatin nanofibers. The main objectives were to: (i) analyze the effects of weight ratios, applied voltage, injection rate, nozzle–collector distance, and their individual and interactive effects on the diameter of PCL/gelatin nanofibers; (ii) compare different data mining models to identify the most accurate model; (iii) detect the most significant factors affecting the diameter of PCL/gelatin nanofibers using sensitivity analysis; and (iv) design a decision support system (DSS) for predicting the diameter of electrospun PCL/Gt nanofibers.
000 g mol−1), gelatin from porcine skin type A (gel strength 300 g bloom), glacial acetic acid (AA), and formic acid (FA) were all provided by Sigma-Aldrich.
:
FA in a 9
:
1 ratio using a magnetic stirrer at room temperature for 4 h. Following this, PCL and gelatin (PCL/Gt) were mixed at seven different volume ratios (80
:
20, 70
:
30, 60
:
40, 50
:
50, 40
:
60, 30
:
70 and 20
:
80) for 20 h prior to electrospinning.29
In this study, three activation functions consisting of a hyperbolic tangent, logarithmic sigmoid, and linear transfer functions were examined to optimize the prediction of the diameter of the electrospinning PCL/Gt nanofibers model.
The backpropagation (BP) method is found to be the most popular and powerful nonlinear statistical method, therefore it is an effective technique for calculating the weight and biases of neurons. The BP algorithm uses learning rules to assign weight arrangements of neurons and layers to nodes based on the output of the network. The weights alter during the learning, and the process is repeated until the best performance is achieved, and the learning process will end.21,35
The aim of BP is to minimize the error between Y (average diameter of PCL/Gt nanofiber) and Ynet (MLP output) in which X and Y are given to the network and the weight of the PEs (w) and input samples (X) are adjusted, and an output of the jth PE on the kth (PEjk) is calculated using eqn (1):
![]() | (1) |
The specific functions known as the transfer or threshold functions are introduced to the network, and the output value of the neurons is presented in eqn (2):
![]() | (2) |
In the next step, the weights of the t numbers of the input/output parts (X and Y) will be changed using the delta rule in eqn (3):
| wjit = wjit−1 + Δwjit | (3) |
Several of the learning algorithms for BP have been used to end the learning process and for adjusting weights. In this paper, the validation data set performed was the generalization of MLP and avoids overtraining of the network. The final step is the test performance of the MLP using the test data set, which is not used in the training and validation data sets. To perform this, the samples were randomly divided into three subsets which include the training data set with 60% of the total samples, a validation data set with 20% of the total samples and a test data set with 20% of the total samples.
![]() | (4) |
![]() | (5) |
There are two data sets, including training and testing in the structure of SVM. The values of the target are based on an n-dimensional matrix in which the most accurate boundaries and margins possible are available.43,44
The SVM model algorithm equation can be expressed in eqn (6):
![]() | (6) |
Next, we provide the Gaussian RBF in eqn (7), as the activation function in this study. As is known, RBF is the most common function with a considerable ability to control the generalization of the SVM network.
| K(xi,xj) = exp(−γ‖xi − xj‖2) | (7) |
To achieve the most accurate SVM for predicting the fiber diameter of electrospinning PCL/Gt, eqn (8) should be minimized.
![]() | (8) |
presents the margin, ∑ξi is the training errors, and C is the tuning parameter respectively.
![]() | (9) |
![]() | (10) |
![]() | (11) |
![]() | (12) |
![]() | (13) |
A Taylor diagram was also used to show the accuracy and efficiency of the models based on the observed values. The Taylor diagram provided a polar plot to summarize multiple aspects of the model and the observed parameters (i.e., correlation coefficient, normalized standard deviation and RMSD).46–48
We need to find the factors with the most effect on the model outputs to modify the diameter of the target nanofibers. Sensitivity analysis was performed on the final model to detect the importance of the variables concerning their role in the model outputs.
| Activation function | Training function | Structure | Test set | Training data | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| R2 | MSE | RMSE | MAE | WI | R2 | MSE | RMSE | MAE | WI | |||
| logsig–logsig–purelin | LM | 4-10-10-1 | 0.96 | 0.036 | 0.19 | 0.097 | 0.9999984204 | 0.987 | 0.014 | 0.118 | 0.075 | 0.9999999967 |
We present the best activation function equations in the structure of the MLP model using eqn (14) and (15).
![]() | (14) |
| Purelin(x) = f(x) | (15) |
According to the values of R2 (Table 1), ANN optimization detected the structure of ‘4-10-10-1’ for PDNFMMLP as the most successful structure of MLP in the prediction of the diameter of PCL/Gt electrospinning nanofibers and effect of electrospinning parameters on the diameter of the PCL/gelatin nanofibers. The determined structure contains four variables as inputs, ten neurons in the two hidden layers, and one neuron (diameter of nanofibers) in the output layer. To use the most accurate estimation functions in the hidden layer and output layer, logarithmic sigmoid and linear transfer functions were used as a learning function, respectively.
The scatter plot provides the correlation between variables which is used to define the accuracy of an ANN model.21,35 The scatter plot of the MLP outputs versus the target values of the PDNFMMLP for the training, validation, testing, and overall data sets are presented in Fig. 1. The determination of the coefficient (R2) proves the strong correlation between the PDNFMMLP outputs and the target values.
Fig. 2 compares the real (target) and simulated (output) values of PDNFMMLP in the data sets. A significant and distinctive agreement between values has been provided in Fig. 2.
![]() | ||
| Fig. 2 Target and output nanofiber diameter values for training, validation, and testing sets and all data. | ||
The PDNFMMLP, using four electrospinning parameters as model input variables, is the most accurate model for prediction of the diameter of electrospun PCL/Gt nanofibers against the changing electrospinning parameters. Eqn (16) is the logarithmic sigmoid and eqn (17) illustrates the PDNFMMLP in electrospun PCL/Gt nanofibers.
![]() | (16) |
| PDNFMMLP = purelin{logsig{∑LW2,1{logsig(∑IW1,1pi + b1)} + b2}} | (17) |
The electrospinning parameters (ratio blends of the polymer, applied voltage, injection rate and, the needle-to-collector distance), as input variables, and the diameter of the PCL/gelatin nanofibers as the outputs, were tagged in the software MATLAB R2016b. In model parameters optimization, 80% of samples (610 samples) were randomly defined as the training set to train the most accurate RBFNN, and 20% of samples (152 samples) were applied to test the performance of the PDNFMRBF. The aim of the training step was the network error minimization with RBFNN parameters values. Therefore, in the best PDNFMRBF performance, the number of neurons was 125, and the spread of the radial basis functions was 7. The best results for PDNFMRBF in the training and test data sets are shown in Table 2.
| Model | Spread | Neurons | Test set | Training data | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| R2 | MSE | RMSE | MAE | WI | R2 | MSE | RMSE | MAE | WI | |||
| RBF | 7 | 125 | 0.821 | 0.136 | 0.368 | 0.269 | 0.9999907541 | 0.818 | 0.191 | 0.437 | 0.231 | 0.9999997597 |
As shown in Table 2, the optimal architecture is determined by the values of R2 in the training and test data sets. The best architecture of RBFNN was determined to be 4-125-1 with four variables as inputs, 125 neurons in the hidden layer with a Gaussian transfer function, and one neuron (diameter of nanofibers) in the output layer.
The scatter plot of the RBFNN outputs versus the target values of the PDNFMRBF for the training and test sets, and all data are presented in Fig. 3. The determination of the coefficient (R2) shows a significant correlation between the PDNFMRBF outputs and targets values.
![]() | ||
| Fig. 3 Scatter plots of PDNFMRBF outputs versus the target values for the training and test sets and all data. | ||
Fig. 4 compares the real (target) and simulated (output) values of PDNFMRBF in the data sets. A notable and satisfying agreement between the values can be observed in Fig. 4.
The SVR regression performance is associated with the proper selection of the parameters, which are ε, C, and γ. The value of the ε is related directly to the number of support vectors.51 The value of γ is determined by the width of the bell-shaped curves in the structure of the SVM regression with a Gaussian function (as shown in eqn (7)). In this research, the polynomial kernel function was found to be more accurate than the Gaussian function; therefore, we did not achieve the value of γ. On the other hand, the value of C allows the SVM to gain more simple curves as the goal accuracy is obtained.42 Thus, we selected the values of the ε and C parameters to achieve a highly accurate SVM regression in the prediction of the diameter of the electrospun nanofibers. In the parameter optimization of PDNFMSVM, all data were divided into two subsets: 80% (610 samples) for the training network and 20% (152 samples) of data for testing the PDNFMSVM accuracy and generalization. Table 3 presents the most appropriate PDNFMSVM parameters and prediction accuracies for SVM regression of the train and test data.
| ε | C | Test set | Training data | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| R2 | MSE | RMSE | MAE | WI | R2 | MSE | RMSE | MAE | WI | ||
| 62.02 | 888.78 | 0.829 | 0.0067 | 0.082 | 60 | 0.9991904256 | 0.657 | 0.0001 | 0.012 | 60.41 | 0.9987218987 |
The best finding was obtained for data modeling without data standardization, while the data was standardized in the MLP and RBFNN modeling to achieve better and faster findings. As shown in Table 3, the best ε value was 62.02, the C value was 887.78 concerning the values of R2 in the training and test data sets. Other models with other values of ε and C show over-fitting and under-fitting in models. In over-fitting models, a significantly higher R2 value is achieved in the training data set, but the accuracy of the PDNFMSVM is decreased significantly in the test data test. The PDNFMSVM is close to that of the training data, otherwise the model is over-trained. In under-fitting models, the performance of PDNFMSVM is not acceptable, or the best performance is not obtained.
The results obtained from the scatter plot of the SVM outputs versus the target values of the PDNFMSVM for the training and test sets, and all data are set out in Fig. 5. The determination of the coefficient (R2) shows the acceptable correlation between the PDNFMSVM outputs and the target values.
![]() | ||
| Fig. 5 Scatter plots of PDNFMSVM outputs versus the target values for the training and test sets and for all data. | ||
Fig. 6 compares the real (target) and simulated (output) values of PDNFMSVM in the data sets. A notable and satisfying agreement between the values is observed in Fig. 6.
Fig. 7 provides the Taylor diagrams observed for the performances of the computing models (i.e., MLP, RBFF, and SVM). It is shown that the MLP model provided a higher RMSD and correlation coefficient compared to the RBFF and SVM models. Therefore, comparison of the findings of the models shows that the MLP is the most accurate model in the prediction of the diameter of the diameter of the PCL/Gt electrospun nanofibers (Fig. 7).
Comparing the findings of PDNFMMLP, PDNFMRBF, PDNFMSVM shows that PDNFMMLP is the most accurate model for the prediction of the diameter of the PCL/Gt electrospun nanofibers (Fig. 8). In comparison to RBFNN and SVM, the MLP model shows the highest R2 value in training, test, and total data sets. After randomizing the data, they were divided into training and test data sets so that the same training and test samples were used for the three modeling methods.
The properties and morphology of the electrospun nanofibers are critical factors when designing nanofibers for different applications. This property depends on many processing parameters.17,24 ANN models have been applied previously as a predictive modeling tool for electrospun nanofibers.52 For example, Ketabchi et al. developed and tested the accuracy of the ANN model for predicting the diameter of chitosan/PEO nanofibers in trials and studies and for analysis of the interactions between the involved electrospinning parameters and the diameter of the chitosan/PEO nanofibers with sufficient sensitivity and specificity.53 In this research, we attempted to validate the accuracy of data mining models such as MLP, RBFNN, and SVM, on the effects of processing parameters including the polymer weight ratios, nozzle–collector distance, applied voltage, and the injection rate on the average diameter of the electrospun PCL/Gt nanofibers.
As can be seen from Table 1, the MLP as an ANN modeling approach can successfully predict the structure of the PCL/Gt electrospun nanofibers with an accuracy of up to at least 0.96 (R2 in test data), so long as reliable and in range data are available to run the PDNFMMLP. Owing to the high degree of complexity in the relationships between the electrospinning parameters and the diameter of the nanofibers, these values were satisfactory.20 The successful application of a BP neural network and MLP in electrospinning studies has been proved in previously published research with a higher accuracy in comparison with the multiple regression models.31,54,55 The reliable results of ANN modeling in electrospinning nanofibers studies have been illustrated in previous studies in which Khanlou et al. aimed to employ an MLP network with a BP algorithm to assess the application of the ANN to predict and optimize the electrospinning parameters for polymethyl methacrylate nanofibers. Using an ANN with a three layer BP neural network, there is a perfect correlation between the targets and outputs. The correlation factors for the training and validation samples were 0.98 and 0.99, respectively.31 Another study showed the ability of an ANN to predict the diameter of polyurethane nanofibers by considering variables of the ratio of solvents, average molecular weight of polyurethane, concentration, voltage, distance, and the electric field. The results show that artificial neural networks can predict the diameter of electrospun polyurethane nanofibers well.56 In this research, PDNFMMLP provides a framework for accurate analyzing electrospinning parameters and the diameter of electrospun PCL/Gt nanofibers that will result in greater economy and save time. The results of the MLP approach, especially the greater accuracy (R2 = 0.96) obtained in comparison with the RBFNN (R2 = 0.821), and SVM (R2 = 0.829) results signify that PDNFMMLP can be used as a comparative impact assessment model for predicting the diameter of PCL/Gt nanofibers.
Fig. 10a and c shows the effect of the PCL/Gt weight ratios and applied voltage on the diameter of the fiber. As one can see, there is a negative correlation between the weight ratio and the voltage and fiber diameter, therefore the fiber diameter decreases with the increased voltage or weight ratios. The fiber diameter decreased upon increasing the content of PCL in the AA/FA solution, as shown in Fig. 10a. A reason for this result could be the presence of an emulsion, which can be weakened at higher PCL contents. PCL/Gt exhibit an emulsion structure when using an AA or AA/FA mixture as a solvent. The explanation for this emulsion structure is the absence, or very limited miscibility, of Gt and PCL and the relatively weak interaction with AA and FA. These results are in accordance with those obtained by Denis et al.29 Also, a decrease in the viscosity of the polymer solution can be ascribed as the cause of the decrease in the nanofiber diameter.57 As shown in Fig. 10b, the effect of distance on the diameter of the nanofiber is contradictory, the fiber diameter increases, as well as decreases, with an increase of the distance. Indeed, at a short spinning distance, there will not be sufficient time for the solvent to evaporate before the jet is placed on the collector owing to thicker nanofibers. Furthermore, the diameter of the nanofibers decreased with an increase in the spinning distance.58 The curve is downward-sloping for high values of the spinning distance. This result may be explained by the fact that by increasing the spinning distance, the jet has enough time to stretch and the solvent will have more time to evaporate before the jet is deposited on the collector leading to thinner fiber formation.58 Another reason is probably owing to breaking of the formed jet into two or more jets, leading to finer nanofibers.20 These results were reported in recent studies.56–58 Fig. 10c shows the effect of applied voltage on the diameter of the nanofibers. In fact, at a high applied voltage, the electric field strength is high, resulting in further stretching of the jet before it is deposited on the collector, and hence the fiber diameter will decrease. On the other hand, increasing the applied voltage will result in an increased surface charge on the droplet jet, favoring the formation of thinner fibers. This observation is in agreement with those from previously published reports.59–61
![]() | ||
| Fig. 10 The trend of the DPNFMMLP output changes upon varying: the (a) PCL/Gt weight ratio; (b) distance; (c) voltage; and (d) injection rate. | ||
Considering the trends observed in Fig. 10d, the injection rate has a two-fold impact on the diameter of the fiber. First, the diameter of the nanofibers increases with an increase in the volume injection rate. The published literature indicates that an increase in the injection rate of the solution typically increases the diameter of the nanofibers.62,63 This result could be explained by the fact that an increase in injection rate leads to an increase of the amount of polymer solution delivered to the top of the needle making the jet diameter increase.57,58,62 Therefore there is a limitation to this, after reaching an optimum value, the diameter of the nanofibers continuously decrease with an increase in the injection rate. Some studies show that the injection rate increase leads to a decrease in the diameter of the nanofibers.20,64 As increasing the injection rate will cause higher electrostatic forces, an increase in the volumetric charge density on the droplet jet, and a greater tensile force which might increase the stretching of the jet, resulting in the formation of thinner diameter nanofibers.59,65 The literature shows that ANN techniques have been designed with data analysis, modeling, simulation, and output.21 These models have been used to investigate the relationship between electrospinning parameter and the diameter of the nanofibers or fiber morphology .1,58
PDNFMMLP provides a new tool as a prediction model for the electrospinning of nanofiber considering the variables of the PCL/Gt weight ratios, applied voltage, distance and, injection rate. PDNFMMLP provides a new tool as a DSS in PCL/Gt electrospinning for prediction of the diameter of the nanofibers resulting from electrospinning. PDNFMMLP was developed for researchers or lab technician to predict the diameter of PCL/Gt nanofibers, which helps them to save time and money. In this study, which develops a tool for the prediction of the diameter of PCL/Gt nanofibers, the steps of model development and implementation are described as a DSS. We proposed the flow diagram shown in Fig. 11 to design a DSS using PDNFMMLP for PCL/Gt electrospinning. The described DSS has been designed using data analysis, modeling, simulation, and output. The output of PDNFMMLP is applicable in the electrospinning of PCL/Gt nanofibers, and the electrospinning processing parameters and diameter of PCL/Gt nanofibers could be modified based on the proposed DSS to reduce time and costs. The values of the optimization factors are not proposed for use in other research, but they could be used as alternatives values to optimize models in this kind of research. Finally, a graphical user interface (GUI) was designed to run the PDNFM model on new data for which the researchers are planning for PCL/Gt electrospinning. A GUI, as a user friendly tool, is designed to provide easy utilization of an ANN technique. The GUI provides a tool to predict nanofiber diameter before electrospinning by changing the values of the input parameters, which saves time and material.66
In general, it is necessary to mention that sizes at the nanoscale significantly impact the structural, mechanical, thermal, thermo-dynamic, kinetic, and electrical properties of materials.28 Furthermore, rapidly increasing interest has been shown by users, and this has led to the production of diverse nanofibers for versatile usage in various applications.67 Determination of the nanofiber size is, therefore, a matter of considerable importance in electrospinning for various applications.68 With the help of the PDNFM model, the changes are detectable before electrospinning. This means that a change in the diameter of the nanofibers and the morphology is measurable before electrospinning. The researchers, engineers, and experts working in academia and industry can easily predict the diameter of nanofibers in electrospinning. GUI as a DSS tool will be run on new data by simply inputting the diameter of the nanofiber, as shown in Fig. 12. As an example, Fig. 12 illustrates the effect of two different electrospinning processing parameters on the diameter of PCL/Gt nanofibers. We found the diameter of nanofibers to be thick (200 to 500 nm) in plan (a) for our specific application in PCL/Gt electrospinning. Therefore, we modified the electrospinning processing parameters of plan (a) to plan (b), in which the diameter of the nanofibers will be optimal in PCL/Gt electrospinning for our application. The modification was conducted by changing the most significant factors in the sensitivity analysis results and trends shown in Fig. 10. Overall, the findings of this study indicate the advantages of modeling and optimizing the diameter of electrospun nanofibers using an ANN and the advantages of these models for experts in the nano-field, tissue engineering, pharmaceutical, environmental, medical, food and engineering industries to help reduce product costs.
| This journal is © The Royal Society of Chemistry 2019 |