Yasuhito Kobayashiab,
Yuta Miyakec,
Fumitaka Ishiwaricde,
Shintaro Ishiwataa and
Akinori Saeki*cd
aDivision of Materials Physics, Graduate School of Engineering Science, Osaka University, 1-3 Machikaneyama, Toyonaka, Osaka 560-8531, Japan
bInteractive Materials Science CADET, Osaka University, 1-3 Machikaneyama, Toyonaka, Osaka 560-8531, Japan
cDepartment of Applied Chemistry, Graduate School of Engineering, Osaka University, 2-1 Yamadaoka, Suita, Osaka 565-0871, Japan. E-mail: saeki@chem.eng.osaka-u.ac.jp
dInnovative Catalysis Science Division, Institute for Open and Transdisciplinary Research Initiatives (ICS-OTRI), Osaka University, 1-1 Yamadaoka, Suita, Osaka 565-0871, Japan
ePRESTO, Japan Science and Technology Agency (JST), Kawaguchi, Saitama 332-0012, Japan
First published on 16th May 2023
The bulk heterojunction structures of organic photovoltaics (OPVs) have been overlooked in their machine learning (ML) approach despite their presumably significant impact on power conversion efficiency (PCE). In this study, we examined the use of atomic force microscopy (AFM) images to construct an ML model for predicting the PCE of polymer:non-fullerene molecular acceptor OPVs. We manually collected experimentally observed AFM images from the literature, applied data curing and performed image analyses (fast Fourier transform, FFT; gray-level co-occurrence matrix, GLCM; histogram analysis, HA) and ML linear regression. The accuracy of the model did not considerably improve even by including AFM data in addition to the chemical structure fingerprints, material properties and process parameters. However, we found that a specific spatial wavelength of FFT (40–65 nm) significantly affects PCE. The GLCM and HA methods, such as homogeneity, correlation and skewness expand the scope of image analysis and artificial intelligence in materials science research fields.
Machine learning (ML) empowered by artificial intelligence (AI) has garnered significant attention as an alternative approach to rapidly discovering new materials and scientific principles.34–37 In this context, ML has been increasingly deployed for the exploration of organic electronic materials, such as OPVs,38–45 organic light-emitting diodes (OLEDs)46–48 and organic thin-film transistors (OTFTs).49–51 Typically, digital fingerprints associated with chemical structures, molecular properties obtained from experiments and quantum chemical calculations and other extracted feature variables are learned in conjunction with the respective objective variables, such as PCE of OPVs, photoluminescence quantum yield of OLEDs and charge carrier mobility of OTFTs. While the film morphology of these thin-film devices is crucial to their performance, attempts to incorporate image analysis data into ML models remain challenging. Some examples of AFM (or TEM) image analysis relating to OPVs include a fast Fourier transform (FFT) analysis of poly(3-hexylthiophene) (P3HT):[6,6]-phenyl-C61-butyric acid methyl ester (PCBM) film,22 image recognition of fibrous structures of P3HT film,52 miscibility of binary/ternary mixture in the framework of Flory–Huggins theory53 and electron spectroscopic imaging to identify polymer and NFA domains.54 Despite the informative acquisition of film surface morphology by widely used AFM, quantitative and comprehensive analyses of these images in large datasets have remained unexplored, possibly due to the non-uniformity in image quality, scattered location in digital space and diverse image processing methods.
In this study, we present an approach where AFM images were integrated into an ML model for predicting the PCE of binary (polymer and NFA) BHJ-OPVs. After gathering AFM images from literature sources, we performed data curation and analysis, followed by the construction of an ML model. We examined the combinational effects of material properties such as bandgap (Eg) and highest occupied molecular orbital (HOMO) of both polymers and NFAs. Our results indicate that even though achieving high prediction accuracy of PCEs is challenging, the characteristic FFT wavelength proved to be crucial to PCE prediction. Our study establishes a basis for interpreting AFM images and sets the stage for future exploration of OPV materials.
Fig. 1 Schematic of the workflow in this study. The example AFM images are taken from a literature24 with permission. |
Fig. 2a shows FFT analysis of AFM images for the low (1.03%),61 middle (8.02%)62 and high (17.1%)63 PCE. The 2-dimensional Fourier power spectra (FPS) were converted to isotropic ones (IFPS), which exhibit multiple characteristic peaks at shorter than 100 nm (Fig. 2b). The minimum and maximum spatial wavelengths are 11.72 (= 5.86 × 2) and 1500 (= 750 × 2) nm, respectively. For a comparison of spatial wavelength among different height-scale AFM images, IFPS was normalized by the intensity at 1500 nm. And then, the power values for each spatial wavelength (from 11.72 to 300 nm, 124 points in total) were used as the input parameters in the ML analysis. Instead of losing height information in the normalization and high pass filter (>300 nm−1) process, HA data (the maximum, mean, variance, energy, contrast, entropy, skewness, and kurtosis values shown in Fig. S2 (ESI†)) were incorporated together with IFPS. For example, the variance and skewness of the histogram are related to the surface roughness.
Fig. 2 FFT analysis of AFM images. (a) Upper panels: original AFM images (grey-scale); lower panels: their FFT images. The AFM images of BHJ films (A, B, and C) were taken from literature61–63 with permission and showed PCE of 1.03, 8.02 and 17.1%, respectively. (b) IFPS of A, B, and C. |
In addition to the FFT and HA analyses, we used GLCM that is a conventional but useful image texture analysis method applied to various images.59,60 As shown in Fig. 3a, an original grey-scale AFM image, each pixel of which possesses a brightness level (0–255), was analysed by counting the pixel values in the neighbour. The definition of neighbour is given by (δ, θ), where δ (from 1 to the maximum pixel size) is the distance from the centre pixel, and θ (from 0 to 90°) is the angle from the horizontal axis. An example of (δ, θ) = (1, 0°) is illustrated in Fig. 3b, together with the acquired GLCM matrix: P(i, j). The row (i) and column (j) of the matrix correspond to the neighbour pixel level and centre pixel level, respectively. This matrix example indicates high values in the left top, which means that the pixels with low brightness levels (1 or 2) locate at a specific region. The example also shows a convergence along the diagonal line, which indicates that the change in brightness level is moderate. If high counted values appear in the right top and/or left bottom, the change of brightness level is steep. The acquired GLCM with different δ (2–10 pixels) and the horizontal direction (θ = 0°) was then subjected to various statistics such as homogeneity, contrast, dissimilarity, angular second moment (ASM), contrast, energy and correlation (Fig. 3c).
A linear regression using least absolute shrinkage and selection operator (LASSO)64,65 was performed using Mordred descriptors, material properties and process parameters as the input parameters. The correlation of experimental and predicted PCE values are displayed in Fig. 4a, where the train data (n = 623) and test data (n = 267) exhibited the correlation coefficient (r) values of 0.802 and 0.747, respectively. The root-mean-square error (RMSE) values are appended in the caption. The r of train data is lower than those obtained by random forest (RF) regression (r = 0.83–0.85 for 5-fold cross validation) in the previous study using similar explanatory variables.55 However, we used LASSO in this study to discuss the polarity of variable coefficients. The negative and positive coefficients (top 30 for each sorted in decent order) are shown in the left and right panels of Fig. 4b, respectively (the list is provided in Table S3 (ESI†)). Most of the important variables are Mordred descriptors derived from the chemical structures, while the material properties of LUMO of NFA, PDI of polymer and Eg of NFA along with the process parameter of 1,2,4-trimethylbenzene solvent (Solvent_TMB) are ranked on the high level. The most important positive and negative parameters are averaged and centred Moreau-Broto autocorrelation of lag 0 weighted by ionization potential (AATSC0i) of polymer and Geary coefficient of lag 6 weighted by intrinsic state (GATS6s) of NFA, respectively.66,67 They are obtained by calculating autocorrelation and quantifying the clustering in 2-dimensional space, although their meaning is difficult to correlate directly with a physical property. However, multi-collinearity calculations of these parameters revealed that AATSC0i appears to correlate with AMID_X (averaged molecular ID on halogen atoms) and nF (number of F atoms) with r = 0.881 and 0.863, respectively (Table S4 (ESI†)). The addition of halogen atoms (typically fluorine) in the polymer is thought to fine-tune its energy level and improve the BHJ morphology. GATS6s was also found to correlate with NtN (the sum of nitrogen atoms) and C1SP1 (sp carbon bound to 1 other carbon) with r = −0.753 Table S5 (ESI†). This is linked to the cyano substituent (–CN) widely appended to efficient NFAs. Meanwhile, the positive, large coefficient of –LUMO of NFA indicates that the deepened LUMO level of NFA contributes to the increase of PCE, which is understood from the increased offset for efficient electron transfer from LUMO of polymer to that of NFA.
Fig. 4 Results of LASSO regression. (a) and (b) The explanatory variables are chemical structures (Mordred descriptors), material properties (bandgap, etc.), and process parameters (solvent, etc.). (c) and (d) The aforementioned parameters plus GLCM and HA data. (e) and (f) The aforementioned parameters plus FFT and HA data. The upper panels (a, c, and e) are the regression plots of experimental (horizontal) and predicted (vertical) PCE. The white-blue (n = 623) and dark blue (n = 267) circles are the train and test data, respectively. The correlation coefficient (r) values of the train and test data are appended. The lower panels (b, d, and f) are the feature importance in decent order. The green, red, orange, and blue bars correspond to the material properties and process parameters, GLCM parameters, HA parameters, and FFT parameters, respectively. A complete list of rankings is provided in Tables S3, S6, and S7 (ESI†). |
Even after additional inclusion of GLCM and HA data into the explanatory variables, the prediction accuracy was not improved. As shown in Fig. 4c, the r values of train and test data are 0.814 and 0.752, respectively, similar to those of Fig. 4a. The ranking of coefficients in Fig. 4d was also very similar, while the GLCM parameters were ranked on the 2nd (homogeneity of δ = 2) and 13th (correlation of δ = 4) in the negative polarity (the list is provided in Table S6 (ESI†)). In addition, the maximum height of HA was ranked the 6th in the negative polarity. These parameters represent the smoothness of a film, and thus, their negative impact on PCE (the smaller, the better) is readily reasonable.
As shown in Fig. 4e, the additional use of FFT and HA data as the explanatory variables exhibited almost unchanged, but marginal improvement in the r values of train (0.817) and test data (0.756). Notably, FFT parameters appeared on the high rank in their coefficient intensities (Fig. 4f). The spatial wavelengths of 65.22 and 40.54 nm were ranked the 2nd and 5th in the positive polarity, respectively. Interestingly, the positive impact of 40–65 nm spatial wavelength (their half-pitch is 20–33 nm) is consistent with the exciton diffusion length in OPV (∼20 nm),68,69 where a narrow bicontinuous network of BHJ maximizes the efficiencies of exciton diffusion and charge separation that benefit the OPV performance. Meanwhile, a spatial distance much larger than the exciton diffusion length causes a radiative and non-radiative loss of exciton, leading to the decrease in JSC and PCE. This finding is the same as the previous FFT study on P3HT:PCBM films with different thermal annealing process;22 however, it should be emphasized that the general importance of fine bicontinuous network shorter than ∼30 nm was found in a large data set (n = 890) comprising various types of polymers and NFAs.
Additionally, we examined RF algorithm instead of LASSO by using the same explanatory variables (Fig. S3†). The r values of train data were high (0.947–0.968), whereas those of test data showed a large decrease (0.814–0.781), possibly due to some overfitting. Similarly to the previous report55 and LASSO results, the most important feature was StN of Mordred descriptor for all models (Tables S9–S11 (ESI†)), which is associated with the number of triple bonds nitrogen of the cyano substituent. In the GLCM and HA-based model, the correlation of δ = 2 of GLCM was ranked the 22nd. In the case of FFT and HA-based models, no FFT spatial wavelength was ranked in the top 30th. This indicates that LASSO is more suitable than RF to focus on AFM-derived parameters.
Since LASSO is a sparse model and reduces the parameters after optimization, detailed examination over the whole spatial wavelength of FFT is unable to access. We, therefore, performed Ridge regression by using FFT and HA descriptors. Fig. 5a displays the results for the train and test data, of which r values are 0.844 and 0.755, respectively, mostly identical to those of LASSO. However, the ranking features appear to drastically change (Fig. 5b). Many material properties and process parameters are ranked on the top level in both positive and negative coefficients. In addition, HA parameters such as maximum height, mean, skewness and variance are ranked higher than 20th. These parameters are associated with the height scale roughness of a BHJ film.
Fig. 5 Results of Ridge regression. The explanatory variables are chemical structures (Mordred descriptors), material properties (bandgap, etc.), process parameters (solvent, etc.), and FFT and HA data. (a) Regression plot of experimental (horizontal) and predicted (vertical) PCE. The white blue (n = 623) and dark blue (n = 267) circles are the train and test data, respectively. The correlation coefficient (r) values of the train and test data are appended. (b) Regression coefficient ranking in decent order. The green, orange, and blue bars correspond to the material properties and process parameters, HA parameters, and FFT parameters, respectively. A complete list of rankings is provided in Table S8 (ESI†). |
An in-depth comparison of LASSO and Ridge results about the FFT spatial wavelength is provided in Fig. 6a. The continuous Ridge spectrum is very similar to the discrete LASSO spectrum, where the wavelength regions at 25–100 nm exhibit a positive impact on PCE. In the Ridge model, negative coefficients are observed at >150 nm. Such a large-scale structure is much larger than the exciton diffusion length and would deteriorate the device performance.
We next demonstrate a generation of virtual BHJ morphology suitable for an efficient OPV. An original grey-scale image generated by applying noise (Fig. 6b) was processed by FFT with a band pass filter (BPF), and new images were reconstructed by inverse FFT. The IFPS with/without BPF are appended in Fig. S4 (ESI†). As shown in Fig. 6c, a narrow fibrous network was acquired after applying BPF-1 that corresponds to the preferred wavelength at 25–100 nm. In contrast, wide and rough features are observed after applying BPF-2 at the un-preferred wavelength at >150 nm (Fig. 6d). We thus exemplified the importance of high frequency (wavelength: 40–65 nm and their half-pitch is 20–33 nm) bicontinuous network in BHJ, which is general regardless of the polymers and NFAs. To examine the direct effect of AFM images on the solar cell performance, we performed LASSO, Ridge and RF by solely using AFM data (GLCM, HA, FFT and their combinations) as the explanatory variables (Fig. S5 (ESI†)). The r values of LASSO and Ridge were very low (0.3–0.4) for both train and test data. The RF model showed a high r of ∼0.85 for train, but a significantly low r of 0.3–0.5 for test, due to overfitting. Accordingly, the PCE prediction with moderate accuracy needs material properties and process parameters. Given the almost unchanged r values irrespective of using AFM data, the chemical structures and material properties may implicitly involve miscibility and BHJ information for a target polymer:NFA blend. This envisions a complete virtual screening of OPV materials without explicitly considering the BHJ structure.
Footnote |
† Electronic supplementary information (ESI) available: Supplementary Tables S1–S11 and Fig. S1–S5. See DOI: https://doi.org/10.1039/d3ra02492j |
This journal is © The Royal Society of Chemistry 2023 |