A robust methodology for PEC performance analysis of photoanodes using machine learning and analytical data†
Abstract
Machine learning (ML) is increasingly applied across various fields, including chemistry, for molecular design and optimizing reaction parameters. Yet, applying ML to experimental data is challenging due to the limited number of synthesized samples, which restricts its broader application in device development. In energy harvesting, photoanodes are crucial for solar-driven water splitting, generating hydrogen and oxygen. We explored electrodes like hematite and bismuth vanadate for photocatalytic uses, noting varied photoelectrochemical performances despite similar preparations. To understand this variability, we applied a data-driven ML approach, predicting photocurrent values and identifying key performance influencers even with limited experimental data in the research development of inorganic devices. We have utilized multiple machine learning algorithms to predict the target value in the calculation process, where the contributions of the dominant descriptors were unknown. We introduced a novel methodology, incorporating clustering to manage multicollinearity from correlated analytical data and Shapley analysis for clear interpretation of contributions to performance prediction. This method was validated on hematite and bismuth vanadate, showing superior predictability and factor identification, and then extended to tungsten oxide and bismuth vanadate heterojunction photoanodes. Despite their complexity, our approach achieved determination coefficients (R2) with a prediction accuracy over 0.85, successfully pinpointing performance-determining factors, demonstrating the robustness of the new scheme in advancing photodevice research.
- This article is part of the themed collection: Analyst HOT Articles 2024