Open Access Article
Eva Unger
ab and
T. Jesper Jacobsson
*c
aHySPRINT Innovation Lab: Hybrid Materials Formation and Scaling, Helmholtz-Zentrum Berlin für Materialien und Energie GmbH, Kekuléstraße 5, 12489, Berlin, Germany
bDivision of Chemical Physics and Nano Lund, Lund University, Box 124, 22100 Lund, Sweden
cDepartment of Physics, Chemistry and Biology (IFM), Linköping University, Linköping, Sweden. E-mail: jacobsson.jesper.work@gmail.com; jesper.jacobsson@liu.se
First published on 29th October 2025
Reproducing literature results for perovskite-based optoelectronic devices is often surprisingly difficult. We argue that a major reason for this problem is insufficient data dissemination, and that this could be mitigated by a collective effort to develop and adopt standardised data-sharing protocols for experimental procedures.
Broader contextMetal–halide perovskites have become leading candidates for next-generation optoelectronics. A persistent challenge in achieving this has been reproducibility. Devices reported under seemingly identical conditions often yield widely varying results, raising concerns about reliability and slowing progress toward commercialisation. This problem is partly rooted in insufficient standards for data reporting, where critical fabrication and characterization details are often omitted or ambiguously described. Adopting FAIR (Findable, Accessible, Interoperable, Reusable) principles and machine-readable data protocols offers a practical path forward. Structured device descriptions would enable reproducibility across laboratories, support large-scale data aggregation, and unlock the potential of machine learning and autonomous experimentation. In this perspective, we present a vision and a path forward for how this could be implemented in the perovskite field. While focusing on perovskites, the problem, as well as the vision for how to solve it, extends to other experimental material science fields as well. |
Multiple factors underpin this variability. We argue that a core issue lies in inadequate standards for reporting device data. We further propose that by improving data management practices – through standardisation and accessible dissemination protocols – we could reduce performance variability, boost reproducibility, strengthen trust in published findings, and ultimately accelerate technological progress. While our discussion centers on perovskite solar cells, the same principles and practices are broadly applicable to other material systems and applications.
The standard medium for describing experiments is the methods section of journal articles. One of the insights from the Perovskite Database Project,6,7 where we extracted data on over 42
000 solar cell devices from 7370 publications, was that as a community, we struggle to write experimental sections with sufficient detail and clarity. Reproducibility is favoured by exhaustive technical detail, structure, rigor, repetition, and completeness. However, those are qualities often at odds with good narrative prose, and scientific journals are primarily vehicles for storytelling, which favours a concise, fluid, and engaging language. This tension between crafting an engaging narrative and reporting exhaustive technical detail frequently leads to gaps in the information needed to replicate a study. For example, effective storytelling tends to focus on novelty and highlights the differences between control devices and the improved devices that are central to the study, while omitting shared or routine details crucial for external reproducibility and cross-study comparisons. Even when experimental protocols are reasonably detailed and well described, challenges often arise when multiple device types are described. In such cases, compactness often takes precedence over clarity, leaving readers to piece together which conditions apply to which device in a way reminiscent of solving a detective mystery. Moreover, there are also many instances where parameters critical to the results have simply been omitted.10
These unsatisfactory standards in data dissemination are partly a matter of cultural norms and practices. Writing a clear, precise, and detailed experimental protocol is hard work. It requires time, effort, and careful thought – effort that may not be undertaken without proper incentives. Setting better standards will therefore require both proper tools and strong community norms, as well as explicit pressure from publishers, reviewers, and funders.
A practical path toward this vision would involve assigning each device a detailed protocol in a structured file format – JSON (JavaScript Object Notation) is a strong candidate being an open standard widely adopted for data storage, transmission, and dissemination through name-value pairs.11 Each device would thus have a dedicated file containing all relevant parameters necessary for reproducibility, as well as data and metadata from any measurements performed on that device. We envision a modular, hierarchical approach to describing perovskite devices, relying on reusable data blocks that capture every aspect of their fabrication and characterization (Fig. 2).
In this scenario, every published paper would include such a data file for each device discussed. Ideally, these files would also be deposited in a persistent online repository, making them citable, referable, and programmatically accessible (Fig. 2). This strategy aligns with the FAIR data principles (i.e. Findable, Accessible, Interoperable, Reusable),12,13 and would enhance the quality, value, and accountability of perovskite research. It would also create an aggregated resource that can be used to track research progress and for extracting additional insights using statistical methods such as machine learning.
As a pilot project in this direction, we have developed a hierarchical, modular JSON schema for describing perovskite compositions which we have described in more detail in a recent publication.14 Complementary online tools, hosted on the NOMAD platform,13,15 enable simple generation of perovskite composition files and provide persistent, citable, and programmatically accessible online storage. This provides a clear example of how a standardised protocol can be designed and made available online together with functionality for generating and storing structured descriptions in an online repository.
Building on insights from this pilot, we are now developing protocols and utilities for a more complete description of both single-junction and tandem perovskite solar cells. Creating data schemas that are sufficiently precise and complete to allow for consistent reproducibility across laboratories is, by necessity, an iterative process. We have recruited numerous volunteers to assist with this effort. To identify critical process and material parameters essential for reproducible results, we are also conducting round-robin experiments in which device recipes are cross-validated across multiple labs.
A cultural shift is likewise needed to realise this vision. A critical mass of researchers must recognise the value of sharing structured device data in accordance with FAIR principles. Without widespread buy-in, even the best tools and protocols will fail to gain traction. Encouragingly, there are signs that such a shift is underway. Growing interest in artificial intelligence – spurred by tools like ChatGPT – has increased awareness of AI's potential, as well as the need for abundant, well-structured data to power it. This awareness may help catalyse a broader adoption of FAIR data-sharing practices.
Nevertheless, while the availability of standards, user-friendly online utilities, successful implementation examples, and demonstrated benefits may encourage many to adopt this vision, it will not be enough for everyone. If only a few groups adopt good standards, the quality of their papers will improve, but the broader field will not benefit very much. This is a classic network effect. The overall benefits rise dramatically as more groups comply with the same improved reporting standards. Some will see only the additional work involved and conclude that they will themselves not reap any immediate benefits. This is essentially the “tragedy of the commons”, where the collective good is undermined by individual reluctance to contribute.16 To achieve widespread compliance, appropriate incentives must therefore be established.
Journals, publishers, and editors are uniquely positioned to create and enforce such incentives, both because they are few in number and because they act as the gatekeepers of peer-reviewed research. By encouraging the use of proper standards and promoting online data repositories that serve the needs of their scientific communities, publishers can have a profound impact. Once proper standards and infrastructure have been established and accepted, they can go on and mandate that device data must be reported in accordance with best practices. Such policies would create the pressure needed to ensure compliance and help drive the cultural shift required for a more valuable representation of perovskite devices.
To convince a community to buy into this vision will likely require a combination of defined standards, reasonably user-friendly procedures for using them, and demonstration of utility, paired with pressures from publishers and funding agencies.
New and better reporting standards are now being developed. We hope that the community will adopt and continue to refine these standards, and that publishers will help to enforce them. A better, FAIRer, and more reproducible future is possible – at least if we are willing to invest the necessary effort, embrace the required changes, and work collectively to enforce them.
| This journal is © The Royal Society of Chemistry 2025 |