Open Access Article
Chenyao Ma
a,
Yuhang Wang
c,
Di Zhang
cd,
Wei Duab,
Qiang Gaoa,
Rui Sua,
Kan Xua,
Huan Gua,
Limin Li*ab,
Piao Ma*ab and
Hao Li
*c
aSuzhou MatSource Technology Co., Ltd, Suzhou 215000, Jiangsu, China. E-mail: lilimin@matsourceai.com; mapiao@matsourceai.com
bGusu Laboratory of Materials, Suzhou 215000, Jiangsu, China
cAdvanced Institute for Materials Research (WPI-AIMR), Tohoku University, Sendai 980-8577, Japan. E-mail: li.hao.b8@tohoku.ac.jp
dFrontier Research Institute for Interdisciplinary Sciences (FRIS), Tohoku University, Sendai, 980-8577, Japan
First published on 17th June 2026
Energy materials underpin global energy transformation and low-carbon development, and their innovation and performance optimization are essential to removing bottlenecks in energy conversion and storage. Conventional research and development modes, including empirical, theoretical, and computational methods, are restricted by low efficiency or inherent conflicts between accuracy and system scale, and can hardly satisfy the requirements of modern high-precision, high-throughput material investigations. The 4th+ paradigm, an extension of the data-driven paradigm (4th paradigm) empowered by advanced artificial intelligence (AI) and data science, takes material databases, universal machine learning interatomic potentials (MLIPs), large language models (LLMs), intelligent agents, and full-process closed-loop systems as the core methodological framework of this perspective. It delivers atomic-scale modeling accuracy and efficient knowledge extraction capabilities to overcome these limitations, distinguished by its more sophisticated methodologies for generating, processing, and extracting knowledge from data. This perspective summarizes the advances of this paradigm in energy materials research and clarifies the mechanisms by which MLIPs and LLMs break through traditional constraints. On this basis, we further construct a future-oriented research framework consisting of four modules and outline prospective development trends to advance intelligent research platforms and accelerate the discovery and industrial translation of high-performance energy materials.
Each paradigm overcomes the limitations of its predecessor while giving rise to new research and development challenges. The evolution of energy materials R&D has spanned empirical, theoretical, and computational paradigms. As the core research direction of this evolutionary trend, AI-driven discovery focuses on designing, optimizing and predicting properties of energy materials via ML, data mining and high-throughput computation to assist conventional experiments. Many traditional trial-and-error experiments are inefficient, costly, and stochastic, generating fragmented data that impedes rational materials design. Theoretical advances, including quantum chemistry and density functional theory (DFT), offer atomic-level insights into electronic and crystal structures.5 Nevertheless, DFT exhibits inherent limitations: an accuracy-scale trade-off, poor adaptability to complex interfaces and disordered systems, and low efficiency in long-time dynamic simulations, which restrict the high-precision, high-throughput development of energy materials. The universal MLIPs bridge the gap between quantum-mechanical precision and large-scale exploration of energy materials, thereby overcoming the computational limitations of conventional DFT-based methods. The data-driven (4th) paradigm has ushered in a new era for energy materials research.9 In this perspective, we use the term “4th+ paradigm” to emphasize that current AI-driven materials discovery remains fundamentally data-dependent, while being substantially enhanced by MLIPs, LLMs, and autonomous agents. In contrast, conventional high-throughput screening (HTS) and self-driving labs (SDLs) have greatly accelerated materials exploration, yet most existing implementations remain task-specific and fail to achieve full integration with standardized databases, physics-aware models, literature-mined knowledge, and long-term feedback infrastructures. Differently, the 4th+ paradigm organically unifies standardized databases, MLIP calculation and LLM knowledge mining to build a complete closed-loop workflow spanning data input, computational prediction and autonomous experimental validation. It integrates universal MLIPs with advanced natural language processing tools to address longstanding experimental and theoretical bottlenecks in the field.10,11 In particular, MLIPs can achieve near DFT-level computational accuracy and boost the efficiency of large-scale simulations by orders of magnitude compared with traditional methods, enabling atomic-level investigations of complex systems. Furthermore, advanced text mining techniques can extract and synthesize knowledge from massive volumes of literature and datasets. The deep integration of computational modeling and knowledge synthesis underpins a data-centric research framework, shortens the R&D cycle, and drives the high-precision prediction and rational design of energy materials. Against this backdrop, energy materials research supported by advanced computational intelligence has entered a new stage of opportunity.
Drawing on high-quality databases, ML regression models (particularly MLIPs), LLMs, and full-cycle closed-loop research systems, this perspective focuses on the data-decisive 4th+ paradigm to systematically review the applications of AI models in the rational design and discovery of energy materials. It analyzes how modern computational methods break through traditional research bottlenecks, aiming to provide support for building standardized, intelligent research platforms and advancing the efficient discovery and industrial translation of high-performance energy materials.12,13
Early materials databases were centered on experimental data (Fig. 2a). Extensive explorations have been carried out by researchers across various fields, as exemplified by several specialized databases: PolyInfo,18 which focuses on polymer structure–property relationships; Starrydata2,19 dedicated to functional inorganic materials (e.g., thermoelectric and magnetic materials); the Crystallography Open Database (COD), which compiles small-molecule crystal structures; and the Inorganic Crystal Structure Database (ICSD),20 which stores experimentally determined inorganic crystal structures. For example, the COD is an open-access repository containing approximately 150
000 small-molecule crystal structures, supporting global data sharing, curation, and reuse of crystallographic knowledge.21 In parallel, the ICSD is a curated database of experimentally determined inorganic crystal structures, providing reliable structural data to support materials characterization, comparison, and discovery.20 However, experiment-based databases face a key challenge: material performance strongly depends on testing conditions, process parameters, and sample preparation details.22 The same material may yield significantly different data under different laboratories and testing conditions. Without standardized recording and extraction of their metadata, the reference value of a single performance value is limited.
With the rapid development of computational materials science, especially the development of DFT and high-throughput computing, computational materials databases have emerged (Fig. 2b). Through large-scale first-principles calculations, such databases provide systematic data for the stability, electronic structure, and thermodynamic properties of materials. A typical example is the Materials Project database, which opens crystal structure, energetics, and other data of inorganic materials and supports application programming interface (API) calls.23 Other databases with similar functions include the Catalysis-Hub24 and Open Catalyst.25 The value of these databases lies in their ability to quickly screen candidate materials before experiments, transforming materials search from empirical exploration into a quantifiable computational process. However, computational databases also have inherent limitations. The vast majority of computational data is based on idealized assumptions, such as perfect crystals and ideal surface structures. While these assumptions ensure the feasibility and repeatability of the calculations, they also make it difficult for the results to directly reflect the complexities of actual operating conditions. Many real-world factors (e.g., surface reconstruction, defect sites, solvent effects, electrode potentials, and interfacial interactions) are often simplified or neglected.
Based on the respective advantages and disadvantages of the two types of databases mentioned above, a new database format has emerged: an integrated platform that combines computational results and experimental metrics. Fig. 2c summarizes representative platforms sorted by five material categories: catalysis, solid electrolytes, hydrogen storage, porous MOFs and two-dimensional materials. For catalytic research, the Digital Catalysis Platform (DigCat: https://www.digcat.org/),26 integrates abundant computational surface libraries and standardized experimental datasets. Via embedded functional interfaces, the platform correlates theoretical adsorption energies with practical device performance and enables instant access to both computational predictions and experimental validations. For solid electrolyte development, Digital Battery Platform (DigBat) focuses on solid-state electrolytes,27,28 emphasizing the standardization of core parameters such as ionic conductivity and migration activation energy; while the Digital Hydrogen Platform (DigHyd) extracts structured data from literature images through a multi-agent workflow to build a high-quality database;29,30 in the porous material category, the ARC-MOF database contains around 280
000 experimentally reported and computationally generated MOF structures with DFT-derived atomic and adsorption parameters;31 these data have been used to support comparative evaluation of CO2 capture performance across different MOF materials.32 For low-dimensional materials, X2DB collects 370 experimentally synthesized two-dimensional materials and links experimental findings with computational databases for cross-scale property characterization.33 These integrated platforms share a common feature. They connect computational descriptors, experimental records, contextual metadata, and tool interfaces. This transforms material data from simple aggregation into a dynamic infrastructure that supports AI access, experimental validation, and process feedback.
However, current materials databases still face an important challenge. Publication bias causes successful results to be widely reported, while “failed experiments” and negative results are rarely included. This can make AI models overly optimistic about the synthesizability of materials. In the future, negative results should be systematically collected and standardized, with clear categories for synthesis failures and performance failures.
![]() | ||
| Fig. 3 ML for materials research. (a) Evaluation of ML models and their predictions for new materials. (b) Example of a standard neural network employed for fitting potential-energy surfaces. (c) MLIPs training for carbon dynamics analyses. (d) MLIPs for the bulk-surface structure and energy of Ru-based high-entropy alloys (e) MLIPs for surface reconstruction analyses. Adapted with permission from: (a) ref. 19 © 2024 Springer Nature (b) ref. 35 © 2007 American Physical Society (c) ref. 40 © 2024 Springer Nature (d) ref. 41 © 2025 American Chemical Society (e) ref. 42 © 2025 Wiley-VCH. | ||
MLIPs represent a pivotal application of ML in materials research.34 Boasting core advantages of high efficiency and precision, they play a crucial role in atomic-scale materials simulations and catalysis studies. The generalized neural-network representation of high-dimensional potential-energy surfaces proposed in the seminal paper published in 2007 is generally regarded as the prototype of current MLIPs, providing core ideas and a theoretical framework for the subsequent development and innovation of MLIP methods (Fig. 3b). These later developments built on and broadened the foundational concepts introduced by Behler and other early MLIP pioneers.35–37 For example, researchers proposed the Gaussian Approximation Potential (GAP), Many-body Atomic Cluster Expansion (MACE) model, and the MACE-MP foundation model, which overcame critical bottlenecks in earlier MLIPs, enabled robust generalization across diverse chemical systems and length scales, and achieved landmark results in materials simulation, chemical calculation, and other fields. Based on this methodological framework, researchers further optimized and expanded the applicability of the GAP and MACE models, addressing key technical challenges in MLIP development for large-scale system simulations and complex reaction predictions, and promoting the transformation of MLIPs from theoretical research to practical application with large-scale implementation.38,39 These works have greatly promoted the in-depth integration of MLIP technology with fields such as energy materials and catalytic chemistry, providing new and efficient tools for related research.
In practical applications, MLIPs integrated with molecular dynamics can accurately replicate graphene growth on Cu substrates and carbon deposition on various metal surfaces. As a representative class of advanced energy materials, such carbon-based functional materials play a vital role in electrochemical energy storage and conversion, and this work lays a solid foundation for their targeted development (Fig. 3c).40 For the acidic oxygen evolution reaction (OER), MLIPs combined with replica-exchange molecular dynamics resolve the bottlenecks plaguing Ru-based multicomponent energy materials, including insufficient stability, ambiguous atomic-scale mixing and phase formation pathways, and low-efficiency high-throughput screening routines. This integrated strategy characterizes the atomic-scale mixing behaviour of Rux(Ir,Fe,Co,Ni)1−x and RuIr-based alloys, verifies homogeneous bulk face-centered cubic (fcc) phase mixing and reveals the formation of minor hexagonal close-packed (hcp) phase, and establishes a high-throughput simulation protocol to rapidly screen promising RuIr-containing quinary materials, delivering rigorous theoretical guidance for high-performance Ru-based OER energy components (Fig. 3d).41 In the realm of electrocatalysis and sustainable carbon utilization, MLIPs greatly accelerate the simulation efficiency of CO2 electroreduction on Sn-based substrates, reproduce surface reconstruction, and when coupled with pH-field coupled microkinetic models, uncover pH-dependent structure–activity relationships.42 This establishes a feasible route for the rational design of high-performance electroactive materials for CO2 reduction (Fig. 3e). Going forward, with the advancement of multi-fidelity training and active learning, and in-depth integration with LLMs and automated experimental platforms, MLIPs will further drive catalysis research toward the “data-driven, simulation-predicted, experimental-validated” closed-loop paradigm, fuelling the rational design and industrial application of high-efficiency catalysts and empowering innovations and industrialization in energy materials. Nevertheless, MLIP techniques still suffer from inherent drawbacks including prediction uncertainty for configurations far outside training datasets, low reproducibility induced by differing training hyperparameters, and substantial experimental validation costs required to verify computational predictions.
![]() | ||
| Fig. 4 LLMs and AI agents for materials research. (a) Schematic diagram and evaluation methods of the DIVE workflow. (b) El Agente autonomous agent for quantum chemistry. (c) Big data-driven AI analysis of hydride SSEs. (d) Establishment of a large prediction model for catalyst design. (e) Accelerated discovery of ORR electrocatalysts in Pt-based high-entropy alloys. Adapted with permission from: (a) ref. 43 © 2026 Royal Society of Chemistry. (b) Ref. 44 © 2025 Cell Press. (c) Ref. 46 © 2025 Wiley-VCH (d) ref. 5 © 2026 Wiley-VCH (e) ref. 47 © 2024 Wiley-VCH. | ||
El Agente44 is a multi-agent system built on LLMs, facilitating the popularization of computational quantum chemistry. This system can efficiently accomplish core tasks such as molecular structure optimization and material property prediction, and shows great potential in transforming traditional research models in fields such as drug discovery and materials science; its future integration with SDLs will further accelerate the process of scientific discovery, making it a breakthrough tool supporting large-scale and inclusive quantum chemistry research and education, and opening up an avenue for the application of LLMs in chemical research (Fig. 4b).44
In terms of subsequent research on energy materials, LLMs facilitate research hypothesis generation, candidate system screening, and preparation optimization, guiding the rational design of high-performance energy materials.45 In the development of hydride-based solid electrolytes, LLMs collaborate with comprehensive solid electrolyte databases and ab initio molecular dynamics simulations to break through the efficiency bottlenecks of conventional trial and error approaches. They rapidly identify promising candidates with low activation energies and provide key technical support for revealing complex ion migration mechanisms and developing high-performance solid electrolytes (Fig. 4c).46 Furthermore, the application of LLMs in energy catalysis continues to expand. For high efficiency oxygen reduction electrocatalysts based on Pt-containing quinary high entropy alloys, LLMs supply elemental libraries and combinatorial blueprints, laying the foundation for one-step fabrication, high throughput testing, activity screening, and theoretical validation, thus accelerating the discovery of high performance multi element catalytic materials. Future efforts will focus on improving decision-making reliability, addressing the limitation that AI cannot fully replace researchers' professional judgment, and advancing the transition of AI agents from “auxiliary R&D” to “collaborative, human-supervised autonomous R&D”, so as to provide intelligent impetus for the comprehensive advancement of the 4th+ paradigm (Fig. 4d and e).5,47 Meanwhile, existing LLMs face prominent challenges such as factual hallucination during literature reasoning, uncertain output reliability, inconsistent experimental reproducibility, and expensive practical validation when guiding material screening and experimental design.48
Within this framework, high-quality structured experimental data is essential for all intelligent algorithms and models. Material testing and performance testing produce raw experimental data, which is standardized and stored in the material database to form reusable data resources. The database integrates multi-source experimental information, extracts the inherent relationships among material structure, composition and performance, and supplies high-quality training samples and feature inputs for ML models. On this basis, ML models predict high-potential candidate materials by mapping relationships in the data, providing clear guidance for experiments and reducing unnecessary trial and error. Experimental results obtained from validation are fed back to the database in real time to supplement performance data, calibrate model deviations and improve prediction reliability. LLMs further combine experimental data, model outputs and literature knowledge to generate new research hypotheses and experimental plans for continuous iteration. To quantify this closed-loop workflow, clear specifications are defined for each functional module: structured experimental data and standardized computational datasets serve as core module inputs, while optimized crystal structures, predicted material properties and refined preparation routes are defined as main outputs. Deviations between experimental measured values and model predictions exceeding preset error thresholds act as key feedback triggers to update database records and retrain ML/LLM models. The core decision criteria are defined by whether property prediction errors fall within pre-set acceptable tolerance ranges. Typical success metrics include reduced experimental trial counts, improved prediction accuracy of target performance and enhanced synthesis reproducibility of candidate materials. The system realizes highly integrated and autonomous operation of the entire research and development process, eliminates obstacles in traditional research modes, and establishes a positive cycle of data, model, experiment, and knowledge.4
Only through the deep integration of high-quality experimental data and AI can the new materials research paradigm be achieved, featuring intelligence-driven, high-efficiency R&D, autonomous closed-loop, paradigm shift, and precision innovation. Experimental data lays a solid foundation for AI models, while AI extracts structure and performance relationships to guide experimental design and screen promising material systems. This synergistic data-model-experiment mode overcomes the problems of fragmented processes and high costs in traditional research, promotes the transformation from experience-driven to intelligent closed-loop development, and finally realizes efficient and precise materials innovation.
| This journal is © The Royal Society of Chemistry 2026 |