Open Access Article
Geunho Choi†
a,
Changhwan Lee†b,
Jieun Kima,
Insoo Yeb,
Keeyoung Jung
*cd and
Inchul Park
*a
aEnergy Materials R&D Laboratories, POSCO N.EX.T Hub, POSCO Holdings, Incheon, 21985, Republic of Korea. E-mail: inchul@posco-inc.com
bAI & Robotics Convergence R&D Laboratories, POSCO N.EX.T Hub, POSCO Holdings, Seoul, 06194, Republic of Korea
cDepartment of Advanced Components and Materials Engineering, Sunchon National University, Suncheon, 57922, Republic of Korea. E-mail: keeyoung.jung@scnu.ac.kr
dInstitute for Battery Research (IBR), Sunchon National University, Suncheon, 57922, Republic of Korea
First published on 2nd December 2025
Microstructures often dictate materials' performance, yet they are rarely treated as an explicit design variable because microstructures are hard to quantify, predict, and optimize. We built an image-centric, closed-loop framework that makes microstructural morphology a controllable objective and demonstrate its use case with precursors for Li- and Mn-rich layered oxide cathodes. This work presents an integrated, AI-driven framework for the predictive design and optimization of lithium-ion battery cathode precursor synthesis. This framework integrates a diffusion-based image generation model, a quantitative image analysis pipeline, and a particle swarm optimization (PSO) algorithm. By extracting key morphological descriptors such as texture, sphericity, and median particle size (D50) from SEM images, the platform accurately predicts SEM-like morphologies resulting from specific co-precipitation conditions, including reaction time-, solution concentration-, and pH-dependent structural changes. We experimentally validated that our optimization pinpoints synthesis parameters yielding user-defined target morphologies of Li- and Mn-rich (LMR) layered oxide materials, with predicted and synthesized structures showing close agreement. This framework offers a practical strategy for data driven materials design, enabling both forward prediction and inverse design of synthesis conditions and paving the way toward autonomous, image-guided microstructure engineering.
New conceptsWe introduce a generative, image-guided inverse-design framework that elevates microstructures from a passive synthesis outcome to an explicit design variable. Unlike conventional optimization strategies that focus on composition or bulk properties and assess morphology only retrospectively, our approach directly encodes images as quantitative descriptors and embeds them into a closed-loop optimization cycle.The key breakthrough lies in combining image-based quantification, diffusion-driven forward simulation, and global optimization into an integrated workflow that enables both prediction and inverse design of precursors for Li- and Mn-rich cathodes. This framework achieves experimental realization of user-defined morphologies and links targeted microstructures to improve electrochemical stability. Beyond batteries, the concept is material-agnostic: by swapping descriptors and training data, the same quantification prediction optimization triad can be extended to catalysts, alloys, and porous membranes where geometry governs function. This work provides the community with a generalizable blueprint for morphology driven discovery, transforming microstructural control from empirical trial and error into a systematic, data driven design paradigm. |
Conventional material optimization frameworks generally address microstructures in a relatively indirect manner, with a focus on maximizing scalar targets (e.g., capacity and conductivity) or adjusting composition and stoichiometry.8 This approach is often predicated on the assumption that microstructural details will emerge because of processing-related decisions. Consequently, electron microscopic image-level features are seldom explicitly delineated as design objectives. In the domain of battery cathode development, for instance, synthesis recipes are optimized to enhance electrochemical properties. The microstructure is evaluated post hoc using bulk average properties such as XRD phase refinement, BET surface area, particle size distribution, and porosity. This approach contrasts with the more conventional practice of designing these proxies upfront. The quantification of an entire SEM or TEM image in a form suitable for optimization remains a non-trivial task. Previous studies have employed a limited set of summary metrics, such as the phase area fraction or characteristic length scales to incorporate morphology into design.9,10 However, these coarse descriptors may overlook the subtle textural and topological nuances that are crucial for performance. Furthermore, there is a lack of discussion regarding the sufficiency of indirect measures, resulting in a suboptimal representation of rich, image-derived information within the inverse design loop.
In addition to the limitations of coarse descriptors, effective image-based inverse design of microstructures confronts several intertwined challenges. The process of distilling the rich, multi-scale patterns visible in SEM and TEM images into quantitative features is exceptionally difficult. Conventional metrics such as median grain sizes, porosity, or aspect ratios capture only fragmentary aspects of morphology, leaving subtle textural and topological nuances unquantified. The establishment of robust, predictive links between synthesis parameters and the resulting microstructure remains largely empirical. Minor changes in precursor concentration, temperature, or mixing protocol can provoke disproportionate or unpredictable morphological shifts. This forces reliance on laborious trial-and-error tuning. Because of the absence of an integrated closed-loop framework that treats image-derived morphology as an explicit design variable, researchers resort to an inefficient Edisonian cycle (adjust, synthesize, characterize, and repeat), an approach that becomes untenable as materials systems become more complex. In order to surmount the aforementioned impediments, it is imperative to employ methodologies that systematically encode substantial image data, facilitate dependable process-morphology predictions, and underpin autonomous, closed-loop design of targeted microstructures.
To address these gaps, we propose a closed-loop, image-driven inverse design framework that integrates wavelet-based image quantification, diffusion-based generative modelling, and global optimization into a unified methodology. First, microstructure images are transformed into a compact “morphology fingerprint” that captures multi-scale texture, particle sphericity, and size distribution metrics via image-based quantitative morphology analysis.11,12 Subsequently, a conditional diffusion model functions as a forward simulator, synthesizing realistic SEM images from process parameters or target descriptors with high fidelity.13,14 Next, a particle swarm optimization (PSO) algorithm is employed to iteratively adjust co-precipitation conditions.15 This process is intended to direct the generated images toward the desired morphology. The efficacy of each iteration is evaluated using quantitative morphology metrics, which serve to determine the algorithm's “fitness”. By treating the microstructure image itself as the design objective rather than relying on scalar proxies, our approach actively explores the space of possible morphologies in a feedback loop. We validated this framework by synthesizing Li- and Mn-rich layered oxide cathodes under the optimized conditions predicted by our algorithm, and SEM characterization confirmed that the experimentally obtained microstructures closely matched the target morphology fingerprint.
Sphericity and D50 of secondary particles are widely used because they predict packing density, stress distribution during electrode calendaring, and fracture resistance under compression.28 AI-assisted segmentation enabled calculation of both D50 and sphericity from the projected area and equivalent radius (Fig. 2b). The resulting D50 values matched laser diffraction data within ±4% (Fig. 2c).
The texture descriptor fills the gap left by subjective terms such as “plate-like”, “rod-like” or “needle-like”.16–18 Although such qualitative labels may hint at the dominant crystallographic facets, they are inherently unreliable because their meaning is observer-dependent and they reduce complex three-dimensional geometry to a single adjective. Most importantly, they do not provide a continuous metric that can be incorporated into engineering models or statistical analyses.
Because texture is computed directly from voxel-level geometry, it remains reproducible even when phase boundaries are indistinct, enabling robust quantification of the primary particle shape (Fig. 2a). Several groups have attempted to quantify the primary-particle shape by first segmenting the particles and then extracting geometric metrics from the segmented volumes. While conceptually attractive, this “direct segmentation” route becomes highly sensitive to hyper-parameter choices or to the specific distribution of labels in the training set once inter-phase boundaries are diffuse or crystallographic domains intergrow.
To verify that the selected descriptors carry complementary information, we calculated pairwise Pearson correlation coefficients among D50, mean sphericity, and texture (SI Fig. S4). The values were −0.38 (D50 vs. sphericity), −0.17 (D50 vs. texture), and 0.26 (sphericity vs. texture), indicating only weak correlations. This confirms that the domain knowledge-based descriptor set is mutually independent and therefore suitable for quantitative morphology analysis. The correlation with experimental variables further demonstrates how particle shape can be tuned. Texture exhibited a strong dependence on pH, as well as on the concentrations of NaOH and NH4OH. It reached values ≥0.7 when the concentrations of NaOH and NH4OH were set to ×0.25 and ×0.09, respectively, under pH 10 conditions that produced high texture value (thin, faceted lamellae). In contrast, texture dropped below 0.3 when the concentrations were changed to ×0.08 and ×0.29 at pH 11, or when O2 induced Mn2+ promoted platelet thickening. Sphericity increased monotonically with reaction time, while its standard deviation decreased, indicating progressive morphological homogenization. Higher concentrations of NaOH enhanced nucleation density and resulted in more spherical particles, whereas lower pH or the presence of oxygen gas reduced sphericity.
000× images, highlighting accurate replication of the rapid texture relaxation observed even in the very first nucleation steps (∼ 120 min). In 3000× images (Fig. 3a), secondary particle densification and aggregate coalescence are reproduced with striking visual fidelity.
Furthermore, although the model is not constrained by explicit physical equations, it internalizes the mapping between synthesis parameters (pH, NaOH, NH4OH, time, etc.) and precursor morphology, generating results that mirror the underlying growth dynamics. In practical co-precipitation, nascent nuclei rapidly aggregate, producing abrupt morphological changes; subsequently, crystals embedded within the agglomerates grow more slowly.29,30 The diffusion model reproduces this sequence: during the first 120 min it tracks the rapid evolution of secondary and primary particles, and at later times it recreates the Ostwald-ripening regime in which convex regions dissolve, concave regions grow, and overall particle sphericity increases.31 For each synthesis condition, texture is computed per particle and summarized as the condition-level mean (µ) and standard deviation (SD). To quantify model variability, we draw 100 independent conditional generations per condition and compute the same per-particle texture, reporting the model µ ± SD and overlaying individual sampled estimates. As shown in Fig. 3d, model means track experimental means with R2 = 0.98 for texture; D50 trajectories in Fig. 3c align with R2 = 0.91. These results indicate that the model preserves underlying structural statistics rather than merely imitating visual style.
Our model demonstrates robust performance within the training manifold, effectively capturing morphology variations across multiple synthesis parameters. Morphology evolution under varying pH conditions is accurately reproduced, and intermediate states are reliably interpolated (Fig. S5). In this univariate pH conditioned case, the model was trained using experimental data at pH 10.0, 10.7, and 11.0; the generated morphologies at these same points closely match the experimental textures, confirming reconstruction fidelity. The interpolated prediction at pH 10.5, which was absent from the training dataset, further highlights the model's capacity to generalize within the same chemical series, consistent with known growth mechanisms such as the directional adsorption of metal–ammonia complexes on the (001) plane.18
Building on this capability, forward predictions were then performed in a two-dimensional synthesis parameter space in which pH and initial NH4OH concentration were varied together from pH 10.0/0.29 M NH4OH to pH 11.0/0.57 M NH4OH to generate intermediate conditions via linear interpolation of the conditioning inputs (Fig. S6; using the same trained diffusion backbone and protocol as in the pH only case). All other synthesis variables were fixed within that separate experimental series. The predictions reveal a monotonic decrease in D50 with increasing pH and ammonia concentration, while particle sphericity remains essentially constant across the interpolation domain.
Experimental validation verified the model's accuracy. SEM images of the synthesized precursor (Fig. 4b, bottom) reveal densely packed secondary particles and finely textured primary networks that mirror virtual predictions (top). Descriptor discrepancies between prediction and experiment remain ≤5% across all metrics. Electrochemical cycling tests show a 3% smaller capacity fade and improved voltage retention (smaller average-voltage decay) relative to baseline LMR cathodes under identical conditions (SI Fig. S7 and S8).
The high-fidelity synthetic micrographs generated entirely in silico demonstrate the feasibility of rapid morphology scouting across extensive compositional spaces (e.g., the full pH–NH4OH range) in mere minutes, thereby compressing weeks of wet-lab trial-and-error into GPU time. Every pixel in the conditional diffusion generator is conditioned on the same descriptor set used for experimental quantification, allowing researchers to trace visual motifs back to numerical targets. Tens of thousands of hypothetical conditions can thus be screened per hour on a single consumer GPU (throughput unattainable with conventional CSTR campaigns) and the linear scaling of optimization cost with the descriptor count permits seamless integration of further constraints (impurity limits, BET, and grain-boundary texture) without architectural overhaul.
Some localized deviations from monotonic behavior observed in interpolated morphologies (the Forward prediction via diffusion-based image generation section) can be attributed to nonlinear interactions between chemical equilibria (e.g., complexation efficiency, and supersaturation) and kinetic factors (e.g., nucleation and coarsening). Such discrepancies are common in multivariate synthesis landscapes and highlight opportunities for refinement through expanded data coverage or conditioning guided by physical constraints.
Although demonstrated for Li- and Mn-rich layered-oxide precursors, the framework has the potential to be material agnostic. The framework is morphology first under fixed composition; composition conditioned inputs (e.g., Ni/Mn ratio) can be incorporated in the conditioning vector when harmonized multi-chemistry datasets become available. This potential can be realized by swapping the training image set and descriptor definitions, enabling immediate transfer to catalysts, additively manufactured alloys, or porous membranes where geometry governs function. The key insight is that while each material system requires domain-specific morphological descriptors such as active site exposure area in catalysts or pore connectivity in membranes the core ‘quantification–prediction–optimization’ closed-loop architecture remains universally applicable. This structural invariance suggests that the framework's value lies not in its current implementation but in its systematic approach to bridging the gap between morphological targets and synthesis protocols. By translating explicit morphological targets into experimentally verifiable recipes, the integrated AI system furnishes a blueprint for accelerating morphology-driven discovery not only in battery materials but also in catalytic, pharmaceutical, and structural domains.
Several challenges remain before the full implementation of autonomous morphology design becomes a standard practice. First, applicability is limited to the present training domain and image-based objectives (single reactor and restricted pH-temperature window); extrapolation and strict physical consistency are limited, and federated learning across routes is needed for generality. Second, electrochemical performance is still analyzed offline; incorporating cycling data directly into the loss function would create a fully closed loop that couples structure and property optimization. Third, residual errors associated with complex kinetics–thermodynamics coupling could be reduced by augmenting the training set around these regions or by introducing physics-informed priors.
The ongoing work integrates multi-reactor datasets and streams real-time electrochemical feedback, with all code and pretrained weights to be released under an open-source license. Coupling this image-driven inverse-design paradigm with robotic synthesis and high-throughput imaging holds great promise for achieving truly closed-loop optimization, thereby accelerating the discovery process for lithium-ion batteries and beyond. The framework is modular but transferable only with re-specified inputs/constraints and independent validation for each new system. We anticipate that data-guided morphology engineering will soon advance from proof-of-concept to a standard tool in the materials-by-design platform, ultimately accelerating the decades-long lab-to-fab timeline for morphology-sensitive materials.
:
Mn = 33
:
67). The as-precipitated powders were washed, filtered, and vacuum dried at 100 °C. The detailed reactor setup and synthesis protocol are described in SI (S1.1.1).
The resulting precursors were examined using a SEM (JEOL JCM-6000) at two magnifications (3000 and 15
000) to capture both secondary and primary particle morphologies. A total of 55 distinct synthesis conditions were recorded, each linked with paired SEM images and particle-size data (Microtrac S3500, D50 values). Image acquisition parameters and data-pairing procedures are given in S1.1.2 and S1.1.3.
For lithiation, the hydroxide precursor was mixed with LiOH·H2O at a Li/(Ni + Mn) molar ratio of 1.38 and calcined at 850 °C for 10 h to obtain Li- and Mn-rich layered oxide (Li1.16Ni0.28Mn0.56O2). Electrode fabrication and electrochemical assembly details are provided in S1.1.4.
Primary-particle features were characterized via wavelet-based texture analysis, capturing hierarchical structural roughness and anisotropy directly from 15
000× magnified SEM images without explicit particle segmentation (S1.2.1). The texture energy, computed as the mean-square magnitude of wavelet coefficients, served as a scalar metric, reflecting morphological complexity.
Secondary-particle morphology was quantified through instance segmentation using the Segment Anything Model (SAM) to identify individual particle boundaries from 3000× magnified images. Circularity-based sphericity (Ψ = 4πA P−2, where Ψ denotes the sphericity, A is the projected area of the particle, and P is its perimeter.) and image-derived D50 values were computed from the segmented masks (S1.2.2). Ensemble-level means (µ) and standard deviations (SD) of these metrics were used as quantitative condition-level descriptors.
During training, the base diffusion weights were frozen while ControlNet layers were optimized via denoising score-matching (ε-prediction) using all available labeled images. Model convergence was monitored using the peak signal to noise ratio (PSNR) and structural similarity (SSIM) metrics, applying early stopping when PSNR > 34 dB and SSIM > 0.965 stabilized within 10 epochs. These training details and architectural schematics are fully described in S1.3.
Users may specify target attributes such as D50 (D), sphericity (M), standard deviation of sphericity (S), and texture (T) numerically or through reference images. The algorithm generates candidate parameter sets, produces corresponding synthetic SEM-like images, quantifies their morphology, and iteratively minimizes a weighted objective function:
The optimization proceeds until convergence to a minimal F, yielding synthesis conditions that reproduce the target morphological characteristics. Full mathematical formulation and implementation details are provided in S1.4.
This integrated workflow bridges experiment, quantitative image analytics, and generative modeling to establish a closed-loop system for both forward prediction of morphology and inverse design of synthesis conditions. All algorithms, training parameters, and implementation codes are detailed in SI Section S1.
Due to company data policies, the raw datasets used for model training cannot be made publicly available; however, further details may be provided by the corresponding author upon reasonable request within confidentiality constraints.
The custom code developed for image analysis, diffusion-based image generation, and optimization is provided as a zip file.
Footnote |
| † These authors contributed equally to this work. |
| This journal is © The Royal Society of Chemistry 2026 |