Seoyeah
Oh
a,
Kyeom
Choi
a,
Jihyeon
Park
a,
Geonho
Kim
a,
Seoyoung
Yoon
a,
Dongjun
Kim
a,
Seokhee
Lee
b and
Jiwon
Kim
*acd
aSchool of Integrated Technology, College of Computing, Yonsei University, 85 Songdogwahak-ro, Yeonsu-gu, Incheon, 21983, Republic of Korea. E-mail: jiwon.kim@yonsei.ac.kr
bEnergy and Environmental Division, Korea Institute of Ceramic Engineering and Technology, 101 Soho-ro, Jinju, Gyeongsangnam-do, 52851, Republic of Korea
cIntegrated Science and Engineering Division, Underwood International College, Yonsei University, 85 Songdogwahak-ro, Yeonsu-gu, Incheon, 21983, Republic of Korea
dDepartment of Integrated Biotechnology, Graduate School, Yonsei University, 85 Songdogwahak-ro, Yeonsu-gu, Incheon, Incheon, 21983, Republic of Korea
First published on 23rd July 2025
Designing cathode materials is crucial for developing advanced Li–S batteries, but conventional trial-and-error methods are time- and resource-intensive. This study employs machine learning (ML) with feature analysis, data augmentation, and backward prediction using particle swarm optimization (PSO) for rapid discovery and inverse design of cathode materials. The predictive model with XGBoost achieved high accuracy with a determination coefficient (R2 = 0.8345) and a mean absolute error (MAE = 4.48%) in estimating capacity retention. The PSO-based backward prediction identified titanium (Ti) and 2-methylimidazole (2-MeIM) as optimal MOF precursors. A Ti/2-MeIM MOF is synthesized in the form of Ti/Zn-ZIF via post synthetic exchange, and subsequent carbonization yields Ti-derivative embedded nitrogen-doped carbon (NC–Ti) as a sulfur host material (S@NC–Ti). S@NC–Ti demonstrated average capacity retentions of 62.3%, 72.1%, and 65.3% at 0.1 C, 0.5 C, and 1.0 C, respectively, aligning with ML predictions. Furthermore, forward prediction successfully anticipated a capacity retention of 75.16% for the Ti/Zn bimetallic ZIF, a carbon precursor, at 1.5 C with a 65 wt% sulfur–carbon ratio, matching the experimental result of 84.13% within a 12% error margin. This study highlights the potential of ML-driven approaches in accelerating cathode material development for Li–S batteries.
New conceptsThis work presents an empirical data-driven inverse design strategy via machine learning (ML), in which targeted capacity retention of lithium–sulfur (Li–S) batteries guides the composition of metal–organic frameworks (MOFs) used as carbon-based precursors for cathodes. While prior ML-assisted material design approaches rely on synthetic data that may not fully reflect experimental complexity, our model captures MOF composition-capacity retention correlations aligning with physicochemical trends observed in empirical datasets. Experimental validation of MOF compositions identified through backward prediction closes the design loop, demonstrating the rationality of performance-oriented cathode optimization for Li–S batteries. Data augmentation mitigates experimental data scarcity by adding Gaussian noise to the feature space—a technique that expands the dataset with slight deviations while preserving the original data characteristics, thereby improving the model's generalization. Among predicted candidates, a MOF with Ti and 2-methylimidazole combination—the most frequently occurring metal–ligand pair—was synthesized and carbonized into a Ti-derivative–containing N-doped carbon. When applied to sulfur cathodes, experimental capacity retention fell within 12% of the predicted value, confirming the predictive reliability of our performance-tailored method in practice. This work offers a generalizable framework for data-driven compositional engineering in complex electrochemical systems, leveraging real-world performance constraints. |
ML has been widely utilized in ESS design to improve performance such as energy density and stability by predicting complex processes and rapidly identifying materials.18–20 For example, Liow et al. used ML with particle swarm optimization (PSO) to design a cathode for Li-ion batteries with a target specific initial capacity (e.g., 150, 175, and 200 mAh g−1), which was subsequently validated experimentally.18 A crystal graph convolutional neural network based ML model was employed to screen binary alloy anodes for metal-based batteries (e.g., Li, Na, K, Zn, Mg, Ca, and Al) by evaluating formation energy, potential, and specific capacity from a large materials database.19 Furthermore, Wang et al. used a data-driven ML approach to identify oxygen-rich porous carbon as an active material for aqueous supercapacitors from previously reported works.20
Among various ESS, Li–S batteries are promising candidates for secondary battery systems owing to their high energy density (2600 Wh kg−1), abundant sulfur supply, and cost-effectiveness.21,22 However, Li–S batteries still face severe challenges such as shuttle effects, sluggish sulfur redox reactions (SRR), and Li anode corrosion caused by complex reaction chemistry. As efforts to overcome these challenges increase, ML has been increasingly used to accelerate material screening, cell component design, and optimization of operating conditions for enhanced Li–S battery systems.23–25 In particular, ML-driven cathode development has garnered considerable attention, as an improved cathode mitigates the shuttle effects, enables high active material loading, and enhances electrode stability. Notably, achieving high cycling retention by alleviating the shuttle effects is a pivotal goal in cathode material design for Li–S batteries, since shuttle effects cause irreversible capacity loss due to self-discharge, low coulombic efficiency, and the formation of unstable and dendritic lithium–metal anodes, hindering long-term battery performance and commercialization. To address this, ML has been employed to reduce shuttle effects, assisting the cathode material discovery. For example, Zhang et al. utilized transfer learning based on the 2DMatPedia database to rapidly identify 14 promising AB2-type sulfur host structures to reduce shuttle effects.26 Lujie et al. combined ML with density functional theory (DFT) in a three-step protocol to design anchoring materials (AMs) for lithium sulfide (Li2S) adsorption: evaluating Li2S anchoring properties, selecting 44 types of two-dimensional AMs, and analyzing their geometrical characteristics for Li2S anchoring effects.27
Although ML accelerates cathode material discovery and design, experimental validation is essential to ensure its practical applicability. Zhiyuan et al. integrated DFT calculations with an ML framework consisting of filter, wrapper, and embedded modules to identify key factors influencing the catalytic activity of high-efficiency DAC-based multi-site catalysts, achieving an initial specific energy of 436 Wh kg−1 in a Fe–Co DASC cathode pouch cell.28 Similarly, Han et al. used binary descriptors based on band match and lattice mismatch indices to design NiSe2 as a catalyst for sulfur host materials using ML, confirming that Li–S batteries with NiSe2 maintain high gravimetric energy even under harsh conditions (e.g., high sulfur loading or at low temperature of −20 °C).29
In this work, we employed ML to expedite cathode material design for Li–S batteries, where metal–organic framework (MOF)-derived porous or hollow carbon structures were chosen as cathode materials for their superior electrical conductivity, structural stability, and design flexibility.30,31 As cathode precursors, MOFs provide tuneable pore structures and they enhance electrochemical activity through the carbonization of metal and ligand components.32,33 First, we constructed datasets incorporating MOF composition (metal and ligand type), synthesis and carbonization conditions for MOF-derived carbon (i.e., synthesis and carbonization time and temperature), weight ratio of sulfur to carbon, and C-rate as input features, with cycling retention after 100 cycles as the output feature. Second, ML identified the relationship between input and output features using SHapley Additive exPlanations (SHAP) explainer to select key input features crucial for output features. The limited data size was expanded to 1040 by adding feature-specific Gaussian noise to each datapoint, while maintaining patterns and dataset properties. An eXtreme gradient boosting (XGBoost)-based predictive model, trained on 80% of augmented dataset, achieved superior predictive accuracy (coefficient of determination >0.92), and demonstrated strong generalization (coefficient of determination difference between training and testing datasets = 0.0042). Subsequently, ML performed backward prediction to inversely design a MOF precursor for cathode materials, integrating titanium (Ti) and 2-methylimidazole (2-MeIM). To validate the prediction, a MOF comprising Ti and 2-MeIM was synthesized via post-synthetic exchange (PSE) by converting zinc (Zn) in ZIF-8 to Ti (Ti/Zn-ZIF), followed by carbonization to form hollow nitrogen-doped carbon with Ti-derivatives (NC–Ti) as a sulfur host (S@NC–Ti). S@NC–Ti demonstrated lower polarization, higher electrical conductivity, and enhanced SRR kinetics than S@NC and S@C owing to evenly distributed Ti-derivatives within the carbon matrix. For cycling performance, S@NC–Ti exhibited average capacity retentions of 62.3%, 72.1%, and 65.3% after 100 cycles at 0.1 C, 0.5 C, and 1.0 C, respectively. Furthermore, S@NC–Ti exhibited a capacity retention of 84.13% under the condition of a 65 wt% sulfur–carbon ratio at 1.5 C. This result falls within a 12% error margin of the forward prediction result, which was derived by combining the backward-predicted conditions (ligand, S-wt, and C-rate) with the experimentally applied bimetallic ZIF (Ti and Zn for metal). This study provides a basis for rational cathode material design for Li–S batteries.
![]() | ||
| Fig. 1 Scheme of machine-learning (ML)-assisted inverse design process of the metal–organic framework (MOF) for the cathode of the Li–S battery and its experimental validation. | ||
While each input feature contributes to retention prediction, high feature dimensionality can cause overfitting, computational inefficiency, and decreased predictive accuracy.34,35 To address this issue, we employed SHAP values with domain knowledge to guide data-driven feature importance evaluation. Also, Categorical Boosting (CatBoost), which effectively processes both nominal (e.g., metal and ligand) and numerical features (e.g., S-wt, C-rate, S-Time, S-Temp, C-Time, and C-Temp), was used to derive SHAP values for our dataset. Importantly, SHAP global importance scores were averaged across ten independently shuffled datasets. As shown in Fig. S1 (ESI†), the global SHAP importance values categorized the eight input features into a high-SHAP group—comprising C-rate (4.78 ± 0.30), metal (4.62 ± 0.48), and S-wt (2.13 ± 0.16)—and a mid-SHAP group, which included C-Temp (1.32 ± 0.14), ligand (1.12 ± 0.20), S-Time (0.95 ± 0.18), S-Temp (0.77 ± 0.14), and C-Time (0.64 ± 0.12). The SHAP decision plot (Fig. S2, ESI†) visually implies that all features contribute in a multivariate and interdependent manner, which may reflect the complex interactions governing Li–S battery chemistry. However, such multivariate contribution does not necessarily imply equal importance in predictive modelling; hence, we applied a dual-filter feature selection framework to refine the input space and alleviate redundancy. In this framework, each input feature was evaluated using two complementary criteria: (i) statistical importance derived from SHAP values and (ii) mechanistic relevance determined by domain knowledge. SHAP offered a data-driven evaluation of feature importance related to capacity retention, while domain knowledge complemented this by ensuring that selected features were mechanistically relevant to the design objective and Li–S battery systems. Accordingly, features with high SHAP importance were preliminarily retained and confirmed to be chemically consistent with known electrochemical behaviors. Specifically, the C-rate regulates electrochemical reaction rates during battery operation, affecting capacity retention.36 During carbonization, metal in MOFs transforms into various metal-derivatives, enhancing cycling performance of Li–S batteries by absorbing LiPS and facilitating SRR.37 Optimizing S-wt also influences both discharge capacity and capacity retention.38,39 Among the mid-SHAP group features, only ligand was retained due to its pivotal role in both the predictive modelling objective and its mechanistic relevance to capacity retention in Li–S batteries.40 Ligand is not only one of the key components in MOF formation but also serves as a precursor for the cathode material after carbonization. In addition, the identity of ligands determines the pore structure, specific surface area, heteroatom doping, and electrical conductivity of the MOF-derived carbon, all of which are closely related to LiPS confinement and redox kinetics—key factors that contribute to the capacity retention of Li–S batteries.41,42
To further support SHAP-driven and domain-informed feature selections, we conducted visual analysis using scatter and box plots. Selected features exhibited clear associations with retention, such as an optimal region for S-wt or distinct distributional differences across different categorial variables (e.g., metal and ligand types). On the other hand, time- and temperature-related features exhibited either localized patterns confined to sparsely sampled regions or insignificant trends, both of which limit their reliability and generalizability across the dataset (see Fig. S3 for detailed information, ESI†). Additionally, time- and temperature-related features showed a low correlation with retention and exhibited correlation-based redundancy with metal and ligand (see Fig. S4 for detailed information, ESI†). Such a low correlation and correlational redundancy may introduce model instability or reduce interpretive clarity, thereby supporting the rationale for exclusion of time- and temperature-related features. Accordingly, four features (i.e., C-rate, metal, S-wt, and ligand) were retained as important features based on both SHAP importance and domain relevance. Among retained features, the C-rate was further examined due to its highest SHAP value and role as an operating condition, unlike other features referring to material properties. As shown in SHAP decision plots (Fig. S5, ESI†), the SHAP values for a given C-rate vary substantially (ranging approximately from −6 to +6) depending on the categorical types or values of other features, indicating that the model does not learn a nonlinear dependence on the C-rate. Rather, the C-rate acts as a conditional variable whose influence is modulated through interactions with material descriptors. Therefore, this dual-filter feature selection framework—combining SHAP-based importance grouping with domain-guided interpretation—yielded a reduced yet robust feature set (C-rate, metal, S-wt, and ligand) that balances statistical significance with functional relevance. Refined input space not only improved model interpretability but also laid the groundwork for the subsequent predictive modelling and validation, the results of which are detailed in Section 2.4.
![]() | ||
| Fig. 2 Comparison of the principal component distributions between the original (blue stars) and augmented (orange circles) dataset. | ||
![]() | ||
| Fig. 3 Prediction performance of (a) decision tree regressor (DTR), (b) random forest regressor (RFR), (c) gradient boosting regressor (GBR), and (d) eXtreme gradient boosting (XGBoost) algorithm. | ||
Subsequently, we performed backward prediction using the particle swarm optimization (PSO) integrated with an XGBoost-based predictive model to inversely design advanced MOFs for cathode materials. The target output—capacity retention above 68.5% after 100 cycles—was determined based on the average retention value of the dataset, ensuring reliable backward prediction aligned with the state-of-the-art MOF-derived carbon for sulfur cathodes while allowing room for capacity retention enhancement. During PSO, particles explored the solution space to estimate input feature values (i.e., metal, ligand, S-wt, and C-rate), which were then fed into XGBoost to predict the output value (i.e., capacity retention after 100 cycles). Input spaces were constrained within the training data range, and categorical variables were represented via one-hot encoding (see Section 4.2 for details). The expected output was evaluated to ensure that it exceeded 68.5%. PSO then iteratively adjusted inferred input features using feedback from XGBoost to refine solutions. Backward prediction yielded 500 possible MOF designs (Table S3, ESI†), providing a data-driven set of candidates predicted to satisfy the retention target. Compared to retention statistics of the original dataset (mean = 68.50%, variance = 16.85), those of PSO-derived designs exhibited higher predicted retention (mean = 78.66%) with reduced variance (8.11%). These results indicate that the ML-guided optimization strategy effectively identifies input feature combinations with higher predicted retention, providing a rational basis for experimental prioritization. In addition, PSO-derived candidates revealed distinct design patterns. Specifically, consistently high predicted retention across a wide range of C-rates (0.05–2.0 C) suggests that the model favored input combinations capable of sustaining capacity retention under varying cycling conditions. S-wt showed a concentrated distribution within the 60–80 wt% range, reflecting the known optimization window that balances sufficient sulfur loading with mitigation of the polysulfide shuttle effect (Fig. S8 (a)–(d), ESI†). For the categorial variables, the one-hot encoded metal and ligand features demonstrated selective sampling behavior, where specific categories were associated with higher (e.g., Fe, or terephthalic acid) or stable (e.g., Ti, or 2-MeIM) predicted retention than others (Fig. S8(e)–(h), ESI†). This selective behavior indicates the model's ability to distinguish the impact of individual distinct categories on retention performance, despite categorical features being encoded without explicit chemical descriptors. Such patterns may reflect underlying chemical effect—such as conductivity, redox activity, and polysulfide confinement—associated with metal–ligand combinations, sulfur content, and C-rate, even though these properties were not explicitly included in the input features. The indicative alignment between predicted design patterns and domain-relevant features likely emerged through the combined mechanism of: (i) the model's learning of input–output correlations from the training dataset, (ii) model-guided global exploration with PSO as evidenced by a PCA projection, and (iii) repeated convergence toward high-performance input combinations (Fig. S9, ESI†). Taken together, our backward prediction strategy integrated both broad design space exploration and locally optimized solution refinement, enabling data-driven discovery of MOF candidates for carbon-based sulfur cathode.
To gain insights into MOF precursors for carbon-based cathode materials in Li–S batteries from the backward prediction results, metal–ligand combinations were ranked by their frequency of co-occurrence among the 500 PSO-derived candidate input sets, and the most frequently appearing combination was subsequently selected for experimental validation. The highest-ranked MOF combination, titanium (Ti) and 2-methylimidazole (2-MeIM), appeared in 41 predictions (8.2%). Notably, previous studies showed Ti transforms into Ti-derivatives, such as TiO2 and TiN, during carbonization, enhancing the electrochemical activity of Li–S batteries by facilitating LiPS adsorption and improving charge transport.45,46 Nitrogen-doped porous or hollow carbon structures derived from 2-MeIM enhance conductivity and stability, improving electrochemical performance.47 The highest-ranked metal and ligand combination had average values of 71.31 wt% (±10.75 wt%), 1.10 C (±0.46 C), and 73.53% (±2.74%) for S-wt, C-rate, and retention, respectively. Within these ranges, commonly used experimental values —65 wt% for S-wt and 1.5 C for C-rate—were selected for experimental validation. This approach reduces the resources required to test all 41 S-wt and C-rate combinations while providing practical feasibility guidelines. The impact of those conditions on retention was assessed through additional forward predictions and experiments, detailed in Section 2.6.
Scanning electron microscopy (SEM), and transmission electron microscopy (TEM) images show its rhombic dodecahedron morphology with an average edge length of 150 nm (Fig. S3a, ESI† and Fig. 5(a), (b)). In contrast, Ti/Zn-ZIF shows a hollow-structured rhombic dodecahedron morphology with a shell arising from the cation exchange between Zn in ZIF-8 and Ti from TiCl4 (Fig. S3b, ESI† and Fig. 5(c), (d)). Powder X-ray diffraction (PXRD) analysis confirms that Ti/Zn-ZIF retains the crystal framework of ZIF-8 (Fig. S4, ESI†) during the hollow structure formation by cation exchange. The Brunauer–Emmett–Teller (BET) method was used to characterize the specific surface area and pore structure (Fig. S5a and b, ESI†). The N2 isotherm of Ti/Zn-ZIF displays both micropores and mesopores, similar to ZIF-8, except for the hollow structure indicated by a hysteresis loop under high relative pressure (P/P0 > 0.5). Ti/Zn-ZIF has a specific surface area of 971.99 m2 g−1, which is lower than 1575 m2 g−1 for ZIF-8, resulting from the partial dissolution of its porous structures during PSE. Simultaneously, the average pore volume slightly increased from 2.14 cm3 g−1 for ZIF-8 to 2.44 cm3 g−1 for Ti/Zn-ZIF due to the hollow structure of Ti/Zn-ZIF. Compositional properties of Ti/Zn-ZIF were analyzed by energy dispersive X-ray spectroscopy (EDS) with TEM and X-ray photoelectron spectroscopy (XPS). EDS mapping confirms uniform distribution of N, Zn, Ti, and Cl in the shell of Ti/Zn-ZIF (Fig. 5(e)). The high-resolution XPS spectrum provides detailed information on the chemical environment of Ti in the Ti/Zn-ZIF shell, revealing two characteristic Ti 2p peaks. Fig. 5(f) shows that Ti exhibits two peaks at 457.9 eV for Ti 2p3/2 and 463.7 eV for Ti 2p1/2 lower than that of conventional titanium dioxide (459.5 eV and 465.2 eV for Ti 2p3/2 and Ti 2p1/2, respectively).55 This shift to lower binding energies suggests increased electron density around Ti, resulting from its coordination with N atoms in 2-MeIM, where N donates a lone pair of electrons to Ti.56
High-temperature carbonization of Ti/Zn-ZIF produced NC–Ti as a cathode material for Li–S batteries. NC–Ti has a large BET surface area (582.02 m2 g−1) and high average pore volume (3.26 cm3 g−1), forming a well-developed meso-micro pore structure (Fig. S5c, ESI†). The PXRD and Raman spectroscopy of NC–Ti reveal the structural characteristics of the carbon shell. The PXRD pattern of NC–Ti shows only (002) and (100)/(101) diffraction peaks, indicating graphitic carbon while the Raman spectra display typical D/G bands confirming amorphous or nanocrystalline carbon (Fig. S6, ESI†). The SEM image in Fig. S7a (ESI†) shows that NC–Ti retains a slightly shrunken rhombic dodecahedron morphology with hollow structure, similar to Ti/Zn-ZIF. The TEM image confirmed the inner hollow structure of NC–Ti (Fig. 6(a)). These structural characteristics offer advantages such as high sulfur loading capacity, adaptability to volume changes during charge/discharge, improved electrolyte penetration, and enhanced Li+ ion diffusion.57 The HAADF-STEM image reveals that numerous Ti-derivative nanoclusters exist and Ti single atoms (Ti–SAs) as bright spots are randomly distributed throughout the carbon shell without agglomeration (Fig. 6(b) and (c)). EDS mapping confirms uniform N and Ti distribution, traces of Cl, and complete Zn removal during the carbonization (Fig. 6(d)–(g)).
![]() | ||
| Fig. 6 Images of (a) TEM, and (b) and (c) HAADF-STEM with different magnifications. Elemental mappings of NC–Ti for (d) N, (e) Zn, (f) Ti, and (g) Cl, respectively. | ||
The chemical environments of Ti, N, and O in NC–Ti were further investigated by XPS analysis. The high-resolution Ti 2p spectrum deconvoluted Ti 2p3/2 and Ti 2p1/2 peaks into Ti3+ and Ti4+ oxidation states, indicating Ti–N and Ti–O bonding.58 The N 1s spectrum deconvolutes into four components: pyridinic N (398.25 eV), metal–N (398.8 eV), pyrrolic N (399.86 eV), and graphitic N (400.71 eV), with the metal–N peak confirming the presence of Ti–N bonding and supporting the existence of Ti–SA. The O 1s spectrum exhibits peaks at 533.31 eV, 531.74 eV, and 530.2 eV, corresponding to adsorbed H2O (H2Oads), C
O species, and O2−, respectively. The O2− peak indicates fully oxidized Ti species, possibly TiO2. The absence of detectable TiO2 nanoclusters in the XRD and Raman analyses likely results from their small size and low crystallinity, which may stem from limited carbonization time. TEM and XPS results suggest the coexistence of TiO2 nanoparticles and Ti–SA in NC–Ti, where TiO2 strongly adsorbs LiPS,59 while Ti–SA catalyzes conversion of LiPS,58 thereby improving capacity retention of the Li–S battery.
The metal–ligand combination suggested by the model through backward prediction was partially adjusted based on domain knowledge and experimentally implemented as a Ti/Zn–ZIF structure, which still retained the Ti–2-MeIM framework. The NC–Ti electrode fabricated through carbonization exhibited good agreement with the predicted retention target, with a deviation of less than approximately 10%. The superior electrochemical performance of NC–Ti can be attributed to the contributions of TiO2, Ti–SA, and nitrogen-doped porous carbon structures formed during the carbonization process, which provides LiPS adsorption site, facilitates LiPS conversion reaction, and enhances conductivity, respectively. The quantitative agreement between the predicted and experimental retention values suggests that our model may have indirectly captured the underlying physicochemical mechanisms influencing performance through the relationship between output (i.e., retention) and composition-based input variables, such as the types of metals and ligands. Although the domain-guided synthesis enabled implementation of the Ti–2-MeIM framework, it was necessary to verify whether this composition resided within the model's learned design space—ensuring that it represented not an exception, but a generalizable case supported by the model. Accordingly, we conducted forward predictions and experimental validation, maintaining Ti and Zn as MOF precursor metals, and 2-MeIM as a ligand. Values of S-wt and C-rate were selected based on the average and standard deviation derived from the backward prediction results: 65 wt% for S-wt and 1.5 C for C-rate. The forward prediction result met the output criteria for backward predictions (Table 2), and experimentally verified. As shown in Fig. 7(f), S@NC–Ti exhibited a capacity retention of 84.13% after 100 cycles at 1.5C with 65 wt% sulfur loading, starting from an initial capacity of 509.85 mAh g−1 and retaining 428.95 mAh g−1. This result not only satisfied the backward prediction criteria but also matched the forward prediction result within a 12% error margin, confirming the robustness of our ML model in both forward and backward predictions. The agreement between the predicted and experimental retention values can be attributed to the model's ability to generalize to unseen combinations. Notably, the implemented Ti/Zn-ZIF was not included in the original dataset, yet the model generated consistent predictions by leveraging patterns learned from a diverse set of metal and ligand combinations in the augmented training set. Building on this, we conducted an additional experimental validation to enhance the clarity and credibility of our predictive model; metal–ligand combination of Ti and 2,5-dihydroxyterephthalic acid (H4-dobdc) was selected, which ranked sixth in predictive frequency (Table 1). The Ti–H4-dobdc combination was not included in our original dataset and, to the best of our knowledge, its application as a cathode material for Li–S batteries has not been reported. Thus, we synthesized a carbonized Ti–H4-dobdc MOF and applied it as a sulfur cathode for a Li–S battery. Electrochemical testing confirmed a capacity retention of 77.84% after 100 cycles, successfully meeting the performance target of the backward prediction (Fig. S17, ESI†). To further assess the model's predictive capability, we re-entered the Ti–H4-dobdc pair into the forward prediction model. The sulfur content (S-wt) and C-rate were set to 65 wt% and 1.5 C, respectively—values representative of the statistical distribution of PSO-derived results, falling within one standard deviation of their respective means. The model predicted a capacity retention of 77.74%, while the experimentally measured retention was 77.84% with a prediction error of only 0.12%, demonstrating the practical validity of the model-generated candidates. Our closed-loop validation process—comprising inverse design, candidate selection, experimental evaluation, and forward prediction across multiple validation points—reinforces the predictive robustness and generalizability of the model within a chemically plausible design space.
| Ranking | Metal | Ligand | Count |
|---|---|---|---|
| 1 | Ti | 2-Methylimidazole | 41 (8.2%) |
| 2 | Fe | Terephthalic acid | 39 (7.8%) |
| 3 | Fe | 2-Methylimidazole | 38 (7.6%) |
| 4 | Fe | 2,5-Dihydroxyterephthalic acid | 36 (7.2%) |
| 5 | Fe | 2-Aminoterephthalic acid | 35 (7.0%) |
| 6 | Ti | 2,5-Dihydroxyterephthalic acid | 34 (6.8%) |
| 7 | Fe | Trimesic acid | 33 (6.6%) |
| 8 | Co | 2,5-Dihydroxyterephthalic acid | 31 (6.2%) |
| 9 | Ti | Terephthalic acid | 28 (5.6%) |
| 10 | Co | Terephthalic acid | 22 (4.4%) |
| Metal | Ligand | S-wta | C-rate | Pred. retentionb (12% error margin) | Exp. retentionc |
|---|---|---|---|---|---|
| a Sulfur loading amount in host material expressed as weight percentage. b Prediction capacity retention after 100 cycles using machine learning with input features obtained from real experimental conditions (metal) and those predicted through backward prediction (ligand, S-wt and C-rate). c Measurement of capacity retention after 100 cycles. | |||||
| Ti/Zn | 2-Methylimidazole | 65 wt% | 1.5 C | 75.16% (66.14–84.18%) | 84.13% |
| Ti | 2,5-Dihydroxyterephthalic acid | 65 wt% | 1.5 C | 77.74% (68.41–87.07%) | 77.84% |
Finally, we demonstrated that augmenting the dataset enhances the reliable inverse design of MOF for carbon materials used as Li–S battery cathodes. Built solely on experimental data without predefined physiochemical guidance, the augmented dataset retained the original structural information while introducing slight variations for generalization. The ML algorithm trained on the augmented dataset exhibited high predictive accuracy and successfully identified MOF precursors that met the backward prediction retention criteria. Nevertheless, further advancements could be suggested to our ML prediction for designing practical and high-performance cathodes for Li–S batteries. For example, a dual-output ML model predicting both retention and initial capacity would support the design of high-performance cathode materials for Li–S batteries. Adding input features related to the N/P ratio and electrolyte composition enables ML to capture complex electrochemical interactions during cycling, thereby enhancing capacity retention prediction. Furthermore, incorporating physiochemical guides such as density functional theory or molecular dynamics into the ML pipeline or experimental validation process could provide mechanistic insights into electrochemical reactions during battery operation and improve the overall cathode design framework.
Key features were identified using SHapley Additive exPlanations (SHAP) based on a CatBoost algorithm, which can handle both categorical features (e.g., metal and ligand) and numerical features (e.g., S-Time, S-Temp, C-Time, C-Temp, S-wt, C-rate and retention). Since CatBoost employs target-based encoding for categorical features—an approach inherently sensitive to the order of input data, we repeated the entire procedure (i.e., dataset shuffling, model training, and SHAP computation) ten times to mitigate order-dependent bias in feature importance estimation. Global feature importance was then determined by averaging the absolute SHAP values across ten runs, with standard deviation reported to indicate consistency.
Nominal data in metal and ligand were converted into binary vectors via one-hot encoding by eliminating potential order assumptions and preserving their discrete characteristics, improving ML interpretability. Data augmentation was performed by introducing additive Gaussian noise with varying standard deviations set to 5% of the standard deviation of the original data: 0.389% for S-wt, 0.0376C for C-rate, and 0.843% for retention. The intrinsic nominal properties of the one-hot encoded features were preserved by duplicating metal and ligand data without modification. The reliability of the augmented data was assessed using metrics such as Wasserstein distance (WD), cumulative probability function (CPF), and probability density plot for datapoint distribution, and principal component analysis (PCA) for inter-feature correlations.
XGBoost was selected for forward prediction. The original dataset was first split into 80% training and 20% test sets before augmentation was applied. Noise was then added only to the training set to generate the augmented training dataset, while the test set was kept in its original form. This approach ensured that the test set contained truly unseen data points, allowing for a more accurate assessment of the model's generalization performance. The hyperparameters of XGBoost were optimized using grid search with 10-fold cross-validation, and predictive performance were evaluated on the remaining test set by coefficient of determination (R2) and mean absolute errors (MAE).
Backward prediction was performed using PSO with a swarm size of 30 and a maximum 200 iterations. During PSO, particles explored the input space defined by four variables: metal, ligand, sulfur weight ratio to MOF-derived carbon (S-wt), and C-rate. The exploration input space boundaries were strictly constrained within the ranges observed in the augmented training dataset. For the continuous variables, S-wt ranged from 49.19 to 90.04 wt% and C-rate ranged from 0.05 to 2.01 C. For the categorical variables, metal and ligand were one-hot encoded into binary vectors, with each feature constrained to binary values (0 or 1). The metal and ligand feature sets contained 11 and 5 distinct chemical identities, respectively. The metal set included Co, Ti, Zn, Al, Ni, Fe, and In, as well as heterometallic combinations such as Fe/Zn, Co/Zn, Co/Fe, and Co/Ni. The ligand set included terephthalic acid, 2-aminoterephthalic acid, trimesic acid, 2,5-dihydroxyterephthalic acid, and 2-methylimidazole.
:
Ti molar ratio of 10
:
1. All processes were conducted under an argon (Ar)-filled glove box (KK-011AS, Korea Kiyon). Ti/Zn-ZIF was washed several times with MeOH by centrifugation and dried under vacuum overnight at a temperature of 60 °C.
:
6.5 by mortar grinding and placed into a ceramic boat. The mixture was heated to 155 °C for 24 hours, and final products were referred to as S@NC–Ti and S@NC, respectively. For comparison, the same process was applied to Super P (S@C).
:
1, v/v) as an electrolyte. The areal sulfur mass loading on the electrode was in the range from 1.5 to 2.0 mg cm−2, and the electrolyte to sulfur ratio was fixed to 20 μL mg−1.
Footnote |
| † Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d5mh00635j |
| This journal is © The Royal Society of Chemistry 2025 |