Open Access Article
Kang Liang†
,
Xinke Qi†,
Xu Xiao,
Li Wang
* and
Jinglai Zhang
*
Henan Key Laboratory of Protection and Safety Energy Storage of Light Metal Materials, College of Chemistry and Molecular Sciences, Henan University, Kaifeng 475004, China. E-mail: chemwangl@henu.edu.cn; zhangjinglai@henu.edu.cn
First published on 23rd December 2025
Self-healing polyurethanes (PUs) exhibit an inherent trade-off between mechanical strength and self-healing efficiency. Although optimizing the feed ratio can address the above limitations, identifying an appropriate ratio through trial-and-error is not straightforward. Machine learning offers promising approaches for composition-property optimization. However, the multi-property optimization for PUs with specific ingredients remains challenging, especially with minimal samples. A chemically-informed active learning (CIAL) framework is developed that integrates domain knowledge with machine learning to optimize fluorescent self-healing PU with merely 20 experimental datasets. By combining gradient boosting regression with multi-objective optimization, the framework successfully achieves co-optimization of mechanical properties and self-healing efficiency, with the relative error of the comprehensive performance index below 12%. The framework's efficiency is further highlighted to achieve optimal results with only 15 samples when discrete performance data are available. The developed P20B sample serves as an intelligent protective coating for Q235, simultaneously achieving real-time fluorescence visualization of damage sites and long-term anti-corrosion. This work provides an innovative solution for intelligent design of polymer materials using tiny experimental data.
Given the inherent trade-off between self-healing capability and mechanical properties, rational adjustment of reactant molar ratios represents a feasible strategy to achieve optimal performance balance.20 However, the conventional trial-and-error approach is inefficient for exploring the complex multidimensional parameter space (e.g., the combinatorial explosion in a four-component system), which fundamentally limits the attainment of desired property equilibrium. Against this backdrop, machine learning has emerged as a revolutionary tool to accelerate material design by exploiting multidimensional relationships between material properties and compositions.21–23 However, its application in PU systems still confronts two key challenges.
Existing literature data not only suffer from limited sample sizes but also exhibit significant heterogeneity, particularly for PU systems with precisely controlled reactants.24,25 Although constructing initial datasets through experiments can improve data quality, conventional approaches require substantial investments of manpower and cost. Taking 50 samples as an example, the entire process from synthesis to performance characterization typically requires 1–5 months for experienced researchers (depending on synthesis complexity). It is of practical significance to minimize data requirements. The second fundamental challenge lies in the predominant focus on optimizing individual performance metrics, while neglecting the inherent trade-off between these parameters. This single-objective optimization paradigm frequently yields materials inadequate for real-world engineering applications that require balanced multi-property performance.26,27 These limitations become particularly pronounced in self-healing PU systems requiring simultaneous optimization of both self-healing and mechanical properties. While Zheng et al.28 successfully employed a reverse design strategy to predict feed ratios for PU elastomers based on merely 25 data points with prediction accuracy (R2) exceeding 0.88, only mechanical properties are optimized in their work. Overall, achieving accurate prediction of feed ratios for self-healing PUs using small-sample data, even less than 25 samples while simultaneously balancing self-healing efficiency and mechanical performance remains a significant challenge. To the best of our knowledge, no relevant studies addressing this specific issue have been reported to date.
In this work, we introduce a chemically-informed active learning (CIAL, Fig. 1) framework for the efficient design of fluorescent self-healing PU, enabling multi-objective optimization with only 20 samples. The distinctive feature of the CIAL framework lies in its deep integration of chemical expertise throughout the entire machine learning pipeline from descriptor construction and data analysis to iterative sample selection, achieving synergistic optimization between human decision-making and optimization algorithm. Taking the self-developed fluorescent self-healing PU (PSSNM) as an example (Fig. S1), merely three iterations with 20 experimental data points successfully established six gradient boosting regression (GBR) models. These models accurately predicted the material's tensile strength (σ), elongation at break (ε), toughness (T), and their corresponding self-healing efficiencies (ησ, ηε, ηT) (with average prediction accuracy R2 > 0.8). Based on these high-precision models, a Pareto front was recommended through Non-dominated Sorting Genetic Algorithm III (NSGA-III) optimization. Experimental validation demonstrated excellent agreement between predicted and measured mechanical properties for optimal sample P20B (with relative errors <12% for comprehensive performance indicators), fully confirming the reliability of the CIAL framework. Furthermore, the property predictions of another self-healing PU system (PBP) using the CIAL framework demonstrated that merely 15 sample sets were sufficient to achieve desirable results. The predicted data showed excellent agreement with experimental measurements, with all comprehensive performance metrics maintaining an average relative error below 13%, which conclusively validated the universality of this framework. Utilizing the optimal P20B sample, an intelligent protective coating was developed for steel that integrates both self-healing functionality and real-time corrosion warning capability. This innovative coating solution provides a promising approach for intelligent protection of critical infrastructure such as oil pipelines. Through the deep integration of chemical intuition and data intelligence, this study establishes a universal machine learning framework that overcomes the “performance trade-off effect” in multifunctional PU materials with tiny data requirements.
By randomly varying the stoichiometric ratios of four components in PSSNM, nine samples were prepared and their key properties were measured including tensile strength, elongation at break, toughness, and self-healing performance (Table S1). Using the molar ratios of four raw materials as inputs, three models were trained using the GBR algorithm to establish relationships between material composition and three mechanical properties, respectively. Considering that the mechanical property values span 2–3 orders of magnitude, logarithmic transformations were applied. Self-healing performance was determined based on the ratios of tensile strength, elongation at break, and toughness after repair relative to their original values. Accordingly, three additional models were developed to predict the self-healing performance. The models demonstrate excellent fitting performance on the training set (R2 = 0.99, 0.93, 0.99, 0.99, 0.91, and 0.99, respectively), but display inferior predictive accuracy on the testing set (R2 = 0.30, 0.15, 0.40, 0.10, 0.35, and 0.19, respectively). The elevated mean absolute error values (MAE = 0.41, 0.25, 0.37, 0.10, 0.02, and 0.10, respectively) for the testing set (Fig. S2 and S3a) further confirm the presence of severe overfitting. This observation aligns with previous literature reports on machine learning performance with small datasets, where overfitting tends to occur due to limited data availability. Expanding the dataset size is a crucial approach to mitigate this issue. To distinguish the different models, the above six models are designated as MR-9.
To address the issue of data scarcity, the original dataset was expanded from 9 to 12 samples by supplementing three additional experimental specimens and systematically characterizing their performance parameters. The relevant formulations and corresponding performance are listed in Table S2. Based on the enhanced dataset, six predictive models were retrained using the same GBR algorithm (named MR-12). The degree of overfitting is significantly reduced, as evidenced by testing set R2 values of 0.57, 0.17, 0.66, 0.69, 0.66, and 0.52 (Fig. S3b and S4). Correspondingly, the MAE values show notable reductions compared to the original MR-9 models. However, the current model's predictive accuracy remains insufficient to meet the requirements for precise prediction of optimal material compositions. These results suggest that increasing the dataset size can partially alleviate overfitting. However, simply expanding the dataset neither fundamentally enhances model generalization capability nor aligns with our original objective of achieving “accurate prediction with tiny samples”. Therefore, subsequent research will focus on integrating materials science mechanistic analysis with expert knowledge to comprehensively investigate the causes of overfitting.
To reveal the contribution of each descriptor to prediction and investigate the underlying causes of overfitting, feature importance analysis based on GBR algorithm was conducted for the MR-9 and MR-12 models (Fig. S5). The results demonstrate that in the MR-9 model, both IPDI and NIAM variables show zero importance scores for toughness prediction, indicating their negligible contributions to the predictive outcomes. Similarly, these two variables exhibit importance scores below 10% in tensile strength prediction. The MR-12 model also presents inappropriate feature selection with the NIAM variable demonstrating near-zero importance scores for both toughness and corresponding self-healing performance predictions, while maintaining below 10% importance for tensile strength and corresponding self-healing performance predictions. Features with low importance typically contain considerable amounts of irrelevant or redundant information. Such noise is difficult to effectively identify and suppress during model training, leading to overfitting of anomalous patterns in the training data.30 As training progresses, the model's reliance on these noisy features intensifies, further diminishing its adaptability to new (testing) data and significantly compromising its generalization performance and predictive reliability. These findings suggest that models constructed from randomly selected data lack effective learning mechanisms. Therefore, integrating domain expertise to screen critical features and establishing high-quality specialized databases are essential for enhancing model predictive performance and reliability.31
Based on established synthetic protocols for PUs containing dual chain extenders, the most representative stoichiometric formulation follows the molar ratio PTMEG/IPDI/HEDS/NIAM = 0.25
:
0.5
:
0.125
:
0.125.33 This formulation adheres to the fundamental principle of maintaining a 1
:
1 molar ratio between hydroxyl groups and isocyanate groups, theoretically ensuring complete consumption of all functional groups. In this system, PTMEG, HEDS, and NIAM all serve as dihydroxy compounds, and their combined stoichiometry is designed to balance the isocyanate equivalents. Specifically, the 1
:
2 molar ratio between PTMEG and IPDI first generates NCO-terminated prepolymers, which subsequently react with the two chain extenders. The disulfide-containing HEDS and NIAM primarily contribute to the self-healing capability and mechanical strength. To achieve an optimal property balance, two distinct extenders are initially incorporated in equimolar proportions.
According to the above classical molar ratio of four components, at least 27 samples are theoretically required to establish four reactants with equal gradient variations. To minimize the required data, we strategically recombine the four components into three key parameters: reactive group molar ratio (Φ1), chain extender ratio (Φ2), and hard/soft segment ratio (Φ3). They are defined as:
![]() | (1) |
![]() | (2) |
![]() | (3) |
This parametric reconstruction not only retains the regulatory dimensions of the original four-component system but also significantly reduces the experimental scale. Three descriptors (Φ1, Φ2, Φ3) directly encode core chemical knowledge: functional group stoichiometry, the interplay between dynamic covalent bonds and hydrogen bonds, and the hard/soft segment microphase separation principle. Gouveia et al.34 demonstrated that increasing the NCO
:
OH molar ratio, i.e., Φ1, can enhance the tensile strength of PUs, while adjusting the molar ratio of two chain extenders can effectively regulate hydrogen bonding content, microphase separation degree, self-healing capability and crystallinity.35,36 Most notably, the modulation of hard/soft segment ratio (Φ3) enables not only performance transitions from soft to hard or brittle to tough, but also variations in the material's self-healing properties by regulating molecular chain mobility.37,38 By systematically adjusting these three variables with equal gradient intervals, the minimum required data are successfully reduced from 27 to 9 while still obtaining meaningful patterns in the four-reactant ratio combinations. This refined experimental design achieves a threefold reduction in sample size without compromising the precision of the system.
Taking the classical dual-chain-extender formulation (PTMEG/IPDI/HEDS/NIAM = 0.25/0.5/0.125/0.125) as our reference system, Φ1, Φ2, and Φ3 are 1, 0.5, and 0.33, respectively. By setting Φ1, Φ2, and Φ3 gradient intervals at 1/10, 1/4 and 1/3, the dataset containing nine representative formulations was established. Based on this, nine representative material formulations (3 levels) were designed to train the GBR models (detailed composition ratios and corresponding performance are listed in Table S3). With the molar ratios of four raw materials and three key variables as model inputs, predictive models were constructed using the GBR algorithm (MD-9A). As demonstrated in Fig. S6a, the MD-9A models exhibit higher precision than the MR-9 models for most prediction targets, and even outperform some MR-12 models, strongly demonstrating the effectiveness of the refined dataset and the importance of human decision in the optimization process. The utilization of three descriptors (Φ1, Φ2, Φ3) means that the model learns structure–property relationships within a chemically meaningful and constrained framework, which is helpful to enhance the reliability of the GBR models, especially for small datasets. The reliability of MD-9A was validated using the leave-one-out cross-validation (LOO-CV) method (Fig. S7). The following models are also testified by the LOO-CV method. Based on the established single-objective predictive models, the NSGA-III algorithm was further employed for multi-objective optimization. To facilitate the comprehensive evaluation of PSSNM, mechanical property indicators (σ, ε, T) and self-healing capabilities (ησ, ηε, ηT) were transformed into two comprehensive targets, Z1 and Z2, respectively. They are defined as:
![]() | (4) |
![]() | (5) |
Subsequently, the Pareto front was recommended and plotted in Fig. S6b and c. It should be noted that the Pareto front in this study is dynamic and iteratively updated, defined as the set of non-dominated solutions within the current dataset. The goal of multi-objective optimization is to identify candidate points with predicted performance that surpasses this current front. After experimental validation, these candidates are incorporated into the dataset to update the front, meaning that breaking through the front represents an iterative and proactive expansion of the performance space. Unlike static Pareto fronts determined after optimization,39 the dynamic front here is updated in real time with new data, enabling efficient discovery of high-performance candidates and adaptive exploration of the evolving chemical space. However, the optimization results do not break through the Pareto front due to constraints imposed by the variable space constructed from the initial nine datasets. While the optimization algorithm can effectively explore existing data space, the inherent limitations of sample space objectively constrain the enhancement of optimization effects, highlighting the necessity to expand datasets or adapt sampling strategies.40
To expand the parameter exploration space, this study refined the experimental design strategy by increasing the Φ1 gradient interval from 0.1 to 0.2, enabling a more comprehensive and systematic search for optimal solutions. Accordingly, nine additional representative material formulations (Table S4) were employed for model training, with the newly designed nine formulations used to retrain the predictive model (MD-9B). The model demonstrates exceptional fitting performance on the training set, with R2 values of 0.99, 0.99, 0.99, 0.43, 0.56, and 0.79 respectively, while maintaining significant predictive accuracy on the testing set, yielding corresponding R2 values of 0.99, 0.76, 0.99, 0.31, 0.36, and 0.64 (Fig. S6d). Compared with MR-9 and MR-12 models, these results further validate that improving data quality is substantially more effective than indiscriminate data expansion for enhancing predictive accuracy, thereby providing empirical evidence for the robustness and effectiveness of integrating DoE with human decision strategies. Unfortunately, the optimization results still fail to completely surpass the Pareto front due to limited quantity of available data (Fig. S6e and f).
Since data scarcity is the critical bottleneck, we integrated data from the above two phases to construct a new dataset containing 15 samples (detailed data are listed in Table S5). The MD-15 models were trained (model accuracy detailed in Fig. S8a) followed by multi-objective optimization. Interestingly, the MD-15 models not only maintain high predictive accuracy but also successfully break the Pareto front (Fig. S8b and c). Two optimal points (denoted as P15A and P15B) were subsequently selected for experimental verification, and the corresponding samples were prepared according to the two recommended formulations outlined in Table S6. The mechanical and self-healing properties of P15A and P15B samples were evaluated by tensile testing (Fig. S8d), and the detailed mechanical property data and corresponding self-healing efficiency are summarized in Fig. S8e, f and Table S7. The tensile strength, elongation at break, and toughness of P15A are 6.06 MPa, 165.04%, and 8.32 MJ m−3, while the corresponding predicted values are 10.65 MPa, 837.94%, and 23.87 MJ m−3. The relative errors between logarithmic values of predicted and measured mechanical properties are 31.3%, 31.8%, and 49.7%, respectively, which are far beyond the acceptable error limitation. P15B exhibited even greater deviations between predicted and experimentally measured values, with relative errors of 116.4%, 122.0%, and 455.8% for tensile strength, elongation at break, and toughness, respectively, suggesting prediction failure.
The Φ2 values consistently approach zero for both P15A and P15B in 15-entry dataset, corresponding to near-zero HEDS molar ratio in these two samples (as shown in Table S6). Notably, this parameter range is absent from the original database. Thus, the inaccurate predictive results may also stem from a significant gap in the parameter space coverage, particularly within the Φ2 value range, besides insufficient data volume. When the multi-objective optimization algorithm recommends samples that are chemically unreasonable or fall outside an existing experimental design space (e.g., for formulations with extremely low Φ2 values, indicating near-absence of the dynamic disulfide bond carrier HEDS), the decision whether to synthesize these samples from high-uncertainty regions should be guided by integrated expertise in chemistry and machine learning. Since disulfide bonds are essential for self-healing performance, the samples without disulfide bonds can fill the specific “chemical gap” in the parameter space, thereby enhancing prediction precision.
Experimental results demonstrate that P20C sample exhibits optimal self-healing performance, while P20B shows the highest tensile strength. Moreover, P20B achieves the best balance between mechanical features and self-healing performance. The successful synthesis of P20B and P20C was confirmed by fourier transform infrared spectroscopy (FT-IR) and proton nuclear magnetic resonance (1H NMR) spectra (Fig. S9 and S10). FT-IR analysis confirms the formation of urethane linkages and the presence of naphthalene rings in both P20B and P20C, with residual –NCO observed only in P20B due to excess IPDI. 1H NMR spectra further verify the successful incorporation of PTMEG, IPDI, HEDS, and NIAM into the polymer backbones. To investigate the underlying mechanism of these performance differences between P20B and P20C, we systematically analyzed their hydrogen bonding fractions and microphase separation behavior. The P15B was also measured as a baseline control representing high hydrogen-bonding conditions. The absorption region related to C
O group (1600–1760 cm−1) in the FT-IR spectrum was deconvoluted into three characteristic peaks: free C
O at 1718 cm−1, ordered H-bonded C
O at 1636 cm−1, and disordered H-bonded C
O at 1687 cm−1 (Fig. S11).41 Quantitative analysis indicates that the total proportion of ordered and disordered H-bonded C
O in P15B reaches 70.0%, significantly higher than that in P20B (57.5%) and P20C (53.2%) (Fig. 2i). This difference in hydrogen bond density directly influences the microphase separation behavior of the materials.42 The microphase separation properties of three aforementioned samples were characterized by means of atomic force microscopy (AFM). As shown in Fig. 2j, AFM results confirm that P15B exhibits the most pronounced microphase separation due to its higher hydrogen bond density and hard segment content, while P20B and P20C show intermediate and weaker microphase separation characteristics, respectively.
In-depth structure–property relationship studies reveal that the high hydrogen bond density in P15B originates from its elevated molar ratio of NIAM monomer and higher hard segment content. The synergistic effect between the rigid aromatic rings in NIAM and the dense hydrogen-bonding network in the hard segments forms a highly rigid three-dimensional network structure. This not only severely restricts molecular chain mobility but also eliminates material elasticity, resulting in typical plastic deformation behavior and extremely poor stretchability and self-healing performance. Furthermore, the exceptionally low relative content of dynamic disulfide bonds (HEDS) in P15B further deteriorates its self-healing capability. In contrast, P20C significantly reduces hydrogen bond density by decreasing the NIAM molar ratio and hard segment content. While this modification provides excellent chain mobility and self-healing performance, it comes at the cost of substantially reduced load-bearing capacity. These comparative results clearly demonstrate the inherent trade-off between mechanical properties and self-healing capability in PU materials. The moderate hydrogen bond density, optimal rigid ring amount, well-designed disulfide bond content, and appropriate phase separation degree in P20B ensure mechanical strength while maintaining sufficient self-healing efficiency. This multi-scale structural coordination strategy provides important guidance for designing high-performance self-healing PU materials, fully demonstrating that rational design of molecular composition and structural parameters enables precise control of material properties.
To further validate the generalizability of this model, we prepared 15 new self-healing PU samples (labeled PBP) using the identical gradient design principle (Fig. S12 and Table S13). In PBP, PTMEG as the soft segment was reacted with 4,4′-dicyclohexylmethane diisocyanate (HMDI) to form a prepolymer. Then, 4,4′-biphenol (BP) and 2,6-pyridinedimethanol (PDM) were added step by step as chain extenders to obtain PBP. Using the molar ratios of four new raw materials and three key variables as input parameters, a predictive model designated as MG-15 was developed based on the GBR algorithm. As shown in Fig. 4a, the MG-15 model exhibits excellent fitting performance on the training set and maintains good predictive accuracy on the testing set, demonstrating high overall precision. Furthermore, the robustness of the model is evaluated using LOO-CV, with the results (Fig. S13) indicating that the MG-15 model performs comparably to the reference models MD-15 and MD-20, thereby confirming its strong generalization capability and stability. Building upon the established single-objective predictive models, the NSGA-III was employed for multi-objective optimization. As illustrated in Fig. 4b and c, the optimization process successfully surpasses the Pareto front. Notably, expert chemical judgment was applied to assess the rationality of the recommended formulations. Consequently, two optimized formulations (PG15A and PG15B) were selected for experimental validation. The synthetic formulations with detailed feed ratios are summarized in Table S14, while Fig. 4d–f, S14 and Table S15 present the corresponding properties. The tensile strength, elongation at break, and toughness of PG15A are 3.91 MPa, 2482.09%, and 61.41 MJ m−3. These measurements demonstrate excellent agreement with model predictions (4.33 MPa, 2894.40%, and 33.87 MJ m−3, respectively), as evidenced by logarithmic relative errors of merely 1.9–14.5%. The self-healing efficiencies also demonstrate good agreement between experimental measurements (ησ = 89.0%, ηε = 81.8%, ηT = 82.6%) and model predictions (ησ = 100.0%, ηε = 97.9%, ηT = 93.0%), with relative errors ranging from 12.4% to 14.9%. PG15B exhibits comparable performance to PG15A, with property prediction errors maintained within 1.3–19.5%. The above results testify the scientific validity and practical applicability of the proposed modeling and optimization framework. The Z1 and Z2 values with relative errors for PG15A and PG15B are below 13% (Fig. S15 and Table S16). As to the recommended PG15C sample, the predicted HMDI proportion is excessive leading to a pronounced imbalance in the stoichiometry of –NCO groups. From a fundamental chemical perspective, the PG15C will not exhibit the ideal features. Consequently, the experimental results significantly deviate from predictions (Fig. S14 and Table S15). The incorporation of chemical domain knowledge enables effective pre-screening of predicted formulations, significantly enhancing the reliability of machine learning predictions by eliminating unrealistic candidates. Principal component analysis (PCA) was employed to reduce the dimensionality of the model inputs. As shown in Fig. S16, the recommended formulations from the MD-15 and MD-20 models (e.g., P15A, P20A) and those from the MG-15 model (PG15A, PG15B) are all located within a consistent region defined by PCA1 values between −1 and 0 and PCA2 values between 2 and 4. In contrast, PG15C falls outside this region, exhibiting inferior performance. PCA can serve as a complementary tool for preliminary evaluation of the reliability of recommended formulations following multi-objective optimization.
For the PSSNM system, a larger experimental dataset (20 samples) is required to effectively surpass the Pareto front and achieve successful predictions, whereas only 15 samples are necessary for the PBP system. This difference is mainly attributed to performance distribution in the dataset. The MD-15 dataset exhibits a relatively uniform performance distribution (Fig. S17a–c); nevertheless, the samples recommended by the MD-15 model exhibit much larger tensile strength than those in the original dataset, indicating the considerable potential for further improvement in this performance dimension. Conversely, the optimized values for elongation at break and toughness fall within the original data range, suggesting that these properties may be approaching their theoretical limits within the current material system. The samples (P20A and P20B) recommended by the MD-20 model further support this interpretation, achieving improvements in tensile strength while maintaining high predictive accuracy (Fig. S17d–f). MG-15 contains several extreme outliers and the samples recommended by the MG-15 model show only marginal or no improvements across all three properties (Fig. S17g–i), indicating that the material performance rapidly approaches its theoretical limits under the current optimization cycle. To further investigate the effect of incorporating high-performance samples into the training data, two strategies were explored. In the first strategy, P20A and P20B were used to replace samples P1 and P3 (Entries 1 and 4, Table S5) in the MD-15 dataset. In the second strategy, P20A and P20B were directly added to the MD-15 dataset without removing any existing samples, yielding an expanded training dataset. New models (denoted as MD-15A and MD-15B) were trained based on these datasets and subsequently subjected to multi-objective optimization. As shown in Fig. S18, both MD-15A and MD-15B yield recommended samples exhibiting only marginal or no improvements across all three target properties, consistent with the trends observed for the MG-15 model. In general, the presence of exceptional high-performance data points enables satisfactory prediction outcomes even with remarkably small sample sizes (as few as 15 samples). For data exhibiting concentrated distribution, a targeted expansion to 20 data points can yield comparable predictive performance.
Notably, the incorporation of the fluorescent NIAM moiety endows the material with unique photoluminescent properties, exhibiting a strong emission peak at 530 nm under 365 nm UV excitation (Fig. S20). When the coating is damaged, the rough fracture surface enhances the refraction and resists the internal reflection, leading to localized fluorescence intensification at the crack sites (Fig. 5e).44,45 As the self-healing process progresses, the surface gradually becomes smooth, accompanied by a corresponding decrease in fluorescence intensity until complete recovery to the initial state. This feature enables real-time visual monitoring of coating damage and precise tracking of the self-repair progress. As shown in Fig. 5f, the scratched region exhibits significantly enhanced fluorescence under UV illumination, whereas the fluorescence signal gradually diminishes to the background level upon healing. Optical microscope images (Fig. S21) confirm the complete disappearance of surface cracks after 6 h of healing at 60 °C, which unequivocally validates the reliability of this fluorescence-based damage warning mechanism. This multifunctional smart coating material, integrating corrosion protection, damage warning, and self-healing capabilities, holds significant scientific and practical value for extending the service life of metallic materials and enhancing the safety of engineering structures.
Materials, characterization details, mechanical and self-healing testing methods, and other machine learning details are stated in the SI.
Footnote |
| † These authors contributed equally to this work. |
| This journal is © The Royal Society of Chemistry 2026 |