Hongchul Shinab,
Taeyoung Yoon
*b and
Sungmin Yoon
*b
aDepartment of Mechanical Engineering, Korea University, Seoul, 02841, Republic of Korea
bDepartment of Mechanical Engineering, Changwon National University, Changwon, 51140, Republic of Korea. E-mail: tyyoon@changwon.ac.kr; yoonsm@changwon.ac.kr
First published on 13th May 2025
Low-cycle fatigue (LCF) data involve complex temporal interactions in a strain cycle series, which hinders accurate fatigue life prediction. Current studies lack reliable methods for fatigue life prediction using only initial-cycle data while simultaneously capturing both temporal dependencies and localized features. This study introduces a novel deep-learning-based prediction model designed for LCF data. The proposed approach combines long short-term memory (LSTM) and convolutional neural network (CNN) architectures with an attention mechanism to effectively capture the temporal and localized characteristics of stress–strain data from acquisition through a series of cycle strain-controlled tests. Among the models tested, the LSTM-contextual attention model demonstrated superior performance (R2 = 0.99), outperforming the baseline LSTM and CNN models with higher R2 values and improved statistical metrics. The analysis of attention weights further revealed the model's ability to focus on critical timesteps associated with fatigue damage, highlighting its effectiveness in learning key features from LCF data. This study underscores the potential of deep-learning-based methods for accurate fatigue life prediction in LCF applications. This study provides a foundation for future research to extend these approaches to diverse materials with varying fatigue conditions and advanced models capable of incorporating non-linear fatigue mechanisms.
Recent advancements in material modeling have aimed to improve the accuracy of fatigue life predictions.9 Many studies have used cumulative data from multiple loading cycles to predict fatigue life, often relying on empirical relationships or machine learning models.10–12 While these approaches provide valuable insights, they face limitations in capturing local stress–strain behavior during high strain cycles, which are crucial for understanding material failure. In addition, these models often require extensive datasets, which may not always be available in industrial settings.13
Historically, empirical models have been used to relate fatigue life to mechanical properties such as damage parameters under monotonic loading.10–12 However, these models often oversimplify the intricate relationships between cyclic loading, material microstructure, and fatigue behavior. Machine learning methods have shown potential to overcome these limitations by learning complex, nonlinear relationships directly from data.14,15 However, the interpretability of these models and their reliance on large datasets are significant obstacles.
Convolutional Neural Networks (CNNs) have been associated with image analysis but have also demonstrated effectiveness in time-series data analysis.16,17 Unlike Recurrent Neural Networks (RNNs), which process data sequentially and rely on hidden states to capture dependencies, CNNs utilize convolutional layers to extract local patterns and features directly from the input data.18 This approach allows CNNs to identify localized temporal patterns, such as stress or strain peaks within a cycle, without being constrained by sequential processing.
A key advantage of CNNs over RNNs is their resilience to issues like the vanishing gradient problem.19 In RNNs, gradients can diminish exponentially during backpropagation over time, leading to the loss of information from earlier timesteps and limiting the model's ability to capture long-term dependencies. In contrast, CNNs leverage hierarchical feature extraction through multiple convolutional layers, enabling the model to capture both short- and long-range temporal dependencies effectively. Additionally, the use of pooling layers in CNNs helps reduce the dimensionality of the data while preserving critical features, further mitigating the computational challenges associated with deep network architectures.20 The long Short-Term Memory (LSTM) networks address this limitation by introducing cell states and gating mechanisms, specifically designed to retain important information over extended sequences.21,22 The forget gate determines which information to discard, the input gate decides which information to add to the cell state, and the output gate regulates the flow of information to the next time step. These mechanisms allow LSTM networks to prioritize and maintain relevant temporal features, making them more robust in modeling long-term dependencies compared to standard CNNs.23 In the context of fatigue life prediction, where stress–strain behavior over time plays a critical role, LSTMs provide an effective way to capture the progression of material degradation within a cycle.
Furthermore, the addition of contextual attention mechanisms enhances LSTM networks by focusing on the most critical temporal features within a sequence while maintaining the data's temporal order.24 First, unlike self-attention, which models all pairwise interactions within a sequence, contextual attention respects the sequential structure of time-series data.25,26 This is particularly important in fatigue life analysis, where the cyclic stress–strain relationship depends on the ordered progression of states within a single cycle. Second, contextual attention emphasizes localized features, such as stress or strain peaks, in their immediate context.27 These points, which are highly indicative of fatigue life behavior, are often overlooked by self-attention's distributed focus across the entire sequence.28 Third, self-attention mechanisms are better suited for tasks requiring global dependencies, such as natural language processing, where temporal order is less critical.29,30 However, in fatigue life analysis, the localized interactions within the stress–strain cycle, particularly at extreme conditions, are the key drivers of material failure and life prediction.31 Fourth, the use of contextual attention directly aligns with the nature of the dataset by highlighting these critical localized dynamics while preserving the temporal flow.32,33 This makes contextual attention a better fit for predicting fatigue life from a single cycle, where local stress–strain patterns are the most significant contributors to accurate life predictions.34
Recent advancements in fatigue life prediction of metallic materials have leveraged machine learning techniques to address the challenges of accurately reflecting mechanical properties and complex loading conditions. Chen et al. proposed a multi-view neural network model that integrates frequency domain analysis with CNN and LSTM to predict the fatigue life of metallic materials under multiaxial loading, demonstrating robust performance across diverse experimental datasets.11 Similarly, Min et al. explored the interplay between processing parameters, post-treatment conditions, and fatigue properties of additively manufactured metals, utilizing machine learning models, including CNNs, to predict fatigue life and crack growth rates.12 Moon et al. focused on the influence of surface roughness and pore characteristics on the fatigue life of laser powder bed fusion Ti–6Al–4V alloys, employing neural networks to establish quantitative relationships with high predictive accuracy.10
Unlike other methods that rely on large datasets with multiple cycles or various material parameters, our model takes a unique approach by using a contextual attention mechanism and focusing on data from just the initial single cycle.32 The contextual attention mechanism improves accuracy and interpretability by highlighting the most important features within the initial cycle, helping the model identify early signs of fatigue that might be missed when analyzing many cycles. Using only the initial cycle makes the model faster and more efficient, as it doesn't need to process large datasets, while still capturing critical material properties like microstructure or surface conditions. However, our study is the first to apply a contextual attention mechanism for fatigue life prediction, highlighting its novelty and contribution to the field.
In this study, we propose an LSTM-based fatigue life prediction framework that incorporates a contextual attention mechanism. Empirical fatigue dataset was used to assess our prediction model. Unlike traditional models that require multiple cycles of data, this approach predicts fatigue life using only the stress–strain data from a single initial cycle. By focusing on local stress–strain interactions during extreme strain events, the model provides improved accuracy while maintaining interpretability. To ensure accessibility and reproducibility, the model is deployed as a Python, allowing seamless integration into industrial workflows. This study sheds light on the AI-based fatigue field with predicting final fracture from experimental data.
Fe | C | Si | Mn | P | S | Ni | Cr | Mo |
---|---|---|---|---|---|---|---|---|
Balance | ≤0.08 | ≤1.00 | ≤2.00 | ≤0.045 | ≤0.03 | 10.00–14.00 | 16.00–18.00 | 2.00–3.00 |
The experiments followed the ASTM E606-21 Standard Test Method for Strain-Controlled Fatigue Testing and compared two treatment types: (1) Type 316 stainless steel subjected to heat treatment only and (2) Type 316 stainless steel subjected to additional high-density pulsed electric current (HDPEC) processing after heat treatment.7,37 Dogbone-shaped fatigue specimens, with a thickness of 1.5 mm and a shape radius of 20 mm, were prepared, as shown in Fig. 1. A notch with a radius of 0.2 mm and length of 2 mm was machined at the center of one side to induce mode I fatigue cracking.36 The study focused on evaluating one-way cracking to assess the effects of HDPEC on damage evolution during fatigue. An electric current was applied to the HDPEC-treated specimens, and their fatigue behavior during crack initiation and propagation was compared to that of heat-only specimens.13
Each sample in the dataset represents a single cycle with 50 timesteps, where stress and strain values were recorded at regular intervals.13,38,39 The dataset spans various strain ranges (εmin to εmax), including ranges such as [0.18–0.36], [0.27–0.54], [0.198–0.396], [0.216–0.432], [0.234–0.468], [0.252–0.504], [0.108–0.216], [0.126–0.252], [0.144–0.288], and [0.162–0.324]. Each sample also includes its fatigue life (Nf), defined as the number of cycles until material failure, which ranges from 2000 to 90000 cycles. Data with incomplete cycles (cycle 1) or irregular stress–strain behavior were excluded to ensure consistency and reliability in the analysis.
![]() | (1) |
The fatigue life value Nf was normalized separately using MinMax scaling, which scaled Nf values to the range [0, 1] according to:
![]() | (2) |
K-fold cross-validation and stratified K-fold cross-validation differ in how they handle randomization and data distribution. K-fold relies on random shuffling to ensure diversity in each fold, whereas stratified K-fold explicitly maintains the proportional representation of features or labels across folds, making it useful for datasets with imbalanced distributions because it prevents underrepresented groups from being excluded. However, K-fold may introduce an imbalance unless the dataset is uniform. In this study, a stratified K-fold was initially applied to preserve the distribution of material treatments and strain ranges. Despite this, K-fold with shuffling ultimately provided superior predictive accuracy and stability because the added diversity from randomization enhanced the model's ability to generalize to unseen data.41 Therefore, K-fold cross-validation with shuffling was chosen for constructing the final model.
For K-fold cross-validation, the dataset D was divided into k folds, denoted as D = D1, D2, ⋯, Dk. For each fold i, training set Ti was defined as Ti = D − Di and the validation set Vi was defined as Vi = Di. The model was trained on Ti and evaluated on Vi, and the performance scores across all folds were aggregated to compute the overall validation score:
![]() | (3) |
![]() | (4) |
The cumulative energy Ecumulative absorbed by the material across all cycles was calculated as the sum of the plastic deformation energy for all cycles
![]() | (5) |
To analyze the behavior of stress and strain over time, the differences and variances of stress and strain values at each timestep were computed. These metrics provide insights into the dynamic changes and variability of stress and strain throughout the fatigue cycles.
The differences in stress and strain between consecutive timesteps were calculated as:
Δσt = σt+1 − σt | (6) |
Δεt = εt+1 − εt | (7) |
To evaluate the variability in stress and strain at each time step, we computed the variance across all samples as follows:
![]() | (8) |
![]() | (9) |
Correlation analysis and flattened principal component analysis (PCA) were conducted to assess whether the time-series stress and strain data could effectively predict the fatigue life Nf. The correlation analysis was aimed at identifying the timesteps in the stress–strain cycle that showed a significant relationship with Nf, while flattened PCA was used to evaluate whether the dataset could be meaningfully separated according to Nf values.
The purpose of the temporal correlation analysis was to examine the relationship between Nf and each feature at individual timesteps, identifying specific points in the cycle that are most predictive of fatigue life.43 To accomplish this, we computed a temporal correlation coefficient at each timestep, denoted as rt to capture the linear association between Nf and the stress or strain values at timestep t. This approach allows us to quantify the relationship between Nf and each specific point within the cycle. The temporal correlation coefficient rt for a given timestep t is defined as:
![]() | (10) |
The purpose of the flattened PCA was to determine if Nf values could be meaningfully separated in the reduced dimensional space, thus suggesting that the stress–strain data contain patterns indicative of different fatigue life outcomes.44 In this approach, each stress–strain cycle was flattened into a single vector, capturing the entire cycle's stress and strain values. PCA was then applied to these flattened vectors to reduce dimensionality and extract the principal components that account for the most variance in the dataset.
The first two principal components (PC1 and PC2) were plotted against each other, with each point colored according to its Nf value.45 This visualization revealed potential clusters in the data, showing whether samples with similar Nf values tended to group together in PCA space. Such clustering would indicate that the primary components capture significant features of the stress–strain data that are relevant to fatigue life prediction. The transformation for each principal component PCk is given by:
![]() | (11) |
To systematically identify the optimal architecture for each model, we performed hyperparameter optimization tailored to both the LSTM and CNN based frameworks.47 For the LSTM with contextual attention model, we utilized Optuna, a Bayesian optimization framework, to minimize validation MAE. The search space included the number of LSTM layers (ranging from 2 to 6), the number of units per layer (32 to 256), dropout rates (0.1 to 0.5), dense layer configurations (32 to 128 units), and learning rates (from 1 × 10−5 to 1 × 10−2, log-uniform). The best-performing model consisted of five stacked LSTM layers with 128 units per layer, intermediate dropout of 0.3, a 64-unit dense layer, and an Adam optimizer with a learning rate of 0.001.
For the CNN-based models, we applied Keras Tuner with the Hyperband algorithm, which efficiently searched the space of candidate architectures. The hyperparameters included the number of convolutional layers (2 to 4), number of filters (16 to 128), kernel sizes (3 to 7), dense layer units (32 to 128), dropout rates (0.1 to 0.4), and learning rate (1 × 10−4 to 1 × 10−2). The final CNN model selected through this process employed three convolutional layers with kernel size 5, max pooling, 0.2 dropout, and a 64-unit dense layer.
Final model selection across all architectures was based on quantitative performance metrics, including R2, MAE, and RMSE averaged over five independent training iterations. As reported in ESI Table S2,† the LSTM-contextual attention model consistently demonstrated statistically significant improvements over all other models (p < 0.05), supporting its suitability for fatigue life prediction using single-cycle time series data.
The final model, which combined LSTM with contextual attention, consisted of five stacked LSTM layers followed by a custom attention layer. Each LSTM layer was accompanied by a dropout layer to mitigate overfitting. The attention mechanism computed weights for each timestep (αt) as:
![]() | (12) |
![]() | (13) |
This mechanism enabled the model to focus on critical stress–strain regions within a single cycle.
The coefficient of determination (R2) measures how well the predicted values align with the true values and is defined as:14
![]() | (14) |
The mean absolute error (MAE), which quantifies the average absolute difference between the predicted and true values, is given as:48
![]() | (15) |
The root mean square error (RMSE), a metric that penalizes larger errors more heavily, is defined as:48
![]() | (16) |
With increasing cycle numbers, the hysteresis loop area progressively decreased, and the stress values declined as the material underwent fatigue damage. Near the fatigue failure, as observed in cycles 19260 and 15
640, the loop area became minimal, indicating a sharp reduction in the energy required for plastic deformation. This trend suggests that material strength weakens over time owing to cumulative fatigue damage, and in the final cycles, the energy drops sharply, indicating the inability of the material to sustain further deformation.
The stress–strain envelope curves in Fig. 4b provide further insight into the fatigue progression. In cycle 20, the material exhibited the highest stress and strain values and consumed most of the deformation energy. However, in intermediate cycles, such as cycles 7820 and 9640, the stress values gradually decrease, reflecting the progressive strength reduction of the material.13 Near the end of fatigue life, the envelope curves narrowed significantly and the stress values became minimal, indicating that the material approached its failure point. This behavior highlights the limited capacity of the material to accommodate additional deformation as fatigue progresses.1
To quantify the energy changes further, the cumulative plastic deformation energy was calculated by integrating the hysteresis loop areas. Fig. 5 compares the cumulative energies of the heat- and heat-HDPEC-treated specimens across various strain ranges. The results show that cumulative energy decreases with an increasing number of cycles, which is consistent with the trends observed in Fig. 4.
![]() | ||
Fig. 5 Comparison of cumulative energy between heat-treated and heat-HDPEC-treated Type 316 stainless steel specimens across various strain ranges (Table S1, ESI dataset†). The red bars represent heat-treated specimens, while the orange bars represent heat-HDPEC-treated specimens. |
In low strain ranges, HDPEC-treated specimens exhibit higher cumulative energy compared to heat-treated specimens, suggesting that HDPEC treatment improves the material's ability to sustain deformation energy over a longer period. In contrast, at high strain ranges, cumulative energy decreases rapidly for both specimen types, indicating accelerated fatigue damage and shorter fatigue lives. Nevertheless, the HDPEC-treated specimens retain relatively higher cumulative energy levels compared to the heat-treated specimens, even in high strain ranges.
These results suggest that heat-HDPEC treatment enhances fatigue resistance by reducing the plastic deformation energy during cyclic loading.3,13 The improved energy retention in the heat-HDPEC-treated specimens indicates that the treatment delayed the onset of significant fatigue damage under low strain conditions. Conversely, under high-strain conditions, energy depletion occurs more rapidly, accelerating fatigue failure.
The combination of hysteresis loop analysis (Fig. 4) and cumulative energy comparison (Fig. 5) demonstrated that heat-HDPEC treatment improved the fatigue resistance of the material. The heat-HDPEC-treated specimens showed a slower decline in plastic deformation energy and sustained more energy through later cycles than the heat-treated specimens. These findings confirm that heat-HDPEC treatment extends fatigue life by delaying fatigue damage accumulation and maintaining material strength over a greater number of cycles.
![]() | ||
Fig. 6 Temporal correlation analysis between stress (blue), strain (red), and fatigue life. Correlation analysis for specimens with fatigue life in the range of (a) 0–20000 and (b) 0–80000 cycles. |
![]() | ||
Fig. 7 Flattened PCA analysis of stress–strain data for type 316 stainless steel specimens. Analysis for specimens with fatigue life in the range of (a) 0–20000 cycles and (b) 0–80000 cycles. |
Fig. 6 shows that both the stress–fatigue life and strain–fatigue life correlations exhibit periodicity across timesteps, with significant peaks and troughs corresponding to the critical phases within the stress–strain cycles. Positive correlation peaks, nearing +1, highlight time intervals where changes in stress and strain are highly predictive of fatigue life, whereas negative correlation peaks, nearing −1, indicate intervals where these changes are inversely correlated with fatigue life. The similarity between the stress and fatigue life and strain–fatigue life correlation curves suggests that both features provide complementary and essential information for predicting fatigue life. These findings emphasize the necessity of capturing localized temporal dynamics within stress–strain cycles for accurate predictive modeling.
Fig. 7 shows the distribution of data under the two conditions. The left panel includes fatigue life values ranging from 0 to 20000, whereas the right panel incorporates the entire dataset, spanning fatigue life values from 0 to 90
000. In the left panel, the data points align along a linear trend, as denoted by the red dashed lines, suggesting that the primary variance in the data is explained by the linear relationships between the first two principal components. This linearity indicates that the stress–strain patterns in the lower fatigue life regimes are uniform and predictable. In contrast, the right panel shows a more complex distribution when the full dataset is considered. While most data points conformed to the linear trend observed in the left panel, a distinct cluster emerged for fatigue life values exceeding 60
000, as highlighted by the red circle. This deviation suggests the presence of non-linear dynamics or unique stress–strain behaviors associated with high fatigue life.
Combining temporal correlation analysis (Fig. 6) and flattened PCA (Fig. 7) provided significant insights into the behavior of stress and strain data across different fatigue life regimes. Fig. 6 emphasizes the critical importance of specific time intervals within the stress–strain cycles, supporting the necessity of temporal attention mechanisms in prediction models. Meanwhile, although majority of the data in Fig. 7 can be described by linear trends, high fatigue life cases exhibit distinct non-linear complexities, necessitating tailored modeling approaches for these regimes.
To apply these findings, both LSTM and CNN models were employed to predict the fatigue life, capturing the linear and non-linear dynamics revealed by Fig. 6 and 7, respectively. This combined approach aims to improve predictive accuracy by integrating the temporal and spatial features inherent in the stress–strain data.
Similarly, the CNN model (Fig. S4†) extracts localized spatial patterns from the input stress–strain data through convolutional layers. In a CNN with attention model (Fig. 3), the attention mechanism assigns importance to specific local features, further refining the predictive process. While the baseline CNN effectively captures local dependencies, it lacks the ability to model the long-term temporal relationships inherent in the LCF data.
The results demonstrated the superior performance of the LSTM with attention model for fatigue life prediction (Fig. 8 and S5†). Statistically significant improvements across all performance metrics (R2, MAE, and RMSE) were observed when comparing the LSTM with attention model to the baseline LSTM model (Table S2,† p-values: R2 = 0.0001, MAE = 0.0003, and RMSE = 0.0000). The attention mechanism allows the model to focus on critical timesteps within the cyclic data, leading to improved predictive accuracy and reliability.
When compared with the CNN with attention model, the LSTM with attention also achieved statistically significant improvements (Table S2,† p-values: R2 = 0.0320, MAE = 0.0474, RMSE = 0.0006). These results highlight the ability of LSTM to effectively capture the long- and short-term temporal dependencies inherent in the LCF data. While the attention mechanism benefits both models, the LSTM with attention better combines global temporal patterns and localized critical features, which are essential for accurate fatigue life prediction. By contrast, the addition of attention to CNN did not result in statistically significant improvements (Table S2,† p-values: R2 = 0.9379, MAE = 0.2364, RMSE = 0.9999). CNN, which excels in local feature extraction, struggles to model the sequential nature of LCF data. Even with the inclusion of attention, CNN processes sequences as discrete segments, limiting its ability to capture broader temporal dependencies.49
The LCF data exhibited intricate stress–strain interactions over multiple cycles, often involving nonlinear evolution. The LSTM with attention model effectively addresses this complexity by capturing long-term dependencies across cycles while focusing on key phases within each cycle. In contrast, the local feature extraction of CNN cannot fully leverage the sequential nature of LCF data, leading to a lower performance. These findings underscore the importance of selecting models that align with the inherent characteristics of data, particularly for time series problems such as low-cycle fatigue.50 The combination of LSTM and attention provides a robust solution for capturing both global and localized temporal patterns, making it well-suited for fatigue life prediction in LCF datasets.
Short fatigue life | Long fatigue life | |
---|---|---|
Heat | Hysteresis loop tends to be quite wide, indicating substantial plastic deformation | Even though some plasticity emerges, the attention remains mostly near the tensile peak |
The attention mechanism primarily focuses on the tensile peak (red). | The attention mechanism primarily focuses on the tensile peak (red). | |
HDPEC | Hysteresis loop is wide, but more local variations in deformation may appear | Cyclic behavior in-between elastic and plastic regimes may prominent, in which plastic proportion lost its effect in this mechanical response |
The attention mechanism focuses not only tensile peak (red) but also transitions between tension and compression (purple). | The attention mechanism focuses transitions between tension and compression, not peak stress |
Fig. 9 presents the attention weight distributions computed by our model when predicting the fatigue life solely from these 20th-cycle hysteresis loops, where the red line denotes the global maximum attention, the blue line denotes the global minimum, and the purple line denotes the local maximum. The red line appears near the tensile peak or regions where the plastic strain is prominent, particularly in the short-life specimens. For heat-HDPEC specimens with a long fatigue life, attention often shifts toward the midrange stress or strain, reflecting the cyclic behavior dominated by elastic responses and the reduced influence of plastic deformation. Such specimens tended to exhibit pronounced plastic deformation relatively early, causing the model to devote maximum attention. The blue line represents regions of low informational value, occurring in near-elastic unloading segments or areas with minimal stress–strain variations. While it can appear in both short- and long-life materials under either heat or heat-HDPEC treatment, it is often more noticeable in long-life specimens that have not accumulated substantial plasticity and therefore have extended elastic-like intervals. The purple line represents additional local maxima that are not as dominant as the red line and can appear near transition points, such as tension-compression transitions, or when minor asymmetry or ratcheting emerges. Notably, heat-HDPEC specimens often exhibit cyclic behavior transitioning between the elastic and plastic regimes, where the plastic proportion loses its significance in the mechanical response.7,8,37 Even with high fatigue life, subtle changes in midrange stress or strain can serve as significant predictors (Fig. 9 purple line).
![]() | ||
Fig. 9 Attention weights from the best-performing LSTM-contextual attention model across different fatigue life (red line: global maximum, blue line: global minimum, purple line: local maximum). |
When examining how the attention of the model adapts to different fatigue lives, short-life specimens frequently show wide loops with significant plastic behavior, causing the red line to cluster around the maximum stress region. Midrange lives exhibit partial plasticity in cycle 20, and the model may highlight both the tensile peak (Fig. 9, red line) and tension–compression transitions (Fig. 9, purple line). Long-life specimens often exhibit cyclic behavior, predominantly in the transitional regime between the elastic and plastic responses. In these cases, the effect of the plastic proportion diminishes, leading to a more uniform attention distribution and smaller local peaks without a single dominant maximum. For the heat-treated group, the loop shape was often more predictably guided by sequential hardening/softening, making the maximum stress region an important area of attention. However, the shorter-life heat specimens can exhibit substantial plastic deformation, thus reinforcing the red line peak in the early cycles.51 In contrast, the heat-HDPEC specimens, which include many long-life samples, exhibit localized deformation onsets during mid-level stress or strain, yielding a stronger purple-line local maxima. This behavior implies that even an elastic loop can contain hidden features that the model considers critical for an accurate life prediction.52
In summary, short-life specimens show prominent plastic deformation in the maximum stress regions, whereas long-life specimens exhibit transitional elastic–plastic behavior in the midrange stress or tension–compression transitions. The heat-treated specimens showed predictable hardening/softening patterns, and the heat-HDPEC specimens demonstrated an extended fatigue life with respect to localized midrange stress or strain changes.
Furthermore, unlike other studies, in which code availability and accessibility are unavailable, our research provides an open Python package (Table 3). The proposed fatigue life prediction model was developed as an open-source Python package to ensure practical usability and accessibility. The model predicts fatigue life using stress–strain data from the initial cycles (20 cycles), enabling efficient and early stage estimation. To enhance usability, the package includes automated data preprocessing functions that streamline the handling of raw stress–strain data, thereby eliminating the need for manual intervention. The optimized inference process ensured that predictions were generated in less than 5.0 seconds, as verified during multiple trials under standard testing conditions, including tests conducted on Google Colab without GPU acceleration. The tests were conducted on an Intel Xeon CPU with two vCPUs and 13 GB of RAM. The implementation and tutorial are available on GitHub (https://github.com/mschongchulshin/fatigue_life_predictor/tree/main) to ensure transparency, reproducibility, and ease of use. This package provides a robust and efficient solution for fast and reliable fatigue life predictions based on initial cycle data.
Dataset | AI model | Accuracy | Interpretability | Code availability accessibility | Ref. |
---|---|---|---|---|---|
Micro-CT images including pore size, pore density, surface roughness | Bayesian neural network | Not explicitly mentioned | Not explicitly mentioned | Not available | 11 |
Entire cycles including additive manufacturing process parameters | FNN, CNN, random forest, ANFIS | R2 > 0.97 | None explicitly mentioned | Not available | 12 |
Entire multiaxial loading cycles including strain paths and material properties | Multi-view deep learning | R2 = 0.94 | Self-attention mechanisms | Not available | 10 |
Initial single cycle of hysteresis loop | LSTM | R2 = 0.99 | Contextual-attention mechanisms | Python package provided | Our research |
Overall, this study demonstrates the potential of deep-learning-based approaches to capture the temporal properties of LCF data and accurately predict the fatigue life. Future studies should expand the applicability of the proposed methods to diverse materials and fatigue environments. Moreover, investigating advanced deep learning architectures, such as more refined attention mechanisms or transformer-based models, may further improve the capture of non-linear fatigue behaviors and cumulative damage processes, thereby enabling more robust and reliable fatigue life predictions in complex scenarios.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d5ra01578b |
This journal is © The Royal Society of Chemistry 2025 |