Open Access Article
Babak Valipour Goodarzi
and
Reza Foudazi
*
School of Sustainable Chemical, Biological and Materials Engineering, The University of Oklahoma, Norman, OK, USA. E-mail: rfoudazi@ou.edu
First published on 26th May 2026
Elastoviscoplastic (EVP) materials pose significant challenges for predictive modeling, especially under large amplitude oscillatory shear (LAOS) conditions due to their nonlinear rheological behavior. While various models have been proposed to describe yielding and viscoelastic transitions, their application to experimental data remains limited due to computational complexity, poor generalizability, and difficulty in fitting noisy data. This work re-evaluates rheological modeling of EVP materials by using a physics-informed neural network (PINN) approach that embeds a modified Saramito model into the training framework. By directly fitting time-dependent stress data on typical EVP materials, this method circumvents the need for gradient estimation from noisy measurements and offers a data-efficient, differentiable, and generalizable alternative to conventional fitting methods. A shear-thinning formulation is introduced to reflect the decreasing viscosity at higher strain amplitudes. The proposed PINN approach is validated using both synthetic and experimental data, demonstrating stable recovery of physical parameters, enhanced interpretability, and improved predictive capability across a range of strain amplitudes. This framework bridges microstructural deformation modes with rheological modeling, offering a powerful tool for understanding and predicting nonlinear viscoelastic behavior in soft matter.
Fig. 1 shows a typical amplitude sweep curve of type III nonlinear behavior to illustrate the elastic, plastic, and viscous-dominant regions depending on the applied strain amplitude. The elastic-dominant region is located at low strain amplitudes where storage modulus (G′) is significantly larger than loss modulus (G″), and both are relatively strain-independent.9 In this region, material behaves like a solid, storing energy elastically, implying that the microstructure remains mainly intact.10–12 Beyond the linear, elastic-dominant regime, G′ declines while G″ increases, as the material begins to yield due to structural reformation.13,14 This intermediate regime is known as the plastic or transition region, and often extends to the crossover point where G′ = G″. Above this point, where G″ overtakes G′, the post-yield liquid-like behavior reflects dominance of viscous dissipation as the irreversible flow results from structural breakdown leading to bulk fluidization.11,15 Concentrated nanoemulsions, as one of the examples of EVP materials with type III nonlinear behavior,16 have rich structures, properties and applications,8,17,18 highlighting the importance of studying their rheology. Generally, dilute emulsions show Newtonian behavior, while emulsions with sufficiently high volume fraction of dispersed phase become viscoelastic and EVP.19
![]() | ||
| Fig. 1 Dynamic moduli of type III nonlinear behavior for a typical elastoviscoplastic material from strain amplitude sweep test in oscillatory mode. | ||
Several studies have tried to develop a proper model for mimicking the viscoelasticity,20–22 such as the Oldroyd model (Fig. 2a),23 and the Jeffrey's model (Fig. 2b).24 These models are not able to account for the nonlinearity due to yielding in EVP materials. Many of these efforts involve modified versions of the Oldroyd model to account for the yield stress.20,25,26
![]() | ||
| Fig. 2 Mechanical analogs for different models, with components arrangement for: (a) Oldroyd model, (b) Jeffrey's model, (c) Saramito model, and (d) strain-separated Saramito model. | ||
The complexity of numerical fitting is the primary challenge in the fitting of yielding in conditional discrete EVP models. Model fitting typically relies on gradient-based optimization methods, which begin with an initial set of parameter guesses and iteratively update them by evaluating whether the model satisfies the governing equations under the current parameter set. This approach also requires finite difference calculations of gradients of the data to minimize the residual of differential equations, which can become misleading when dealing with noisy data. Even if we circumvent the disturbed gradients of noisy data by directly solving the equation, a numerical solver must go through all the data points, which means the solver's iteration loop should be inside the optimization iteration loop. This can make the convergence extremely time-consuming, while the discontinuity in the discrete model may also result in diverging during solving the equation in each loop.
To address all the associated challenges with numerical fitting, we adopt a physics-informed neural network (PINN) approach for fitting constitutive equations.27,28 In recent years, PINN has emerged as an efficient mathematical tool for solving and fitting differential equations27,28 in many engineering and science fields, including material science,29,30 chemistry,31–33 fluid mechanics,34–36 and specifically rheology.37–40
Unlike classic neural network approaches, which act like a black-box predictor with no physical intuition, PINN embeds a constitutive model to set a meaningful theoretical base.27 PINN fitting algorithm uses gradient descent logic or more sophisticated variants of the gradient descent, like the Adam optimizer.41 The only difference in comparison to conventional numerical approaches is that neural networks' output can serve as an intermediate between the experimental data and the model, to alleviate the challenges related to a lack of data points or noise. The detailed instructions of this method for rheology models with sets of examples are provided by Mahmoudabadbozchelou et al.39
PINN optimizes the neural network to align its output predictions with experimental data by minimizing a corresponding loss function known as data loss. Simultaneously, it optimizes the coefficients of a user-suggested governing equation in a way that its solution matches the expected values by minimizing a corresponding loss function known as physics loss. The simultaneous optimization of these two loss functions is based on the minimization of the weighted summation of them. Unlike traditional numerical methods, which can become unstable due to differentiation of noisy data, PINNs can still converge to physically consistent solutions as regularization can prevent overfitting to the noise of data.42 Moreover, the gradient calculation for a neural network is more efficient. This is due to the fact that the neural network itself is a continuous, differentiable function, even though we only evaluate it at discrete points, and the automatic differentiation can provide all the gradient values simultaneously. In contrast, conventional finite difference approximations require step-by-step calculations, which are much slower.
Aside from the advantages of PINN, some limitations of the conventional gradient descent approach persist. First of all, an improper learning rate is prone to time-consuming convergence and skipping the global minimum of loss function, or getting trapped in local minima instead of locating the global minima.43 Another significant limitation arises from the quality of experimental data. Sparsity and noise in data can still affect the accuracy of PINN.44 Thus, PINNs require careful weighting between data-driven loss and physics-based residual loss. This balancing prevents overfitting of the neural network to noise and forces the output to follow the physics of the model, which introduces additional hyperparameters (α). Depending on the complexity of the model, feeding one dataset from one rheological measurement protocol may not be enough for the PINN to recover correct parameters for a complex model.45
Few studies used PINN or any other fitting method to directly fit rheological models to LAOS data.39,46 To the best of our knowledge, none of them ended up with a model that accurately covers various strain amplitudes with a single set of fitting parameters. In the present work, therefore, we address two challenges. First, we suggest a model that can capture physical behavior at various strain amplitudes from linear to nonlinear regions. Second, we develop a new 2-step PINN framework to fit the suggested model simultaneously to several stress datasets of different strain amplitudes. Instead of directly penalizing residuals by gradient calculation, we enforce the governing equation through a cumulative reconstruction of stress with integration, which eliminates the need for explicit initial-condition constraints commonly used in PINN formulations, leading to faster convergence. In the following sections, we present a brief review of the EVP models and the associated challenges with conceptual interpretations and numerical complexities in fitting them to measured experimental data. Then, we address these challenges by suggesting an updated version of the model and the corresponding fitting approach to overcome the limitations.
σ = η + τ
| (1) |
![]() | (2) |
only represents the relaxation of the blue box section in Fig. 2c.
Considering yield as a point where unrecoverable strain becomes effective, we can relate the yield stress of Saramito model (σ0) to τ0 as follows:
σ0 = η y + τ0
| (3) |
y is the strain rate at which the slit is activated, which is dependent on frequency and strain amplitude. Therefore, depending on the
y, σ0 changes. It should be noted that σ0 is referring to the onset of plastic flow, not the flow from the rest state.
Donley et al.21 revisited Saramito model into a strain-separable version as in Fig. 2d, where the unrecoverable yielding-controlled dashpot is placed in series to a Kelvin section. Kamani et al.49 replaced the linear unrecoverable dashpot in the same arrangement with a Herschel–Bulkley version and removed the slit. This model is known as KDR (Kamani–Donley–Rogers). Unlike the classical Saramito model, where yielding is governed by a fraction of total stress, the strain-separable versions define yielding based on the total stress, so the yield is constant in this model (in contrast to eqn (3)).
While the Saramito model is commonly used to represent EVP material, to the best of our knowledge, no study has successfully fitted this model to rheological data. For example, Cheddadi et al.50 used the Saramito model to simulate foam flow by estimating the model's parameters according to prior experiments, rather than fitting data. This approach does not use the model to predict the flow behavior but rather assesses the model's capability to explain EVP behavior. In another study, Abdelgawad et al.51 used the Saramito model with a Herschel–Bulkley unrecoverable dashpot to simulate the flow of a Pluronic F127 solution through a single periodic wavy channel, and estimated constant parameters of the model from experiments. They observed only a qualitative match of flow fields and velocity profiles that does not fully align with the experimental data. Fraggedakis et al.26 tried to fit a Carbopol data to the Saramito using a nonlinear regression approach by minimizing the error between experimental and simulated dynamic moduli, rather than directly fitting the model to stress data. However, this approach failed, and to address this issue, they replaced the linear slit-controlled dashpot with an isotropic-kinematic hardening model (IKH model).52 While they achieved a satisfactory fitting, this approach only considers the first harmonic of stress when fitting the model to G′ and G″ in the nonlinear region of the data. Even for the strain-separable Saramito and KDR, model parameters to predict LAOS curves were estimated from linear viscoelastic moduli and steady-shear flow curves,21,53 which do not appear to accurately fit nonlinear stresses.
As we show in the results and discussion section, there is no unique single set of parameters for the model to represent the real type III materials at all strain amplitudes. This implies that the assumption of constant model parameters across different strain amplitudes is invalid, as the underlying microstructure formation and deformation may be strain dependent. There are some experimental findings discussing the structural changes under high strains/stresses. For example, the Payne effect21,54 leads to a decrease in the elasticity and enhanced plastic flow with strain amplitude as the microstructure undergoes progressive breakdown. In the classic view, the Saramito model assumes a constant modulus for all elements to represent energy storage and loss, which is not consistent with microstructural changes. In other words, changes in the viscoelasticity of EVP materials cannot be solely attributed to variations in the contributions of linear springs and dashpots. The need for parameter adjustment reflects the model's inability to inherently capture the continuous evolution of viscoelastic and plastic contributions across different deformation amplitudes.
To extend the Saramito model to a more comprehensive version, we suggest a new approach in updating the model. First, we propose a physically meaningful correspondence between each component of the Saramito model and the underlying structural phenomena that govern rheology. Table 1 lists different phenomena for storing or dissipating energy in each element of the Saramito model. When the material structure is distorted reversibly in emulsions19 both energy and strain remain reversible upon removal of the stress load. This reflects elastic behavior, which can be represented by the spring element. In contrast, some processes involve energy dissipation without contributing to permanent deformation, like the rolling of particles over each other in colloidal materials,19 or the local vacancy diffusion of particles into neighboring voids.55 These phenomena are captured with the recoverable-strain dashpot (i.e., the dashpot in parallel to the spring). The rest of the irreversible structural breakdowns lead to simultaneous energy dissipation and unrecoverable deformation that are represented by the unrecoverable dashpot, which is activated above the yielding point.
| Element | Examples |
|---|---|
| Spring (G) | • Reversible colloidal deformation56 |
| • Reversible structure distortion19 | |
| • Reversible stretching and recoiling of polymer chains57 | |
| • Swelling tendency of hydrogels due to osmotic pressure58 | |
| Recoverable dashpot (η) | • Diffusion to void spaces55 |
| • Rolling of particles/colloids59 | |
| • Polymer reptation60 | |
| Unrecoverable dashpot (ηp) | • Irreversible structure breakdown61,62 |
| • Mutual displacement of large blocks63 | |
| • Sliding64 | |
| • T1 neighboring switch in foams65,66 |
Agrawal et al.67 analyzed the effect of two different values of η = 500 Pa s extracted from the linear viscoelastic region and hypothetical η = 1 Pa s in the Saramito and KDR model, while keeping other parameters of the model fixed to study the accuracy of the model in predicting LAOS stress of Pluronic F127 hydrogel. The result showed that η = 500 Pa s is accurate in the linear region but overpredicts stress at large amplitudes, while with η = 1 Pa s it underpredicts the stress at low amplitudes. This can be interpreted in a way that dissipation mechanisms, such as droplet rolling, are more significant at low strain amplitudes or linear regimes. Such observations indicate that assuming a fixed value of η is a major disadvantage of these models, preventing them from accurately representing the reality of type III materials.
In this work, we suggest a nonlinear strain rate thinning recoverable dashpot (η(
)). This modified model will remain thermodynamically consistent with respect to the second law of thermodynamics, regardless of how η(
) function is defined. Based on applied power (P) and stored energy (S) the model must satisfy:
| D = P − Ṡ≥ 0 | (4) |
![]() | (5) |
![]() | (6) |
P = σ = τ + 2η( )
| (7) |
![]() | (8) |
), the model remains thermodynamically consistent. Several candidate functions can offer shear rate thinning behavior, such as empirical power decay functions,68 like Carreau fluid model, where the viscosity decreases smoothly from a low-rate plateau to a high-rate plateau.69 In the power functions, the power index can adjust the curvature for accurate fitting. However, the trade-off is that this additional index makes the model more complicated. In contrast, an exponential decay form can be viewed as a simplified monotonic decay with fixed curvature, chosen as a lower-parameter alternative. Moreover, in natural processes, Euler's number e ≈ 2.71828 appears wherever growth or decay is continuous and proportional to the current state.70 This proportionality in natural phenomena makes their mathematical descriptions commonly exponential, e.g., in biological growth,71–73 physical decays,74–76 and chemical reactions.77,78 Accordingly, the exponential trend commonly exists in rheology, where the exponential stress relaxation is a well-established concept.79,80
For modified Saramito model, we redefine η of the recoverable dashpot in Fig. 2c (black colored dashpot) as eqn (9) to capture the underlying continuous structure breakdown, which causes unrecoverable dashpot effectiveness to diminish.
![]() | (9) |
* are the zero-shear viscosity, infinite-shear viscosity, and critical shear rate respectively. The general form of the updated model is still similar to the classic Saramito model and can be fitted by PINN in a similar way. The only difference is that instead of fixed values of η, there are different corresponding values at each specific strain rate. For further simplicity and parameter reduction, we fix η∞ = 0 since we expect the failed structure under strain to dominantly exhibit unrecoverable energy dissipation, and the recoverable dashpot contribution disappears. While the classic Saramito switches between Kelvin and Oldroyd below and above yield, imposing a decaying recoverable dashpot makes the model softly switch between Kelvin and Maxwell, with a gradual Oldroyd transition in between. However, there is still the sharp activation of constant ηp, decided by the constant τ0. Even though in reality there is no sharp yield transition, the simplistic view is that at some point the structure fails irrecoverably, which implies yielding.19,81 In the Results and discussion section, we show that this modification to the model enables it to represent oscillatory behavior from linear to nonlinear regions of strain amplitude.
We have implemented two customized PINN algorithms for fitting the Saramito model, which is detailed in the Appendix. The Python script is also available on GitHub, based on the PyTorch library. The first algorithm is the simple PINN fitting of the classical Saramito model to stress data at a single strain amplitude, shown in Fig. 3. The second one, presented in Fig. 4, is a two-step approach for the simultaneous fitting of the our modified Saramito model to stress data at several strain amplitudes. In the Results and discussion, we describe why this approach is essential to get meaningful fitting.
, τ0 = 75 Pa, ηp = 20 Pa s, and η = 10 Pa s for the strain amplitude of 0.5 and frequency of 10 rad s−1. The fitting to stress vs. time is acceptable, and all fitting variables converge towards their corresponding expected original reference values. Additional plots are presented in the supporting information (SI). Fig. S1 shows the optimization of total loss with respect to epoch progress. Fig. S2 shows the optimization of fitting parameters of the Saramito model with respect to epoch progress.
To investigate the influence of the weighting factor of α on the total loss function (data loss + α × physics loss) on PINN performance, we compared their fitting results after 10
000 epochs using four values of α in the range of 0.6 to 1.5 in Table 2. Despite variations in total loss, the recovered values of λ, η, ηp, and τ0 remained relatively stable across different α values, showing only slight deviations. This suggests that the general framework of the PINN is sufficiently robust and stable for parameter estimation. Notably, α = 1.5 slightly compromises parameter accuracy (e.g., ηp = 16.10 vs. the original model value of 20), which can be attributed to the lower contribution of data loss in optimization, leading to a slightly off-fitted neural network. On the other hand, α = 0.6 produced the lowest loss with estimated parameters very close to the expected values, suggesting it may provide the most reliable trade-off between data fitting and physical model constraint enforcement. Fig. S3 compares the loss and elastic moduli of the original model and PINN fitted model in amplitude sweep results, confirming stable convergence of PINN regardless of various α values.
000 epochs
| Predefined coefficients | α = 1.5 | α = 1.0 | α = 0.8 | α = 0.6 | |
|---|---|---|---|---|---|
| Loss = 4.83 × 10−4 | Loss = 1.486 × 10−4 | Loss = 6.036 × 10−5 | Loss = 5.177 × 10−5 | ||
| λ (s) | 0.05 | 0.0405 | 0.0453 | 0.0470 | 0.0462 |
| η (Pa s) | 10 | 10.25 | 10.80 | 10.60 | 10.61 |
| ηp (Pa s) | 20 | 16.10 | 17.61 | 18.37 | 18.04 |
| τ0 (Pa) | 75 | 84.71 | 81.18 | 79.48 | 80.18 |
Aside from the advantages of PINN, some limitations of the conventional gradient descent approach persist. First, a small learning rate leads to slow convergence, and a large learning rate can skip the minimum value. The loss landscape may also have multiple local minima, making it challenging for gradient descent algorithms to locate the global minimum consistently.43 Additionally, several possible fits of a complex model for a single set of experimental data may exist. Thus, the fitting algorithm may not accurately recover the original parameters.45 As an example, for the synthetic data from the parameter sets of
, τ0 = 25 Pa, ηp = 10 Pa s, and η = 20 Pa s, the single Lissajous curve at the strain amplitude of 0.5 is not informative enough for recovering the fitting parameters because the fitting ends up with a simpler model expression where τ0 converges to zero (see Table 3 and Fig. S4), even though the Fig. 5c and d suggests satisfactory stress prediction. In other words, the PINN fitting parameters with τ0 = 0 Pa result in a stress function, which is numerically indistinguishable from the original synthetic data.
| Predefined coefficients | Epoch = 10 000 (8 seconds) |
Epoch = 50 000 (40 seconds) |
Epoch = 90 000 (72 seconds) |
|
|---|---|---|---|---|
| Loss = 2.531 × 10−4 | Loss = 1.441 × 10−4 | Loss = 1.428 × 10−4 | ||
| λ (s) | 0.1 | 0.078 | 0.088 | 0.079 |
| η (Pa s) | 20 | 13.71 | 12.94 | 11.82 |
| ηp (Pa s) | 10 | 14.48 | 17.86 | 18.20 |
| τ0 (Pa) | 25 | 14.48 | 0.00 | 0.00 |
An important observation is when τ0 is comparable to the induced stress in the spring, a more pronounced yield transition is observed during the oscillatory cycle at large strain amplitudes. The yielding produces stronger distortions in the stress–strain Lissajous curves. Physically, a higher τ0 delays the onset of plastic flow, resulting in more abrupt yielding once the stress threshold is exceeded. In contrast, decreasing τ0 reduces the nonlinearity and numerical resolution for fitting the parameters of the model.
This fitting failure is not an issue only associated with the PINN, but a general limitation of any fitting approach when the data is not informative for the corresponding model. As a solution, we suggest a two-step fitting to two separate linear and nonlinear oscillatory stress datasets of a sample. In this approach, G and η can be obtained by fitting the linear data with the Kelvin–Voigt model, which is equivalent to the unyielded version of the Saramito model. Then, by fixing these obtained parameters in the Saramito model, we proceed to PINN fitting of the nonlinear yielding oscillatory stress data for obtaining τ0 and ηp. Table 4 compares the original and fitting parameters by this two-step approach for two examples with different values of λ. By capturing recoverable elements of the model separately from the linear rheological data, the approach is less prone to misleading fitting of nonlinear behavior and getting trapped in a local minimum. Furthermore, it exhibits faster performance in view of less calculation at each epoch and also the fewer required epochs for PINN to fit only τ0 and ηp parameters.
000 epochs with a two-step approach for two sets of parameters
| Predefined coefficients | α = 0.6 | Predefined coefficients | α = 0.6 | ||
|---|---|---|---|---|---|
| Loss = 5.13 × 10−6 | Loss = 7.48 × 10−5 | ||||
| λ (s) | 0.1 | 0.114 | λ (s) | 0.05 | 0.051 |
| η (Pa s) | 20 | 20.00 | η (Pa s) | 20 | 20.00 |
| ηp (Pa s) | 10 | 11.42 | ηp (Pa s) | 10 | 10.10 |
| τ0 (Pa) | 25 | 24.46 | τ0 (Pa) | 25 | 25.11 |
To demonstrate the advantage of PINN fitting, it is compared with conventional fitting methods, such as gradient descent, in Table S2. The conventional fitting methods led to divergence due to severe noise in the data because it introduces drastically misleading values either by differentiation or integration during residual checking. It should be noted that by adjusting a good initial guess, Levenberg–Marquardt can converge for some noisy examples, but remains unstable, ending up with unreasonable infinite values. The comparison of fitting approaches for nanoemulsion sample at 10 rad s−1 and the strain amplitude of 0.5 is provided in Table S1.
To assess the Saramito model's predictivity for real type III materials, we fit the data at the strain amplitude of 0.5 and then compared the model's prediction with experimental data at lower and higher strain amplitudes, as shown in Fig. 6b. The model significantly underpredicts the stress magnitude below the strain amplitude of 0.5 and overpredicts the stress magnitude above 0.5 strain. However, a meaningful fitted model should be able to predict unseen experimental data, with the same set of parameter constants. Otherwise, it is not useful for simulations and practical applications.
To come up with a meaningful fitted model for all strain amplitudes, the two-step fitting can be an appropriate approach, as worked for synthetic data. One other possible solution might be to fit the model to all strain amplitudes at the same time with a cumulative loss function. However, this approach does not lead to convergence during the fitting process, indicating that the Saramito model is inherently unable to accurately capture the flow behavior across the entire range of strain amplitudes.
In the next steps, we fit the modified Saramito model which assumes a decaying recoverable dashpot as in eqn (9). The increased number of parameters complicates the fitting due to the exponential decay function. Therefore, we use an enhanced version of the 2-step fitting framework as shown in Fig. 4. First, G and η0 can be fitted to the model for very small strain amplitudes in the linear region, e.g. 0.01. As discussed in the previous section, this approach ensures fitting the unyielded regime with the unyielded model. The remaining parameters can be fitted using separate PINNs, but with an aggregated loss function for several strain amplitudes in the nonlinear regions. Here, we consider an equally weighted culminative loss function, so optimization sensitivity to each strain amplitude is equal for all of them. Obviously, based on this cumulative fitting approach and more complex model, a meaningful convergence requires more epochs and results in larger values of the loss function.
Fig. 7a shows the fitting of Saramito with the updated decaying recoverable dashpot to the samples of the nanoemulsion with 50
000 epochs (120 seconds). In addition to the enhanced fitting of the modified Saramito model, the 2-step approach offers a fair fit for the wide range of linear to non-linear regimes with a single set of parameters. However, there is a slight off prediction at the strain amplitude of 0.1, where the transition from elastic to plastic regime happens. This is due to the assumption of a constant unrecoverable dashpot and the sharp yielding transition, which is a trade-off for avoiding over-parametrization.
![]() | ||
| Fig. 7 Fitting of (a) modified Saramito model with decaying recoverable dashpot; (b) classical Saramito model, and (c) KDR model to nanoemulsion data. | ||
In Fig. 7b and c, we also included fitting to the classical Saramito and KDR model as two important EVP models. We explained the equations of the KDR model in supporting information (SI). By adopting a similar 2-step fitting approach for them to fix the fitted parameter from linear data and fitting the rest of the parameters to nonlinear datasets, both classical Saramito and KDR models fail to converge meaningfully. The issue with these models is that enhancing the accuracy of fitting in the linear region decreases accuracy in the nonlinear region fitting, and vice versa.
To assess the uncertainty of the modified Saramito, a bootstrap analysis was employed. In this approach, we redistribute the error of each datapoint to generate new noisy data and check how the fitted parameter changes. Performing five bootstraps, the stage-1 parameters, G and η0, remained unchanged across bootstrap replicates. Among the stage-2 parameters, the yield stress τ0, plastic viscosity ηp, and rate-softening parameter
* showed relatively small variability across the bootstrap runs. According to Table 5, τ0 remained within a narrow range of approximately 204–218 Pa, while ηp varied between about 22.4 and 25.7 Pa s. These results suggest that the nonlinear constitutive parameters are reasonably robust to the noise level present in the experimental data, although this should be interpreted as a local uncertainty estimate rather than a full global identifiability analysis. It should be noted that this uncertainty is dependent on the state of the model. For example, if one dashpot dominates the stress signal, it will increase the uncertainty for recovering other parameters.
000 epochs with a two-step approach for two sets of parameters
| Parameter | Mean | Standard deviation | 95% Lower confidence interval | 95% Upper confidence interval |
|---|---|---|---|---|
| G (Pa) | 9664.82 | 0.00 | 9664.82 | 9664.82 |
| η0 (Pa s) | 160.38 | 0.00 | 160.38 | 160.38 |
| η∞ (Pa s) | 0.00 | 0.00 | 0.00 | 0.00 |
* (s−1) |
2.28 | 0.12 | 2.16 | 2.41 |
| ηp (Pa s) | 24.01 | 1.32 | 22.41 | 25.75 |
| τ0 (Pa) | 211.20 | 5.93 | 203.92 | 217.71 |
In Fig. 8, we showed two more examples of fitting the modified Saramito to the kraft real mayonnaise sauce and the pepsodent complete care toothpaste. The strain amplitudes of the input data for the two-step fitting must capture the linear to completely yielded regimes. Thus, the selection of the strain amplitudes depends on the material. For example, for the mayonnaise, we considered a strain amplitude of 2 to cover the fully yielded regime. The pronounced yield in data is the informative pattern for fitting the model. The model shows a fair fit for the mayonnaise as a type III soft matter. We also included the toothpaste example as a type I material, for which the model still provided an acceptable approximation, but with lower accuracy at higher amplitudes due to smaller yields.
![]() | ||
Fig. 8 Fitting of the modified Saramito to (a) kraft real mayonnaise (50 000 epochs), and (b) pepsodent complete care toothpaste (80 000 epochs). | ||
Having a single set of parameters for modeling EVP materials in all strains provides the opportunity for systematic studies and simplicity for interpretation. The current model is inherently frequency-dependent, due to the presence of combined spring and dashpots. However, it does not consider the direct functionality of dashpots to frequency. Accordingly, for further improvement, we need to embed the frequency dependence in the elements. In general, soft materials exhibit weak frequency dependence.16,82 One possible approach for our future work is adopting the strain rate frequency superposition introduced by Wyss et al.82 By using this approach, we can approximate the response at other frequencies by altering strain amplitude.
τM-NN = σNN − η
| (1A) |
![]() | (2A) |
term mimics yield-like behavior if |τM-NN| is bigger than τM0. Otherwise, this term becomes zero. Having
At each datapoint, a cumulative summation results in τmodel values while complying with the initial value. Then, we can back-calculate total stress (σmodel), which is still entirely differentiable for straightforward gradient-based optimization after passing through elementary tensor operations. Thus, the model is not directly solved, but stress is being checked to see if it is satisfying the model regarding the initial value. This approach is faster than calculating
from autograd command and checking the residual of the equation. Moreover, we do not need additional loss term to enforce the initial condition, but inherently it is embedded in the integration.
| Lossdata = MSE(σNN, σ) | (3A) |
| Lossphy = MSE(τNN, τmodel) + MSE(σmodel, σNN) | (4A) |
In case of clean and noiseless experimental data, MSE(σmodel, σNN) leads to faster convergence as it provides an accurate reference of convergence from the first epoch. Finally, the total loss will be a weighted summation of data and physics losses. By considering the weight factor α for the physics loss, we can control how strongly the PINN enforces governing equations versus fitting observations during minimization of total loss, ultimately improving training stability.83
| Total loss = Lossdata + αLossphy = Lossdata + α(Lossphy1 + Lossphy2) | (5A) |
, the PINN algorithm will update the τM0, λ, η, ηp, and σNN at each epoch with the Adam optimizer to minimize the Total Loss. In the case of simultaneous fitting, we add the total loss of all separate datasets together for the optimization.
![]() | (6A) |
![]() | (7A) |
![]() | (8A) |
![]() | (9A) |
![]() | (10A) |
![]() | (11A) |
The Python code is also available on GitHub. The code was run on the G4 GPU of the Google Colab service. In terms of training duration, we observed that every 10
000 epochs requires approximately 8 seconds for the single fitting and 24 seconds for the two-step fitting.
| This journal is © The Royal Society of Chemistry 2026 |