Open Access Article
Sobin Alosious
ab,
Yuhan Liu
c,
Jiaxin Xu
b,
Gang Liu
d,
Renzheng Zhang
b,
Meng Jiang
ad and
Tengfei Luo
*abef
aLucy Family Institute for Data and Society, University of Notre Dame, Notre Dame, IN 46556, USA. E-mail: tluo@nd.edu
bDepartment of Aerospace and Mechanical Engineering, University of Notre Dame, Notre Dame, IN 46556, USA
cDepartment of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, IN 46556, USA
dDepartment of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN 46556, USA
eDepartment of Chemical and Biomolecular Engineering, University of Notre Dame, Notre Dame, IN 46556, USA
fCenter for Sustainable Energy at Notre Dame (ND Energy), University of Notre Dame, Notre Dame, IN 46556, USA
First published on 17th June 2026
The discovery of polymers with targeted properties is challenged by the vast chemical design space and the limited availability of consistent, high-quality data across multiple properties. In this work, an integrated polymer informatics framework is presented that combines the automated molecular dynamics engine for polymer simulations (ADEPT) workflow with multi-task and multi-fidelity machine learning (PolyGraphMT). Polymer repeat units are represented as molecular graphs and processed using a graph neural network to learn structure–property relationships. Starting from SMILES representations for monomers, ADEPT automates the construction of atomistic models and the evaluation of their properties using molecular dynamics simulations and density functional theory calculations. The simulation data are combined with curated experimental data and group contribution theory estimates to construct a unified dataset of approximately 62
000 polymer property values spanning 28 properties across thermal, mechanical, transport, electronic, optical, and structural properties. Using this dataset, inter-property correlations are analyzed, and multi-task learning strategies are evaluated for joint property prediction. The results show that multi-task models achieve performance comparable to single-task models in data-rich regimes and exhibit superior accuracy as training data become limited. In addition, fidelity-aware learning demonstrates improved agreement with experimental data for representative multi-fidelity properties. The trained models are further applied to large-scale property prediction for polymers in the PolyInfo database (∼13
000 polymers) and the PI1M virtual polymer library (∼1 million polymers), producing physically consistent property distributions across a broad chemical space. Overall, the proposed framework provides a structured approach for scalable prediction and screening of polymer properties across multiple property types and data fidelity levels.
Data availability is a central requirement for materials informatics, as machine learning (ML) models rely critically on the quality, quantity, and diversity of training data.8,9 In inorganic materials science and small-molecule chemistry, large open databases derived from high-throughput first-principles calculations and curated experimental measurements, such as the Materials Project, AFLOW, OQMD, and QM9, have played a decisive role in advancing data-driven materials discovery.10–13 In the polymer domain, dedicated databases such as PolyInfo and Polymer Genome have enabled important progress by compiling experimental measurements and first-principles data for selected properties.7,14 While these databases have enabled important progress in polymer informatics, the underlying data remain highly sparse and heterogeneous. For most polymers, only a limited number of properties are reported, often measured under different processing histories, molecular weights, and testing conditions.
Beyond database development, recent advances in polymer informatics have also focused on polymer representations, large-scale generative design frameworks, and foundation-model-inspired learning approaches. Several studies have emphasized the importance of chemically meaningful polymer representations for enabling scalable ML workflows. In particular, the BigSMILES formalism introduced a stochastic extension of SMILES specifically designed to describe macromolecular structures, including copolymers and branched architectures, while preserving compatibility with text-based cheminformatics pipelines.15 More recently, open polymer data infrastructures such as CRIPT have further highlighted the need for standardized and interoperable polymer data representations for community-scale informatics efforts.16 At the same time, generative and language-model-based approaches have rapidly expanded the scope of AI-driven polymer design. Jackson and co-workers introduced the open macromolecular genome (OMG), a large-scale polymer database and generative framework designed to enable synthetically accessible polymer discovery through reaction-aware generative modeling.17 In parallel, Ramprasad and co-workers developed polymer language-model frameworks such as polyBERT, which treat polymer SMILES representations as a chemical language and enable fully machine-driven polymer informatics pipelines through transformer-based representation learning.18 Earlier efforts from the same group also demonstrated the effectiveness of multi-task learning and multi-fidelity learning strategies for polymer property prediction using heterogeneous experimental and computational datasets.19,20 A variety of polymer representation strategies have additionally been proposed for polymer informatics, including periodicity–aware graph encodings,21 polymer fingerprinting and descriptor-based approaches,19 and simpler SMILES-derived molecular descriptors. While these representations differ in their treatment of polymer connectivity and periodicity, many have demonstrated competitive performance across polymer property prediction tasks. The primary focus of the present work is therefore not the development of a fundamentally new polymer representation scheme, but rather the integration of heterogeneous multi-property and multi-fidelity datasets within a unified graph-learning framework. Consequently, the overall workflow is expected to remain compatible with alternative polymer representation strategies. Together, these studies highlight the rapidly evolving ecosystem of polymer representations, databases, generative models, and transferable learning frameworks that are shaping modern polymer informatics.
In addition, recent studies have demonstrated the feasibility of high-throughput molecular dynamics (MD) workflows for generating polymer property data at scale.22,23 Hayashi et al.23 introduced RadonPy, an open-source Python framework that enables fully automated, high-throughput all-atom MD simulations for polymer property prediction. The workflow integrates polymer structure generation from SMILES representations, force-field assignment, equilibration, equilibrium and nonequilibrium MD simulations, and automated trajectory post-processing within a unified pipeline, enabling systematic property generation directly from chemical structure.
More recently, Yoshida et al.24 extended such automated simulation workflows toward the construction of PolyOmics, an omics-scale computational polymer database generated using fully automated MD pipelines. The database comprises physical property data for over 105 polymeric materials and includes thermal, mechanical, dielectric, and transport properties computed under standardized simulation protocols. PolyOmics further demonstrated that ultralarge computational polymer datasets can be effectively leveraged for machine learning through simulation-to-real (Sim2Real) transfer learning, where models pretrained on computational data are fine-tuned using limited experimental measurements. The study additionally reported power-law scaling behavior in prediction accuracy with increasing dataset size, highlighting the potential of ultralarge simulation datasets as foundational resources for polymer informatics.
At the same time, it is well established that simulation-derived properties, particularly those obtained from classical MD, often exhibit systematic deviations from experimental measurements due to force-field limitations, finite-chain effects, incomplete equilibration, and the classical treatment of atomic vibrations.25,26 Nevertheless, MD predictions frequently preserve strong correlations with experimental trends for multiple polymer properties, including thermal conductivity (κ), specific heat capacity (Cp), density (ρ), and glass transition temperature (Tg), across chemically diverse polymer classes.27–30 This suggests that simulation data can serve as informative lower-fidelity proxies for learning global structure–property relationships, even when systematic offsets relative to experiment remain present.
ML has emerged as a powerful framework for predicting polymer properties directly from chemical structure, enabling rapid evaluation of materials that would be infeasible to characterize experimentally or computationally at scale.8 Single-task learning approaches, in which independent models are trained for individual properties, have demonstrated promising accuracy for targets such as Tg, elastic moduli, and dielectric response.31 However, these models are inherently data-inefficient for sparsely sampled properties and do not exploit the strong correlations among polymer properties.32
Multi-task learning addresses these limitations by learning shared representations across multiple tasks, enabling information transfer between related properties. Kuenneth et al.19 demonstrated that multi-task neural networks trained on a dataset comprising 36 polymer properties for approximately 13
000 polymers consistently outperform single-task models, with error reductions of up to 20–30% for sparsely sampled properties. Similarly, Queen et al.33 developed POLYMERGNN, a graph neural network framework for multi-task prediction using an experimentally curated dataset of over 240 synthesized polyesters, achieving predictive performance of R2 ≈ 0.72 for Tg and R2 ≈ 0.70 for intrinsic viscosity under five-fold cross-validation. Although this work focused exclusively on experimental data, it demonstrated the effectiveness of shared latent representations for learning correlated polymer properties.
Despite these advances, polymer property datasets remain inherently heterogeneous, combining experimental measurements with MD, density functional theory (DFT), and group contribution (GC) data that differ substantially in fidelity. Treating all data sources as equally reliable can bias learned models toward abundant but lower-fidelity data, while discarding computational data limits chemical coverage. Although multi-fidelity learning strategies have been explored in broader materials science contexts, their systematic integration with multi-task learning for polymer property prediction remains limited.34–36
The present work differs from existing Sim2Real transfer-learning frameworks in several important ways. PolyOmics primarily focuses on pretraining ML models on large homogeneous MD datasets followed by fine-tuning on limited experimental data for downstream tasks.24 In contrast, the framework developed here is designed around the joint integration of heterogeneous datasets spanning multiple physical domains, properties, and fidelity levels within a unified multi-task graph-learning framework. Rather than treating simulation and experiment through a sequential pretraining-fine-tuning paradigm alone, the present approach incorporates experimental measurements together with multiple computational data sources of differing physical accuracy and computational cost, including DFT, MD, and GC estimates, through fidelity-aware weighted optimization and task-grouped multi-property learning. Importantly, not all properties contain the same fidelity combinations, and the effective multi-fidelity coverage therefore remains strongly property-dependent. This formulation enables the model to exploit both cross–property correlations and complementary information across heterogeneous fidelities while remaining scalable to diverse polymer property domains.
In this work, we introduce ADEPT, an automated molecular dynamics engine for polymer simulations, for systematic generation of polymer properties from chemical structure, and integrate it with a newly developed multi-task, multi-fidelity ML framework (referred to here as PolyGraphMT) for large-scale polymer property prediction. ADEPT is designed for high-performance computing, enabling massively parallel, high-throughput property calculations using property-specific MD workflows, including both equilibrium and nonequilibrium methods. It combines atomistic MD and DFT calculations with curated experimental data and GC estimates to construct a heterogeneous dataset of approximately 62
000 data points spanning 28 properties across thermal, mechanical, transport, electronic, optical, and structural domains.
Using this dataset, we analyze inter-property correlations, evaluate task grouping strategies, and quantify the benefits of multi-task learning under varying data availability while accounting for data fidelity. The PolyGraphMT framework represents polymer repeat units as molecular graphs and employs graph neural networks to learn structure–property relationships across multiple properties and fidelity levels.
The integrated ADEPT-PolyGraphMT framework enables (i) simultaneous multi-property prediction, (ii) fidelity-aware learning across heterogeneous data sources, and (iii) scalable screening over large polymer spaces. The resulting models enable unified prediction and consistent screening across both experimental polymer databases (∼13
000 polymers) and large virtual polymer libraries (∼1 million polymers), which is not achievable with existing fragmented workflows. This establishes a scalable framework for data-driven polymer discovery by integrating property generation, model training, and large-scale screening. The overall framework integrates the ADEPT simulation engine with the PolyGraphMT learning architecture.
Each polymer is represented by its repeat-unit chemical structure and may be associated with one or more target properties evaluated at different fidelity levels. Let i ∈ {1, …, N} index polymers,
index target properties (for example, Cp, κ, and Tg), and
index data fidelity levels corresponding to experimental, MD, DFT, or GC sources.
For a given polymer i, property p, and fidelity level f, the observed value is denoted as yi,p(f). In practice, only a subset of all possible (i, p, f) combinations is available, leading to a sparse, heterogeneous supervision structure. The learning objective is to construct a predictive model that leverages shared information across related properties and fidelity levels, while treating experimental data as the highest-fidelity reference during model training and evaluation.
, where
denotes the set of atoms (nodes) and
denotes the set of covalent bonds (edges). Node features encode atomic identity and local chemical attributes, while edge features represent bond connectivity and bond order.This graph-based representation preserves the chemical structure of the repeat unit and enables the use of message-passing neural networks to learn structure–property relationships directly from molecular connectivity.38
The graph representation used for machine learning is constructed from standardized polymer repeat units rather than explicitly polymerized multi-chain structures. Consequently, message passing occurs only within the repeat-unit molecular graph and does not explicitly propagate across covalently connected repeat units in a full polymer chain. To reduce ambiguities associated with terminal capping or monomer fragmentation, polymers are represented using repeat-unit SMILES containing explicit polymerization points denoted by *. Finite-chain atomistic polymer models are generated separately within the ADEPT workflow for MD simulations, with chain lengths selected to balance computational tractability and approximate convergence for high-throughput property calculations.
to a fixed-dimensional latent representation
. This mapping is expressed as
![]() | (1) |
The encoder comprises multiple graph convolution layers in which atomic feature vectors are iteratively updated through neighborhood aggregation. At each layer, atomic representations are updated using information from directly bonded neighbors, followed by nonlinear activation and normalization. After message passing, a global pooling operation aggregates the node-level representations into a single polymer-level embedding zi. This shared embedding serves as the common input for all downstream property prediction tasks and follows standard message-passing neural network implementations.38 Additional implementation details of the network architecture and training configuration are provided in the Methods section (see Methods – PolyGraphMT Framework).
| ŷi,p = gp(zi), | (2) |
All prediction heads operate on the same encoder output zi, which enforces a shared representation across tasks while allowing property-dependent nonlinear transformations in the final layers. This design supports information sharing among related properties without requiring identical functional forms for different prediction targets.39
This formulation incorporates heterogeneous data sources without introducing separate prediction branches or explicit bias-correction terms for individual fidelity levels. Differences in data fidelity are handled through loss weighting during optimization, allowing lower-fidelity data to contribute to representation learning while higher-fidelity experimental data exert a stronger influence during model fitting and evaluation. Detailed formulations of the loss function, normalization strategy, and training procedures are provided in the Methods section (see Methods – PolyGraphMT Framework: Multi-Task Multi-Fidelity Learning).
Accordingly, the present framework represents a weighted heterogeneous learning strategy rather than an explicit discrepancy-learning or fidelity-mapping architecture commonly used in parts of the multi-fidelity learning literature. The current implementation emphasizes shared representation learning across heterogeneous fidelity sources using a common property-specific prediction head and fidelity-aware loss weighting.
Table 1 summarizes the polymer property dataset used in this work, comprising approximately 62
000 data points spanning thermal, mechanical, transport, gas permeability, electronic/optical, and structural properties. The dataset integrates information from multiple sources, including experiments, MD, DFT, and GC estimates, and covers a wide range of property magnitudes with highly variable data availability across tasks. The thermal subset includes Tm, Tg, αT, κ, and Cp, with Cp and Tg having the most extensive coverage and the most diverse fidelity composition. Mechanical properties (E, G, K, and ν) and transport properties (η and D) are obtained from MD simulations. Gas permeability properties (PHe, PH2, PCO2, PN2, PO2, and PCH4) are derived from experiments and span multiple orders of magnitude. Electronic and optical properties computed at the DFT level include α, EHOMO, ELUMO, Eg, µ, and Etotal, which are combined with MD-derived information to estimate bulk properties (n, ε, and εr). Structural and physical properties (Rg and ρ) provide direct links between polymer conformational statistics, packing behavior, and macroscopic properties. Overall, the large number of target properties, heterogeneous data fidelity, and strongly imbalanced data availability across properties (Table 1) motivate the adoption of a unified multi-task, multi-fidelity ML framework. Although the overall dataset spans 28 polymer properties across multiple experimental and computational sources, the fidelity coverage and overlap structure remain strongly property-dependent. Substantial multi-fidelity overlap is primarily available for properties such as specific heat capacity (Cp), glass transition temperature (Tg), thermal conductivity (κ), and density (ρ), whereas several mechanical properties are predominantly MD-derived and many electronic or optical descriptors are primarily obtained from DFT calculations. Even for representative multi-fidelity properties, only a subset of polymers contains both experimental and lower-fidelity computational data, while many polymers remain unique to a single fidelity source. A detailed property–fidelity availability matrix together with overlap statistics for representative multi-fidelity properties is provided in the SI (Table S1 and Fig. S1).
| Property | Symbol | Unit | Source | Points | Data range |
|---|---|---|---|---|---|
| Thermal | |||||
| Melting temperature | Tm | K | Exp | 3671 | [210.6–873.1] |
| Glass transition temperature | Tg | K | Exp/MD | 7360 | [134.1–768.1] |
| Thermal diffusivity | αT | m2 s−1 | MD | 799 | [3.5 × 10−8–8.3 × 10−7] |
| Thermal conductivity | κ | W m−1 K−1 | MD/Exp | 2327 | [0.002–1.59] |
| Specific heat capacity | Cp | J kg−1 K−1 | Exp./MD/GC | 13 104 |
[439.2–2831.7] |
![]() |
|||||
| Mechanical | |||||
| Young's modulus | E | GPa | MD | 1012 | [0.36–11.04] |
| Shear modulus | G | GPa | MD | 1012 | [0.12–4.2] |
| Bulk modulus | K | GPa | MD | 1017 | [0.72–10.9] |
| Poisson ratio | ν | — | MD | 1012 | [−0.17–0.48] |
![]() |
|||||
| Transport | |||||
| Viscosity | η | Pa s | MD | 704 | [9.5 × 10−5–0.11] |
| Diffusivity | D | cm2 s−1 | MD | 700 | [2.19 × 10−9–0.096] |
![]() |
|||||
| Gas permeability | |||||
| He permeability | PHe | Barrer | Exp | 466 | [0.05–17800] |
| H2 permeability | PH2 | Barrer | Exp | 511 | [0.02–36800] |
| CO2 permeability | PCO2 | Barrer | Exp | 756 | [1.2 × 10−6–47000] |
| N2 permeability | PN2 | Barrer | Exp | 798 | [1.6 × 10−4–16600] |
| O2 permeability | PO2 | Barrer | Exp | 807 | [7.0 × 10−7–18700] |
| CH4 permeability | PCH4 | Barrer | Exp | 683 | [4.1 × 10−4–35000] |
![]() |
|||||
| Electronic/optical | |||||
| Polarizability | α | a.u | DFT | 2036 | [1.9–70.3] |
| HOMO energy | EHOMO | eV | DFT | 2916 | [−13.4–−6.84] |
| LUMO energy | ELUMO | eV | DFT | 2916 | [−3.01–3.08] |
| Band gap | Eg | eV | DFT | 2916 | [5.4–16.7] |
| Dipole moment | µ | Debye | DFT | 2916 | [0.003–12.49] |
| Total electronic energy | Etotal | eV | DFT | 2916 | [−3.2 × 106–−2.5 × 106] |
| Refractive index | n | — | MD + DFT | 744 | [1.02–1.84] |
| Dielectric constant | ε | — | MD + DFT | 744 | [1.06–12.7] |
| Permittivity | εr | — | MD + DFT | 744 | [9.4–113.1] |
![]() |
|||||
| Structural/physical | |||||
| Radius of gyration | Rg | Å | MD | 2500 | [9.17–75.22] |
| Density | ρ | g cm−3 | MD/Exp | 3643 | [0.11–2.97] |
Fig. 2 presents the NEMD results used to validate the MD-derived κ values used in this work. A representative simulation snapshot showing the imposed heat-flow direction and boundary configuration is presented in Fig. 2(a), confirming the formation of a well-defined transport geometry without structural distortion.29 The cumulative energy exchanged with the heat reservoirs shows a clear linear dependence on time (Fig. 2(b)), indicating the establishment of a stable steady-state regime from which the heat flux is reliably extracted. The corresponding temperature profile along the transport direction exhibits a well-defined linear region within the central conduction zone (Fig. 2(c)), with minimal curvature and noise, enabling robust estimation of the temperature gradient. The resulting κ values are directly compared with experimental measurements in Fig. 2(d), considering only data reported between 20 °C and 30 °C to match the room-temperature MD simulation conditions. The MD predictions show good agreement with experiments, yielding an R2 value of 0.75 and low absolute error. These results indicate that the MD workflow captures the dominant thermal transport trends across diverse polymer chemistries and provides physically consistent κ values suitable for subsequent ML analysis.
Fig. 3(a and b) illustrates the MD-based evaluation of Tg. In MD simulations, Tg is identified from the temperature dependence of ρ and reflects changes in molecular mobility and thermal expansivity in amorphous polymers. Fig. 3(a) shows the variation of ρ with temperature for a representative polymer. Two approximately linear regimes are observed, corresponding to the glassy and rubbery states. To estimate Tg, linear fits are performed separately for the low-temperature and high-temperature regions using different numbers of data points. Multiple fits are performed, and the final Tg is taken as the mean of the resulting intersection temperatures. If any fitted Tg value falls outside the initial temperature-scanning range, it is discarded. This approach captures changes in volumetric thermal expansion while reducing sensitivity to the specific choice of fitting window. The parity plot in Fig. 3(b) compares MD-predicted Tg values with experimental data and shows good agreement, with an R2 value of 0.72 and low prediction error across chemically diverse polymers. The estimated MD-derived Tg values remain sensitive to fitting-window selection, equilibration time, finite-time averaging, and the inherent limitations of simulation-based dilatometry approaches.41–44 Although the present stepwise equilibration procedure reduces direct dependence on a continuous cooling rate, the resulting Tg estimates should still be interpreted as semi-automated simulation-derived approximations rather than exact thermodynamic transition temperatures.
Fig. 3(c and d) presents the MD-based evaluation of the bulk modulus K. Fig. 3(c) shows the time evolution of K obtained from simulations initiated from different configurations, illustrating temporal fluctuations around a stable mean. To validate the reliability of the MD predictions, polymers with experimentally reported K values were collected from PoLyInfo and other literature sources,45 considering only measurements conducted between 20 °C and 30 °C to match the room-temperature simulation conditions. The parity plot in Fig. 3(d) compares MD-predicted and experimental K values and shows reasonable agreement, with an R2 value of 0.64 and low absolute error. The remaining deviations are attributed to variations in experimental conditions, polymer synthesis routes, and measurement techniques, which introduce noise into the reported modulus values. To assess the influence of polymer morphology on modulus evaluation, multiple independent initial structures were examined. The resulting variations in K were small, indicating that modulus predictions are largely insensitive to morphological randomness in the initial configurations.46 To further reduce potential morphology-related bias, all reported MD-derived K values were averaged over simulations performed using three distinct initial structures for each polymer.
Although the finite-deformation approach reduces some of the convergence issues associated with fluctuation-based elastic-property calculations, the predicted elastic constants can still be influenced by morphology-dependent packing effects, residual internal voids, and finite-chain free-volume effects. Consequently, apparent agreement for individual elastic properties should not be interpreted as proof of a fully converged thermodynamic state point for all polymer systems. More rigorous structural convergence analysis and uncertainty propagation remain important future directions for large-scale automated polymer simulation workflows.
Fig. 4(a and b) evaluates the accuracy of MD-predicted ρ through direct comparison with experimental data and illustrates the effect of bias correction. The parity plot in Fig. 4(a) shows a systematic deviation between MD predictions and experiments, despite a clear linear correlation across the sampled density range, indicating the presence of a global bias. The initial MD predictions yield a low R2 value of 0.21. After applying a linear calibration using polymers common to both datasets, the agreement improves substantially, as shown in Fig. 4(b), with the R2 increasing to 0.81. This corresponds to an improvement of approximately 286%, demonstrating a significant enhancement in agreement with experimental ρ values.
Fig. 4(c–e) presents the corresponding analysis for the Cp. Fig. 4(c) shows a representative enthalpy–temperature relationship obtained from MD simulations, where block-averaged enthalpy varies linearly with temperature, allowing stable estimation of Cp from the slope. Direct comparison with experimental values reveals that MD-predicted Cp is systematically overestimated, as shown in Fig. 4(d), consistent with previous studies based on classical MD simulations.23,47,48 The overestimation of Cp arises from the classical treatment of vibrational degrees of freedom in MD simulations. In classical mechanics, all vibrational modes are fully excited according to the Boltzmann distribution, independent of vibrational frequency. In contrast, quantum mechanics predicts partial population of high-frequency vibrational modes at room temperature, following Bose–Einstein statistics. As a result, classical MD assigns excess vibrational energy to stiff bond-stretching and bond-bending modes, leading to inflated Cp values relative to experiments.25,26 Despite this systematic bias, MD-predicted Cp values exhibit a strong monotonic correlation with experimental data, indicating that MD captures the underlying structure–property trends. This behavior and its implications for data-driven correction and transfer learning have been analyzed in detail in our previous work.48 The initial MAE of 1713.79 J kg−1 K is reduced to 192.40 J kg−1 K after applying a global linear correction, corresponding to an error reduction of approximately 89%. The corrected parity plot in Fig. 4(e) shows improved agreement with experiments, with reduced scatter and substantially lower prediction error.
The bias-corrected parity plots shown in Fig. 4(e) are included only as a diagnostic analysis to illustrate that the MD-derived Cp values preserve a strong systematic correlation with experimental trends despite the known classical overestimation of heat capacity. These corrected values were not used as inputs for model training or multi-fidelity learning. Instead, the multi-fidelity framework operates directly on the original heterogeneous datasets through fidelity-aware weighting without applying manual bias correction to the training labels. Importantly, MD-derived Cp values were not included in the joint multi-fidelity Cp learning framework because classical MD systematically overpredicts Cp due to the classical treatment of vibrational modes. Consequently, the fidelity-aware Cp analysis combines experimental and GC data rather than experimental and MD data. In contrast, for density (ρ), the original non-bias-corrected MD data were used directly within the multi-fidelity learning framework.
An extensive convergence test with respect to different optimization procedures has been conducted in our previous work.29 To further examine possible residual free-volume effects and equilibration-related density deviations, additional post-equilibration NPT validation simulations were performed for a representative subset of polymers initialized from the final equilibrated structures. Density, volume, and chain-size observables were monitored over the validation window, and mean-squared internal distance scaling was analyzed from unwrapped trajectories as a qualitative chain-statistics stability check. The corresponding analyses are provided in the SI, including density/volume stability plots, radius-of-gyration trajectories, internal-distance scaling behavior, and quantitative stability metrics (Fig. S4–S6 and Table S2). Overall, the validation subset exhibited relatively small density drift, modest late-time volume fluctuations, and no obvious signatures of abrupt chain collapse or unstable late-time structural relaxation.
The dielectric-property workflow combines DFT-derived electronic polarizabilities with dipolar fluctuations obtained from classical MD simulations to estimate the static dielectric response.23 Specifically, the electronic contribution is estimated through the Lorentz–Lorenz relation using isotropic polarizabilities,49 while the orientational contribution is obtained independently from simulation-cell dipole fluctuations. The total dielectric constant is then approximated through an additive decomposition of electronic and dipolar contributions. We note that this workflow combines several approximations, including monomer-level electronic structure calculations, isotropic polarizability assumptions, classical MD dipole fluctuations, and additive partitioning of dielectric contributions. Consequently, the predicted dielectric and permittivity distributions, particularly for extreme high-value candidates identified within PI1M, should be interpreted cautiously and primarily as screening-level estimates requiring future validation. Nevertheless, the predicted refractive-index range obtained in this work (n = 1.02–1.84) remains physically reasonable for polymeric materials. The present framework therefore aims to provide scalable approximate dielectric-property estimation suitable for large-scale polymer screening rather than quantitatively definitive dielectric predictions for all polymer classes Also, even if the method may contain systematic errors, the relative ranking of the properties should still be validated in general.
The present workflow should not be interpreted as independently validating each simulated property in isolation, since several quantities remain thermodynamically coupled through density, volume, and related structural state variables. Systematic deviations in density can therefore propagate into derived quantities such as thermal diffusivity, refractive index, and dipolar dielectric response, while compensating errors may contribute to apparently good agreement for some observables. Consequently, agreement for an individual property does not necessarily guarantee that the simulated systems reproduce a fully self-consistent experimental thermodynamic state point. The present results primarily demonstrate the ability of the workflow to capture useful structure–property trends at scale, while more rigorous state-point validation, structural convergence analysis, and uncertainty propagation remain important directions for future work. Representative repeat-unit structures corresponding to selected polymers from the training and validation datasets are provided in the SI (Fig. S3) to aid interpretation of the reported property trends and representative examples discussed throughout the manuscript.
Having established the physical consistency of the MD-derived properties and quantified their systematic deviations relative to experiments, we now turn to data-driven polymer property modeling using ML. The validated MD, experimental, and GC datasets provide a complementary foundation for learning shared structure–property relationships across multiple targets. In the following sections, we examine how multi-task and multi-fidelity ML strategies leverage these datasets to improve predictive performance, particularly in data-scarce regimes.
Fig. 5 presents a Spearman rank correlation heatmap of the polymer properties included in the analysis. Only property pairs with at least 30 shared data points are retained to ensure statistical reliability. The heatmap reveals clear blocks of correlated behavior within subsets of properties, along with weaker but non-negligible cross–property associations, highlighting the potential benefit of learning shared representations across multiple targets. Strong correlation patterns are observed among electronic and dielectric descriptors, including α, Eg, EHOMO, ELUMO, µ, n, ε, εr, and Etotal, consistent with their shared dependence on electronic structure and polarization response. Mechanical properties (E, G, K, and ν) also exhibit internally consistent correlations, reflecting their common sensitivity to stiffness and packing constraints in amorphous polymers. Thermal properties (Cp, Tg, Tm, αT, and κ) show more heterogeneous correlations, indicating that while certain structural factors influence multiple thermal responses, each property retains distinct physical drivers. The heatmap further shows measurable cross–domain correlations between structural descriptors (ρ and Rg) and several thermal and mechanical properties, as well as selected electronic and dielectric quantities. These patterns suggest that polymer packing and conformational statistics encode shared information that can support prediction across different property classes, even when direct physical coupling is indirect. Guided by these observations, two complementary grouping strategies are adopted for multi-task learning, as summarized in Table 2. First, physical groupings are defined based on property domains (electronic, thermal, mechanical, permeability, and other). Second, correlation-based groups (G1–G4) are constructed by clustering properties that exhibit stronger Spearman rank correlations in Fig. 5. Although the correlation-based task groups used in the present study were selected using the observed correlation structure together with considerations of physical interpretability and sufficient data availability, the grouping process could also be automated using clustering algorithms applied to the property correlation matrix. For example, pairwise distances derived from correlation strength could be used together with hierarchical clustering or related graph-based partitioning methods to construct task groups in a fully data-driven manner. Such automated grouping strategies represent a promising direction for future extensions of the framework. These grouping schemes enable a systematic evaluation of whether correlation-aware task selection provides additional benefits beyond conventional domain-based multi-task learning, which is examined in the following section.
| Group type | Group name | Properties |
|---|---|---|
| Physical | Electronic | α, Eg, ε, Etotal, EHOMO, ELUMO, µ, n, εr |
| Physical | Thermal | Cp, Tg, Tm, αT, κ |
| Physical | Mechanical | K, ν, G, E |
| Physical | Permeability | PHe, PH2, PCO2, PN2, PO2, PCH4 |
| Physical | Other | η, D, ρ, Rg |
| Correlation | Group 1 (G1) | α, Eg, EHOMO, ELUMO, n, Etotal |
| Correlation | Group 2 (G2) | Cp, ELUMO, Eg, Rg, ρ, Tg |
| Correlation | Group 3 (G3) | Tg, Tm, η |
| Correlation | Group 4 (G4) | PHe, PH2, PCO2, PN2, PO2, PCH4, Cp, κ |
We now turn to data-driven polymer property modeling using multi-task and multi-fidelity ML. Instead of training independent models for each target property, the proposed framework learns a shared molecular representation across multiple properties while using property-specific prediction heads. This design enables information sharing among related targets and allows lower-fidelity data to support representation learning, while higher-fidelity data provides stronger supervision where available. The following results evaluate the impact of task grouping and joint learning on predictive accuracy, using single-task models as a baseline.
Fig. 6 compares the scaled MAE obtained from single-task and multi-task ML models across different property groups and task configurations. In Fig. 6(a), electronic and dielectric properties generally show reduced error under multi-task learning, particularly when correlation-based groupings (G1 and G2) are used. In contrast, training a single model across all properties does not consistently improve performance, suggesting that indiscriminate task aggregation can yield limited gains or negative transfer for some electronic targets. Thermal properties (Fig. 6(b)) exhibit a more mixed response. Multi-task learning improves prediction accuracy for Cp, whereas only modest or negligible changes are observed for Tm and Tg. This behavior is consistent with the partially distinct physical mechanisms underlying these thermal properties. In several cases, correlation-based groupings perform comparably to, or even better than, purely domain-based groupings, suggesting that statistically informed task selection can be beneficial. Mechanical properties (E, G, K, and ν) show relatively small differences between single-task and multi-task models (Fig. 6(c)). This indicates that the available MD data already provide adequate supervision for these targets, limiting the additional benefit of task coupling. Transport and structural properties (Fig. 6(e)) display selective improvements under multi-task learning, particularly for ρ and Rg, where shared structural information may contribute to improved predictions. Overall, Fig. 6 shows that the effectiveness of multi-task learning is strongly property-dependent and reflects the balance between positive transfer and task interference. Multi-task learning can reduce prediction error when task groupings capture meaningful statistical or physicochemical relationships, allowing the shared encoder to learn more transferable latent representations from limited data. This effect is particularly beneficial for sparsely sampled properties, where information from correlated tasks can improve generalization. However, when tasks exhibit weak correlations, substantially different data distributions, or large imbalances in dataset size and fidelity, joint optimization can introduce negative transfer, in which unrelated tasks interfere with one another and reduce predictive accuracy. Consequently, certain properties or task groupings remain better suited to single-task learning despite the shared-representation advantages of the multi-task framework. These observations support the use of correlation-aware task grouping and provide a basis for the multi-fidelity analysis presented in the following section. Comparison of R2 for different polymer properties obtained from single-task and multi-task learning models is provided in the SI (Fig. S2).
Fig. 7 highlights the robustness of the multi-task learning framework relative to single-task models under varying training data availability. The analysis examines the effect of progressively reducing the fraction of training data while keeping the test set fixed. Training fractions of 100%, 50%, 10%, 5%, and 1% are considered, corresponding to approximately 10
000, 5000, 1000, 500, and 100 training data points, respectively.
When the full training dataset is used, single-task and multi-task models exhibit comparable performance in terms of both MAE and R2, with the multi-task model showing slightly lower MAE and marginally higher R2 values (Fig. 7). This indicates that, in the data-rich regime, both models are capable of learning accurate structure–property relationships for Cp. As the training data size decreases, a consistent performance gap emerges across both error and correlation metrics. The multi-task model maintains lower MAE and higher R2 values than the single-task model at all reduced training fractions. The difference becomes more pronounced at lower data availability, where the single-task model shows a clear degradation in predictive accuracy, while the multi-task model retains comparatively stable performance. This behavior is observed consistently in both MAE (Fig. 7(a)) and R2 (Fig. 7(b)), indicating improved robustness of the multi-task model under limited training data. These results indicate that multi-task learning improves data efficiency by leveraging shared information across related tasks, leading to better generalization when training data are scarce. The observed trends support the use of multi-task learning for polymer property prediction in scenarios where high-fidelity experimental data are limited.
Parts of the present Cp analysis build upon our previous single-property transfer-learning study, which focused specifically on integrating MD, GC, and experimental datasets for Cp prediction together with analysis of the systematic overprediction of Cp in classical MD simulations.48 In the present work, these Cp-related discussions are included primarily to provide context for the fidelity-aware learning strategy. The primary new contribution of the current study is the extension from a single-property transfer-learning framework to a unified multi-task and multi-fidelity graph-learning framework spanning 28 polymer properties across multiple physical domains.
Fig. 8 examines the influence of multi-fidelity training strategies on experimentally relevant property prediction using representative polymer properties with different experimental and lower-fidelity data availability. To systematically evaluate the role of lower-fidelity data and fidelity-weighting strategies, we compared experimental-only training together with multiple multi-fidelity formulations, including equal weighting, linear fidelity weighting, cosine fidelity weighting, and unscheduled combined-fidelity training. The corresponding parity plots for representative Cp models are shown in Fig. 8, while quantitative results for additional properties are summarized in the SI (Table S4).
For Cp, the experimental dataset is relatively sparse (118 experimental samples), while the GC dataset provides substantially broader chemical coverage (12
076 samples). Under these conditions, incorporating GC data generally improved experimentally evaluated prediction performance relative to experimental-only training. The experimental-only model yielded an average R2 value of 0.48 ± 0.31 and an MAE of 186.39 ± 58.15 J kg−1 K, whereas the multi-fidelity models achieved improved average performance, with the best results obtained using linear fidelity weighting (R2 = 0.56 ± 0.27, MAE = 171.61 ± 65.59 J kg−1 K). The remaining fidelity-weighting strategies produced comparable performance improvements relative to the experimental-only baseline. These results suggest that lower-fidelity GC data provide useful representation-learning benefits by expanding chemical coverage when experimentally measured data are limited.
The behavior observed for other properties further highlights the property-dependent nature of multi-fidelity learning. For density (ρ), where both experimental and MD datasets are relatively large and physically correlated, fidelity-aware learning produced more consistent improvements over experimental-only training, with cosine weighting yielding the best overall performance (R2 = 0.78 ± 0.06 compared with 0.70 ± 0.17 for experimental-only training). In contrast, for Tg, the experimental dataset already contains substantially more data than the corresponding MD dataset, and all fidelity strategies produced nearly identical performance (R2 ≈ 0.89), indicating limited additional benefit from lower-fidelity integration. Thermal conductivity (TC) remained challenging across all training strategies because of the extremely limited experimental dataset (65 samples), resulting in large variance and unstable generalization behavior regardless of the fidelity treatment.
Overall, these results indicate that the effectiveness of multi-fidelity learning depends strongly on the balance between experimental-data availability, lower-fidelity dataset coverage, and the degree of correlation between fidelity levels. The primary benefit of the multi-fidelity framework arises from the incorporation of broader lower-fidelity chemical information when experimental datasets are sparse, while differences among specific fidelity-weighting schedules remain comparatively modest for most properties. Fig. 6 and 8 therefore address complementary objectives. Fig. 6 evaluates shared representation learning within the multi-task framework, whereas Fig. 8 focuses specifically on the influence of heterogeneous fidelity integration on experimentally relevant prediction performance.
Having established the predictive accuracy and robustness of the proposed ML models, we next apply them to large-scale polymer property prediction. Large-scale property prediction is performed for polymers in the PolyInfo database, comprising approximately 13
000 real polymers, and the PI1M (ref. 50) database, containing one million virtual polymers. In total, predictions were generated for 28 polymer properties, corresponding to approximately 3.6 × 105 property values for the PolyInfo dataset and 2.8 × 107 property values for the PI1M virtual polymer library. For each target property, the model configuration that achieved the best validation performance in the preceding analyses is selected. Depending on the property, this corresponds to either a single-task model or a multi-task model with an appropriate task grouping. Table 3 summarizes the minimum, median, and maximum predicted values for all properties across both datasets. For the PolyInfo database, the predicted ranges for thermal properties such as Tm, Tg, κ, and Cp are consistent with experimentally reported values, with median predictions lying within physically expected regimes. Mechanical properties (E, G, K, and ν) exhibit relatively narrow distributions, reflecting the constrained mechanical response typical of amorphous polymers. Electronic and optical properties display broader ranges, particularly for α, µ, and ε, consistent with their sensitivity to chemical composition and electronic structure. Structural properties (Rg and ρ) show limited variation, reflecting packing constraints in polymer melts. For the PI1M database, the predicted medians for most properties remain comparable to those observed for PolyInfo, while the overall property ranges are generally broader. This widening reflects the substantially larger and more diverse chemical space represented in the virtual polymer set. Importantly, predicted values across all properties remain within physically reasonable bounds, indicating stable model behavior when extrapolated to a larger design space.
000 real polymers) and the PI1M database (1 million virtual polymers)
| Property | PolyInfo database | PI1M database | ||||
|---|---|---|---|---|---|---|
| Min | Median | Max | Min | Median | Max | |
| Thermal | ||||||
| Tm (K) | 207.87 | 509.66 | 880.45 | 203.78 | 447.25 | 918.02 |
| Tg (K) | 143.58 | 394.01 | 702.26 | 113.38 | 338.99 | 691.51 |
| αT (m2 s−1) | 5.0 × 10−8 | 1.8 × 10−7 | 5.6 × 10−7 | 2.5 × 10−8 | 1.8 × 10−7 | 6.0 × 10−7 |
| κ (W m−1.K) | 0.11 | 0.26 | 0.49 | 0.14 | 0.26 | 0.83 |
| Cp (J kg−1.K) | 437.43 | 1257.86 | 2560.19 | 551.00 | 1304.21 | 2796.52 |
![]() |
||||||
| Mechanical | ||||||
| E (GPa) | 1.46 | 5.26 | 10.77 | 0.82 | 5.18 | 10.16 |
| G (GPa) | 0.59 | 2.01 | 4.08 | 0.43 | 1.95 | 3.57 |
| K (GPa) | 1.20 | 4.90 | 8.61 | 1.24 | 4.84 | 9.95 |
| ν | 0.08 | 0.31 | 0.43 | 0.08 | 0.32 | 0.51 |
![]() |
||||||
| Transport | ||||||
| η (Pa s) | 7.1 × 10−3 | 1.1 × 10−2 | 2.9 × 10−2 | 6.4 × 10−3 | 1.3 × 10−2 | 2.9 × 10−2 |
| D (cm2 s−1) | 2.7 × 10−8 | 3.0 × 10−4 | 9.0 × 10−2 | 1.3 × 10−8 | 1.1 × 10−3 | 2.9 × 10−1 |
![]() |
||||||
| Gas permeability | ||||||
| PHe (Barrer) | 0.55 | 14.93 | 26 495.19 |
1.14 | 21.80 | 37 246.60 |
| PH2 (Barrer) | 0.56 | 17.83 | 38 526.12 |
0.56 | 20.19 | 75 825.66 |
| PCO2 (Barrer) | 2.3 × 10−4 | 9.81 | 61 187.62 |
4.3 × 10−3 | 14.80 | 54 059.37 |
| PN2 (Barrer) | 5.7 × 10−4 | 0.46 | 18 698.51 |
1.5 × 10−4 | 0.86 | 40 807.90 |
| PO2 (Barrer) | 9.1 × 10−4 | 1.64 | 23 378.65 |
6.4 × 10−4 | 2.17 | 30 797.60 |
| PCH4 (Barrer) | 1.7 × 10−3 | 0.53 | 24 869.14 |
5.6 × 10−4 | 1.85 | 58 958.58 |
![]() |
||||||
| Electronic/optical | ||||||
| α (a.u.) | 2.10 | 45.39 | 93.94 | 0.25 | 40.04 | 89.77 |
| EHOMO (eV) | −12.96 | −8.67 | −6.74 | −11.98 | −9.37 | −8.09 |
| ELUMO (eV) | −1.84 | 0.57 | 3.24 | −2.97 | 1.03 | 3.17 |
| Eg (eV) | 6.57 | 9.15 | 15.48 | 7.55 | 10.91 | 16.70 |
| µ (Debye) | 0.02 | 2.90 | 6.92 | 0.01 | 2.97 | 8.24 |
| Etotal (eV) | −3.0 × 106 | −8.9 × 105 | −4.7 × 104 | −4.3 × 106 | −8.3 × 105 | −4.1 × 104 |
| n | 1.26 | 1.55 | 1.74 | 1.25 | 1.52 | 1.91 |
| ε | 2.08 | 3.58 | 16.19 | 2.13 | 3.90 | 21.82 |
| εr | 13.77 | 24.88 | 78.64 | 13.54 | 25.65 | 86.77 |
![]() |
||||||
| Structural | ||||||
| Rg (Å) | 9.21 | 19.49 | 42.95 | 8.50 | 18.06 | 43.35 |
| ρ (g cm−3) | 0.69 | 1.24 | 2.79 | 0.85 | 1.20 | 2.80 |
Fig. 9 presents representative predicted distributions for selected properties, including Tm, K, and ε, for both PolyInfo and PI1M polymers. The PolyInfo distributions show a single dominant peak with a spread comparable to that observed in experimental datasets, whereas the PI1M distributions exhibit smoother profiles and extended tails due to the larger sample size. The minimum and maximum values shown are consistent with the summary statistics reported in Table 3 and further illustrate the broader coverage achieved for the virtual polymer library. Together, the results in Table 3 and Fig. 9 indicate that the trained models generate physically consistent property distributions at scale while preserving trends observed in experimentally characterized polymers. This capability enables systematic screening and comparative analysis of large polymer libraries across multiple property dimensions.
Taken together, the results indicate that combining physics-based simulations, experimental data, and ML within a unified framework enables consistent and scalable prediction of multiple polymer properties. Validation against experimental measurements provides confidence in the simulation-derived datasets, while correlation-aware multi-task learning and fidelity-weighted training improve predictive performance, particularly when experimental data are limited. The large-scale predictions for both real and virtual polymer libraries demonstrate the potential of the proposed framework for systematic evaluation of polymer properties across diverse chemical spaces.
000 property values spanning multiple property classes and data fidelity levels.
Using this dataset, inter-property correlations are quantified to assess the suitability of joint learning across targets. The results show that the effectiveness of multi-task learning depends on task-relatedness, with correlation-informed groupings providing more consistent improvements than uniform task aggregation. The analysis further indicates that multi-task learning improves data efficiency, with performance differences between single-task and multi-task models becoming more pronounced as the available training data are reduced. In addition, fidelity-aware training strategies improve predictive accuracy when combining experimental and computational data by balancing broad coverage from lower-fidelity sources with stronger supervision from experimental measurements.
The trained models are subsequently applied to large-scale property prediction for polymers in the PolyInfo database and the PI1M virtual polymer library. The resulting property distributions remain physically reasonable and reflect trends observed in experimentally characterized polymers, supporting the use of the models for systematic screening across extended chemical spaces.
Overall, the results demonstrate that integrating simulation-derived data, experimental measurements, and ML within a unified framework provides a practical approach for multi-property polymer prediction. The proposed methodology supports the scalable evaluation of both known and virtual polymers and provides a structured framework for polymer screening and analysis in settings where data availability and fidelity vary across properties.
![]() | (3) |
![]() | (4) |
![]() | (5) |
For each polymer, the temperature-scanning range was defined as a bounded interval around an initial estimate of Tg. This initial estimate was obtained from a separate multilayer perceptron model trained on experimentally reported Tg values. The MD temperature scan was then performed over a restricted window of ±150 K around the estimated Tg, rather than across a broad global temperature interval. This procedure limits the required simulation window while ensuring adequate sampling of both glassy and rubbery regimes.
In the present workflow, Tg is estimated from the temperature dependence of density using independently equilibrated simulations at discrete temperature intervals rather than from a continuous cooling trajectory at a fixed cooling rate. An auxiliary MLP model trained on experimental Tg data is used only to guide the initial fitting-window selection and is not used directly as the final simulation label.
In the equilibrium approach, after sufficient equilibration, production simulations were performed at 300 K and 1 atm to sample thermodynamic fluctuations. The Cp was computed from enthalpy fluctuations using the fluctuation–dissipation relation,60
![]() | (6) |
In addition, a non-equilibrium approach based on the enthalpy–temperature relationship was employed. The system was first equilibrated at 290 K and 1 atm, followed by a controlled temperature ramp from 290 K to 310 K under NPT conditions. Average enthalpy values were recorded as a function of temperature, and Cp was obtained from the slope of a linear fit to the enthalpy–temperature curve,47
![]() | (7) |
This non-equilibrium approach generally provides more stable estimates of Cp, but requires separate temperature-ramping simulations and therefore incurs additional computational cost; accordingly, all Cp values reported in this work were obtained using the NEMD method. A detailed comparison between EMD and NEMD approaches for polymer Cp, including quantitative error analysis, is provided in our previous work,48 where NEMD was shown to offer improved numerical stability and was therefore adopted in the present study.
![]() | (8) |
and the linear thermal expansion coefficient was obtained as αL = αV/3.
The temperature gradient was extracted from the central conduction region by fitting the linear portion of the steady-state temperature profile, excluding bins adjacent to the thermostatted regions to minimize boundary effects. The heat flux was obtained from the cumulative energy exchanged with the thermal reservoirs and normalized by the cross-sectional area perpendicular to the transport direction. κ was then evaluated using Fourier's law,
![]() | (9) |
To ensure reliable estimation of κ, only simulations exhibiting a linear increase in cumulative exchanged energy with time and a well-defined linear temperature gradient in the central region were retained for analysis. Reported κ values correspond to block-averaged means computed over the steady-state portion of the trajectory, with uncertainties estimated from the variance across independent time blocks.
![]() | (10) |
![]() | (11) |
![]() | (12) |
Within linear elasticity, the stress–strain relationship is given by
![]() | (13) |
For isotropic amorphous systems, the bulk modulus (K) and shear modulus (G) were computed using the Voigt–Reuss–Hill averaging scheme applied to the stiffness and compliance tensors. Young's modulus (E) and Poisson's ratio (ν) were then derived from K and G using standard isotropic relations,60
![]() | (14) |
![]() | (15) |
To ensure statistical reliability, for each simulation, final mean values of elastic properties were averaged over 100 configurations evenly sampled from an 8 ns production trajectory. Reported values correspond to ensemble averages, with coefficients of variation (defined as the ratio of standard deviation to the mean) not exceeding 0.33.
Monomer geometries were generated from repeat-unit SMILES using RDKit, where polymerization placeholders (*) were replaced with tritium tags ([3H]) to preserve valence and identify connection sites.61 Initial three-dimensional coordinates were generated using the ETKDGv2 algorithm with explicit hydrogens added prior to export in XYZ format.51
All electronic-structure calculations were performed using Psi4 (ref. 62). To obtain stable and physically reasonable geometries, a three-stage optimization protocol was applied: (i) HF/STO-3G pre-optimization, (ii) refinement at HF/6-31G, and (iii) final optimization using the range-separated, dispersion-corrected ωB97M-D3BJ functional with the 6-311+G(2d,p) basis set.63 Geometry and self-consistent field convergence thresholds were progressively tightened at each stage.
The static dipole polarizability was evaluated at the optimized geometry using a finite-field approach. Single-point DFT calculations were performed under uniform electric fields of magnitude ± δ, with δ = 10−4 atomic units, applied independently along each Cartesian direction. The polarizability tensor was obtained from central finite differences of the induced dipole moments,
![]() | (16) |
and reported in Å3 (ref. 64).
Single-point electronic properties were then computed at the optimized geometries using ωB97M-D3BJ with density fitting and tight self-consistent field convergence. HOMO and LUMO energies were extracted from the converged orbital spectrum and reported in eV, and the electronic band gap was computed as Eg = ELUMO − EHOMO. Total electronic energies were reported in eV, and dipole moments were reported in Debye. For iodine-containing monomers, a mixed-basis treatment was employed in which LanL2DZ was applied to iodine while all other atoms used 6-311G(d,p).65
The electronic contribution to the dielectric response was estimated by first computing the refractive index (n) from the DFT-derived isotropic polarizability using the Lorentz–Lorenz relation,49
![]() | (17) |
The orientational, or dipolar, contribution to the static dielectric constant (εdip) was obtained from MD simulations using dipole moment fluctuations,
![]() | (18) |
The total static dielectric constant (ε) was calculated as
| ε = εel + εdip − 1, | (19) |
![]() | (20) |
denotes the set of polymers for which property p at fidelity level f is available,
is the corresponding number of observed data points, and wp,f is a weighting factor used to balance the contributions from different properties and data fidelities.The normalization by
ensures that properties or fidelity levels with larger datasets do not dominate the optimization objective. The weighting factors wp,f provide a mechanism to adjust the relative influence of experimental and computational data during training, reflecting their differing reliability without enforcing explicit bias correction. The present framework adopts a prescribed fidelity hierarchy in which experimental data are treated as the highest-fidelity reference, followed by DFT, MD, and GC data. This ordering is motivated by the overall physical accuracy and level of approximation associated with each method, while recognizing that the effective reliability may still vary across different properties. The fidelity weights used in the loss function are therefore prescribed based on the assumed relative reliability of the corresponding data sources rather than learned adaptively from the data. Importantly, not all properties contain the same fidelity combinations. In particular, GC data are only used for Cp in the present work. Since classical MD systematically overpredicts Cp due to the classical treatment of vibrational modes, the fidelity-aware Cp analysis was restricted to GC and experimental data, where the lower-fidelity trends remain more consistent with the experimental target space. Consequently, MD and GC data are not simultaneously combined within the current Cp multi-fidelity training setup. Although the present framework uses fixed fidelity weights, future extensions could incorporate adaptive fidelity weighting or discrepancy-learning approaches in which the relative reliability of different fidelity sources is learned directly from the data.
Each target property was normalized independently using statistics computed from the training data prior to optimization. This normalization prevents properties with larger numerical magnitudes or different physical units from disproportionately dominating the loss function.
Hyperparameters, including learning rate, batch size, network depth, and regularization parameters, were optimized using Bayesian optimization implemented with Optuna.66 Model selection was based primarily on validation mean absolute error (MAE), while RMSE and coefficient of determination (R2) were additionally used for performance evaluation and comparison. For multi-task models, validation metrics were computed either as aggregated measures across tasks or with respect to a designated primary task, while the training objective itself remained unchanged. All reported model performances were averaged over five random seeds, and the variability across seeds was used to estimate uncertainty in the reported metrics.
The encoder consists of multiple message-passing layers in which node embeddings are iteratively updated through graph convolution, normalization, nonlinear activation, optional residual connections, and dropout. For the GINE backbone, edge features are explicitly incorporated during message passing, whereas GIN and GCN variants use only node features. Graph-level polymer embeddings are obtained through global pooling, with mean pooling used in the present study.
The number of message-passing layers, embedding dimensions, normalization strategy, activation functions, dropout values, and prediction-head architectures were optimized using Optuna based on validation performance and training stability. The hyperparameter search space included 3–7 message-passing layers, graph embedding dimensions ranging from 256 to 768, and prediction-head depths of 1–3 fully connected layers.
To enable simultaneous multi-property learning, all tasks share the same molecular graph encoder while maintaining separate task-specific multilayer perceptron (MLP) prediction heads. Each prediction head uses the same architectural template consisting of repeated linear layers and nonlinear activations followed by a task-specific output layer. Fidelity information is incorporated through learned fidelity embeddings, which are either concatenated with the graph embedding or applied through feature-wise linear modulation (FiLM) prior to property prediction. A summary of the key architectural hyperparameters and training configurations used in the PolyGraphMT framework is provided in Table S2 of the SI.
Supplementary information (SI): additional analyses of the multi-fidelity dataset, property overlap statistics, multi-task learning performance, model architecture and training details, representative polymer structures, post-equilibration structural validation, and supplementary benchmarking results. See DOI: https://doi.org/10.1039/d6dd00206d.
| This journal is © The Royal Society of Chemistry 2026 |