Gustavo Chaparro and
Erich A. Müller
*
Sargent Centre for Process Systems Engineering, Department of Chemical Engineering, Imperial College London, London, SW7 2AZ, UK. E-mail: e.muller@imperial.ac.uk
First published on 19th June 2025
The parameterisation of the force field of a molecular system is essential for accurately describing and predicting macroscopic thermophysical properties. Here, we discuss three approaches to obtain the molecular parameters (σ, ε, and λr) of the Mie force field from experimental data for quasi-spherical molecules. The first approach is based on a classical strategy that considers fitting only to vapour–liquid equilibria data. The second approach entails a simultaneous fit to equilibrium properties and liquid shear viscosity. Finally, a third approach incorporates solid–fluid equilibrium data. The fitting procedure is facilitated by the use of recently published machine-learned equations of state for the Mie particle, which allows the prediction of thermophysical properties given a set of molecular parameters. The goodness-of-fit is assessed based on the deviations between calculated and experimental data. We also assess the behaviour of the thermal conductivity and speed of sound of the saturated liquid phase to evaluate the transferability of the molecular parameters to properties not used in the parametrisation. Apart from the singular case of monoatomic molecules, no single set of parameters can simultaneously describe the fluid phase equilibria, transport, and solid transition properties of quasi-spherical molecules. This result highlights the limitations of the Mie potential for modelling the thermophysical properties of small molecules. Therefore, a compromise must be made, either to achieve a good description of a specific set of properties or to attain modest accuracy across all phase space.
Design, System, ApplicationThe fitting of force field parameters to experimental thermophysical data typically relies solely on comparisons with vapour–liquid equilibrium data. We utilise physics-informed machine-learned equations of state to evaluate this classical approach against two alternative parameterisation strategies that incorporate either transport or solid-phase properties. Using the Mie forcefield as a benchmark, it is observed that for simple monoatomic substances, all schemes are essentially equivalent, yielding a good overall fit. However, for more complex molecules, the limitation of the Mie forcefield becomes evident, as no single set of parameters is seen to accurately and simultaneously predict all studied properties. These results highlight the need to develop more robust and detailed force fields for molecular modelling. |
Regardless of the type of force field (i.e., classical or machine-learning-based) and its molecular resolution (AA, UA, or CG), its applicability depends on the selection of molecular parameters that, along with the state conditions (i.e., density, temperature, and composition), define a Hamiltonian that can be translated into macroscopic observables through molecular simulations and statistical thermodynamics. The appropriate parameterisation of the force field (i.e., the fitting of the molecular parameters) is crucial for its success in predicting thermophysical properties. The parameterisation of a given force field can follow either a bottom-up16,17 or a top-down13 approach. In bottom-up approaches, the force field is fitted to reproduce forces, bond distances, and angles from rigorous quantum mechanical calculations. This approach is also the ansatz of machine learning potentials that aim to bridge ab initio accuracy with larger scales.9,18 Conversely, the top-down approach directly optimises the molecular parameters to reproduce relevant experimental data. This method links a molecular model to its implied thermophysical properties and is the most efficient from an engineering and design perspective.
For the sake of conciseness, we shall henceforth discuss the parameterisation of the Mie potential, shown in eqn (1).
![]() | (1) |
Although the fitting of force fields to experimental data seems a judicious approach, the details of how to proceed remain elusive: which properties should be targeted? Which ones provide the most information? How can one guarantee the robustness and transferability of the optimised parameters? A century ago, Lennard-Jones pioneered the answering of these questions by parameterising the Mie potential using a top-down approach, which led to the well-known Lennard-Jones (LJ) potential (i.e., ULJ = 4ε[(σ/r)12 − (σ/r)6]).22,23 He recognised that the molecular parameters of the Mie potential (or the LJ potential) could be fitted to either the second virial coefficient or the viscosity of a dilute argon gas. However, no unique set of parameters could be isolated: “… so it does not prove possible to obtain a molecular model which will simultaneously explain the two sets of experimental facts”.23 The results from Lennard-Jones already showed that even for simple molecules like argon, there are trade-offs with respect to which property to choose for fitting.24
Within a modern approach, one can target the simultaneous fitting of multiple thermophysical properties and larger data sets. Take, for example, the SAFT-γ Mie force field,13 a CG force field based on the Mie potential. The parameterisation of this force field is performed by invoking an analytical equation of state (the SAFT-VR-Mie EoS25), which accurately maps the force field parameters to macroscopic fluid phase equilibria. By using the equation of state as a surrogate of the underlying MD results, optimised molecular parameters that best fit the vapour–liquid equilibria over a broad range of state points can be effectively determined. The SAFT-γ Mie force field has since been successfully used to model equilibrium properties of alkanes,26 carbon dioxide,27 greenhouse gases,28 water,29 fluorinated compounds30,31 and polymers32–34 amongst others. However, limitations have been seen when attempting to employ the force fields to describe transport properties and to determine the triple point and onset of the solid phases.
In spite of its limitations, the above approach can be generalised by employing the corresponding states principle. Mejía et al.35 developed a corresponding states correlation that relies on the critical temperature (Tc), a saturated liquid density (ρl at Tr = 0.7) and the Pitzer acentric factor (ω = −1 − log10(Psat/Pc) at Tr = 0.7) to obtain the molecular parameters of homonuclear Mie chains. This approach, a.k.a. the M&M correlation, has been successfully applied to model the phase equilibria and interfacial properties of a diverse range of industrially relevant molecules, including alkanes, aromatics, gases, and solvents.36 Moreover, this corresponding states principle has been applied to 6000+ molecular fluids, whose parameters have been freely published in the Bottled SAFT webpage.37 Alternatively to the M&M correlation, Hoang et al.38 proposed a similar corresponding states correlation for homonuclear Mie chains. Interestingly, they proposed using the reduced liquid viscosity data point (η1 at Tr = 0.7) as an additional parameter into the correlation. This approach suggests that transport properties could be employed in addition to the volumetric equilibrium properties to obtain more robust and transferable molecular models.
That being said, this contribution focuses on the single-bead CG modelling of quasi-spherical molecules using the Mie potential, eqn (1). Three main parameterisation approaches are explored and compared. The first approach follows the classical strategy for parameterising force fields where only vapour–liquid equilibria and fluid data are considered. The second approach follows the corresponding states principle of Hoang et al.,38 where the liquid shear viscosity data is considered in addition to the VLE data. The third parameterisation approach includes solid–fluid data along with fluid phase equilibria. Details of the different parameterisation approaches and how the thermophysical properties are modelled are discussed in section 2, while the discussion of the results is given in section 3. Finally, the conclusions are summarised in section 4.
Here, we consider three parameterisation approaches to obtain the molecular parameters (σ, ε, λr) of the Mie potential. We formulate a general objective function as follows.
![]() | (2) |
PVLEb | ρl,VLEb | ΔHVLEb | PSLEc | PSVEd | ΔHSVEd | ηe | |
---|---|---|---|---|---|---|---|
a Thermophysical properties considered in the objective function (OF) are indicated by a “✓”.b Vapour–liquid equilibria (VLE) properties are also referred to as “vaporisation” properties. These include the vaporisation pressure (PVLE), saturated liquid density (ρl,VLE), and vaporisation enthalpy (ΔHVLE). VLE properties can be obtained either with the FE-ANN EoS40 or FE-ANN(s) EoS,41 based on eqn (3).c Solid–liquid equilibria (SLE) properties are also referred to as “melting” properties. Only the melting pressure (PSLE) is considered here.d Solid–vapour equilibria (SVE) properties are also referred to as “sublimation” properties. These include the sublimation pressure (PSVE) and sublimation enthalpy (ΔHSVE). SLE and SVE properties are obtained using the FE-ANN(s) EoS.e The shear viscosity (η) is modelled using the ANN-based models developed in ref. 42, based on eqn (6). | |||||||
OF1 | ✓ | ✓ | ✓ | ✗ | ✗ | ✗ | ✗ |
OF2 | ✓ | ✓ | ✓ | ✗ | ✗ | ✗ | ✓ |
OF3 | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✗ |
Thermophysical properties under VLE conditions are generated within the range Tr = T/Tc ∈ [0.55, 0.95]. The VLE data includes saturation pressure, saturated liquid density, vaporisation enthalpy, and saturated liquid viscosity. In addition, the thermal conductivity and speed of sound of the saturated liquid phase are generated to assess the transferability of the molecular parameters to unseen thermophysical properties. Similarly, the melting pressure data is generated in the Tr = T/Tc ∈ [0.55, 1.1] range, and the sublimation pressure and enthalpies are generated in the Tr = T/Tc ∈ [0.45, 0.55] range. These constitute our database of experimental information.
![]() | (3) |
![]() | (4) |
![]() | (5) |
The second objective function (OF2) requires predicting the shear viscosity of the Mie fluid. For this purpose, we employ the artificial neural network (ANN)-based models for the transport properties of the Mie fluid reported in ref. 42. The recommended model for the shear viscosity (η) is based on an ANN and utilises a semi-log scale. This model is formulated using reduced units and can be converted into real units, as shown below.
![]() | (6) |
The behaviour of the speed of sound and the thermal conductivity will also be studied to assess the transferability of the optimised molecular parameters (σ, ε, λr) to other thermophysical properties. The speed of sound can be obtained by any EoS as follows.
![]() | (7) |
Here, w is the speed of sound (in m s−1), κT is the isothermal compressibility (in Pa−1) and CV and CP are obtained isochoric and the isobaric heat capacities (in J K−1 mol−1). The FE-ANN/FE-ANN(s) EoSs explicitly model the residual Helmholtz free energy and the ideal contribution can be accounted for analytically as shown in eqn (5). The isothermal compressibility is directly obtained from the Helmholtz free energy as κT = (∂ρ/∂P)T/ρ. It is recommended that the ideal and residual contributions for the heat capacities be considered separately.
CV = CidV + CresV | (8a) |
CP = CidP + CresP | (8b) |
Finally, the thermal conductivity is obtained from the recommended ANN-based model developed in ref. 42. This model uses reduced units and a semi-log scale. Therefore, the thermal conductivity of the Mie fluid is obtained as follows.
![]() | (9) |
![]() | (10) |
![]() | (11) |
The ideal gas isochoric heat capacity is obtained from the correlations of the DIPPR project 801.46 The number of non-vibrational degrees of freedom of a molecule depends on its molecular structure. Monoatomic molecules have three non-vibrational degrees of freedom. Linear molecules, such as carbon monoxide, have five non-vibrational degrees of freedom. Non-linear molecules, like methane, have six non-vibrational degrees of freedom. The correction term κvib is small for liquid phases. For the molecules of interest of this work κvib/κANN ≲ 10−2. Finally, the thermal conductivity is obtained by adding eqn (9) and (10).
κ = κANN + κvib | (12) |
The optimised molecular parameters and their corresponding deviations are summarised in Table 3. This table also includes the parameters predicted by the M&M35 and the Hoang et al.38 corresponding states correlations. Overall, it is evident from this table that the VLE can be modelled with low deviations (MAPE ≲ 5%) when it is the sole target data (i.e., OF1). It must be noted that the parameters obtained from this objective function are in good agreement with the M&M correlation. This similarity results from the M&M correlation using the critical temperature, acentric factor, and liquid density to predict molecular parameters that correctly predict the VLE conditions at Tr = 0.7. However, this correlation is parametrised to predict a correct density behaviour, and higher deviations (>10%) are observed for the saturation pressure (PVLE). This higher deviation in pressure is also observed from the parameters obtained in the correlation of Hoang et al.38 This correlation targets a correct density and shear viscosity behaviour. The objective of the parameters provided by this correlation is comparable to the ones obtained from using OF2.
a Thermophysical properties included in the objective functions are coloured with a light grey background. Refer to eqn (2) and Table 1 for further details about OF1, OF2, and OF3.b The objective functions are optimised using pseudo-experimental data obtained from the NIST TRC.43c The reported Mean Absolute Percentage Error (MAPE) is obtained as ![]() |
---|
![]() |
Using OF1 (or the M&M correlation) provides a good description of the VLE. However, the overfitting of the VLE phase envelope commonly comes at the cost of increasing the repulsions and interaction energy of the Mie potential. As will be shown later in Fig. 3–5, the apparent excellent agreement with the experiments stems from extending the validity range of a fluid EoS to regions below the triple temperature where a solid phase would exist. This overfitting, in itself, is not problematic if only fluid properties are considered, but it has implications if one wishes to transfer the model to other thermophysical properties. For example, an erroneous prediction of triple points will substantially influence viscosity predictions, which are known to rise exponentially as one approaches solid-phase conditions.53
The ratio between the critical (Tc) and triple (Ttr) points for the selected molecules is illustrated in Fig. 1. From this figure, it is apparent that Tc/Ttr is accurately represented for monoatomics when using only VLE data. However, including data on shear viscosity (OF2) or solid–fluid equilibria (OF3) data is essential to more effectively predict this ratio for polyatomic molecules. The proposed parameterisation approaches can also predict other transport properties, such as the self-diffusivity coefficient. As shown in Fig. S.5 in the ESI,† the self-diffusivity can be accurately predicted by the three different objective functions as long as the stable fluid region is correctly predicted (i.e., correct Tc/Ttr ratio). This behaviour results from the dependence of the self-diffusivity on the molecular size (related to σ) for non-associating molecules. This result has been observed in the early13 and recent54 parameterisation of the SAFT-γ force field and implies that this property is not determinant when parameterising coarse-grained force fields that ignore explicit polar moments and electrostatics. These general observations set the stage for the main discussion below, which explains how achieving acceptable fits for certain properties comes at the price of producing unreliable representations of others. The discussion will be presented below based on the aforementioned molecular groups (i.e., monoatomic, linear, and tetrahedral molecules). Further results for molecules not covered in this section are available in section S.2 of the ESI.†
![]() | ||
Fig. 1 Critical and triple temperature ratio of selected quasi-spherical molecules. Grey bars with diagonal hatch refer to the value obtained from the NIST TRC database.43 Blue bars with a circle pattern refer to the values obtained from OF1 (VLE only). Orange bars with a star pattern refer to the values obtained from OF2 (VLE + shear viscosity). Green with a dotted pattern refers to the values obtained from OF3 (VLE + SLE + SVE). |
The results of Table 3 are best interpreted graphically. The phase equilibria and selected thermophysical properties results of argon are illustrated in Fig. 2. Similar figures for krypton and xenon can be found in the ESI.† Fig. 2 takes into account the three sets of parameters along with two possible modelling approaches. Fig. 2(a)–(c) depict the phase equilibria results modelled solely with fluid-phase data. Generally, good agreement can be noted with the reference VLE data when using the OF1 and OF2 parameterisation approaches. Fig. 2(d)–(f) examine the same set of molecular parameters but using the FE-ANN(s) EoS. This EoS broadens the prediction to encompass the entire phase diagram, including VLE, SLE, SVE, and the triple point. It can be observed that even when solid-phase data are excluded from the parameterisation, all modelling approaches converge to predict a similar phase envelope, equilibrium pressure, and enthalpy of phase change. These sets of parameters also accurately predict the triple temperature, which is marked by a solid black line in the figures.
![]() | ||
Fig. 2 Selected thermophysical properties argon. (a) and (d) Phase envelope. (b) and (e) Clapeyron plot. (c) and (f) Enthalpy of phase change. (g) Speed of sound. (h) Shear viscosity. (i) Thermal conductivity. (a)–(c) Phase equilibria modelled using the FE-ANN EoS developed ref. 40. (d)–(g) Phase equilibria and speed of sound modelled using the FE-ANN(s) EoS developed in ref. 41. FE-ANN/FE-ANN(s) EoSs are based on eqn (3). (h) and (i) The shear viscosity and thermal conductivity are modelled using the ANN-based models developed ref. 42, based on eqn (6) and (12). Dashed blue lines and symbols refer to results using parameters optimised with OF1 (VLE only). Dotted orange lines and symbols refer to results using parameters optimised with OF2 (VLE + shear viscosity). Solid green lines and symbols refer to results using parameters optimised with OF3 (VLE + SLE + SVE). Refer to eqn (2) and Table 1 for further details about the objective functions. Filled circle is the critical point. Filled square is the triple point. Pseudo-experimental data obtained from NIST TRC.43 Upright triangles are VLE data, diamonds are SLE data, and downward triangles are SLE data. For reference, the solid black line refers to the triple temperature. |
The results for the speed of sound, shear viscosity and thermal conductivity of saturated liquid argon are presented in Fig. 2(g)–(i). Similarly to the phase equilibria results, the three parameterisation approaches accurately predict these thermophysical properties. These findings further confirm that the Mie potential can model the thermophysical properties of simple monoatomics with precision. Notably, the optimal repulsive exponents for these molecules are closer to the value of 13–13.5 than to the “classical” value of 12 employed in the LJ model.
The results are presented for nitrogen in Fig. 3. Fig. 3(a)–(c) illustrate the VLE results using the fluid FE-ANN EoS. A good agreement can be observed for the VLE phase envelope, vaporisation pressure, and vaporisation enthalpy using OF1 or OF2. However, it is important to note that the fluid FE-ANN EoS is stretched to predict VLE at conditions below the triple temperature. In a molecular simulation context, the Mie particle exhibits coexistence between a fluid and a solid phase in those conditions. In Fig. 3(d)–(f), the entire phase diagram is modelled using the FE-ANN(s) EoS with different parameter sets. These figures clearly demonstrate that the triple point predicted by the parameters obtained from OF1 is above some experimental VLE data points. Under those conditions, the VLE is unstable, and the Mie particle exhibits SVE. Incorporating the shear viscosity or solid phase data reduces the repulsive exponent and interaction energy of the Mie potential and, consequently, lowers the triple point. The parameters from OF2 and OF3 model the VLE similarly to those from OF1, but they provide a better description of the triple point, SLE and SVE. It is noteworthy that the order of magnitude of the melting pressure (∼103 MPa) is considerably higher than that of the vaporisation (∼10−1 MPa) and sublimation pressures (∼10−3 MPa). Although the molecular parameters obtained from OF3 predict the correct melting pressure trend, the reported relative errors are significant (∼50%). Ultimately, the differences in the various molecular parameters result in noticeably distinct SLE densities. This information could be helpful in distinguishing optimal molecular parameters; however, this data is not available from the NIST TRC correlations.43
![]() | ||
Fig. 3 Phase equilibria modelling of nitrogen. (a) and (d) Phase envelope. (b) and (e) Clapeyron plot. (c) and (f) Enthalpy of phase change. (g) Speed of sound. (h) Shear viscosity. (i) Thermal conductivity. (a)–(c) Phase equilibria modelled using the FE-ANN EoS developed ref. 40. (d)–(g) Phase equilibria and speed of sound modelled using the FE-ANN(s) EoS developed in ref. 41. FE-ANN/FE-ANN(s) EoSs are based on eqn (3). (h) and (i) The shear viscosity and thermal conductivity are modelled using the ANN-based models developed ref. 42, based on eqn (6) and (12). Dashed blue lines and symbols refer to results using parameters optimised with OF1 (VLE only). Dotted orange lines and symbols refer to results using parameters optimised with OF2 (VLE + shear viscosity). Solid green lines and symbols refer to results using parameters optimised with OF3 (VLE + SLE + SVE). Refer to eqn (2) and Table 1 for further details about the objective functions. Filled circle is the critical point. Filled square is the triple point. Pseudo-experimental data obtained from NIST TRC.43 Upright triangles are VLE data, diamonds are SLE data, and downward triangles are SLE data. For reference, the solid black line refers to the triple temperature. |
The liquid speed of sound results of Nitrogen are shown in Fig. 3(g). It is observed here that the three parameterisation approaches led to a similar description of this property. However, it should be noted that the described ranges for these properties depend on the predicted critical and triple points. For the case of the shear viscosity, shown in Fig. 3(h), it can be observed that including properties related to the repulsive branch of the force field enhances the accurate description of this property. In this instance, the parameters from OF2 and OF3 exhibit a similar behaviour. This finding contrasts with the observations for the thermal conductivity, illustrated in Fig. 3(i). These results graphically illustrate the trade-offs inherent in the coarse-grained modelling approach, specifically regarding which properties to fit (in this case, shear viscosity or thermal conductivity). Similar conclusions can be drawn from the analysis of the data for carbon monoxide, which is included in section S.2 of the ESI.†
![]() | ||
Fig. 4 Phase equilibria modelling of methane. (a) and (d) Phase envelope. (b) and (e) Clapeyron plot. (c) and (f) Enthalpy of phase change. (g) Speed of sound. (h) Shear viscosity. (i) Thermal conductivity. (a)–(c) Phase equilibria modelled using the FE-ANN EoS developed ref. 40. (d)–(g) Phase equilibria and speed of sound modelled using the FE-ANN(s) EoS developed in ref. 41. FE-ANN/FE-ANN(s) EoSs are based on eqn (3). (h) and (i) The shear viscosity and thermal conductivity are modelled using the ANN-based models developed ref. 42, based on eqn (6) and (12). Dashed blue lines and symbols refer to results using parameters optimised with OF1 (VLE only). Dotted orange lines and symbols refer to results using parameters optimised with OF2 (VLE + shear viscosity). Solid green lines and symbols refer to results using parameters optimised with OF3 (VLE + SLE + SVE). Refer to eqn (2) and Table 1 for further details about the objective functions. Filled circle is the critical point. Filled square is the triple point. Pseudo-experimental data obtained from NIST TRC.43 Upright triangles are VLE data, diamonds are SLE data, and downward triangles are SLE data. For reference, the solid black line refers to the triple temperature. |
![]() | ||
Fig. 5 Phase equilibria modelling of tetrafluoromethane. (a) and (d) Phase envelope. (b) and (e) Clapeyron plot. (c) and (f) Enthalpy of phase change. (g) Speed of sound. (h) Shear viscosity. (i) Thermal conductivity. (a)–(c) Phase equilibria modelled using the FE-ANN EoS developed ref. 40. (d)–(g) Phase equilibria and speed of sound modelled using the FE-ANN(s) EoS developed in ref. 41. FE-ANN/FE-ANN(s) EoSs are based on eqn (3). (h) and (i) The shear viscosity and thermal conductivity are modelled using the ANN-based models developed ref. 42, based on eqn (6) and (12). Dashed blue lines and symbols refer to results using parameters optimised with OF1 (VLE only). Dotted orange lines and symbols refer to results using parameters optimised with OF2 (VLE + shear viscosity). Solid green lines and symbols refer to results using parameters optimised with OF3 (VLE + SLE + SVE). Refer to eqn (2) and Table 1 for further details about the objective functions. Filled circle is the critical point. Filled square is the triple point. Pseudo-experimental data obtained from NIST TRC.43 Upright triangles are VLE data, diamonds are SLE data, and downward triangles are SLE data. For reference, the solid black line refers to the triple temperature. |
The effect of the various choices of parameters in the predicted phase diagram is shown in subfigures (d) to (i). For the case of Methane, Fig. 4, it observed that when including solid phase (OF3) or shear viscosity (OF2) data the correct behaviour for the VLE is retained with an improvement on the prediction of the triple point, SLE and SVE. Both OF2 and OF3 objective functions lead to lower repulsions and interaction energies of the system and similar molecular parameters. In this context, it is observed that incorporating a thermophysical property related to the repulsive interactions benefits the description of the phase diagram and transport properties. This result is also observed for linear molecules and aligns with the corresponding states methodology proposed by Hoang et al.38
For the case of tetrafluoromethane, Fig. 5, the limitations of the Mie potential are further unveiled. In Fig. 5(d)–(f), it is shown how the highly repulsive parameters (i.e., λr = 34) obtained from OF1 lead to a phase diagram where a considerable part of the experimental VLE data points are below the triple line. Below this temperature, the Mie particle will exhibit a solid phase. Moreover, when incorporating solid (OF3) or shear viscosity (OF2) data, there is no set of parameters that can adequately describe all properties (hence the title of this manuscript). It must be noted that all these parameter sets fail to describe the triple temperature of tetrafluoromethane. It is not a surprise that other thermophysical properties, such as the speed of sound, Fig. 5(g), and thermal conductivity, Fig. 5(i), are not described correctly either. Tetrafluoromethane (CF4), similar to methane (CH4), is a non-polar molecular. However, the fluorine atoms in CF4 are highly electronegative and larger than the hydrogen atoms in CH4. The fluoride atoms in CF4 can lead to strong local polar moments, an overall octopolar moment and steric effects that a simple single-bead coarse-grained model seems incapable of taking into account. These results align with the findings of Bell57 that suggested including these polar moments in an EoS to improve the description of derivative data of refrigerants.
It has been found that a single set of parameters can accurately describe the entire range of properties for simple monoatomic molecules, such as argon, krypton, and xenon. An optimal set of parameters is encountered regardless of the thermophysical properties used to parameterise the Mie potential. The expected simple nature of noble gases, described as isotropic particles with a fixed repulsion and dispersion, most likely contributes to this result. However, the limitations of the Mie potential become evident beyond these simple cases. For linear and tetrahedral molecules, the numerical solution of the parameterisation overfits the vapour–liquid equilibrium when it is the sole target property. Notably, this approach is a customary parametrisation strategy in SAFT-like EoSs25,39 and leads to a displacement of the triple point, resulting in a region of the VLE phase envelope that is unstable (frozen) as it lies below the triple temperature. This behaviour is ignored when using the EoS in the context of fluid phases, but it will lead to incorrect phase behaviour when deployed in a molecular simulation. Furthermore, this suboptimal parameterisation leads to a poor description of other thermophysical properties, particularly transport properties. In contrast to noble gases, which interact solely through simple repulsion and van der Waals forces, polyatomic molecules can exhibit additional anisotropic inter- and intramolecular interactions. Examples include dipolar and higher polar moments, polarisation effects, and steric effects caused by molecular geometry. These interactions are not accounted for when modelling a polyatomic molecule using a coarse-grained approach with a single bead per molecule.
In contrast to the exclusive use of VLE data, the proposed parameterisation approaches suggest that incorporating properties such as shear viscosity and/or solid–fluid equilibria data results in a different set of molecular parameters. The inclusion of these properties in the parameterisation influences the interaction energy and repulsive exponent of the molecular model. This stems from the fact that both of these properties are strongly affected by short-range interactions. Shear viscosity relates to the friction between molecules, which is pertinent at short distances. Likewise, the solid phase and crystal structure directly relate to relatively short-distance steric packing. In contrast, vapour–liquid equilibria is a consequence of the interplay between repulsive and attractive interactions. This interplay can be characterised by multiple sets of roughly equivalent molecular parameters that correctly describe the VLE region.19 This multiplicity (and ultimate mathematical global minimum in the parameter search) is not a guarantee that the parameters will extrapolate effectively to other thermophysical properties. For this reason, we advise to include properties related to the repulsions of the system when parameterising force fields. This observation aligns with the corresponding states methodology proposed by Hoang et al.,38 which takes the shear viscosity into account in the force field parametrisation.
Incidentally, the use of viscosity or solid properties in the overall fit reduce the rather large repulsive exponent (λr > 20) obtained when “force-fitting” large or polar molecules to VLE. The apparent success of UA force fields based on the Mie potential in describing both fluid phase equilibria and transport properties,58,59 despite being fitted only to VLE, seems to be aided by a fortunate choice of a conservative repulsive exponent within the range of 14–16. We conjecture that more robust and transferable parameter sets could be obtained by incorporating information such as viscosity, triple point data, or similar that directly pertains to the repulsive branch of the potential.
In summary, we have found that
• Relying exclusively on VLE as the target property in the objective function carries a risk of “overfitting” the Mie potential. This seemingly “optimal” parameterisation does not guarantee that other thermophysical properties can be accurately modelled or transferred. A clear sign of overfitting, as well as of an inadequate molecular model, is the shift of the triple point, which reduces the effective VLE region predicted by the model compared to experimental data.
• The parametrisation based exclusively on VLE data does not sufficiently quantify system repulsions. We propose a compromise parameterisation that we believe offers greater robustness than solely concentrating on VLE data. We recommend including a property that is significantly influenced by these short-range interactions, such as viscosity or solid-phase data, in the force field parameterisation. The limited effectiveness of VLE modelling in this context highlights the shortcomings of the molecular model or force field employed (which is the reasoning behind the manuscript's running title). Often, the Mie model's parameter set cannot simultaneously account for all properties.
As an end note, we highlight that the parameterisation approaches presented in this manuscript rely on physics-informed artificial neural networks to model thermophysical properties. These thermophysical property models are equations of state derived directly from discrete molecular simulation data and physics-inspired constraints employing machine learning techniques. They serve as reliable surrogates for molecular simulation results and represent promising paths for parameterising force fields using a top-down approach. It should be noted that the observations made in this manuscript are limited to simple spherical geometries and London-like dispersive interactions. Future work will focus on extending the framework to incorporate additional molecular geometries and interactions.
Footnote |
† Electronic supplementary information (ESI) available: Includes details about reduced Mie units and further modelling results of quasi-spherical molecules. See DOI: https://doi.org/10.1039/d5me00048c |
This journal is © The Royal Society of Chemistry 2025 |