Ivan Antolovića,
Simon Stephan
b and
Jadran Vrabec
*a
aThermodynamics, Technische Universität Berlin, Ernst-Reuter-Platz 1, 10587 Berlin, Germany. E-mail: vrabec@tu-berlin.de
bMolecular Thermodynamics Group (MTG), RPTU Kaiserslautern, Erwin-Schrödinger-Straße 44, 67663 Kaiserslautern, Germany
First published on 15th September 2025
The predictive power of the COSMO-SAC activity coefficient model is rigorously put to the test using an extensive dataset of binary liquid–liquid equilibria (LLE). Two model variants, COSMO-SAC-2010 and COSMO-SAC-dsp, are evaluated across 2478 binary systems and nearly 75000 experimental data points. They achieve a success rate exceeding 90% in detecting the occurrence of LLE, demonstrating strong qualitative performance across chemically diverse systems. In benchmark comparisons, COSMO-SAC-2010 sets the standard for nonaqueous systems, while COSMO-RS performs best for aqueous mixtures, placing the two at a broadly comparable overall level with complementary strengths. COSMO-SAC-dsp yields larger deviations but provides broader coverage, particularly for polar and hydrogen-bonding systems. Both reliably capture systematic trends across homologous series, making them effective tools for solvent screening and thermodynamic consistency analysis. A high-throughput and fully automated computational framework—integrated into the open-source package ThermoSAC—enables adaptive Gibbs energy screening, LLE tracing, and anomaly detection. This work establishes COSMO-SAC as a leading framework for predictive thermodynamics and offers reproducible benchmarks and tools for future model development, such as those based on machine learning.
In recent years, LLE modeling has gained significant attention in various fields, including the development of sustainable separation processes and pharmaceutical drug formulation. The advent of green solvents, such as ionic liquids (IL) and deep eutectic solvents (DES), has necessitated predictive models for screening viable solvent systems efficiently.1 Similarly, predictive modeling has been applied to aqueous sugar solutions and fruit juices to enhance separation processes in the food industry.2 Moreover, LLE play a crucial role in biphasic CO2 capture, where phase separation is used to facilitate solvent regeneration and energy-efficient CO2 recovery.3,4 In pharmaceutical applications, LLE principles govern the partitioning of drugs in aqueous two-phase systems (ATPS) and influence the miscibility of amorphous solid dispersions (ASD), impacting drug stability and bioavailability.5,6
A widely used predictive model for fluid-phase equilibria is based on the continuum-solvation formalism COSMO (COnductor-like Screening MOdel),7 which forms the foundation of a family of COSMO-based approaches that avoid component-specific empirical parameters by using molecular surface charge distributions (σ-profiles) from quantum-chemical calculations. These distributions are used to estimate activity coefficients, enabling fully predictive phase behavior modeling. The approach was later extended to COSMO-RS (COSMO for Real Solvents)8–10 and COSMO-SAC (COSMO – Segment Activity Coefficient).11 COSMO-SAC has been applied to vapor–liquid (VLE),12–14 solid–liquid (SLE),15–17 and LLE,18,19 offering a unified thermodynamic framework for diverse systems.
Building upon prior research on the prediction of phase equilibria, this study extends the application of COSMO-SAC to the complex domain of LLE, representing a critical test of its generalizability. Specifically, the focus lies on the ab initio prediction of LLE using two COSMO-SAC variants: COSMO-SAC-2010 (no dispersion term) and COSMO-SAC-dsp (with dispersion). A large-scale evaluation is conducted on the basis of 2478 binary systems, systematically assessing the models' accuracy in predicting experimental phase behavior.
The primary objectives of this work are to (i) benchmark the predictive power of COSMO-SAC against experimental LLE data, (ii) assess whether dispersion corrections included in COSMO-SAC-dsp systematically improve predictions, (iii) develop and validate an automated workflow for high-throughput LLE screening, and (iv) provide an open-source benchmark dataset to facilitate future advancements in LLE modeling. Through large-scale computational predictions combined with rigorous validation, the study aims to refine the applicability of COSMO-SAC across diverse solvent systems and separation processes.
Beyond its methodological contributions, this work supports broader efforts to advance predictive thermodynamics for industrial applications. The large-scale evaluation highlights strengths and limitations of COSMO-SAC-2010 and COSMO-SAC-dsp for LLE modeling and informs future improvements in phase equilibrium predictions. Ultimately, the findings contribute to the development of more efficient, sustainable separation technologies and reinforce the role of computational models in solvent selection and process design.
![]() | (1) |
Systems | Data points | Substances | |
---|---|---|---|
Experimental data | 6153 | 120![]() |
2462 |
COSMO-SAC-2010 | 2478 | 74![]() |
933 |
COSMO-SAC-dsp | 2258 | 71![]() |
870 |
In total, 2262 σ-profiles are present in the UD database. Among the 2462 substances compiled in the experimental dataset, 1076 have a matching σ-profile. However, only 933 can form binary systems where valid σ-profiles are available for both components, resulting in 2478 systems and 74296 data points for evaluation.
For COSMO-SAC-dsp, an additional constraint was imposed, requiring σ-profiles to have a valid dispersion parameter. However, 237 out of the 2262 σ-profiles had a missing dispersion parameter, as the value εmolecule defined in eqn (1) was absent. This led to a reduction by 220 systems. Consequently, the COSMO-SAC-dsp dataset comprises 2258 binary systems with 71280 data points formed by 870 unique substances.
Most experimental LLE data (70.1%) were measured at ambient pressure, either indicating values near 1 bar or cases with absent pressure information, for which ambient pressure was assumed. The remaining 29.9% span elevated pressures from 1.2 bar to 12000 bar, with the majority below 2000 bar. The temperature ranges from 87.6 K to 694.6 K and exhibits a pronounced clustering around standard laboratory conditions, with 51.0% of data concentrated in the interquartile range of 293.1 K to 344.3 K.
The COSMO-SAC activity coefficient model solely depends on temperature and mole fraction, with no explicit pressure dependence. As a result, all model predictions correspond to ambient pressure conditions. Although activity coefficients in the liquid phase are often considered pressure-invariant due to low compressibility, elevated pressures can still affect phase behavior. Therefore, high-pressure experimental data were omitted during quantitative comparisons.
Family | Members | Examples |
---|---|---|
Gases | 4 | Carbon dioxide, hydrogen sulfide, sulfur dioxide |
Multifunctionals | 158 | Tetrachloromethane, furfural, vinyl acetate |
Other nitrogens | 20 | Nitrobenzene, hydrazine, nitric acid |
Alkanes | 129 | Cyclohexane, biphenyl, n-undecane |
Alkenes | 52 | Cyclohexene, ethylene, 1-pentene |
Alkynes | 8 | Phenylacetylene, ethyne, 1-hexyne |
Aromatics | 59 | Benzene, phenol, furan |
Carbonates | 4 | Ethylene carbonate, propylene carbonate |
Epoxies | 2 | 1,2-Propylene oxide, 2,2-dimethyloxirane |
Esters | 86 | Ethyl acetate, dimethyl adipate, amyl formate |
Halogenated hydrocarbons | 77 | Ethyl iodide, chloroform, dichloromethane |
Halogens | 2 | Bromine, iodine |
Ethers | 43 | Diethyl ether, tetrahydrofuran, ethoxybenzene |
Peroxy (no acids) | 2 | 1-Methyl-1-phenylethyl hydroperoxide |
Acids | 55 | Formic acid, hydrogen iodide, lauric acid |
Anhydrides | 2 | Acetic anhydride, maleic anhydride |
Amines | 45 | Aniline, ethylenediamine, piperidine |
Carbonyls | 56 | Acetone, 2-butanone, acrolein |
Thiols | 10 | Ethanethiol, 2-propanethiol, hexylmercaptan |
Thioethers | 3 | Diethyl sulfide, dimethyl sulfide, dibutylsulfide |
Alcohols | 91 | 2-Propanol, cyclopentanol, n-tridecanol |
Amides | 11 | Acetamide, acetanilide, formamide |
(Iso)nitriles | 11 | Acetonitrile, 1,4-dicyanobutane, benzylcyanide |
Sulfoxides and sulfonyls | 2 | Dimethyl sulfoxide, sulfolane |
Water | 1 | Water |
Total | 933 |
The following classification scheme applies specifically to the experimental data, distinguishing LLE systems based on their phase separation behavior:
• UCST: phase separation occurs upon cooling, meaning that the homogeneous phase is stable at higher temperatures, but undergoes demixing as the temperature decreases.
• LCST: phase separation occurs upon heating, where the homogeneous phase is stable at lower temperatures, but demixes as the temperature increases.
• Closed-loop: the system exhibits both UCST and LCST behavior, forming a phase separation loop with a miscibility gap in an intermediate temperature range.
• Other: unusual patterns that do not conform to classical UCST or LCST behavior, including island-type, hourglass-shaped, or diagonally aligned phase separation regions.
• NoType: insufficient data prevents reliable classification.
Most classified experimental systems exhibit UCST behavior (∼92%), while LCST and closed-loop systems form smaller fractions. This predominance aligns with expectations, as many known LLE systems exhibit temperature-dependent phase behavior favoring UCST-type separation. Table 3 summarizes the classification results of the experimental data.
This classification provides a framework for subsequent statistical analyses. Specifically, the interpretation of temperature deviations depends on whether a system follows UCST, LCST, or closed-loop behavior. Systems with a closed-loop phase behavior, for instance, may exhibit non-monotonic temperature deviations that require specialized analytical treatment, as discussed in Section S4 of the SI.
Beyond the lack of anchor points, LLE calculations also present challenges in solving the equilibrium conditions. Unlike other phase equilibrium types, LLE calculations require evaluating activity coefficients for each component in all phases. For a binary system, this results in four separate evaluations—compared to just one for SLE and two for VLE. This means that all four activity coefficients must be considered simultaneously, making LLE calculations more demanding. The fundamental equilibrium conditions for each phase equilibrium type are given by
![]() | (2) |
![]() | (3) |
LLE: xL1iγL1i = xL2iγL2i, | (4) |
As shown in eqn (2)–(4), each phase equilibrium type imposes different modeling requirements when expressed in terms of activity coefficients. The VLE includes fugacity coefficients of the vapor phase, while SLE depend on the Gibbs energy of fusion to describe the equilibrium between the solid and liquid phases. In contrast, LLE rely solely on activity coefficients, without the need for experimental reference data or an equation of state. This makes LLE structurally distinct in modeling and highly sensitive to the choice of the activity coefficient model. At the same time, LLE are an ideal benchmark for evaluating thermodynamic models, as any discrepancies directly reflect their ability to capture liquid-phase interactions.
Due to its sole dependence on the activity coefficient model, LLE are highly sensitive to model configurations. Unlike VLE and SLE, which show minor variations upon parameter changes, LLE predictions can shift drastically. This was evidenced in our preceding work,6 where altering the combinatorial term—switching from Staverman–Guggenheim to free volume or removing it—led to an “entropic catastrophe”. While SLE remained relatively stable upon such changes, LLE showed extreme variations, highlighting strong model sensitivity.
To achieve this, a structured screening approach was developed to identify initial values solely from the thermodynamic model. By systematically evaluating the Gibbs energy of mixing Δgmix across a range of temperatures and a fixed number of mole fractions, a comprehensive search was conducted in the T − x plane, independent of system-specific experimental data, as shown in Fig. 1. The Gibbs energy of mixing was determined by
![]() | (5) |
The temperature range for screening was set between 0 K and 1000 K. To improve computational efficiency, an adaptive temperature step size was implemented. For temperatures above 200 K, a step size of 40 K was applied, while in the intermediate range between 100 K and 200 K, a finer step size of 20 K was used. Below 100 K, where phase separation is expected to be more pronounced, the step size was further refined to 10 K to enhance resolution.
It is important to note that the predicted LLE curves may extend below the melting point of one or both components, indicating metastable states with respect to SLE. For example, pure ethylene glycol solidifies near 260 K (see Fig. 1). While SLE are not modeled in this study, the associated thermodynamic constraint is acknowledged and has been addressed previously in the context of COSMO-SAC-based SLE modeling.6
To improve LLE detection, mole fraction spacing was adapted in addition to temperature. While initial scans used uniform spacing, analysis of 2478 binary systems showed that higher resolution near the infinite dilution limits significantly improved detection and initial value quality. A sigmoidal transformation based on a logistic function was applied to redistribute the originally equidistant grid accordingly. The mapping function is given by
![]() | (6) |
Since most experimental systems exhibit a UCST (91.9%), screening was performed from low to high temperatures, denoted as forward screening. Larger miscibility gaps typically occur at lower temperatures and yield more pronounced Gibbs energy curves, facilitating LLE detection by providing clearer separation signals.
Conversely, screening from high to low temperatures often first encounters the UCST region, which is already challenging to detect, even when its exact location is known. Furthermore, initial values found near the UCST often lead to uncertainties about whether true equilibrium conditions have been identified, as calculations in this region tend to scatter and require refinement (see Section S1 of the SI). To ensure stability and robustness in detecting LLE, it is therefore preferable to begin the search at temperatures far away from critical solution temperatures.
However, reverse screening from high to low temperatures was conducted as well to identify and mitigate “early scan errors”. This step ensures that no relevant phase separation regions are overlooked during forward screening. Once a potential LLE binodal was detected, screening was stopped to avoid unnecessary computational effort, as this initial value served as a starting point for subsequent LLE tracing, which is described in Section 2.4.
From the Gibbs energy of mixing curve, the binodal was determined using the alternating tangents method proposed by von Solms et al.27 First, the spinodal points were identified, and the derivative was approximated using central differences. Starting from one spinodal point, a tangent was constructed toward the opposite spinodal point, as shown in Fig. 2. This approach was found to be both robust and reliable. While the original method defines bounds between the spinodal points and the pure component edges, an improvement was introduced by incorporating derivative information. Since the tangent must have equal slopes at the equilibrium points, the boundaries were further refined to lie between the spinodal and local minima of the Gibbs energy of mixing curve. This refinement is particularly useful when handling multiple LLE regions at a single temperature.
Ultimately, the initial value screening provides two key pieces of information: (1) whether an LLE exists for the given system, and (2) its temperature and mole fraction (T − x) coordinates. These data form the basis for further calculations, ensuring that numerical solvers start from well-defined initial conditions. By combining systematic screening with bidirectional evaluation, the accuracy and robustness of LLE detection are significantly improved.
To address this, a reverse screening procedure was introduced, screening from high to low temperatures. This method served as a safeguard against early scan errors, where the algorithm terminates after identifying the first LLE region, potentially missing additional phase separation regions. Such errors were particularly evident in systems where the detected LLE did not align with experimental data, suggesting the presence of another, previously undetected binodal at higher temperatures. By reversing the screening direction, these additional LLE regions—otherwise overlooked—were successfully identified.
Reverse screening revealed previously undetected phase separation in four water-containing systems. These island-type LLE regions appeared only with COSMO-SAC-dsp, while COSMO-SAC-2010 predicted a single, continuous region. A good example is water + valeronitrile, as shown in Fig. 3, where COSMO-SAC-dsp predicts two distinct LLE regions, while COSMO-SAC-2010 yields just one. Forward screening became trapped in the smaller LLE1, missing the larger LLE2 at higher temperatures. Reverse screening successfully identified LLE2, resolving early scan errors and improving agreement with experimental data.
As the tracing proceeds in temperature direction, LLE curves can terminate in two distinct ways: (a) at a critical point, where the compositions of the two liquid phases become identical, or (b) at a vapor–liquid–liquid equilibrium (VLLE), where the LLE binodal intersects a VLE envelope, forming a three-phase region. While this study does not explicitly resolve VLLE due to the absence of vapor-phase fugacities, binodal termination behavior was still considered during anomaly detection and binodal classification.
However, selecting an appropriate step size ΔT is not trivial. If too large, it can produce rough or inaccurate curves, particularly near the critical solution temperatures, where the slope (∂T/∂xi)p of the binodal approaches zero. In extreme cases, the UCST or LCST may be missed due to overshooting. Conversely, a very small step size ensures high resolution, but significantly increases computational cost due to the large number of required steps.
Manual step size adjustment is feasible for a small number of systems, but impractical for tackling 2478 systems and two model variants, amounting to approximately 5000 different LLE curves. A constant step size may perform well in steep regions but remains inadequate near the UCST. To ensure robustness, reproducibility, and computational efficiency, an adaptive sampling method with a dynamically adjusted ΔT was required.
ΔT ∝ Δxn. | (7) |
The exponent n was introduced to refine resolution near the UCST when necessary (n > 1), as seen in a polymer amorphous–amorphous phase separation study.6 However, in this work, the exponent was kept at unity, n = 1. This simple adjustment significantly improved results over the constant step-size method, as illustrated in Fig. 4a and c.
Several adaptive methods were explored, including those inspired by mechanical motion (velocity, acceleration), geometric concepts (arc length), and Taylor approximations. While some offered theoretical advantages, they often suffered from numerical instability or lacked robustness across all systems. Further details, including equations and performance insights, are given in Table S1 of the SI. Ultimately, the simple miscibility-gap scaling according to eqn (7) proved to be most reliable and broadly effective.
To improve efficiency, a dynamically adjusted initial guess was implemented using a Taylor expansion. Although the step-size methods discussed earlier are formulated in terms of the miscibility gap width Δx, the actual predictive update applies to the vector of mole fractions, xLLE1 = (xL11, xL21). Nevertheless, for consistency with the previous equations, we retain the notation Δx in the predictive expression
![]() | (8) |
The most frequent anomaly observed was a sudden bend of the LLE curve with rising temperature, inconsistent with meaningful phase separation. This irregularity, termed here as a bifurcation anomaly, involves an abrupt slope change in one curve branch—typically the one farther from the pure component boundary. While the widening miscibility gap initially suggests a UCST, the distorted curvature signals a breakdown of equilibrium. An analysis of the Gibbs energy of mixing reveals the root cause: instead of two, these systems exhibit four spinodal points, allowing for two distinct LLE regions, each with its own LCST. As temperature rises, the curves intersect at a point where the inner binodal points coincide, marking the onset of the anomaly. Beyond this point, the tangents connecting inner and outer binodal points intersect, and the system transitions into a single LLE.
This transition is illustrated in Fig. 5, where inset (a) shows the Gibbs energy of mixing below the intersection point, featuring two well-separated miscibility gaps. These represent two independent LLE regions governed by their respective LCST. As temperature rises, the inner binodal points converge, merging the two curves. Inset (b) captures the behavior above the intersection, where only a single LLE remains and the inner binodal tangents become unstable, as described by Novák et al.33 Notably, four spinodal points do not always yield dual LLE curves unless the tangents connecting binodal pairs are true tangents, not secants—a condition requiring the inner and outer points to align consistently without producing physically unrealistic crossings.
Such complex phase behavior is not merely theoretical. The binary system methylisopropyl ketone + water exhibits two biphasic regions at different pressures and even three coexisting liquid phases (LLLE) at intermediate pressure, as shown experimentally by Steiner and Schadow.34 COSMO-SAC predicts this behavior from first principles, supporting the plausibility of bifurcation anomalies and demonstrating its capability to capture complex multiphase equilibria. The alternating tangents method27 was extended to handle multiple spinodal regions, enabling robust treatment and preventing misinterpretation.
![]() | (9) |
![]() | (10) |
![]() | (11) |
Fig. 6 summarizes the success rates of both model variants under two conditions: the full dataset comprising 2478 binary systems with available experimental data and σ-profiles, and a reduced dataset limited to 2258 systems with assigned dispersion parameters εmolecule. In the full dataset, COSMO-SAC-2010 identified LLE in 2155 systems and COSMO-SAC-dsp in 2075, with combined predictions capturing 2240 systems (90.4%). This number is likely to be higher if dispersion parameter data were complete.
Because COSMO-SAC-dsp applies only to systems with valid εmolecule values, its predictions are restricted to a subset of 2258 binary systems. Within this set, COSMO-SAC-2010 and COSMO-SAC-dsp predict LLE in 1994 and 2075 cases, respectively, with a combined success rate of 92.1%. Importantly, COSMO-SAC does not predict LLE by default; each result emerges from the model's underlying thermodynamics. The high success rates thus reflect genuine predictive performance, not an artifact of the method.
The results in Fig. 6 reveal several key trends: COSMO-SAC-2010 achieves success rates of 87.0% and 88.3% in the full and constrained datasets, respectively. COSMO-SAC-dsp reaches 83.7% in the full dataset, with an estimated increase to 90.5% if dispersion parameters were available for all σ-profiles. Despite its broader coverage, COSMO-SAC-dsp does not supersede the 2010 variant: it misses LLE in four systems captured by COSMO-SAC-2010, while uniquely identifying 85 systems not detected by the latter. These differences underscore the complementary nature of the two model variants.
Fig. 7 shows typical phase behaviors predicted by COSMO-SAC. Most common are (a) UCST and (b) closed loop LLE with both UCST and LCST. Other topologies include (c) hourglass shapes, (d) droplet-like boundaries, and (e) valley-shaped gaps. Some systems exhibit (f) emerging or (g) fully detached immiscibility regions. More complex cases include (h) double LCST, (i) LCST with an island, and (j) two UCST and two LCST forming distinct, closed loops. The most intricate case is (j), with two UCST and two LCST. Unlike bifurcation anomalies, both form distinct closed loops, resembling a fused dual LLE system.
Further deviations from classical behavior appear in panels (k) and (l). One system exhibits a diverging LLE, with boundaries extending beyond the screening range. Another reveals a chromosome-like topology where the solver passed a pinch point and uncovered a second stable solution. The lower branch, indicated by a dashed line, is interpreted as a metastable artifact, reflecting the occasional stabilization of spurious LLE solutions in systems with island-type complexity. Interestingly, none of the COSMO-SAC predictions display a purely LCST-driven demixing—a significant departure from experimental observations. Since LCST behavior is generally associated with entropy-driven phase separation,35,36 this discrepancy suggests that the COSMO-SAC models may overemphasize enthalpic interactions, potentially due to inherent modeling assumptions.
This trend is visualized in Fig. 8, which shows the UCST distribution and associated critical mole fractions. For clarity, mole fractions were mirrored such that COSMO-SAC-2010 data appear in the left half (xi ∈ [0, 0.5]) and COSMO-SAC-dsp in the right half (xi ∈ [0.5, 1]). This transformation—xi → min(xi, 1 − xi) or xi → max(xi, 1 − xi), depending on the variant—preserves physical validity while enhancing interpretability and symmetry. It enables a clearer analysis of clustering and systematic behavior across both models.
Most predicted UCST lie between 200 K to 500 K, with critical mole fractions clustering around xi = 0.25 to 0.75. A noticeable high-temperature cluster above 700 K, composed almost entirely of aqueous systems, appears in both model variants (see Fig. S5 of the SI). At lower temperatures, critical points tend to shift toward composition edges, suggesting broader asymmetry. Despite the lack of thermodynamic anchor points and the model's sole reliance on γi, nearly all predicted UCST lie within the experimentally sampled range of 87.6 K to 694.65 K, underscoring the predictive power of COSMO-SAC.
Statistically, COSMO-SAC-dsp predicts a higher UCST than COSMO-SAC-2010 for 79.6% of all systems, indicating a systematic overestimation of the miscibility gap width. In the subset of cases where COSMO-SAC-dsp yields a lower UCST, 87.7% involve aqueous mixtures, suggesting that dispersion corrections disproportionately affect strongly polar or hydrogen-bonded systems.
To ensure a robust quantitative comparison, additional data exclusion steps were applied beyond that for the qualitative assessment. Specifically, all experimental data points measured at pressures above 10 bar were excluded, following the procedure of Fingerhut et al.14 In their VLE study, this threshold was chosen to approximate ideal gas behavior. In the context of LLE, although the liquid phase is often treated as incompressible, pressure effects cannot be fully neglected at elevated pressures. Experimental LLE data frequently showed steep or discontinuous curve segments, which likely belong to remote isobars and are not directly comparable to ambient-pressure predictions.
Among the 87 directly comparable chemical family pairs, COSMO-SAC-2010 performs better in 44 cases (i.e. 50.6% of all pairs), COSMO-SAC-dsp in 20 cases (23.0%), with near-identical accuracy (<1% difference) for the remaining 23 cases (26.4%). The most notable improvements from dispersion were found for amines + aromatics (AADx reduced from 50.5% to 21.9%), alkanes + halogenated hydrocarbons (ΔAADx: −12.7%), and acids + aromatics (ΔAADx: −11.6%). Aqueous systems also benefit notably, e.g., water + epoxies (ΔAADx: −11.0%) and water + (iso)nitriles (ΔAADx: −8.6%).
Conversely, the highest AADx for COSMO-SAC-dsp occurs for esters + multifunctionals (48.2%) and gas-containing pairs, due to carbon dioxide. The largest deteriorations appear for other nitrogens with ethers, alkenes, alcohols, or alkanes (up to ΔAADx: +21.6%), and for alkanes + anhydrides. Still, over a quarter of family pairs yield comparable AADx for both models, especially in weakly interacting systems, such as alcohols + gases or alkanes + ethers. Best performance is seen for water with alkynes, alkanes, alkenes, or halogenated hydrocarbons, and for (iso)nitriles or amides with aromatics.
The largest individual AADx values occur for the “Gases” and “OtherNitrogens” families, though closer inspection reveals that these families are represented by only a few components after exclusion based on pressure, dispersion parameters, and data availability. In the case of “Gases”, only carbon dioxide remains, while the “OtherNitrogens” family is dominated by nitromethane and nitroethane—precisely the systems where COSMO-SAC-dsp predicts diverging LLE behavior, leading to disproportionately large deviations. In contrast, chemically more balanced families, such as acids and esters, exhibit similar AADx values across both model variants. Overall, the AADx amounts to 9.9% for COSMO-SAC-2010 and 13.0% for COSMO-SAC-dsp, with the difference largely driven by these few problematic systems.
This result is remarkable when viewed in the broader context of previous studies. The total AADx for LLE predictions aligns closely with previous solubility studies applying COSMO-SAC to pharmaceutical systems (12.6%),6 despite the absence of external anchor points such as the melting temperature used in SLE. Furthermore, activity coefficient deviations of COSMO-SAC can span a wide range—from 0% near the pure component limit to as much as 300% at infinite dilution.14 Given that LLE is determined solely by γi across the entire composition space, this level of accuracy is notable. By contrast, the lower AADx typically observed in VLE can be partially attributed to the presence of a reference point, such as a boiling point, which constrains both temperature and composition predictions. Furthermore, the evaluation by Fingerhut et al.14 was limited to vapor-phase mole fractions, whereas LLE requires accurate predictions for both coexisting liquid phases.
COSMO-SAC-2010 | COSMO-SAC-dsp | |||||
---|---|---|---|---|---|---|
AD | AAD | Neff | AD | AAD | Neff | |
x1 | 0.0021 | 0.0988 | 40![]() |
0.0017 | 0.1298 | 43![]() |
T | 35.85 K | 81.83 K | 40![]() |
62.56 K | 99.85 K | 39![]() |
K1 | −0.0717 | 0.1344 | 9803 | −0.1031 | 0.1707 | 10![]() |
xL11 | −0.0587 | 0.1052 | 20![]() |
−0.0959 | 0.1369 | 21![]() |
xL21 | 0.0627 | 0.0923 | 20![]() |
0.0995 | 0.1227 | 21![]() |
The contributing data points are well balanced not only between phases, but also between temperature and mole fraction for COSMO-SAC-2010, ensuring an overall unbiased evaluation. COSMO-SAC-dsp generally covers more data by predicting LLE for a larger number of systems, but also exhibits more temperature mismatches. This is because diverging LLE—predicted exclusively by COSMO-SAC-dsp—lack a defined UCST, preventing meaningful comparison and thus reducing the effective sample size for AADT.
The AADT reaches approximately 82 K for COSMO-SAC-2010 and 100 K for COSMO-SAC-dsp, which may appear large. However, due to the geometry of typical LLE envelopes—flat near the UCST and steep at the binodal edges—small composition errors can produce large temperature deviations, and vice versa. This sensitivity is examined in Section S7 of the SI using synthetic LLE curves. The results show that minor shifts in mole fraction can cause significant temperature errors when projected onto steep binodal regions, and small temperature offsets can distort composition near the critical points. These effects highlight the need for caution when interpreting AAD in LLE contexts and suggest that additional metrics may help to provide a fuller picture of model performance.
Table 5 compares the performance of various LLE prediction methods using the reference dataset. Unlike the AADx metric used throughout this work, the textbook employs the percentage average absolute logarithmic deviation (AALDS), defined as
![]() | (12) |
Method | Nonaqueous | Aqueous | Overall | |||||
---|---|---|---|---|---|---|---|---|
Systems | Points | AALDS | Systems | Points | AALDS | Systems | AALDS | |
UNIFAC-VLE | 47 | 4334 | 86.3 | 42 | 7576 | 62.1 | 89 | 70.9 |
UNIFAC-NIST/KT | 42 | 3982 | 79.8 | 42 | 7576 | 124.6 | 84 | 109.2 |
UNIFAC-NIST/Mod | 47 | 4334 | 49.5 | 41 | 7537 | 66.9 | 82 | 60.5 |
UNIFAC-Dortmund | 43 | 5063 | 65.1 | 36 | 5962 | 68.2 | 79 | 66.8 |
COSMO-RS/FSAC2 | 25 | 3419 | 124.4 | 27 | 4376 | 61.0 | 52 | 88.8 |
COSMO-RS/GAMESS | 40 | 4204 | 139.9 | 36 | 5905 | 55.3 | 76 | 90.5 |
COSMO-SAC-2010 | 34 | 3121 | 37.9 | 47 | 7772 | 112.6 | 81 | 91.2 |
COSMO-SAC-dsp | 39 | 3805 | 116.7 | 45 | 7178 | 130.9 | 84 | 126.0 |
Among all models considered in ref. 37, COSMO-SAC outperforms COSMO-RS for nonaqueous systems, with COSMO-SAC-2010 setting the benchmark by achieving the lowest AALDS of all models. For aqueous mixtures, the picture is reversed: COSMO-RS performs best, while COSMO-SAC-2010 is moderate and COSMO-SAC-dsp weakest. Overall, COSMO-SAC-2010 reaches a level comparable to COSMO-RS, with each showing complementary strengths. While the UNIFAC-NIST/Mod variant attains the lowest overall AALDS, this reflects extensive parameterization to experimental data, whereas COSMO-SAC and COSMO-RS are fully predictive. From this perspective, COSMO-SAC emerges as a solid, open-source alternative to COSMO-RS.
![]() | ||
Fig. 11 Predicted LLE envelopes for mixtures of acetonitrile with n-alkanes (n-butane to n-hexadecane) using COSMO-SAC-2010. |
Despite deviations in the exact location of the binodal curves, COSMO-SAC-2010 successfully reproduces key qualitative trends observed experimentally. In particular, it captures the systematic shift of the critical point toward higher temperature and higher acetonitrile mole fraction as the chain length increases. This behavior reflects the rising hydrophobicity of longer alkanes and their reduced affinity for polar solvents such as acetonitrile.
Moreover, the model accurately predicts the change of the LLE envelope shape across the series: while the n-butane mixture exhibits an almost symmetric phase diagram, the systems with longer alkanes—such as hexadecane—show pronounced asymmetry, with the binodal curve being increasingly skewed toward pure acetonitrile. The fact that such shifts in topology are qualitatively captured without any parameter optimization, highlights the power of COSMO-SAC in predicting relative tendencies within chemical families.
From a practical perspective, this makes COSMO-SAC particularly useful for solvent screening tasks, where relative positioning of miscibility gaps may be more informative than exact critical temperatures. Applications include the selection of extraction solvents, where homologous series like alkanes, alcohols, or esters are frequently evaluated, as well as the design of phase-change solvent systems for separations or pharmaceutical formulations. In such contexts, the ability to reliably capture directional trends within chemical families—even without perfect numerical agreement—can significantly accelerate and de-risk early-stage decision making. This underscores the potential of COSMO-based models not just as quantitative predictors, but as powerful qualitative guides in chemical process design.
While only COSMO-SAC-2010 results are shown in Fig. 11 for clarity, it is worth noting that COSMO-SAC-dsp exhibits comparable predictive behavior.
The LLE.miscibility() method encapsulates the full adaptive tracing algorithm developed in this work, including forward and reverse screening, automatic step-size control, and optional spinodal detection. A dedicated GMixScanner class further supports robust and parallelizable detection of initial LLE regions through Gibbs energy scans, with methods such as find_first_binodal() and find_all_binodal() tailored for early screening and anomaly detection. The package, including documentation and examples, is available at https://github.com/ivanantolo/thermosac and is intended as a reusable tool for model development, reproducibility, and educational applications.
A central contribution of this work lies in the development of an automated, robust workflow for high-throughput LLE tracing, featuring forward and reverse screening, adaptive sampling, and anomaly detection. The framework allows for precise localization of phase boundaries and critical points, and facilitates comprehensive statistical evaluation across systems with diverse phase behaviors—including UCST, LCST, and complex bifurcating or closed-loop topologies.
The COSMO-SAC model achieves a qualitative success rate exceeding 90% in identifying LLE occurrence, demonstrating robustness across chemically diverse systems without reliance on system-specific tuning. Benchmarking against other predictive models shows that COSMO-SAC outperforms COSMO-RS for nonaqueous systems, with COSMO-SAC-2010 setting the benchmark. For aqueous mixtures the trend is reversed, with COSMO-RS performing best, while COSMO-SAC-2010 is moderate and COSMO-SAC-dsp weakest. Overall, COSMO-SAC-2010 is comparable to COSMO-RS, with complementary strengths, whereas the UNIFAC-NIST/Mod variant attains the lowest total AALDS through parameterization to experimental data.
Both COSMO-SAC variants effectively capture systematic trends within homologous series, correctly predicting relative shifts in critical temperature and asymmetry of the miscibility gap. This capability is valuable for solvent screening and process design, where relative performance within chemical families often outweighs the need for exact numeric agreement. Quantitatively, COSMO-SAC-2010 achieves a total AADx of 9.9% and AADT of 81.8 K, confirming it as the more precise variant overall. COSMO-SAC-dsp provides broader coverage, though with slightly larger deviations (13.0% AADx, 99.9 K AADT).
In summary, this work delineates the performance landscape of COSMO-SAC for LLE prediction and provides an open-source infrastructure with benchmark data to support future model development. The present insights advance the use of COSMO-based methods in solvent screening and process design, with implications for separation technologies, green chemistry, and pharmaceutical applications. Ongoing research into refined interaction terms, extended segment descriptors, and hybrid modeling strategies—as exemplified by developments in openCOSMO-RS, which incorporates London-type dispersion potentials39–42—promises to further enhance the reliability and reach of first-principles phase equilibrium predictions.
Supplementary information: methodological details on LLE tracing refinements, adaptive sampling strategies, and anomaly corrections, together with additional figures and statistical analyses. It further includes critical point distributions and special system classes (aqueous and diverging systems) as well as complete system lists for figures in the main text. See DOI: https://doi.org/10.1039/d5dd00259a.
This journal is © The Royal Society of Chemistry 2025 |