Lander
Bogers
,
Faezeh
Khodabandehlou
and
Christian
Maes
Department of Physics and Astronomy, KU Leuven, Belgium. E-mail: christian.maes@kuleuven.be
First published on 23rd June 2025
Steady nonequilibria dissipate energy and, when changing external parameters, an extra or excess heat accompanies the relaxation to the new nonequilibrium condition. For nonequilibrium systems in contact with a thermal bath, the heat capacity is defined as that excess heat per degree temperature for a quasistatic change of the bath temperature. It is fairly common to find negative heat capacities for steady nonequilibrium systems, in contrast with the situation for systems in thermal equilibrium. We discuss and illustrate the origin of that negative thermal response using Markov models. We find that the negativity results from an anticorrelation between quasipotential and (a change in) pseudopotential, the first measuring (excess) heat (and Clausius entropy), and the latter being related to the Boltzmann entropy. It can be quantified via an appropriate choice of effective temperatures.
The thermal response refers to the behavior of a physical system under changes in temperature or when subjected to heat pulses. For a macroscopic system in equilibrium, one can measure the change in its temperature by heating it. That can be done under various constraints Z such as, e.g. for a gas keeping its volume or pressure constant, and gives rise to the standard definitions of (equilibrium) heat capacities CZ(T). In short, δQrevZ = CZ(T)dT where QrevZ is the reversible heat given to the system to increase its temperature by T → T + dT while keeping the constraint Z. Interestingly, heat capacity may on the one hand be used to measure energy, entropy, or enthalpy, and on the other hand inform about the variance (fluctuations) of those potentials in the corresponding equilibrium ensemble. However, those relations are not given when the interaction becomes long-ranged such as for Newtonian gravity or when the system is nonthermodynamic such as in finite-size clusters,6–8 where the equivalence of ensembles is violated; negative heats become possible there.
The situation for nonequilibrium heat capacities is conceptually similar but results in other expressions. We refer to9–16 for introductions and examples.
The interest in the notion of nonequilibrium heat capacity derives from the wish to quantify thermal response, and to understand what information is encoded in it concerning the nonequilibrium condition and its dynamics. A variety of toy-examples have been studied so far,9,10,14,16–18 and sufficient conditions have been formulated for a nonequilibrium extension of the third law of thermodynamics.11,13 It has also been observed that this nonequilibrium heat capacity can become negative, which makes the question of the present paper: to understand the meaning and the origin of that negativity. We focus on models with a discrete state space, that show a typical Schottky anomaly already for zero driving.19,20 Some driven examples of Section VI show an inverted Schottky anomaly, visible already in the right plot of Fig. 3 for the next example. For examples of Markov diffusions with negative specific heats, we refer to;14,17,18,21 the understanding of negativity there is the same as for the jump processes that we consider below.
After the Glossary (which is next), Section III reminds the reader of the relevant setup. For nonequilibrium steady conditions, we need an open system, and we identify a thermal bath in its environment where the system dissipates heat. Upon small and slow changes dT in the bath, an excess of heat δQexc = C(T)dT is absorbed by the system that defines the heat capacity C(T) at temperature T of the thermal bath. We ignore from now on in the notation the possibility of different constraints Z, which can act on the system itself and on the bath. It is no longer true that C(T) needs to be positive and indeed we know plenty of examples where C(T) < 0. It is an interesting feature which goes hand in hand with the typical extra we get from nonequilibrium heat capacities: C(T) is able to pick up dynamical or kinetic information about the system which is not available when scanning it in equilibrium.
Section IV gives the theoretical framework for understanding the occurrence and the implication of negative specific heats for nonequilibrium systems. The heat capacity gets written as a covariance in the stationary distribution. The two involved random variables are anticorrelated when the heat capacity is negative. They are related on the one hand to the quasipotential, which gives the expected excess heat,11 and to the pseudopotential, which governs the population statistics in the form −1/βlog
ρ(x).22 It is there that we see how Clausius and Boltzmann entropies separate.
Section V defines the relevant effective temperatures and we use it to quantify the negativity of the heat capacity. In particular, a sufficient relation follows for that negativity.
Finally, in Section VI we present a number of discrete models with agitated and double-channel transitions to illustrate the theory and the specific origin of negativity at very low and at intermediate temperatures. Each time, we see that the probability of the quasi-ground state (minimizing the quasipotential or expected excess heat) is increasing with temperature (quite unlike the situation for systems in thermal equilibrium) and the effective temperatures (defined with respect to that quasi-ground state) are decreasing with temperature. It indicates a population anomaly (even leading to an inversion): as the temperature of the bath increases, low-lying quasi-energies get more populated. It can also happen without population anomaly when the effective temperatures become almost constant as function of the bath temperature (again, unlike in equilibrium).
(1) Steady nonequilibrium condition: if we consider an open system, it matters a great deal what boundary conditions and what external forces are applied. A standard setup is that of the so-called canonical ensemble where the system is in weak contact with a thermal bath, and thermal equilibrium is obtained for the system. However, if additionally, rotational forces act on the system, or time-dependent fields, or gradients in chemical potential or pressure are maintained etc. the system may be kept in a steady nonequilibrium condition. An example would be a chemical reactor with sources and sinks, in which the concentrations of the various species are kept stationary while chemical currents are maintained.
(2) Stationary density at temperature T: the open system (as above) is subject to a dynamics, here modeled as a Markov process. It makes sense to define a Master equation giving the updating of probabilities ρ(x,t) of a state x at time t. We can think of ρ(x,t) as the occupation fraction of the state x for many independent identical copies of the system. The Master equation depends on transition rates k(x,y) which indicate the probability per unit time for having a transition x → y when the system is in state x. We assume that k(x,y) is nonzero if and only if k(y,x) is nonzero. The (time-dependent) Master equation for the (time-dependent) probabilities is
![]() | (II.1) |
The eqn (II.1) gives the time-dependent probability, analogous to the time-dependent solution of Fokker–Planck or Smoluchowski equations. The stationary Master equation puts the left-hand in (II.1) equal to zero. For a finite number of states, all of which can be reached starting from an arbitrary state, it can be shown (Perron–Frobenius theorem) that there is a unique stationary (time-independent) probability ρ:
![]() | (II.2) |
We call the stationary process an equilibrium process when there is detailed balance: there exists an energy function E(x) so that for all pairs of states x, y,
k(x,y)e−βE(x) = k(y,x)e−βE(y) | (II.3) |
We are mostly interested in nonequilibrium processes where (II.3) is violated for some pair x, y of system states. On the other hand, we do attach a physical meaning to the ratio
![]() | (II.4) |
(3) Generators: the backward generator L for a Markov process with transition rate k(x,y) is defined as
(4) Excess heat: the notion of excess heat is explained in Section III and in Fig. 1 in particular. One basically considers the Joule heat flux which is constantly dissipated in a thermal bath for a steady nonequilibrium systems. That heat flux (dissipated power) depends on the temperature of the thermal bath. By changing that temperature, an extra heat flux appears, called excess heat.
(5) Thermal response: for a system that can exchange energy with a (thermal) heat bath at temperature T, we speak about the thermal response when probing system properties as T is varied.
Quasistatic response corresponds to very slow changes in that temperature, with respect to system relaxation times.
Heat capacity measures a thermal response in terms of the excess heat (see above) absorbed by the system.
(6) Quasipotential: the quasipotential V is a certain function of the system state x, taking real values V(x). Sometimes we add a subscript of temperature T, making VT(x), to emphasize its temperature-dependence. In the present paper, the quasipotential is a measure of expected excess heat, when the system is in state x. It is mathematically defined in (III.3).
We take the quasipotential always to be centered, meaning that the expected average 〈V〉 = 0 vanishes for the corresponding stationary probability.
For an equilibrium process, with (II.3), the quasipotential V(x) = E(x) − 〈E〉, reduces to the (centred) energy function.
(7) Pseudopotential: in the case of nonequilibrium processes, we most of the time have no idea or explicit construction of the stationary probability ρ. We call pseudopotential Φ its logarithm
(8) Expected power versus stationary power: given the heat q(x,y) in the transition x → y we define the expected power (heat flux)
(9) Covariance: given a probability ρ on the system states, we define the covariance 〈f;g〉 = 〈fg〉 − 〈f〉 〈g〉 (connected two-point function) between any two functions f and g. It measures the correlation, positive or negative, as is important for the understanding of negative specific heat in (IV.3).
(10) Kinetic aspects: to study nonequilibrium regimes, either transient (toward relaxation to equilibrium) or steady (as in the present paper), we need more than the usual thermodynamic functions and principles. Kinetics matters, which is not so surprising from the dynamical perspective. One of the interesting aspects of nonequilibrium calorimetry, is that it can detect some of those kinetic aspects. More specifically, dynamical activities (like in switches and their switching rate) and energy barriers (in transition rates) will be detected and characterized via heat capacities for the nonequilibrium system.
(11) Effective temperature there have appeared various effective temperatures in transient and steady nonequilibrium processes, for example, to restore the (equilibrium) fluctuation–dissipation relation. In the present paper, in (V.2), we use a state-dependent effective temperature T(x). It depends on the state x and also on some reference state x* (not indicated). We will take x* to be a state with minimal quasipotential, and define T(x) starting from the pseudopotential as defined above (and in (V.2)). It gives a measure of population-change or population inversion. Under detailed balance (in equilibrium), T(x) = T is the temperature of the heat bath.
For more precision and mathematical details, we refer to the original papers14,17 and to more recent publications.10–12,15,16 Experiments are in progress, with27,28 as early measurements.
The theoretical modeling so far is restricted to Markov processes. We have here time-homogeneous processes Xt that converge exponentially fast to a unique stationary distribution ρ(x), x ∈ K over state space K, satisfying the stationary Master equation of the form L†ρ = 0, where L† is the forward generator. More specifically, let k(x,y) be the transition rate to jump from state x to state y, then the stationary Master equation is
![]() | (III.1) |
The solution ρ is the stationary distribution. To emphasize its temperature dependence, we sometimes denote it by ρT. The subscript T is also used for other quantities.
The backward generator L generates the time evolution in the sense that
etLg(x) = 〈g(Xt)|X0 = x〉 | (III.2) |
After identifying the heat flux , we define the quasipotential VT = V as the centered function, 〈VT〉T = 0, that solves the Poisson equation,30
![]() | (III.3) |
![]() | (III.4) |
Example III.1 (agitated three-level system) consider an agitated molecular system, where the hierarchy of energy levels randomly switches. Such a molecular switch can be modelled as a Markov jump process on a ladder with (to be specific) three levels; see Fig. 2. Each leg σ = ± has three states η = 1, 2, 3, so that the states are of the form x = (η,σ). Each state located in σ = − has an energy E(η, −) = (η − 1)ε and the states on σ = + has an energy E(η, +) = (3 − η)ε, and the process is switching legs at rate α. The transition rates are chosen as
![]() | (III.5) |
The heat flowing to the bath at inverse temperature β equals ε at each transition η → η′ where the level is changing. The changing of legs is work done by external sources.
For completeness, the stationary distribution, the quasipotential and the heat capacity of this three-ladder are calculated in Appendix A. Stationary distributions are obtained by solving the steady master equations. By solving the Poisson equation in (III.3), the quasipotentials of each state are determined, where and
flowing local detailed balanced condition.23
The plots of the heat capacity for varying α, Δ and temperature T are shown in the Fig. 3. Taking α = 0 corresponds to the equilibrium case. For α > 0, the nonequilibrium heat capacity depends kinetically on the barrier Δ and may become negative (here, at low temperatures for large enough Δ).
![]() | ||
Fig. 3 Heat capacity of a 3-level ladder as a function of temperature T for different values of α, Δ and ε as defined in the transition rates (III.5). We observe an inverted Schottky anomaly for large enough Δ when ε = 1. |
For some regimes of the values, the heat capacity becomes negative and the inverted Schottky anomaly is observed. The dependence of heat capacity on α and Δ is summarized in Fig. 4, where the colors shows different values of the heat capacity.
![]() | ||
Fig. 4 The heat capacity of a 3-level ladder as a function of Δ, α and T for fixed ε = 1, the transition rates are given in (III.5). The color bar represents the values of the heat capacity. |
This example is continued in Section VI.1.
Example III.2 (heat conducting system, ref. 33). A pleasant and interesting discussion of negative response is contained in the paper by Zia et al.33 We disagree with their identification of the specific heat with the temperature-derivative of the expected energy – that is a procedure that works in equilibrium at constant volume, but is not correct for steady nonequilibria. Yet, their example is very instructive. We repeat it here in the corrected version, using the quasipotential (III.3).
Consider a three-level system (states are a, b, c) in contact with two baths. See Fig. 5, where the two thermal baths are suggested at inverse temperatures β1, β2 and the two energy gaps ε, δ > 0 are shown.
![]() | ||
Fig. 5 Heat conducting three-level system of Example III.2, after.33 |
The stationary distribution is given by and
, where
is the normalization. In equilibrium, where β1 = β2 = β, the heat capacity is
We look at the nonequilibrium situation where β2 ≠ β1. The system plays the role of a (molecular) conductor between two thermal baths. The heat capacity is now a matrix Cij, i, j = 1, 2 for quantifying the excess heat absorbed by the system from the i-th bath when changing the temperature in bath j. The sum C2,2 + C2,1 + C1,1 + C1,2 = Ceq when β1 = β2.
The expected heat fluxes to bath 2 are: and
and to bath 1 are:
and
. Therefore, solving the Poisson eqn (III.3) gives the quasipotential for bath 1,
![]() | ||
Fig. 6 Heat capacities for Example III.2 quantifying the excess heat absorbed by the system. Parameter values are δ = 3, ε = 1, β2 = 1 in the left plot and β1 = 1 in the right plot. |
More relevant is the sum C1,1 + C2,1, which is the total extra heat absorbed by the system when slightly increasing the temperature of the first thermal bath. We have
![]() | (IV.1) |
Another way to express heat capacity as a correlation function is less explicit but conceptually more attractive. In fact, the heat capacity (III.4) or (IV.1) can be written as
![]() | (IV.2) |
![]() | (IV.3) |
![]() | (IV.4) |
Inspecting (IV.4), the negativity of the nonequilibrium heat capacity signifies at least an anticorrelation between the heat absorbed and the change in occupation with increasing bath temperature. One possibility is that increasing the temperature may, at least in certain regimes, lead to higher occupation of low-energy states. That is not unrelated to the idea of negative temperatures for gravitating or for very inhomogeneous and smaller systems for which the potential energy is anticorrelated with the kinetic energy and/or that fail to satisfy equivalence of ensembles.6–8 What happens is that increasing the temperature may, at least in certain regimes, lead to higher occupation of low-energy states.
ρeq(x) ∼ e−βE(x) | (V.1) |
Fixing the bath temperature T, we denote by x* any specific state that has the smallest quasipotential: for all x,
ΔV(x) = V(x) − V(x*) ≥ 0 |
![]() | (V.2) |
Writing in terms of the inverse bath temperature β, we have
![]() | (V.3) |
![]() | (V.4) |
![]() | (V.5) |
![]() | (V.6) |
The situation is then as follows:
When there exists a temperature range where is positive (unlike equilibrium), then a negative heat capacity will arise (and can only arise) from a sufficiently decreasing effective temperature Teff(x) in temperature T (for some state x). We illustrate that in the next section. Indeed, in nonequilibrium, due to the driving, population inversion can occur, and
is easily made positive for some temperature range.
We can also get a negative heat capacity when is negative (like in equilibrium), as long as the slope of Teff against T is sufficiently small: we need that
(for some state x). We will encounter that for instance in Fig. 7 and 8 around T = 0.5. It is the situation when the effective temperature (almost) stops to depend on the environment temperature.
![]() | ||
Fig. 8 Left: Stationary occupation of (1, −) in Example III.1, for varying temperatures. The top curve shows the equilibrium case (α = 0), and the lower curve is for α = 0.1. In nonequilibrium there exists a range of temperatures where for Δ = 1, α = 0.1 and ε = 1 in Example III.1, ![]() |
As a sufficient condition for negative heat capacity, we then have
![]() | (V.7) |
We can rewrite (V.7) using the heat bath temperature directly, for easier intuition,
![]() | (V.8) |
We repeat that the effective temperature (V.2) is defined with respect to some minimizer x* of the quasipotential, and we note that can also change with temperature T. In many cases at low temperatures x* = g where g denotes the state that maximizes the pseudopotential Φ of (IV.4) and hence the stationary probability, ρT(g) ≥ ρT(x): the dominant state g can then be identified with the state x*, that minimizes the quasipotential. That is certainly the case close-to-equilibrium,12,35 but is not always true in nonequilibrium.
Example VI.1 (Example III.1 continued: negative heat capacity around zero temperature). The low-temperature behavior of the heat capacity is
Observe that the quasipotential VT((1, −) = x*) is minimal at all temperatures T; see Fig. 7. That minimizer for the quasipotential maximizes the pseudopotential, in the sense that ρ(1, −) = ρ(3, +) ≥ ρ(2, −) = ρ(2, +) ≥ ρ(3, −) = ρ(1, +) for all temperatures as well.
Next, we see from Fig. 8, that the stationary occupation of x* = (1, −) is increasing with temperature over T < 0.5, , which is different from the equilibrium situation where the ‘ground state’ loses occupation at higher temperature. At the same time, as shown in the right figure, for the same temperature range,
and
. That produces the negative heat capacity according to (V.8).
Example VI.2 (uneven ladder: negative heat capacity at intermediate temperatures). Consider the uneven 3-level ladder, with an unbalanced energy gap, in Fig. 9. The energies E(η,σ) are
E(1, −) = 0, E(2, −) = 2ε, E(3, −) = 3ε. | (VI.1) |
k−(1,2) = k+(3,2) = e−β(Δ+ε), k−(2,3) = k+(2,1) = e−β(Δ+ε/2) |
k+(1,2) = k−(3,2) = e−β(Δ+ε), k+(2,3) = k−(2,1) = e−β(Δ+ε/2) |
kη(−, +) = kη(+, −) = α > 0, ∀η = 1, 2, 3. |
![]() | ||
Fig. 9 Uneven 3-level ladder of Example VI.2. To be compared to the 3-level ladder in Fig. 2; now the energy gaps are unbalanced. |
The heat capacity of this system can become negative over intermediate values of temperature for certain values of α, ε and Δ; see Fig. 10, where the heat capacity is plotted for fixed α = 0.1, ε = 2 and different Δ.
![]() | ||
Fig. 10 Left: Heat capacity of Example VI.3 for varying temperature, different values of Δ and fixed values of α = 0.1, ε = 2. Right: Quasipotentials at Δ = 3, α = 0.1 and ε = 2, where x* = (1, −). |
In Fig. 10, the heat capacity remains positive for Δ = 1. For higher values of Δ, it becomes negative at low temperatures and for even higher values of Δ, it becomes negative at higher (intermediate) temperatures.
For an understanding in terms of (V.8), we observe from Fig. 11 that the stationary occupation of (1, −) can be increasing with temperature for high enough Δ. On the other hand, the effective temperature of the other states located on the leg – are seen for Δ = 3 in the right plot. For the intermediate range of temperatures where the heat capacity is negative, Teff(3, −) is strongly decreasing in temperature.
Example VI.3 (double-channel two-level model with negative heat capacity at intermediate temperatures). Consider the two-level system with states 1 and 2, with possible transitions over two channels, + and −. See Fig. 12, and ref. 17.
![]() | ||
Fig. 12 Double-channel 2level-model of Example VI.3. The plus-channel is more reactive than the minus-channel for ϕ > 0 in (VI.2). |
The transition rates are
![]() | (VI.2) |
In Fig. 13 and 14, the heat capacity is plotted for different values of ϕ and w in varying temperatures. As it is shown in higher values of driving w the heat capacity can be negative at intermediate temperatures; see Fig. 13. The mechanism is different from the previous example (where the barrier was the relevant variable for the nonequilibrium regime); here it is the asymmetry between the two channels that does the job (of negative heat capacity).
The analytical expression of the heat capacity is given in Appendix B, and there is a transition in ϕ, depending on the driving W, between completely positive and partially negative temperature-regimes of the heat capacity.
The quasipotential is plotted in Fig. 15 as a function of temperature.
V(2) < V(1) for w = 4, ϕ = ε = 1 and T ∈ [1.7,2.2], and x* = 2. For the negativity of the heat capacity we see from Fig. 17 that is positive for T ∈ [1.7,2.2]. We have plotted the effective temperature of state 2 in Fig. 16.
![]() | ||
Fig. 16 The effective temperature of state 1 with respect to temperature for Example VI.3. The right plot is the zoom in the range of temperatures where the heat capacity is negative. |
![]() | ||
Fig. 17 Stationary distribution for Example VI.3, with T ∈ [2,10] at w = 4 and ϕ = ε = 1. It is important that ![]() |
Again, where the effective temperature is decreasing with temperature we get a negative heat capacity; see Fig. 13.
Observe from Fig. 15 that although x* = 1, the stationary probability ρ(1) is not the highest at all temperatures.
Solving the stationary Master equation for the rates given in (III.5), the stationary distribution is
![]() | ||
Fig. 18 Stationary distributions of the three-level ladder given in Example III.1 for different values of α, Δ and temperature T. |
The quasipotential equals
Since the steady average of the quasipotentials is zero, there must be at least one state with a negative quasipotential. In this example, the states with the highest stationary occupation correspond to those negative quasipotentials.
The heat capacity is computed from (III.4) to be
At very low temperatures with Δ > 0,
With αε > 0, at low temperatures, the heat capacity is negative for Δ > ε/2.
The stationary distribution is
![]() | (B.1) |
The quasipotential is obtained from solving the Poisson eqn (III.3),
Hence, the heat capacity is
a = (eβw+ϕ + eβ(w+ε) + eβε+ϕ + 1)3 |
For w > ε ≥ 0 and all values of ϕ and β, if eϕ(w − ε)(eβw + eβε) > (w + ε)(eβ(w+ε) + 1) and ε(cosh(βw) + cosh(ϕ)) > wsinh(ϕ), then the heat capacity is negative. Another condition that leads to negative heat capacity is eϕ(w − ε)(eβw + eβε) < (w + ε)(eβ(w+ε) + 1) and ε(cosh(βw) + cosh(ϕ)) < w
sinh(ϕ). For ϕ = 0; the heat capacity is always positive for w > ε ≥ 0.
At very low temperatures,
This journal is © the Owner Societies 2025 |