Michael E.
Cates
*ab and
Vinothan N.
Manoharan
cd
aSUPA, School of Physics and Astronomy, University of Edinburgh, Mayfield Road, Edinburgh EH9 3JZ, UK
bDAMTP, Centre for Mathematical Sciences, University of Cambridge, Cambridge CB3 OWA, UK
cHarvard John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA 02138, USA
dDepartment of Physics, Harvard University, Cambridge, MA 02138, USA
First published on 15th July 2015
Defining the entropy of classical particles raises a number of paradoxes and ambiguities, some of which have been known for over a century. Several, such as Gibbs' paradox, involve the fact that classical particles are distinguishable, and in textbooks these are often ‘resolved’ by appeal to the quantum-mechanical indistinguishability of atoms or molecules of the same type. However, questions then remain of how to correctly define the entropy of large poly-atomic particles such as colloids in suspension, of which no two are exactly alike. By performing experiments on such colloids, one can establish that certain definitions of the classical entropy fit the data, while others in the literature do not. Specifically, the experimental facts point firmly to an ‘informatic’ interpretation that dates back to Gibbs: entropy is determined by the number of microstates that we as observers choose to treat as equivalent when we identify a macrostate. This approach, unlike some others, can account for the existence of colloidal crystals, and for the observed abundances of colloidal clusters of different shapes. We also address some lesser-known paradoxes whereby the physics of colloidal assemblies, which ought to be purely classical, seems to involve quantum mechanics directly. The experimental symptoms of such involvement are predicted to be ‘isotope effects’ in which colloids with different inertial masses, but otherwise identical sizes and properties, show different aggregation statistics. These paradoxes are caused by focussing one's attention on some classical degrees while neglecting others; when all are treated equally, all isotope effects are found to vanish.
For hard spheres, the interaction energy E is zero and the entropy, S, is the sole determinant of the phase diagram. In many colloids the suspended particles have a hard core, a steric or electrostatic repulsion of short range compared to the size of the core, and negligible interaction at larger distances. These indeed behave like hard spheres and their properties confirm all aspects of the theoretical phase diagram for such spheres.2 Adding attractions at larger distances complicates things (since E and S both matter) but the calculations can still be done, and agreement is just as impressive.1
There are two puzzling niggles with this success. First: what happened to the solvent? The calculations are performed for particles with hard sphere (and/or other) interactions, as though they resided in a vacuum. Is it really true that the solvent can be ignored altogether, once the effective interactions between colloidal particles (which of course may depend on what the solvent is) have been worked out? Secondly, a colloidal suspension may contain, in a typical sample, of order 1012 nearly spherical particles, each a micron in diameter. Each of these particles contains in turn around 1012 atoms. It follows that, if we allow for small variations in shape as well as in size, no two colloidal particles are exactly alike. Despite this, the phase diagrams observed for colloidal systems are the same as those calculated by statistical mechanics on the basis that the interacting particles concerned are indistinguishable.
We address the issue of the ‘missing solvent’ later in this article, but focus first on the apparent role of indistinguishability. Recall that this is a fundamentally quantum-mechanical concept, and applies only to particles that are molecularly identical. This does not hold for colloidal suspensions, for the reasons just given. Sethna has coined the term ‘undistinguished’ particles for classical systems like these in which particles could be distinguished, but in practice are not.3 It seems deeply reasonable that undistinguished and indistinguishable particles should have the same thermodynamic behaviour, at least so long as one remains firmly in the domain of classical physics. Within this domain, fermions and bosons have the same statistics, and although factors of ℏ might enter a calculation, they should all cancel out in the end. Colloids are large enough to be classical in this sense.
Such an equivalence seems particularly reasonable if one takes an ‘informatic’ definition4 of the entropy S. In this informatic view, S = kBlnW where W is the number of distinct and equiprobable microstates that we choose to treat as equivalent when defining a macrostate. (Slightly more generally, where pi is the probability of microstate i as determined within an ensemble of our choosing.) It follows that if we choose to make no distinction between colloidal particles—an attractive proposition for the experimentalist, who would likely prefer not to label each of the 1012 individual particles she can follow in her microscope—then all our calculations become the same as if there actually is no distinction. (In technical terms, the classical partition function for N labelled particles must always be divided by a factor N! to account for permutations among them:5,6 this holds whether the particles are actually indistinguishable or merely ‘undistiguished’.) The informatic view thus neatly explains why the statistical mechanics calculations capture the observed phase behaviour of colloidal suspensions.
Intriguingly however, the equivalence of indistinguishable and undistinguished particles is not accepted by all interpretations of classical statistical mechanics. In particular there is one school of thought, most clearly elucidated in an influential textbook by the distinguished physicist Shang-Keng Ma,7 that asserts that for the entropy of a system to be objectively real, it must be a dynamical quantity and not an informatic one. According to this ‘kinetic’ approach, S = kBlnW where W measures the volume in phase space that the system can explore on the time scale of an experiment. The precise definition of this time-scale dependent volume is problematic, as discussed carefully by Ma. (Specifically, only a tiny fraction of accessible states are ever sampled in practice.) Nonetheless one can sympathise with his view that the entropy ought to be a property of a thermodynamic system alone, and hence definable without reference to the informatic state of an observer. The ‘kinetic’ view of entropy has carried significant weight, for instance in the community working on glasses.8
The informatic and kinetic views of entropy would be in harmony if they always gave equivalent predictions for macroscopic behaviour such as phase equilibria. However, they do not; and wherever this issue has been looked at carefully,9–11 the informatic view has been found correct, and the kinetic one found wanting.
This point is best illustrated by a colloidal experiment that has been done innumerable times in laboratories around the world. If a suspension of monodisperse hard colloidal spheres is prepared in a homogeneous fluid state at (say) a volume fraction ϕ = 56%, this state will crystallize. (In the phase diagram, this initial state lies in the metastable fluid but has a density below the glass transition. Crystallization is therefore rapid, and the identity of the equilibrium state as a colloidal crystal is unambiguous.) In the fluid, the spheres can easily swap places whereas in the crystal, they cannot. It is only a slight oversimplification to say that the diffusivity of the colloidal particles in the crystal is negligible, so that particle swaps are entirely absent on experimental time scales.† For indistinguishable particles, the entropy gain on transforming from liquid to crystal is extensive, and positive at this density: as is well known, the ordered structure has more entropy because particles have more room to wobble about when their mean positions are localized on a lattice.1
However, within the kinetic approach, the additional entropy cost of localizing distinguishable particles onto un-swappable lattice sites contains a term Sperm = kBln(N!) where N! counts particle permutations. This term must be paid to collapse an accessible phase-space volume in which distinguishable particles can change places, into one where they cannot. This putative entropy cost is supra-extensive (kBNlnN) and thus for large N outweighs the extensive entropy on formation of the crystal. Thus the kinetic approach to entropy predicts that colloidal crystals are thermodynamically impossible. Yet they are observed every day.
How could a theory get it so wrong? The mistake of the kinetic approach is that it counts the states accessible to only one specific colloidal crystal (i.e., a single permutation of the colloidal positions). But there are N! distinct crystals that might arise, and the kinetic approach underestimates by this enormous factor the probability of finding the system in one or other of them. In the informatic approach, on the other hand, all the states that we as observers deem equivalent are counted as members of the same macrostate; the multiplicity of crystals with different particles occupying different sites then cancels exactly Sperm, recovering the same result as for indistinguishable particles. Thus we correctly predict that at ϕ = 56% (say) and N → ∞, formation of some crystal or other occurs with probability one, even though formation of any specified realization of the crystal occurs with probability zero. The kinetic approach can of course finesse this by adding on a ‘configurational entropy’ to count the number of mutually inaccessible states.8 But this concedes defeat: the original goal7 was to show that entropy counts only the states the system can actually reach, whereas the configurational term directly adds back all those—within an equivalence class set by the observer's choice of macrostate—that it can't. For thermodynamic purposes, this simply restores the informatic definition of entropy.
These arguments are not new9–11 but deserve to be more widely known (and taught!). They illustrate two interconnected points. First, not all reasonable-sounding definitions of entropy for classically distinguishable particles are equivalent: some are right and some are wrong. Second, experiments on colloidal suspensions can resolve with striking clarity what the right definitions are. This adds to the many other ways in which colloidal experiments have clarified basic concepts in classical statistical mechanics.2
In the rest of this article, we first reiterate the above discussion from a slightly more formal angle (Section 2) and then explore similar lines of reasoning in two further areas. In both of these, pitfalls in defining the classical entropy can be illuminated by laboratory experiments, or indeed thought-experiments, involving colloids. The two areas are the so-called ‘symmetry number’ in molecular partition functions (Section 3); and the paradoxical dependence of rotational entropies on particle masses (Section 4). Finally, in Section 5 we return to the first of the two niggles raised above concerning the success of statistical mechanics: why is it that the entropy of a colloidal system can be computed as though the solvent were replaced by a vacuum? This will also resolve a further paradox involving the role of particle masses in translational, rather than rotational, entropies.
(1) |
(2) |
By definition, λ (which involves ℏ) should cancel from any physical observable that involves only classical physics. Its apparent failure to do so—in calculations that are classical, but not quite correct—lies behind some of the paradoxes discussed later. But first let us focus on indistinguishability.
Gibbs was fully aware that for distinguishable particles the N! divisor in eqn (1) is in principle absent, and that this absence destroys thermodynamics as we know it.12 For instance, without this divisor the entropy of an ideal gas is supra-extensive:
SIG(N,V)/kB ≃ Nln(V/Nλd) + NlnN | (3) |
He did however pose the following question. Suppose we have two types of particle (red and blue) each indistinguishable among themselves, for which SIG(Nr,Nb,V)/kB ≃ Nrln(V/Nrλd) + Nbln(V/Nbλd). This is not equal to SIG(Nr + Nb,V) for a single species. A paradox now arises if one imagines red and blue particles to smoothly and continuously become merged into a single population of purple particles. How and when does the entropy jump from one formula to another? This is Gibbs' paradox, and one resolution of it is offered by quantum mechanics: you cannot smoothly transmute one chemical species into another, so ‘red is red and blue is blue and ne'er the twain shall meet’. This resolution asserts that the problem is a mere artefact of classical thinking.
Gibbs himself found a different resolution which, unlike the quantum-mechanical one, makes sense even for classically distinguishable particles such as colloids (which could in effect be made in a continuous range of colours, if one so desired). He considered how the probabilities of different collections of states would change if the observer chose to ‘undistinguish’ specific subsets of such particles. This act merges previously distinct macrostates, whose probabilities therefore must be added together. Gibbs found that the entropy switches from one form to another at precisely the point where we choose to give up distinguishing red particles from blue. His general analysis of this class of problems remains useful today, for example when considering colloids with a continuous distribution of sizes.13 And, in the case where all particles are undistinguished, the final result is simple: the N! divisor reappears in eqn (1), and extensive, single-species thermodynamics is restored.
Thus, although quantum indistinguishability offers a fast and convenient derivation of the N! divisor in eqn (1), it is quite wrong to believe that this factor defies classical explanation. While many others have noted this fallacy, most recently Frenkel,14 the belief is still widespread (and the supposed breakdown of classical thinking itself often confusingly referred to as the ‘Gibbs paradox’). The cause of the misunderstanding is perhaps that most of us learn statistical physics from textbooks that rely solely on the quantum mechanical explanation of the N! divisor. In a textbook context—where atoms and molecules are typically the main focus (e.g.ref. 15)—this represents sound pedagogy; but the result is that many physicists remain unaware of Gibbs' completely classical derivation of the same factor. Indeed it seems that his derivation had been almost entirely forgotten before its excavation by Jaynes about 20 years ago.16
Fig. 1 Optical microscope images (left) and structures (middle) of tri-tetrahedral and octahedral colloidal molecules. Each of these two isomers contains six micrometer-scale polystyrene spheres bound together by depletion forces, which are induced by nanoparticles that cannot be seen in the micrographs. The depletion attraction, details of which are reported in ref. 18, has a depth of about 4kBT and a range of about 80 nm, much smaller than the size of the polystyrene spheres. Experiments show that in an equilibrium ensemble, the probability of observing a tri-tetrahedron is 96%, compared to 4% for the octahedron. The difference in probabilities is dominated by rotational entropy, which is a function of the symmetry number σ (right). (Images from Guangnan Meng.) |
The preponderance of the tri-tetrahedron cannot be explained away by appeal to which structure is more likely to form first, since the isomers can freely interconvert over the timescale of its experiment. Having ruled out kinetics and potential energy as reasons for the dominance of the tri-tetrahedron, we must conclude that the tri-tetrahedron has a higher entropy than the octahedron.
Where does this entropy difference come from? We shall show, using the same statistical mechanical formalism used for molecules,5 that the dominant factor is the difference in symmetry between the two structures. For simplicity we suppose our classical colloidal ‘molecule’ to be in vacuo (the role of a solvent is considered in Section 5). In equilibrium, the probability P of observing an isomer is proportional to Zmole−βU, where U is the potential energy of the isomer (which depends only on its bond number) and Zmol, its molecular partition function. To a good approximation, Zmol can be factored into translational (T), rotational (R), and vibrational (V) parts:
Zmol = ZTZRZV | (4) |
Z T = V/λmolecule3. | (5) |
(6) |
We are still off by a factor of approximately 12, which must come from the rotational contribution. In three dimensions, the classical-limit rotational partition function is
(7) |
Though it is reassuring that the calculation reproduces the experimental results, it sheds little light on how the symmetry of the octahedron could work against it so strongly. The usual tactic to make sense of such results is to appeal to quantum mechanics. There the appearance of the symmetry number in the partition function makes perfect sense: the molecular wave function has a symmetry determined by the placement of atoms in the molecule, so that rotations commensurate with this symmetry are fundamentally indistinguishable. If we did not include the symmetry number, we would overcount the number of rotational microstates. And indeed this is where many textbooks5 leave the matter.
But this explanation is wholly unsatisfying for colloidal molecules, where the wavefunction, were we even able to compute one, would not be symmetric unless each of our particles were composed of exactly the same number and configuration of atoms. Since that is certainly not the case, and the particles can in principle be distinguished—by small differences in their size, for example—we must seek an alternative explanation.
To do so, we return to the original derivation of the symmetry number by Ehrenfest and Trkal in 1921,19,20 which made no recourse to wavefunctions or quantum mechanics in general. We illustrate this argument with a simple example inspired by Gilson and Irikura21 (see also ref. 22).
Consider a colloidal molecule with N = 4 particles that are constrained to two dimensions (Fig. 2). There are 4! = 24 possible ways they can be arranged to form a four-particle molecule, for example a rhombus or a square. Writing the partition function in terms of the coordinates of the particles (or ‘atoms’) and integrating out the momenta yields
(8) |
Why does the symmetry number appear in Zmol (eqn (4)–(7)) and not in Zatom (eqn (8))? And why does a factor of 1/N! appear in Zatom and not in Zmol? The difference is that Zmol is integrated over molecular coordinates. In writing eqn (4) we assumed that our colloidal molecule is a rigid body that can translate, rotate, and vibrate.
In molecular coordinates, permutations and rotations are not independent operations but become ‘entangled’ (in the classical sense of that word!). To see this, we distinguish the particles and assign them colours, as shown in Fig. 2. In the distinguished system, the rotational partition function is proportional to the number of permutations times the number of possible orientations of the cluster: 4!·2π. But sometimes the same configuration can be made in different ways. For the rhombus, which has two-fold symmetry in the plane, each configuration of labeled particles (Fig. 2B, top) can also be made by permuting the particles and rotating the cluster by π radians (Fig. 2B, bottom). The rotational partition function is therefore proportional to 4!·π or 4!·2π/σ, where σ = 2. For the square, which has four-fold symmetry in the plane, each configuration can be made in four different ways, corresponding to permutation plus rotation of π/2 radians, or an integer multiple thereof (Fig. 2C). The rotational partition function is therefore proportional to 4!·π/2 or 4!·2π/σ, where σ = 4.
If, as in the experiment, the particles are undistinguished, we must divide by N! = 4! in both cases, leaving a rotational partition function that is proportional to 2π/σ. We now see why the symmetry number appears in Zmol and not Zatom, and vice versa for the 1/N! term: both changes result from grouping together the atomic degrees of freedom. The proportions of the two isomers observed in an equilibrium ensemble will depend inversely on the ratio of their symmetry numbers, as well as on the differences in potential energy, vibrational entropy, and moments of inertia. (We will see in the next Section that there are some crucial cancellations between the moments of inertia and the vibrational terms.) The same argument applies to the octahedral and tri-tetrahedral colloidal molecules observed in the experiments.
What if we chose to distinguish the colloidal particles in the experiment, for example by labeling them with different fluorescent colours? In that case it would be natural to consider each different arrangement of coloured particles to be a different molecule. There are 30 arrangements for the octahedron (6!/σ, where σ = 24), and 360 for the tri-tetrahedron (σ = 2). If we were to count how often we saw each of these isomers, we would find that each of the 360 tri-tetrahedral isomers would occur twice as often as each of the 30 octahedral isomers, because of the ratios of moments of inertia and vibrational frequencies. It is only when we lump these macrostates together—by ignoring the distinctions between particles—that we obtain a result that depends on symmetry numbers.
It is no coincidence that this argument is similar to that made by Gibbs in resolving the paradox now named after him. The aim of Ehrenfest and Trkal was in fact to to understand the origin of the N! divisor in the canonical partition function, the very same factor that we discussed in Section 2. The only way to understand how the entropy of a gas depends on the number of molecules, Ehrenfest and Trkal argued, is to consider a situation where the number of molecules N can vary in a reversible process. Within the canonical ensemble, N is fixed when the system contains only a single species, and we cannot add more molecules reversibly. But when there are multiple species of molecules in the same system that can interconvert, their numbers Ni may vary reversibly. In this case, factors of Ni! appear in the partition function without any appeal to quantum mechanical argument. The symmetry number appears as a side-note in this thought experiment. It is a historical irony that most modern textbooks ascribe to it a quantum origin, given that Ehrenfest and Trkal aimed to show exactly the opposite (see ref. 21 and 23 for elegant and modern reformulations of their arguments).
The broader point here is that our statistical mechanical models must be consistent with the types of observation we make, since those observations define the macrostate to be modeled. Consider, for example, a flexible colloidal molecule that can fluctuate between symmetric and asymmetric conformations without changing its bond topology (Fig. 3). Does such a molecule have a symmetry number, given that it spends most of the time in asymmetric conformations? The answer depends on what we observe. If our experiment measures only the concentration of this molecule relative to other isomers, then the symmetry number must appear in our calculation, because it accounts for how identical conformations can be reached by permuting particles followed by either rotation or flexing.21 This example shows that the symmetry number is more generally understood as a property of the topology of the bond network of a molecule rather than its geometry. This may seem like a deep statement, but it is really a definition that follows from what experiments typically measure (and what they do not). Put differently, we have chosen to treat all three states of the chain in Fig. 3 as states of the same molecule, not different molecules, and we must calculate the symmetry number accordingly.
When used to address classical entropies, this artificial separation leads to another paradox. (Recall that a paradox is an apparent contradiction that, on close inspection, disappears.) Let us consider the rotational contribution to its partition function which in three dimensions is eqn (7). For the simplest case of a diatomic molecule, deriving this result is a standard undergraduate exercise: one examines the small β limit of the quantum partition function where En = ℏ2n(n + 1)/2I is the rotational energy and (2n + 1) the number of distinct quantum states of that energy. Eqn (7) can be derived by generalizing this quantum calculation, or by a purely classical integral over the canonical coordinates for a rigid rotor.
The paradox is as follows. Imagine we have two species of colloids of the same size and interactions but with two different particle masses. For instance, the two types could comprise inner spheres of gold and of aluminium, each coated with an outer layer of polystyrene so as to create the same final size and surface chemistry. For simplicity ignore gravity (assume the experiment to be done in the space station), and consider ‘triatomic’ linear molecules that contain one gold and two aluminium particles. Then according to eqn (7), the gold particle is more likely to be found in either one of the edge positions than in the middle position. More generally, given two species of particles, placing the heavy ones at the periphery of the molecule increases the moment of inertia and hence the partition function. According to eqn (7) this should directly influence the abundances of these different configurations, favouring those with heavy particles outermost. In atomic physics this would be called an ‘isotope effect’.
Can this really be true? A thought-experiment says no: colloids explore their configuration space by over-damped Brownian motion in which inertia is negligible, so how can the inertial mass possibly control the abundances of different cluster geometries? The paradox was raised in a commentary by one of us,24 with the tentative suggestion that eqn (7) might fail for particles embedded in a molecular solvent rather than a vacuum (see Section 5 below). But in fact the resolution of this paradox is much simpler than that.25,26 The ‘colloidal isotope effect’ is a mirage, caused by artificially separating classical rotations from vibrations and translations.
It is true that for a molecule in vacuo the full classical partition function is usually approximated by the factorized form, eqn (4). But what matters is whether this object, not just one of its factors, supports an isotope effect. It doesn't, as we can see by sidestepping the above factorization and using instead a more fundamental ‘interacting atom’ representation:
(9) |
How does this outcome square with the ‘molecular’ representation, eqn (4)? For a classical molecule of N atoms—assuming harmonic vibrational states and no vibration–rotation coupling—the molecular partition function (in d = 3) follows from eqn (4)–(7):
(10) |
(11) |
(12) |
Let's now address colloids in a solvent. A good starting point is the full semi-classical partition function for all particles present, in which we retain the Nd colloid coordinates q but integrate out the solvent ones Q. Integrating also over all momenta as usual, we have
(13) |
So long as we can determine a good approximation for U(q), then the solvent has indeed gone away; our colloids might as well be in vacuo, so long as this effective interaction replaces the one they would really have had there. The effective interaction potential can include solvation forces and all manner of other solvent-dependent interactions; nonetheless, all that we need to know about the solvent is encoded in U(q).
The paradox lies in the continuation of this argument. The colloid free energy is now F(T,V,N) = −kBTlnZ where
(14) |
One resolution sometimes offered is that, as the solvent gets integrated out, the effective phase-space measure for the colloids—which roughly defines how far you have to move a colloid before it counts as being in a ‘different’ configuration, becomes set not by λ but by a coarse-grained length-scale related to the solvent diameter.24 This is, for example, how the world might appear to a lattice modeller where the solvent size appears to set a natural discretization scale. The classical quantity is then postulated ad hoc to replace λ in eqn (14), and hence in eqn (12) and all similar results for formation of supra-dimeric clusters.
But in fact this is a specious argument, for reasons spelled out below. Instead the astute reader will have noticed that the paradox is based on the same flawed reasoning as the one discussed in the previous section: we have treated some degrees of freedom as classical, but not all. Specifically, we assumed that the binding energy Δ of a colloidal dimer was connected with formation of a fixed bond—ignoring the fact that, classically, all bonds have a vibrational entropy. Referring again to eqn (9), it is once again clear that—because masses only enter the momentum integral in the partition function, which is separable—all masses must cancel, as must ℏ, once the full set of classical degrees of freedom is treated on an equal footing.
The way this happens for colloidal bonding is easiest seen for the case of a square well potential, of depth Δ and range . We treat for simplicity the case of d = 1, for which rotations do not enter. However small may be, in the classical limit there is a finite entropy of confinement for the relative coordinate between the two particles in our dimer; and for the square well this is simply Sbond = kBln(/λ). The equilibrium constant is now (using eqn (12) for d = 1) . Thus the equilibrium constant is independent of quantum mechanics and there is, as promised, no isotope effect.
We now see why the replacement of λ with did not offer a true resolution to the paradox. If the physical behaviour is classical, none of the actual results can depend on λ; so replacing λ with can make no difference unless the calculations have been wrongly executed. As the above example shows, the length that replaces λ in (say) the dimerization constant K of eqn (12) is not some ad hoc coarse-graining scale : it is instead a precisely defined length that can be calculated directly from the effective interaction U(q). And in our chosen example, it is simply the range of the bonding interaction, .
In summary, for classical systems such as colloids, the absolute phase-space measure as set by quantum mechanics must cancel from any observable physical quantity. This remains just as true after integrating out the solvent (to get an effective interaction U(q)) as it was before doing so. It is therefore never wrong to use for the colloid partition function the measure set by the semi-classical limit of quantum mechanics, in which factors of the thermal de Broglie wavelength λ appear, eqn (14). But if such factors fail to cancel in any predicted observable, this is because a mistake has been made. The most likely pitfall is to inadvertently treat some of the classical degrees of freedom as frozen. (Indeed, freezing a classical degree of freedom carries an infinite classical entropy cost, which is cut off only by quantum mechanics.) Because the phase-space measure must cancel, replacing λ with a classical coarse-graining length such as a solvent size, while it may disguise an incorrect calculation, cannot change a correct one.
In other cases, assumptions carried over from quantum mechanics are mostly harmless, but not if they cause one to forget that the same results can also be justified classically. For example, there is little danger in assuming that colloidal particles are indistinguishable, since most experiments choose to leave them undistinguished, which gives equivalent results in the classical limit. Danger does lie in arguing that because colloids are distinguishable in principle, theory can be improved by treating them as distinguished in statistical mechanics. By defining macrostates far too narrowly, that line of thinking leads to predictions that are not borne out by experiments: colloids would never crystallize, and symmetric colloidal molecules would have the same rotational entropy as asymmetric ones.
Experiments on colloids have helped to resolve other paradoxes as well. The oldest and most famous of these is Maxwell's demon. In the 1980s Landauer and Bennett ‘exorcised’ the demon by showing that erasing classical information carries an entropy cost.27 But for decades afterward it was not unusual to find articles28 claiming that the demon must produce entropy when it makes a measurement. As Bennett noted,29 the tendency to focus on the entropy cost of measurement, rather than of erasure, may ‘have been a side effect of the great success of quantum mechanics’. The question has been settled by recent experiments on colloidal particles, which show that the cost of erasing information agrees with Landauer's prediction.30
These experiments and the others we have mentioned all support the informatic view of entropy. The main argument against this view is that it introduces subjectivity into the definition of the entropy. This is true, but we see it as a strength, not a weakness. We use the term ‘subjective’ in the same sense as Jaynes4 did: ‘depending on the observer’. For classical systems such as colloids, the observer has a choice of what to observe and what to ignore; for example, she can treat each different arrangement of six fluorescent particles as a different molecule, or she can ignore the colours and keep track of only the bonds. The macrostate therefore depends on the observer's choice. So, too, does the entropy, which is found by counting the number of microstates that make up this macrostate. It is possible to define the entropy non-subjectively—for example, by insisting that colloids always be treated as distinguishable—but we have seen that such a definition makes it difficult to reconcile theory with experiment.
A subjective entropy might appear to create new problems. After all, the heat transferred in a reversible process is proportional to the change in entropy. If the entropy depends on the observer, how can a measurement of the heat be consistent from observer to observer? We note first that experiments can measure only entropy differences, not absolute entropies. Each observer is free to add an arbitrary (subjective) constant to the entropy, and even if each chooses a different constant, the measurements are unaffected. But what if the heat were measured directly, for example by measuring the motion of the particles with an optical microscope? Would the heat flow really depend on the choices of the observer? Perhaps surprisingly, the answer is yes, because the observer can choose which degrees of freedom to measure and which to ignore. In any reversible change, the energy change associated with degrees of freedom that are measured is called work; the remainder is called heat. The first law of thermodynamics (which states that the sum of heat and work is the change of internal energy) then directly requires that the heat flow depends on which degrees of freedom are retained as state variables, and which are ignored. But if each observer makes the same choice, the heat measured will be the same from observer to observer—and in that sense it remains an objective quantity.
Thus we would argue that the informatic view is the simplest way to interpret experiments on colloids. As long as we clearly specify our choices, all paradoxes vanish. The beauty of colloidal systems is that the particles are fundamentally classical and distinguishable in principle. This allow us consciously to make choices about the macrostate in more than one way, and to observe and compare the results. In so doing, we gain a better understanding of Gibbs' arguments and the effects of entropy in the macroscopic world.
Footnote |
† Of course, colloid diffusivity is not entirely negligible in the crystal, nor is it so fast in the liquid as to explore all states on an experimental timescale. But nonetheless, the liquid samples vastly more permutations than the crystal on experimental timescales. |
This journal is © The Royal Society of Chemistry 2015 |