Potential and limits of a colloid approach to protein solutions

Looking at globular proteins with the eyes of a colloid scientist has a long tradition, in fact a signiﬁcant part of the early colloid literature was focused on protein solutions. However, it has also been recognized that proteins are much more complex than the typical hard sphere-like synthetic model colloids. Proteins are not perfect spheres, their interaction potentials are in general not isotropic, and using theories developed for such particles are thus clearly inadequate in many cases. In this perspective article, we now take a closer look at the ﬁeld. In particular, we reﬂect on the fact that modern colloid science has been undergoing a tremendous development, where a multitude of novel systems have been developed in the lab and in-silico . During the last decade we have seen a rapidly increasing number of reports on the synthesis of anisotropic, patchy and/or responsive synthetic colloids, that start to resemble their complex biological counterparts. This experimental development is also reﬂected in a corresponding theoretical and simulation effort. The experimental and theoretical toolbox of colloid science has thus rapidly expanded, and there is obviously an enormous potential for an application of these new concepts to protein solutions, which has already been realized and harvested in recent years. In this perspective article we make an attempt to critically discuss the exploitation of colloid science concepts to better understand protein solutions. We not only consider classical applications such as the attempt to understand and predict solution stability and phase behaviour, but also discuss new challenges related to the dynamics, ﬂow behaviour and liquid-solid transitions found in concentrated or crowded protein solutions. It not only aims to provide an overview on the progress in experimental and theoretical (bio)colloid science, but also discusses current shortcomings in our ability to correctly reproduce and predict the structural and dynamic properties of protein solutions based on such a colloid approach. local (or cage) diffusion coefﬁcient as obtained from NSE experiments at the nearest neighbour peak q ∗ in the respective static structure factors, corresponding to the nearest neighbour distance, and thus probing local dynamics on length scales of the protein size. The local dynamics is not affected by the vicinity to the critical concentration nor does it show any signiﬁcant temperature dependence. Reprinted and adapted with permission from ref. 144 , https://pubs.acs.org/doi/10.1021/acs.jpclett.5b02092. Further permissions related to the material excerpted should be directed to the ACS. approximately the same position in the phase diagram (i.e. a comparable distance to the critical temperature) for all three systems. This ﬁgure demonstrates the enormous effect of a short-range weak attraction on the local short-time diffusion when compared to the purely hard sphere behaviour. It also reveals the tremendous effect of attractive patches (at constant overall attraction strength), and demonstrates the considerable inﬂuence of a weak shape anisotropy on the local short-time diffusion.


Potential and limits of a colloid approach to protein solutions
Anna Stradner, * a,b and Peter Schurtenberger a,b Looking at globular proteins with the eyes of a colloid scientist has a long tradition, in fact a significant part of the early colloid literature was focused on protein solutions. However, it has also been recognized that proteins are much more complex than the typical hard sphere-like synthetic model colloids. Proteins are not perfect spheres, their interaction potentials are in general not isotropic, and using theories developed for such particles are thus clearly inadequate in many cases. In this perspective article, we now take a closer look at the field. In particular, we reflect on the fact that modern colloid science has been undergoing a tremendous development, where a multitude of novel systems have been developed in the lab and in-silico. During the last decade we have seen a rapidly increasing number of reports on the synthesis of anisotropic, patchy and/or responsive synthetic colloids, that start to resemble their complex biological counterparts. This experimental development is also reflected in a corresponding theoretical and simulation effort. The experimental and theoretical toolbox of colloid science has thus rapidly expanded, and there is obviously an enormous potential for an application of these new concepts to protein solutions, which has already been realized and harvested in recent years. In this perspective article we make an attempt to critically discuss the exploitation of colloid science concepts to better understand protein solutions. We not only consider classical applications such as the attempt to understand and predict solution stability and phase behaviour, but also discuss new challenges related to the dynamics, flow behaviour and liquid-solid transitions found in concentrated or crowded protein solutions. It not only aims to provide an overview on the progress in experimental and theoretical (bio)colloid science, but also discusses current shortcomings in our ability to correctly reproduce and predict the structural and dynamic properties of protein solutions based on such a colloid approach.

Introduction
The use of a colloid approach to understand protein solution behavior is not a very recent development. In fact, a number of the early founders of colloid science have treated synthetic and bio-colloids such as proteins alike, adhering thereby to the early definition of colloids solely based on the size of the dispersed or dissolved colloidal matter. 1 [details of any supplementary information available should be included here].
See DOI: 10.1039/cXsm00000x/ ‡ Additional footnotes to the title and authors can be included e.g. 'Present address:' or 'These authors contributed equally to this work' as above using the symbols: ‡, §, and ¶. Please place the appropriate symbol next to the author's name and include a \footnotetext entry in the the correct place in the list. ment of colloid science, or rather soft matter science, was then largely based on fundamental work related to synthetic colloids such as the ubiquitous polystyrene (PS) or poly(methyl methacrylate) (PMMA) spheres. [7][8][9][10] It is only more recently that colloid science has moved again back into the focus of the protein community. This was primarily driven by the lack of an understanding of the theoretical foundation of protein crystallization and the need to design improved approaches to crystallize proteins and obtain their high resolution structure through x-ray and neutron based crystallography. The pioneering work of George and Wilson 11 and Muschol and Rosenberger 12 linked the ability of proteins to crystallize with overall measures of their interaction potential such as the second virial coefficient B 2 , and their findings were then connected to theoretical and simulation work on the relationship between the interaction potential and the resulting phase behavior [13][14][15][16] . The strong similarities between the available experimental phase diagrams of globular proteins and those obtained for colloids interacting via a combination of a hard sphere repulsion and a short range attraction were realized, 15,17 and a generic phase diagram for globular proteins was proposed that provided general rules for finding optimal crystallization conditions. 16,18 Particularly important was also the development of an extended law of corresponding states, which further supported the universality of this generic phase diagram and provided a simple link between the location of the binodal and a reduced second virial coefficient B * 2 = B 2 /B 2,HS , where B 2,HS is the corresponding hard sphere second virial coefficient, as an effective temperature irrespective of the detailed shape of the interaction potential. 19 Since these pioneering developments in the mid 90's, we have seen a renaissance in the use of the experimental and theoretical toolbox of colloid science in order to understand various solution properties of globular proteins. While protein phase diagrams and crystallization was initially in the focus, this colloid approach to proteins was subsequently extended to a much wider range of topics that included protein self-assembly and aggregation, the stability of crowded protein solutions and mixtures, and the dynamics of strongly interacting protein solutions. [20][21][22][23][24][25][26][27][28][29][30][31] However, this approach has recently been questioned in an article by Sarangapani et al. 32 , where the authors concluded that the proteins' capacity to adjust their shape and thus the nature of the interprotein interactions in response to variations in pH and concentration severely limits the usefulness of colloid models to describe the structural and dynamic properties of protein solutions. They also pointed out that proteins are macromolecular entities rather than rigid colloidal particles, and suggested that one should try to consider links to synthetic polymer and polyelectrolyte solutions. The article was enthusiastically received in a "new and notable" article in the same journal by Prausnitz 33 , who wrote that "now, after the report of Sarangapani et al., the colloidlike theory is dead; the Sarangapani group have delivered a coup de grace. We can take comfort in the remark of Sarangapani et al. that, while scientifically erroneous, the colloidlike theory may nevertheless be useful for some purposes in biotechnology. Thank you! That's like saying even a placebo can sometimes cure an illness." Does this mean that we no longer should try to use colloid models when attempting to understand and predict various properties of protein solutions? We believe that this is not the case, and that this rather reflects a too narrow and outdated view of what colloids and colloid models are, and in particular neglects the many exciting developments that have happened in the area of synthetic colloids in recent years. While colloidal hard spheres have been very important in creating the foundation of modern colloid science, we have seen a multitude of anisotropic, patchy and/or responsive colloids emerging that we believe are highly relevant as model systems for proteins. It is common knowledge that protein interactions depend on solution parameters such as pH, ionic strength or temperature, but so do colloid interactions. Proteins are indeed biological macromolecules, but one would expect polymer-or polyelectrolyte behavior only for completely denatured proteins, which we do not consider here. While conformational changes in proteins can obviously happen as a function of pH or other external variables, the globular nature of the overall protein (tertiary) structure remains, and will require models that are colloid rather than polymer/polyelectrolyte based. The question of polymer vs colloid nature will certainly become much more important in the current focus on intrinsically disordered proteins (IDPs) 34 .
However, it is also clear that much more emphasis will have to be given to the anisotropic and patchy nature of protein interactions and its influence on the various structural and dynamic properties of protein solutions. In this article we try to give a brief overview of recent developments in colloid science that may be of importance for protein solutions, and point out a number of topics where such improved colloid models could have a considerable impact in our strive for understanding and predicting protein solutions. We restrict our discussion to a small number of selected topics where we feel a particular need for improvement, and a large potential for profiting from parallel developments in colloid and protein science: i) improved models for interparticle interactions and the resulting phase behavior; ii) characterizing and understanding colloid and protein dynamics in dense systems, and the importance of appropriate characterization and simulation tools; iii) self-association and its consequences on viscosity and dynamical arrest. These topics are chosen for their importance in understanding complex biological systems and processes as well as in view of the needs of pharmaceutical industry for the development of future high concentration drug formulations.

Current trends in colloid science -from hard spheres to anisotropic and patchy particles
Much of the progress made in understanding the structural and dynamic properties of colloidal suspensions is a result of their frequent use as ideal model systems to address fundamental issues in condensed matter physics such as liquid ordering, crystallization and glass formation, and the corresponding structural and dynamic properties of the various systems as a function of the interaction potential 8 . The initial work had primarily focused on the use of hard sphere-like colloids, which was instrumental in experimentally demonstrating the existence of an entropically driven crystallization in a purely hard sphere-like system 7 , and in developing a new research thrust focusing on glass transition, dynamic arrest and jamming [35][36][37][38] . Subsequently the field has seen a dramatic increase in the complexity of colloids and colloidal interactions, and there were in particular four developments that are especially important for the topic of this article: the importance of short range attractions on colloid phase behavior 8,13,14,39 ; equilibrium cluster formation in systems with mixed potentials, combining a long range soft repulsion and a short range attraction 25,[40][41][42] ; the development of anisotropic particles [43][44][45][46][47][48] , and finally the dramatic consequences of patchy interactions on phase behavior, self-assembly and colloid dynamics 31,43,45,[49][50][51] . As we will see below, these recent developments are also of prime importance for our understanding of protein solutions, and we therefore briefly summarize the most important features.
The investigation of non-equilibrium phenomena in colloidal suspensions has clearly emerged as one of the most important 2 | 1-18 Highlighted in red is also the binodal line denoting the metastable liquid-liquid phase separation in short range attractive systems, which can lead to interesting new arrested states when in competition with the arrest or gel-line. The dark blue line in the volume fraction/range of repulsion plane at zero attractions (blue shaded) denotes the arrest line that moves to lower volume fractions as the electrostatic repulsion is turned on. In the volume fraction/attraction strength plane at long-range electrostatic repulsions, where the range of the repulsion is significantly larger than the attraction range, we find the formation of equilibrium clusters, with a monomer-cluster transition (red dotted line) moving to lower volume fractions as the attraction strength increases, and a cluster glass transition at higher volume fractions (black line). The phase behaviour of systems where the mixed potential consists of attractions and repulsions with comparable range or longer-ranged attractions is even more complex and a recent attempt to describe this in a generic state diagram can be found in ref. 52 . Adapted and modified from ref. 53 with permission from The Royal Society of Chemistry.
fields of soft matter research. Topics such as dynamical arrest or jamming in suspensions of (weakly) short-range attractive colloids, the interplay between spinodal decomposition and glass formation or the existence of a metastable liquid-liquid binodal have had a considerable attention from the experimental and theoretical soft matter community 39,54,55 . A schematic view of our current understanding is given in Fig. 1. For ideal hard sphere particles, we first observe a transition from a liquid to a crystal phase, followed by a disordered solid phase, a glass, at volume fractions of approximately φ ≈ 0.58. 7,35,37 If a weak and short-ranged attraction is now turned on, this leads to the astonishing observation of a melting of the glass, followed by a so-called re-entrant glass or solid formation at even stronger attractions 39,54 . For very strong interparticle attractions, we reach the regime of so-called irreversible aggregation, where soft fractal gels form already at very low volume fractions [56][57][58] . At intermediate strength of the attraction, the situation is even more complicated due to the fact that phase sep-aration into a dilute (gas-like) and a concentrated (liquid-like) suspension can occur, and this can subsequently lead to the formation of a long-lived 'interaction network' of particles 59 .
Colloids frequently carry charges, and thus interact via a screened Coulomb repulsion. When combined with a short range attraction, new states appear, the metastable liquid-liquid phase separation is suppressed and we observe the formation of (transient) equilibrium clusters 25,[40][41][42] . Particularly interesting in this context is the fact that such clusters were found to undergo an arrest transition from a cluster fluid to a cluster glass that occurs at a significantly lower volume fraction when compared to the hard sphere glass at φ g ≈ 0.58. 41,60,61 Phase behavior and the occurrence of dynamical arrest upon an increase in concentration is indeed of prime importance in protein solutions in a wide range of areas, and the contents of Fig. 1 will thus reappear again below.
Another very important aspect in this context is shape anisotropy, as proteins in general are far from being perfect spheres. In recent years we have seen a wealth of colloid synthesis strategies, leading to a spectacular array of colloids of different shapes, compositions and functionalities (see Fig. 2 for some key examples). 43,46,62,[65][66][67][68][69] Together with similar advances in computer simulations of anisotropic particles, they have improved our understanding of the influence of anisotropy in both shape and interactions on aggregation, self-assembly and phase behaviour in colloidal suspensions. 43,65,[69][70][71][72] However, most of the work has been devoted to phase behavior and self assembly, and there is still a lack of knowledge on the dynamics of anisotropic particles, and the influence of shape anisotropy on dynamical arrest. Particles with moderate shape anisotropy such as ellipsoids with axial ratios of order 1-4 are particularly relevant for comparisons with globular proteins. While there are some simulation as well as experimental studies focusing on various aspects of the dynamics of ellipsoids at high concentrations, most of the experimental work is on (quasi-) 2D systems, with only a few notable exceptions on 3D bulk behavior. [73][74][75][76][77][78][79] Progress on the experimental side is primarily hindered by the fact that most existing model systems for such ellipsoidal particles are highly turbid at high concentrations, thus making an investigation of dynamic properties such as the collective or self diffusion coefficient or the particle mean square displacement very difficult or impossible with standard optical tools. It is only through the use of recently introduced techniques such as differential dynamic microscopy or X-ray photon correlation spectroscopy that these properties have become accessible at high concentrations close to the glass transition for these model ellipsoids also. 63,79 During the last 2 decades, enormous experimental and theoretical effort has also been devoted to the design and synthesis and to the understanding of the phase behaviour of so-called patchy colloidal particles, i.e. particles with site-specifically engineered surfaces (see Fig. 2 for some key examples). 43,51,[82][83][84][85] As a consequence of their surface patchiness these particles can interact via directional and specific interactions. Both experiments and simulations demonstrate that this can result in a considerably more complex phase and self-organization behaviour than in colloids exhibiting purely isotropic interactions. This has recently been  62 ; Copyright (2008) Schweizerische Chemische Gesellschaft); peanut-shaped core-shell particles (reprinted and adapted with permission from ref. 63 . Copyright (2018) WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim); faceted and bowl-shaped core-shell microgel particles (reprinted and adapted from ref 46 with permission from The Royal Society of Chemistry). Bottom: Patchy colloids and colloidal molecules. Bottom left: Colloidal clusters made of different thermoresponsive particles that form structures similar to multi-patch particles, where the different patches can change from overall repulsive to overall attractive as a function of temperature (reprinted with permission from Peng et al. 64 ). Bottom right: Examples of so-called reconfigurable colloidal molecules formed by thermoresponsive particles with complementary shapes that can reversibly assemble into molecule-like clusters, and change configuration as a function of temperature. Shown are confocal laser scanning micrographs (CLSM) and schematic representations illustrating the specific self-assembly in colloidal molecules with a valency of 4 at 20 • C (methane, CH 4 ; top). Increasing the temperature to 40 • C, the valency of the assembly decreases to 2 (carbon dioxide, CO 2 ; middle). At 48 • C, the assembly resembles a dihydrogen, H 2 , configuration (bottom) (reprinted from ref. 48 ).
discussed in a comprehensive review article on the programmed self-assembly of patchy particles by Duguet et al. 84 , where the enormous variety of available particles and the resulting phase behavior is illustrated.
However, while there is a large and still rapidly increasing number of publications focusing on patchy particles, the vast majority are either simulation studies of their self-assembly behavior, descriptions of new synthesis routes or of the self-assembly patterns found with these particles. There is a clear lack of studies providing an experimental verification of the many predictions on the phase behavior of patchy colloids, and virtual no work at all is devoted to the consequences of patchy interactions on the dynamics of these suspensions. This is particularly important for an extension of the colloid approach to protein solutions beyond classical spherical particles with centrosymmetric potentials, as this currently limits our ability to use less coarse-grained models that are based on the molecular structure of proteins for a calculation of key structural and dynamic properties and their dependence on solution parameters such as the ionic strength, temperature, pH or protein concentration.
With these recent developments in colloid science in mind, we will now survey the current trends in the application of concepts and models from colloid science to proteins, and point out the current limits and future prospects of such an approach. This discussion basically centers on the question of the optimal coarsegraining strategy necessarily required to allow for a quantitative and predictive understanding of the various aspects of protein solutions. As schematically described in Fig. 3, this requires a strategy where sufficient molecular details are retained in order to allow for a quantitative reproduction of the particular quantity aimed for, while limiting the computational costs to make the necessary calculations or simulations feasible.

Protein phase behavior
The stability of protein solutions is vital for biological functionality or drug formulation using so-called biologics (or biopharmaceuticals), and globular proteins are generally well optimized against non-functional interactions. 86 Although there is recent and exciting evidence that phase transitions in living cells may in fact represent an important mechanism in cellular organization, 87 protein phase separation has often been related to the occurrence of so-called protein condensation diseases. 88,89 The connection between liquid-liquid phase separation in solutions of eye lens proteins and the formation of cataract, still one of the leading causes of blindness worldwide, has initially spurred a systematic study of the phase diagrams of various representatives of the γ-crystallin family by Benedek and collaborators. 88,[90][91][92][93][94][95][96] At the same time, this group also extended the investigation of protein liquid-liquid phase separation to another key model protein, lysozyme, for which early work had already indicated the existence of critical phenomena, 97 and found very similar behavior. 98 In addition to providing a first systematic study of liquid-liquid phase separation in globular proteins, these studies also provided evidence that the resulting binodal is actually metastable with respect to the liquid-crystal transition. 91,99 In parallel, work on the phase behavior of colloid-polymer mixtures indicated that the gas-liquid transition could become metastable with respect to the liquid-solid transition, provided that the size ratio polymer/colloid was sufficiently small ( 0.25) to render the depletion interaction between the colloids short range. 13,14,100 The analogy between the phase behavior of colloids with short range attractions and globular proteins exhibiting metastable liquid-liquid phase separation was quickly realized, and immediately explored to advance the difficult and tedious search for solution conditions favorable for growing protein crystals. 15,16,20,21,29 This development was motivated by the pressing need for high quality protein crystals for determining protein structures through X-ray crystallography, and the notorious difficulties in finding appropriate solution conditions for the growth of such crystals. The striking similarity of the known phase diagrams of different globular proteins, and the discovery of George and Wilson 11 that the second virial coefficient B 2 of proteins that could successfully be crystallized were all found in a narrow range of slightly negative values, provided the initial scientific basis. 18,101 Another key ingredient was the "extended law of corresponding states" proposed by Noro and Frenkel,19 which indicated that the reduced second virial coefficient B * 2 could be used as an effective parameter to quantify the strength of the short range attraction and locate the gas-liquid and liquid-solid phase boundaries. 102 All these individual puzzle stones were then put together to a picture, in which globular proteins were believed to interact in a way well described by a (centrosymmetric) short range attractive potential. For such potentials, B * 2 then largely determines the location of the binodal and the liquid-solid phase boundary for a given protein system, and crystallization is favoured in a crystallization window between the liquid-solid boundary and the binodal, and at concentrations below an arrest or gel line expected above the critical concentration (Fig. 4). 16,20,29 This scenario provides a ra-tional approach to protein crystallization, where a measurement of B * 2 and its dependence upon various solution conditions would allow for a relatively rapid screening to find optimal nucleation and growth conditions. However, has this advancement in understanding of the physics of protein phase behavior and crystal formation indeed changed the picture? Unfortunately, growing high quality protein crystals still largely happens via a trial-anderror approach, and we lack predictive understanding of the link between the protein composition and the choice of the optimal solution composition in order to control the kinetics of nucleation and growth. Reality is obviously much more complex, and a lower degree of coarse-graining in the models used to describe protein solutions is clearly required. While fully atomistic simulations of a few hundred proteins have been performed, 103,104 , determining phase diagrams requires hundreds of simulations, which is not yet feasible. We therefore need to find an appropriate coarse-graining strategy that is optimized for the problem at hand.
At this point it is thus helpful to step back and look again more carefully at the existing data. While the available data on experimental phase diagrams is limited to few proteins such as lysozyme and different variants of γ-crystallin only, it covers a variety of different solution conditions. Overall, the similarities between binodals of short-range attractive colloid models such as a squarewell (SW) fluid are striking. 106 However, a closer look has revealed distinct differences between the measured and calculated binodals for a SW model, and required the use of patchy colloid models with directional interactions to achieve much more The optimal region for protein crystallization is the yellow shaded area in zone I, while zone III is prone to gelation. Redrawn after ref 16 . quantitative agreement with experimental data. 105,107,108 This is illustrated with data for lysozyme shown in Fig. 5(top), which clearly shows that the isotropic SW potential cannot reproduce the relatively wide binodal, which is much better approximated by the patchy colloid model. 105 Given the complex surface of globular proteins, this is not really surprising. Similarly, while colloids with isotropic potential form dense crystal structures such as face-centered cubic (fcc), body-centered cubic (bcc) or hexagonal close-packed (hcp), protein crystal structures are generally much more open. 109 Again, these type of structures can indeed be reproduced using patchy models. 110 On the other hand, almost quantitative agreement with the ELCS was found when using an effective protein size σ e f f that includes contributions from repulsive interactions such as screened charges in the normalization of the measured B 2 values, as shown in Fig. 5(bottom). 106 Where does that leave us in our attempt to design a coarse-graining strategy?
Given the large effects that patch number, size and strength can have on the phase diagram of patchy particles, 83 one would not expect that the corresponding binodals for different proteins and solution conditions would follow an ELCS even remotely. However, dramatic changes occur for low numbers of strongly attractive and small patches. 83,112 In proteins, this primarily corresponds to conditions where strong specific interactions lead to the formation of dimers, larger oligomers or other molecular com-  105 ). Region I corresponds to a stable fluid phase, region II to a fluid-crystal coexistence, region III a metastable gas-liquid coexistence and region IV a crystalline phase. Reprinted and adapted from ref. 105 , with the permission of AIP Publishing. Bottom: A comparison of the experimentally determined phase behavior of lysozyme for a broad set of experimental conditions with predictions for an adhesive hard sphere (AHS) and a short-range attractive square well (SW) potential, and a test of the applicability of the extended law of corresponding states, ELCS, as proposed by Noro and Frenkel 19 (NF). Note that here a reduced second virial coefficient b * 2 instead of the normalized second virial coefficient B * 2 is used for the y-axis, where the protein size is replaced by an effective diameter σ e f f in the calculation of the hard sphere value B 2,HS in order to also include other repulsive interactions caused by screened charges. Reprinted and adapted from ref. 106 , with the permission of AIP Publishing.
plexes, which may in fact suppress phase separation that would otherwise proceed via weaker non-specific attractions. 113,114 On the other hand, simulations for patchy particles with different patch interactions show that the binodals of particles with the same patch distribution, but different types of patches, indeed collapse when plotted as a function of the normalized second virial coefficient B * 2 . 110,115 This indicates that we may indeed be able to use a simple measurement of B * 2 as a quite robust parameter to estimate the location of a critical point and binodal for globular proteins, and their dependence upon variations in solution conditions. This is of course good news for example for formulation science. On the other hand, however, when looking at the behavior of patchy particles, the situation of crystallographers is much less fortunate. 29,116 We obviously need much more detail in the anisotropic interaction potential and the underlying coarse-grained structural model of the protein to successfully reproduce or predict protein crystal formation. There were a number of attempts to develop a detailed angularly dependent interaction potential based on known crystal structures in order to calculate B * 2 , investigate protein assembly, or gain insight into protein crystallization. 81,[117][118][119][120][121] However, this requires already that the protein structure is known. Moreover, using fully detailed all-atomistic simulations is still out of question for studies that require a larger number of proteins such as in attempts to calculate phase diagrams, or study structural and dynamic properties of concentrated solutions and mixtures. Here we need improved approaches to define the salient features of interactions between given proteins and translate them into an appropriately coarsegrained patchy model. Moreover, we also need much more work on various processes such as transient complex formation, nucleation and growth in patchy systems, and the influence of internal flexibility in protein crystal formation.
So far we have concentrated on equilibrium phase diagrams for colloids and proteins. As discussed previously, non-equilibrium phenomena such as glass and gel formation are of key importance in these systems, and have attracted considerable attention from the community (Fig.1). In the context of model colloid studies, it has also been suggested that the arrest line for short-range attractive colloids should also scale with B * 2 , and a measurement of B * 2 should thus allow for an estimate of the volume fraction at which arrest should occur for a given temperature or strength of attraction. 122 Dynamical arrest as a result of an arrested spinodal decomposition in globular proteins has been well studied for lysozyme. 53,59,111 However, while the binodal lines for lysozyme at different ionic strength can be rescaled according to the Noro-Frenkel ELCS, the arrest lines that characterize the location of the attractive glass or gel line below the binodal do not follow the ELCS, but seem to depend on temperature only (Fig. 6). 111 This is in contrast to both the experimental work in ref. 122 as well as Brownian and Newtonian computer simulations in ref. 123 . In both cases, the particles chosen were purely shortrange attractive, whereas lysozyme under the conditions chosen to look at an arrested spinodal decomposition has a mixed potential, with a short-range attraction as well as a screened Coulomb repulsion (Fig. 6). In fact, using idealized mode coupling theory (MCT) with such a mixed potential indeed reveals a behavior where the attractive glass transition is temperature-dependent only, and thus not exhibit a B * 2 scaling. 111 This indicates that for interactions typical for proteins in aqueous solutions we may not be able to construct and use a generic state diagram that captures both equilibrium phase behavior as well as non-equilibrium arrest transitions.
Until now we have concentrated on globular proteins only. However, other classes of proteins have gained enormous attention from the community, primarily for reasons connected to the formulation of novel biopharmaceuticals. Prime examples are monoclonal antibodies (mAbs) such as immunoglobulin gamma (IgG), which are considered as a major platform for potential drug candidates. 124,125 However, successful mAb applications require stable and low viscosity high concentration formulations, which are often difficult to achieve as mAbs are prone to exhibit reversible self-association at high concentrations that result in enhanced viscosity, and are also known to exhibit high turbidity due to the presence of liquid-liquid phase separation. 45,[126][127][128][129][130] A number of studies have made attempts to characterize cluster formation in mAb solutions, and to interpret antibody solution properties through analogies with colloids. [129][130][131][132][133][134][135][136][137] This is by no means straightforward due to the non-spherical shape and internal flexibility of mAbs, since interactions between proteins are frequently treated based on spherical approximations, and in particular the enormous effect that specific, directional interactions can have are often not considered.
The binodals observed for mAbs differ substantially from those observed from globular proteins. Their critical concentrations (C c ) and temperatures (T c ) typically are lower than those of globular proteins, and the binodal and spinodal is wider, resembling those of low valency patchy particles. This is illustrated in Fig.  7(top), which shows a comparison between the measured binodals for a mAb, two different γ-crystallins and lysozyme. 138 All binodals are rescaled with their values of T c and C c , respectively. There are indeed a number of quite successful attempts to use patchy particle theory to model protein interactions and the resulting self-assembly and phase behavior. 129,130,135 This is demonstrated in Fig. 7(bottom), which shows a comparison between calculated and measured binodals for two different mAbs. 129 However, there is still need for significant improvements in relating model parameters to the actual composition and structure of the mAb and the solution, and we also need more work on establishing the impact of the high flexibility of the mAb structure.

Protein dynamics and crowding effects in cells and pharmaceutical formulations
While analogies to colloids have primarily been used in order to understand protein interactions and characterize and predict structural properties and phase behavior of protein solutions, we have recently seen an increasing amount of work also devoted to the dynamics of protein solutions. The internal motion, the diffusion of proteins in crowded solutions and their macroscopic flow properties are key issues related to drug design, drug delivery formulations and our understanding of the cellular machinery, yet we lack both adequate theoretical models with predictive power as well as the necessary experimental data needed to validate these models on the required broad range of length and time scales. We therefore also briefly discuss the current status of investigation of protein dynamics in highly concentrated or crowded environments, and how analogies to colloids can help in this field.
Life sciences today are at a crossroads. Current approaches towards understanding the cellular machinery predominantly focus on the molecular level. 139 However, creating an inventory of detailed molecular structures of and interactions between biological macromolecules is simply not enough to fully understand complex systems such as living cells. While the molecules of biology do remarkable and surprising things, they still obey the laws of physics. It is important to realize that collective properties such as the entropy and free energy represent driving forces for chemical reactions, self-assembly, and phase transitions within cells. A quantitative understanding of a cellular system using an in- tegrative systems biology approach will thus have to start from collecting information about molecular interactions, and then devise a statistical physics-based model to arrive at a mathematical/numerical modelling with predictive power. This also requires a sound understanding of dynamic processes such as internal motion and diffusion of proteins, and their dependence on the various interactions present in a crowded environment. This not only poses enormous theoretical challenges, but their experimental characterization on the relevant time and length scales requires novel approaches. It is clear that significant progress in this area can only be achieved on the basis of a concerted longterm effort that combines traditionally disparate intellectual disciplines. It is here where analogies to colloids and the use of the experimental and theoretical toolbox developed for colloids can help.
The interior of living cells is a highly concentrated or crowded medium that can contain thousands of proteins up to very high volume fractions of 40% or more. Diffusion of proteins in cells is essential, as it strongly influences the cellular machinery through numerous processes such as signal transmission or reactions between proteins. In a dense and crowded environment such as the cell, an individual protein will feel the presence and interaction potential of all the surrounding proteins. Direct and hydrodynamic interactions strongly alter diffusion already on length scales comparable to the protein size, and we thus expect that they significantly influence reactions between different proteins compared to the situation encountered in typical in-vitro model experiments. It is thus vital to measure, understand and predict the diffusion of proteins in crowded media. 23,[140][141][142][143] There is an increasing number of studies devoted to this topic, but experimentally these studies primarily rely on the use of potentially disturbing labelling combined with techniques such as fluorescence recovery after photobleaching (FRAP) or fluorescence correlation spectroscopy (FCS). These techniques measure long time tracer diffusion over distances much larger than the protein size. 142,143 Other popular techniques to characterize the self and collective diffusion of proteins at high concentration are NMR diffusion measurements 145 and dynamic light scattering (DLS) 28,144,[146][147][148] . Once again, these techniques characterize self and collective diffusion over distances much larger than the protein size. Short and long time diffusion over distances of order the protein size that are highly important for cellular processes, however, are very difficult to investigate. Here neutron and x-ray based methods such as neutron spin echo (NSE) measurements or x-ray photon correlation spectroscopy (XPCS) in principle are ideally suited as they are capable of characterizing diffusive processes on these short characteristic length scales.
The importance of the appropriate choice of length scales is demonstrated in Fig. 8, which shows a comparison of DLS and NSE results for the concentration and temperature dependence of the short time collective diffusion coefficient of γ B -crystallin. This globular protein is known to exhibit liquid-liquid phase separation, with a critical concentration around c c ≈ 220 − 240 mg/ml, or a critical volume fraction of φ c ≈ 0.15 − 0.17. 91,92,144 The DLS data shows a dramatic effect of both temperature and concentration on the collective short time diffusion coefficient D s c (q) (Fig.  8 top). DLS measures the decay of large length scale density or concentration fluctuations, and for globular proteins not too far away from a critical point these are entirely dominated by critical fluctuations. What we thus observe in DLS is critical slowing down when approaching the critical point or a spinodal. 146,147 Due to the universal nature of critical phenomena, the only thing that matters here is the reduced temperature or distance away from the critical point or the spinodal, and simple coarse grained colloid models of particles with short range attractive potentials Top: Rescaled short-time collective diffusion coefficient in the long wavelength limit as obtained from DLS. The diffusion coefficient shows a strong temperature dependence particularly close to the critical volume fraction resulting from so-called critical slowing down that becomes more pronounced when approaching the critical point or spinodal line by decreasing the temperature. Bottom: Rescaled short-time local (or cage) diffusion coefficient as obtained from NSE experiments at the nearest neighbour peak q * in the respective static structure factors, corresponding to the nearest neighbour distance, and thus probing local dynamics on length scales of the protein size. The local dynamics is not affected by the vicinity to the critical concentration nor does it show any significant temperature dependence. Reprinted and adapted with permission from ref. 144  that can reproduce the binodal and spinodal will also yield the concentration and temperature dependence of D s c . This changes only when approaching the arrest line at very high concentrations close to the arrest line, where the measured correlation functions show an unusual almost logarithmic decay due to the competition between critical slowing down and dynamical arrest, and where more work that also takes into account the effects of patchy interactions on diffusion will be needed. 144 The situation is very different for NSE, where collective diffusion is characterized at much larger q-values or smaller characteristic length scales. The NSE data shown in Fig. 8 has been obtained at a q-value corresponding to q * , the nearest neighbor peak of the structure factor, or a length scale of the nearest neighbor distance or protein diameter. 144 At this large q-value, critical contributions to the structure factor have decayed completely, and S(q) is now dominated by the local structure, i.e., by the nearest neighbor cage formed by the surrounding proteins. On these length scales, collective dynamics characterized by D s c (q * ) corresponds to the relaxation of structural correlations between neighboring proteins or cage diffusion. 31,149 As the interaction potential is only weakly dependent or independent on temperature, the resulting rescaled cage diffusion coefficient D s c (q * )/D 0 , where D 0 is the free diffusion coefficent of γ B -crystallin in the absence of interaction effects, shows no detectable temperature dependence (Fig. 8 bottom). However, the NSE data also shows a dramatic decrease of cage diffusion with increasing concentration, with D s c (q * )/D 0 decreasing by one order of magnitude already for a volume fraction of around 15%. This slowing down of collective short time diffusion is much more pronounced than what one would expect for example for excluded volume effects, where D s c (q * )/D 0 should decrease by only about 30%. 31 There are indeed a number of attempts in the literature to use colloid theory in order to interpret protein diffusion in crowded solutions measured by neutron-based quasi-elastic scattering methods such as NSE or neutron backscattering. 27,31,60,150,151 While for proteins with dominating repulsive interactions such as described by screened Coulomb and effective hard sphere-like potentials agreement is very good, 27,31 this is clearly not the case for proteins with weak short-range attractive potentials such as γ B -crystallin. Attempts to use a coarse-grained colloid model with an isotropic short-ranged attractive potential completely failed in particular at low volume fractions, dramatically underestimating the slowing down of collective diffusion due to crowding effects. Using a patchy model where the overall strength of the effective pair potential as characterized by B * 2 is comparable, demonstrated that weak attractive patches can indeed strongly influence D s c (q * )/D 0 due to the formation of transient clusters and networks (Fig. 9). 31 The importance of transient cluster formation on short time dynamics for short range attractive particles has also been pointed out in other simulation studies. 152,153 With the extreme sensitivity of the collective short time diffusion to the exact interaction potential in patchy systems, it is clear that understanding and predicting protein dynamics in crowded environments on these short length scales requires a considerable future effort for conditions where the interactions are not primarily repulsive, but influenced by directional attractive contributions.
There is however a word of caution needed. The characteristic length scales over which motion is measured in quasi-elastic neutron scattering (QENS) experiments such as NSE or backscattering is not only relevant because of the underlying mechanisms that determine collective diffusion in strongly interacting particle systems. For proteins, this also has other consequences, related to their internal structure and flexibility. At short length scales, smaller than the protein size, QENS also becomes sensitive to rotational diffusion and internal motion. 27,[155][156][157][158][159] Analysing protein diffusion at q-values beyond the nearest neighbor peak is thus often more complex than for synthetic hard colloids due to these additional contributions that need to be taken into account. However, protein conformational dynamics as measured by QENS at larger q-vaues is interesting in its own right, as it is essential for their biological function: A protein must adopt spe- Fig. 9 Simulations of the rescaled local short-time diffusion coefficient (or short-time cage diffusion coefficient) of particles with slightly different shapes and interaction potentials (data taken from refs. 154 and 31 ) as a function of the volume fraction. Symbols denote the simulation results for hard spheres (filled black circles), short-range weakly attractive spheres (open black circles), hard ellipsoids (filled red triangles), shortrange weakly attractive ellipsoids (open red triangles), and short-range weakly attractive spheres with two additional attractive patches (filled blue circles). The dashed lines are guides to the eye. The black line is the theoretical prediction for hard spheres. 149 The axial ratio of the ellipsoids is roughly 1:2. The weakly attractive potential (for the attractive spheres, attractive ellipsoids and patchy spheres) is chosen such to maintain approximately the same position in the phase diagram (i.e. a comparable distance to the critical temperature) for all three systems. This figure demonstrates the enormous effect of a short-range weak attraction on the local short-time diffusion when compared to the purely hard sphere behaviour. It also reveals the tremendous effect of attractive patches (at constant overall attraction strength), and demonstrates the considerable influence of a weak shape anisotropy on the local short-time diffusion.
cific active conformations in order to properly function. In particular large-scale domain motions, which can be induced by ligand binding or are intrinsically related to the protein domain structure, can for example modulate the entry of substrates into active sites and control enzymatic reactions. Neutron spin echo (NSE) spectroscopy is ideally suited for probing domain motions on the required length (nm) and time (nsµs) scales without the need of any perturbing labelling, and has indeed been used successfully in a few cases already. [155][156][157][158][159] It is however also important to point out that this requires an analysis where contributions from translational and rotational diffusion and local internal domain motions can be decoupled. This can be achieved in principle using molecular dynamics simulations (MD) based on the known protein structure. 158 However, this becomes much more difficult at higher concentrations, where direct and hydrodynamic interactions between proteins complicate the resulting dynamics. 27 There is another aspect of proteins that has frequently been neglected when using a colloid analogy to understand and predict diffusion of proteins in crowded environments. Globular proteins are rarely perfect spheres, but rather posses an anisotropic shape. γ B -crystallin for example has a shape that resembles a triaxial ellipsoid, with axial ratios of about 1.7 and 2. 144 The importance of shape anisotropy in the interpretation and simulation of short time diffusion in crowded solutions has been discussed before, but the prevailing assumption has been that for small axial ratios such as those in γ B -crystallin an effective sphere approximation 160 should be adequate to treat interaction effects for these systems. 26,27 However, a recent simulation study using molecular dynamics simulations that includes hydrodynamic interactions based on a mesoscale hybrid approach has clearly shown a strong effect from shape anisotropy for shapes such as those resembling γ B -crystallin, which is particularly pronounced for weakly atractive interactions. 154 Moreover, not only are these effects quite strong, but they also contribute in a non-monotonic way as a function of concentration, as shown in Fig. 9. In order to understand and predict the dynamics of crowded protein solutions, it is thus not sufficient to perform in vitro experiments under dilute conditions and use estimates of the overall strength of interparticle interactions together with standard effective sphere colloid models. There is a clear need for an extension of the often used simple colloid models and to incorporate more molecular features into such coarse-grained models when attempting to describe local short-time diffusion in crowded solutions.
The ability to understand and predict dynamic properties of concentrated protein solutions and mixtures is not only vital for an understanding of cellular mechanisms, but also important for the formulation of future protein drugs. The development and use of mAb-based human therapeutics has for example rapidly evolved during the last 20 years and led to an almost exponential growth in market value. 161 Together with the need for high concentration formulations that allow for facile home administration, this creates the need for an advanced predictive understanding of the stability and viscosity of concentrated protein solutions. 127,162 In particular the frequently observed propensity of mAbs to reversibly aggregate, and the consequences of this concentration dependent cluster formation on the resulting sample viscosity significantly adds to the effects of particle interactions on the flow properties of concentrated formulations. 127,132,163 There is thus a need to improve our analytical capability to investigate concentrated formulations, and to disentangle the effects of interprotein interactions, concentration dependent reversible aggregation, and degradation on the long-term stability and viscosity. The question is whether we can use the different routes towards dynamical arrest discussed in connection with short-range attractive colloids and summarized in Fig. 1 for this purpose?
We can look at this by comparing the available information for three proteins that have been used as well-defined model proteins: lysozyme, γ B -crystallin and α-crystallin. Given the detrimental nature of an increased viscosity in high concentration biologics formulations due to the existence of a dynamical arrest transition, this is a question that is not only of interest for basic research on protein solutions, but of enormous practical importance for pharmaceutical industry. 127,162 Similar to attempts made to use the ELCS in order to link easily measurable experimental parameters such as B * 2 with the thermodynamic stability with respect to a possible liquid-liquid phase separation, B * 2 or its dynamic analogue k D , obtained from DLS measurements, could in principle allow for an early prediction of the location of such an arrest line. This would be particularly attractive as such mea- Fig. 10 The connection between the concentration dependence of the rescaled local short-time diffusion (or short-time cage diffusion 149 ) as obtained from Neutron Spin Echo (NSE) measurements and dynamical arrest for three different proteins (data taken from ref. 31 and 60 ). The dashed vertical lines (same colour code as for the symbols) indicate the macroscopic arrest transitions for the proteins as determined by macro-and microrheology 28,60,164 and DLS experiments 144 . The three proteins show very different arrest scenarios, as showcased in the cartoons: lysozyme, which is investigated here under conditions where the combination of a short-range attraction and a long-range repulsion (low ionic strength, positive net charge (pH ≈ 7.8), T = 5 • C) leads to the formation of equilibrium clusters 25 . These charged clusters form a cluster glass, with a liquid-solid transition at volume fractions as low as φ ≈ 0.28. 60 Solutions of the bovine eye lens protein γ B -crystallin (at T = 25 • C) macroscopically arrest at a volume fraction of φ ≈ 0.35. Computer simulations together with experimental static and dynamic light and neutron scattering data indicate that this is due to a combination of weak short-range attractions and additional patch-interactions, facilitating the formation of large, open clusters and space-spanning networks already at relatively low volume fractions. 144 Finally, the bovine eye lens protein α-crystallin shows an arrest transition at φ ≈ 0.58, reminiscent of colloidal hard sphere systems where at this high volume fraction the particles get trapped in their nearest neighbour cages. 28 surements require small amounts of sample only, which is another key concern in the early stages of drug development. However, when looking at the B * 2 values for these three proteins, the picture looks as follows: Lysozyme is strongly repulsive (B * 2 1) at these low ionic strengths; 25,60 γ B -crystallin is overall weakly attractive (B * 2 ≈ −1); 165 and α-crystallin shows hard sphere behavior, i.e. B * 2 = 1. 28 On the other hand, as shown in Fig. 10, the arrest transition occurs at φ g (Lysozyme) ≈ 0.28 ≤ φ g (γ B -crystallin) ≈ 0.35 φ g (α-crystallin ≈ 0.58. As discussed previously, there is good evidence that for these more complex mixed and/or patchy potentials found for proteins, the ELCS does not hold for the location of the arrest line, and we need to consider the actual arrest mechanisms that act for these three proteins. Lysozyme at low ionic strength at a pH value where it carries a net positive charge interacts via a combination of a short-range attraction and a long-range screened Coulomb repulsion, which leads to the formation of equilibrium clusters. 25 These charged clusters increase in size with increasing protein concentration and form a cluster glass. 60,61 Due to the relatively open structure and the long-range repulsion between the clusters, the corresponding arrest transition occurs already at volume fractions as low as φ g ≈ 0.28. 60 Increasing the temperature reduces the cluster growth slightly, and thus also shifts the arrest transition to somewhat larger volume fractions. 61 It is interesting to note that the concentration dependence of the relative (zero-shear) viscosity η r = η s /η 0 where η s is the zero shear viscosity of the sample and η 0 that of the solvent, is well described by the Quemada relation η r = (1−φ /φ g ) also found for hard sphere systems, indicating that the clusters act as effective hard spheres, with a sphere size given by the combination of the cluster radius and the Debye length. The onset of dynamical arrest is also reflected in the rescaled local short-time collective diffusion coefficient measured at the clustercluster peak in the structure factor, and we see that the cluster growth at higher concentrations results in values of D s c (q * )/D 0 that strongly decrease at higher φ -values, where cluster growth becomes important (Fig. 10). 60 At the solution conditions discussed here, the bovine eye lens protein γ B -crystallin exhibits short time collective diffusion indicative of weak patchy attractions, 31,144 where D s c (q * )/D 0 decreases dramatically with increasing concentration (Fig. 10). Computer simulations of patchy colloids have indicated that this unusually strong decrease is linked to the formation of transient clusters, which become much more pronounced for patchy than for isotropic potentials. 31 These open transient clusters are also responsible for the macroscopic arrest that occurs already at low volume fractions φ g ≈ 0.35, where these networks become space spanning, and where the network then is load-bearing despite the transient nature of the individual bond. 144 Finally, the bovine eye lens protein α-crystallin has been shown to interact via a hard-sphere-like potential, resulting in structural and dynamic properties that are quantitatively described by a colloid model of polydisperse hard spheres (Fig. 10). 28 This hard sphere nature of α-crystallin is also reflected in an arrest transition that occurs at the expected value for hard sphere colloids, φ g ≈ 0.58. Here the mechanism is due to the formation of cages by the nearest neighbor particles, resulting in a two step decay of the intermediate scattering function. 28,36 While the long-time diffusion coefficient, which is related to the escape of particles from the transient cages is not visible with NSE, the short-time D s c (q * )/D 0 is well characterized by the weaker concentration dependence of pure hard spheres. 31,149 The three examples shown in Fig. 10 not only document the dramatically different arrest lines and arrest scenarios for these globular proteins that differ in their effective interaction potential. The figure moreover shows that the measured collective short-time diffusion on length scales of the nearest neighbor distance is also strongly influenced by the potential, and that D s c (q * )/D 0 is extremely sensitive to the actual potential details. While the structural features of these protein solutions can be well reproduced by a strongly coarse-grained model with an isotropic potential (mixed potential for lysozyme, short range attractive for γ B -crystallin, hard sphere for α-crystallin), this does completely fail for the calculation of short time diffusion in the case of the patchy potential of γ B -crystallin. 31 There are two lessons to be learned, one utterly inconvenient and one rather conve-nient from the viewpoint of those who are interested in describing and predicting properties of protein solutions. On the one hand, there is unfortunately no easy way of predicting the existence and location of an arrest transition for concentrated solutions of globular proteins from simple low-concentration measurements such as a measurement of a second virial coefficient. On the other hand, however, measuring short-time collective diffusion on length scales of the nearest neighbor distance (or the protein diameter) offers an exquisitely sensitive test for specific interaction potentials through a comparison of the concentration dependence of the measured and simulated D s c (q * )/D 0 . At the same time, D s c (q * )/D 0 is providing unique insight into the underlying mechanisms of a given arrest transition. We believe that in combination with appropriate simulation schemes, the method has enormous potential for evaluating interactions between proteins and their dependence on small variations in composition such as for example induced by point mutations. The drawback is, however, that the field currently suffers from the very limited availability of the technique, with only a few neutron spin echo instruments existing worldwide.
At first sight, these findings also seem quite discouraging when considering the current challenge to develop stable high concentration and low viscosity biologics formulations. As described before, a key problem in antibody formulation is that mAbs can exhibit enhanced self-association, causing too high viscosities for administering the drug. There have again been numerous studies where researchers tried to determine the molecular origins of self-association, and to predict enhanced self-association and viscosity based on simple low concentration parameters such as B * 2 and k D , albeit mostly with limited success. 130,134,[166][167][168][169][170][171][172] Given the difficulties in trying to reproduce D s c (q * )/D 0 or the location of the arrest line for globular colloids, it seems out of question to arrive at some prediction of the location of an arrest line or the concentration dependence of the relative viscosity η r for mAbs that exhibit a concentration-dependent self-association and a corresponding strong increase of η r at higher concentrations. However, the observation previously made for lysozyme cluster fluids, where η r was found to follow the Quemada relation, 61 clearly indicates that there is hope! If we can use an effective hard sphere model to describe the concentration dependence of the zero shear viscosity for protein cluster fluids, the same may be possible also for self-associating mAbs.
This has indeed been demonstrated for an IgG system, where the problem of enhanced self-association in high concentration antibody solutions and the concomitant high viscosities was tackled using a succession of coarse graining steps. 130 In this study, mAb self-association and viscosity as a function of concentration was characterized using a combination of experiments (static and dynamic light scattering and microrheology), theory and simulations using analogies to patchy colloids. In a first step, selfassociation and cluster formation due to attractive electrostatic interactions between oppositely charged points on the arms (antigen binding or FAB domains) and on the tail (constant or FC domain) was identified as the driving force for the ionic strength dependence of the concentration-induced increase of the relative viscosity. A model that takes into account the anisotropic character of both the mAb shape (Y-shaped particle) and of the mAb-mAb interactions (patchy attraction) was then constructed, based on electrostatic calculations of a single antibody molecule in buffer, and MC simulations and analytical calculations were performed, the latter using an analytic solution based on Wertheim 173 theory. A comparison between calculated and measured values of the concentration dependence of the isothermal compressibility or apparent molecular weight provided quantitative values for the strength of the attractive patches. In combination with so-called hyperbranched polymer theory 174 this directly provided the cluster size distribution as a function of concentration.
To further predict the dynamic properties, an additional coarsegraining step was made, by considering mAb clusters rather than single mAb molecules as effective hard or sticky hard spheres. Here the cluster size distribution from the initial fit to the static light scattering data was used together with the corresponding theories for the concentration dependence of the apparent collective diffusion coefficient or hydrodynamic radius for these two models. The only additional free parameters in this step were a constant required for the rescaling of the volume fraction from the one based on individual mAbs to the one describing the cluster fluid, and the stickiness parameter for the sticky hard sphere model. In a final step, the relative viscosity η r was calculated for each concentration without any additional free parameter using the Mooney relation η r = exp[2.5φ HS /(1 − φ HS /φ g )] often applied for polydisperse hard spheres, 175 and assuming φ g = 0.71 based on literature values for hard spheres with comparable polydispersity. Remarkably, the measured data -the apparent molecular weight as obtained by static light scattering, the apparent hydrodynamic radius from dynamic light scattering and the zero-shear viscosity from microrheology measurements -were in good agreement with the model predictions, showing that excluded volume interactions between the assembled clusters are indeed at the origin of the strong increase of the viscosity at high mAb concentrations. Moreover, the attraction strength of the patch-patch interactions calculated from the Wertheim fit to the static light scattering data agreed remarkably well with an estimate based on screened Coulomb interactions and a charge distribution on the IgG based on electrostatic calculations for the specific type of immunoglobulin. This is indeed quite encouraging, and indicates that this could be a suitable approach to investigate and quantitatively assess the effects of additional excipients or chemical modifications on the antibody interaction. Such a model could then be used to estimate their effect on antibody stability and the resulting viscosity from molecular information, which would be vital for an advanced formulation strategy where unpromising candidates could be discarded at an early stage. Another interesting feature of such a successive coarse-graining procedure with appropriate patchy models is that it allows to determine the interaction strength and the cluster size distribution as a function of concentration for a given mAb system, which can subsequently be tested with DLS and (micro)rheology measurements without additional free parameters other than a rescaling of the volume fraction. The development of theoretical models to describe mAb self-association into clusters, and its consequences on other easily measurable structural and dynamic properties, has been in the focus of the community. With this approach, such models can be submitted to a critical assessment. Obviously this type of approach could easily be extended to other proteins that also display patchy interactions, and would allow for example to investigate various arrest scenarios for such systems as a function of key solution parameters.

Conclusions
This perspective article was initially largely motivated by the notion described in ref. 33 , questioning altogether the usefulness of applying colloid-like theory to protein solutions. Having spent many years in trying to gain understanding of the structural and dynamic properties of protein solutions with such an approach, this asked for a re-evaluation of the situation. With the discussion presented above, we hope that it becomes clear that the statements made in ref. 33 rather reflect a too narrow view of what colloids are than a serious game stopper. Colloids are more than simple hard spheres, and in particular the recent trends in synthesizing, characterizing and simulating patchy, responsive and anisotropic particles have provided a wealth of information and new models that can be harnessed to learn much more about protein solutions. However, we have also seen that while the currently existing models indeed allow us to make fairly precise predictions about the phase behavior and the static properties of proteins under well-defined solution compositions (keeping in mind that quantities such as the interaction potential, the size and the shape of synthetic colloids also do not necessarily remain invariant as a function of pH, temperature or concentration), the situation already changes unfavourably when aiming at processes such as crystallization where angularly resolved interaction potentials and local dynamics matter. We clearly lack well-defined procedures to extract the relevant details from a 3D structure of a particular protein and then create a maximally coarse-grained patchy model that retains just the necessary anisotropic interaction features for understanding or predicting a specific property.
This becomes even worse when interested in dynamic properties such as length-scale dependent collective and self diffusion and viscosity. Here we suffer primarily from two deficiencies: First of all, the existing work on patchy colloids has primarily focused on self-assembly and phase behavior. Very little attention has however been given to dynamic properties, and we thus find a lack of suitable theoretical models that could help us to understand the dynamics of proteins at higher concentrations. While there have been isolated attempts to harvest the significant progress made in the understanding of diffusion in concentrated colloidal suspensions and apply the same concepts and theoretical models to protein solutions, we are still unable to predict local and global dynamic properties of dense protein solutions based on the known molecular structure of the individual components, or to assess the effect of single point mutations on them. The existing theoretical models for (spherical and isotropically interacting) colloids cannot be directly transferred to protein solutions due to their complex structure and internal flexibility, and there is correspondingly a considerable lack of quantitative characterization and understanding of protein diffusion in crowded environments on the required molecular level. While for example our understanding of polymer dynamics has advanced tremendously since the early 1980's by the availability of neutron spin echo and other quasielastic neutron techniques and the simultaneous progress in polymer theory and multi-scale modelling techniques, 174,176 the corresponding field of protein dynamics has clearly lacked similar progress. There is thus an urgent need for improving our ability to characterize, interpret and predict the dynamic properties of individual proteins as well as of their concentrated solutions and mixtures.
Moreover, the experimental toolbox of colloid physics has primarily been developed for investigating larger colloids with sizes of several hundred nanometers, and the wealth of existing data on colloid dynamics comes mainly from techniques such as DLS or confocal laser scanning microscopy (CLSM). Proteins are much smaller, and there are only a few techniques that can for example deliver measurements of short and long time collective and self diffusion on length scales of the protein size. Neutron and Xray based scattering methods have the potential to deliver the required experimental data; however, we lack a theoretical and simulation toolbox similar to what already exists for structural properties and the interpretation of SAXS and SANS data. While detailed near-atomistic simulations and a comparison with neutron and X-ray based spectroscopic data is feasible for reproducing the internal dynamics of individual proteins, a similar approach is not viable for crowded solutions and mixtures. We need significant methodological development, including developing both advanced coarse-graining approaches in simulation and a combination of several experimental methods such as NSE and X-ray photon correlation spectroscopy (XPCS). 177 This needs to go hand in hand with the development of new sample environments to account for the very limited quantities of proteins typically available for such investigations, and the design of new measurement and data analysis procedures to overcome the notorious beam damage problem when working with intense synchrotron radiation.
However, having discussed all what is still missing, we should nevertheless not end without pointing out the enormous progress already made. We believe that a colloid approach to protein solutions has indeed an enormous potential in many areas as diverse as cell and systems biology, pharmaceutical formulation and materials sciences. The essential point in applications of concepts from colloid physics to proteins, however, has to be a case by case reflection on the required level of coarse graining needed for a given problem, and a critical choice of the experimental techniques and data chosen for a meaningful test of model predictions.

Table of Contents entry
We critically discuss the application of colloid science concepts to better understand protein solution properties in the entire concentration range.