I. S.
Mian
a and
C.
Rose
b
aLife Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA. E-mail: smian@lbl.gov
bWireless Information Network Laboratory, Rutgers University, North Brunswick, NJ 08902, USA. E-mail: crose@winlab.rutgers.edu
First published on 22nd March 2011
In this Perspective, we propose that communication theory—a field of mathematics concerned with the problems of signal transmission, reception and processing—provides a new quantitative lens for investigating multicellular biology, ancient and modern. What underpins the cohesive organisation and collective behaviour of multicellular ecosystems such as microbial colonies and communities (microbiomes) and multicellular organisms such as plants and animals, whether built of simple tissue layers (sponges) or of complex differentiated cells arranged in tissues and organs (members of the 35 or so phyla of the subkingdom Metazoa)? How do mammalian tissues and organs develop, maintain their architecture, become subverted in disease, and decline with age? How did single-celled organisms coalesce to produce many-celled forms that evolved and diversified into the varied multicellular organisms in existence today? Some answers can be found in the blueprints or recipes encoded in (epi)genomes, yet others lie in the generic physical properties of biological matter such as the ability of cell aggregates to attain a certain complexity in size, shape, and pattern. We suggest that Lasswell's maxim “Who says what to whom in what channel with what effect” provides a foundation for understanding not only the emergence and evolution of multicellularity, but also the assembly and sculpting of multicellular ecosystems and many-celled structures, whether of natural or human-engineered origin. We explore how the abstraction of communication theory as an organising principle for multicellular biology could be realised. We highlight the inherent ability of communication theory to be blind to molecular and/or genetic mechanisms. We describe selected applications that analyse the physics of communication and use energy efficiency as a central tenet. Whilst communication theory has and could contribute to understanding a myriad of problems in biology, investigations of multicellular biology could, in turn, lead to advances in communication theory, especially in the still immature field of network information theory.
Insight, innovation, integrationOne of the major transitions in evolution is the emergence and radiation of complex multicellular organisms from single cells. At the heart of multicellular biology, ancient and modern, lies the dynamic and reciprocal exchange of “information” within and between cells and their environment. The concept of “communication” is not restricted to evolutionary developmental biology, normal or aberrant, and pathology. Communication theory is a field of mathematics concerned with the problems of signal transmission, reception, and processing. Currently, this field is primarily the province of telecommunications engineering. In this Perspective, we propose that communication theory provides an invaluable theoretical framework for exploring problems relevant to multicellularity. A key virtue of communication theory is its ability to be blind to molecular and/or genetic mechanisms. By analysing the physics of communication and using energy efficiency as a central tenet, communication theory has the potential to enhance understanding of and yield unexpected insights about biology that are of theoretical and experimental utility. |
Who says what to whom in what channel with what effect. Harold Lasswell's working definition of communication.
The major evolutionary transitions are from replicating molecules to molecules in compartments, independent genes to chromosomes, RNA to DNA plus RNA, DNA plus RNA to the genetic code and proteins, prokaryotes to eukaryotes, asexual clones to sexual populations, single cells to complex multicellular organisms, chemically- to neurally-integrated individuals, solitary individuals to colonies and social groups, and primate groups to human linguistic communities.1,2 One broad class of transitions led to new entities constructed from previously autonomously reproducing units (protocells, chromosomes, eukaryotic cells, populations and species, multicellular organisms, and integrated social groups). The other class of transitions led to the sophistication of the internal organisation and information processing capacity of an entity (cells with DNA, cells with translation, animals with nervous systems, and linguistic societies).
A hallmark of these evolutionary transitions is changes in the way information—autocatalytic systems, (epi)genetic, behavioural, and symbolic—is stored, transmitted, and processed.1,2 Information can be stored in protocells, chromosomes, DNA, populations, differential gametes, social networks and symbolic networks, as codes, and encoded in neural impulses and circuits. Information transmission between generations can occur through reproduction (replicating molecules transmitted as groups, genes transmitted synchronously in linked groups, DNA replication, coordinated reproduction of the new entity, and segregation and recombination) or through communication (social learning, and linguistic communication). Information transmission among cells and individuals can occur via(bio)chemical communication (multicellular organisms) or neural communication (animals with nervous systems). Irrespective of the mode of storage and means of transmission, whenever there is a flow or exchange of information, there is, as embodied in Lasswell's maxim, communication.
Although both biosemiotics and communication theory consider the transmission, reception and processing of signals from one place or instant to another, they provide different perspectives on Lasswell's maxim. With its roots in philosophy and linguistics, biosemiotics is primarily a descriptive and qualitative study of the semantics of communication in living systems. The engineering origin of communication theory means that this mathematical and quantitative study of communication addresses challenges such as point-to-point communication and information collection by sensor networks, topics that are relevant to biology across a wide range of spatiotemporal scales such as from molecules to languages.
To date, information- and coding-theoretic studies have focused on molecular biology and neuroscience problems pertaining to sequencing technology (polymerase chain reaction, sequence related data-compression, DNA base calling), molecular interactions (proteins and nucleic acids, proteins and domains, biological networks), theoretical sequence analysis (DNA mutations and disease, whole genome alignment), heredity, ensembles of neurons, neural spike trains and higher functions of cognitive systems.7–19Nucleic acid- and protein-related studies of this type have the potential to shed light on organic codes because these two classes of molecules contribute to the synthesis and primary structures of the basic macromolecular components of all cells. However, scant attention has been paid to another ubiquitous class: carbohydrates (glycans or sugars)—a term that includes monosaccharides, oligosaccharides, polysaccharides, and derivatives of these compounds—as well as glycoconjugates, glycans attached covalently to other molecules such as proteins and lipids.20–24
Superficially, the paucity of information- and coding-theoretic studies of carbohydrates can be explained by glycomics being a less mature field than genomics and proteomics.25–28 A deeper explanation is the more complex and dynamic nature of the glycome—the entire complement of carbohydrates, whether free or present in more complex molecules, of an organism or a cell.29–35 In humans, the size of the glycome is orders of magnitude larger than the proteome and ∼2% of the genome is estimated to encode proteins that make, modify, localise or bind carbohydrates. The distinctive features of carbohydrates include their branched structure, the diversity of secondary modifications of monosaccharides, the importance of the spatiotemporal and molecular context in which they occur, and the indirect relationship of carbohydrates to the genome.
Communication theoretic studies of the glycome and the glycocode, the complex information conveyed by glycans and glycoconjugates, would increase understanding of the major events in macroevolution and extant multicellular biology. For example, the biological communication mediated by glycans underlies diverse molecular, cellular, and tissue functions and plays critical roles in development, health and disease.
Reliable intra- and intercellular communication is essential for not only multi-species and multi-kingdom multicellular ecosystems, but also for every multicellular plant and animal species. In metazoans for example, such communication is a component of homeostasis (the ability of tissues, organs and organ systems to maintain a relatively constant internal environment regardless of the conditions in the external environment) and resilience (the capacity of a system to recover from a stochastic or deterministic shock, insult or disturbance). The issues of homeostasis and resilience in Metazoa permeate the work of Dr Mina Bissell, the honouree of this themed issue of Integrative Biology. Her research examines microenvironmental influences on gene expression and tissue specificity in normal and malignant breast tissues.48,49 She has highlighted the need to culture mammalian cells in physiologically relevant three-dimensional (3D) microenvironments and demonstrated that such experimental techniques generate reasonable ex vivo many-celled structures that approximate the ducts, acini, and other functional units and architectural elements observed in the breast.
A fundamental question in evolutionary developmental biology is how single-celled organisms coalesced to produce many-celled forms that diversified and evolved into the varied microbiomes and multicellular organisms in existence today. We suspect that ex vivo 3D organotypic cell culture models are useful systems for probing both ancient and modern metazoans. For example, models of the basic “building blocks” of tissues and organs may provide physicochemical and structural insights about the hollow balls, tubes, many-celled forms and other morphotypes postulated to have existed during the emergence and radiation of multicellularity. Architectural and functional insights may emerge from comparative analyses: a 3D organotypic model of a mammary gland acinus—a hollow ball of cells—can be seen as but a lumen away from the rosette-shaped colony of Salpingoeca rosetta, a choanoflagellate that can exist either as single cells or ∼4–50 cells arranged in closely packed spheres.50
Here, we propose that communication theory provides a new quantitative lens for multicellular biology and illustrate how the intersection of these fields is a fertile ground for re-examining existing topics and exploring new subjects. We suggest information- and coding-theoretic approaches that could be used to analyse aspects of the glycocode, an organic code critically important for multicellularity and its (dys)functions. We probe the fundamental question “How much information is necessary to specify a physical structure?” This issue is woven through Dr Bissell's genotype-phenotype studies: what, when, where, why and how is information exchanged amongst and between cells and their microenvironment, and what is the relationship between dynamic reciprocal communication and tissue form and function, normal and aberrant. We consider the information–physical structure question from the stand point of two “structural codes” and outline some possible research avenues. The first code is the stochastic mapping between the spatiotemporal organisation of chromosome territories in the cell nucleus, an increasingly important epigenetic mechanism, and the phenotype of the cell: communication theoretic studies might help to clarify the role of nuclear architecture in evolutionary developmental biology and cell (de)differentiation. The second code is the stochastic mapping between the orientation of the mitotic spindle and the plane of cell division in the context of single cells and many-celled structures: communication theoretic studies might help to reveal what a cell could possibly know in the isolated versus collective state.
Our abstraction that communication theory is an organising principle for multicellular biology is in accord with and responsive to recent arguments for “a new kind of molecular biology, the molecular biology of organization” and studies centred on the cell, a fundamental unit of structure, function, and organisation.51 This initial foray into building bridges between two seemingly disparate fields focuses less on genes and more on principles governing the origins (evolutionary developmental biology), generation (morphogenesis), maintenance (homeostasis and resilience), subversion (infectious and chronic diseases such as cancer and immune disorders), and decline (ageing) of multicellularity. Here, we explore flows and exchanges of information that have the potential to address basic questions in the biology of Metazoa and that could illuminate (dys)functions of cells, tissues and organs.
This Perspective is organised as follows. Section 2 provides an overview of the evolutionary developmental biology of multicellular animals, and the Developmental Genetic Toolkit implicated in metazoan evolution. Section 3 summarises existing work on Dynamical Patterning Modules (DPMs), physical forces and processes mobilised by subsets of the molecules in the toolkit. A new DPM, mitotic spindle orientation, is proposed and examined with respect to the coherent growth of and (re)shaping of many-celled structures (hollow balls and tubes) and a possible role in normal and malignant epithelial tissue biology. Section 4 discusses communication theory as an organising principle for multicellular biology, provides a primer on well known and useful concepts and theorems, and describes some extant illustrative applications of information and coding theory. Section 5 provides the broad brush strokes of how concepts and ideas from information theory, coding theory, and communication theory might be utilised to address topics that are relevant to multicellular ecosystems and many-celled structures in general, and metazoan tissues and organs in particular. Section 6 explores what communication theory might contribute to multicellular biology and vice versa.
Choanoflagellates are a group of free-living unicellular and colonial flagellate eukaryotes considered to be the closest living relatives of Metazoa. Their ultrastructural and functional characteristics are conserved in the specialised “feeding cells” of sponges, eukaryotes considered to be living models of the last common metazoan ancestor. Choanoflagellates can attach to and engulf bacteria, a process that could be the single-cell antecedent to cellular adhesion. Protozoa can respond to both other organisms and their environment based on secreted proteins, a potential precursor to the kind of cell-to-cell signalling essential for multicellularity. The genome of the choanoflagellate Monosiga brevicollis harbours versions of metazoan cell signalling and cell adhesion gene families.54
Placozoa are small, flattened non-parasitic multicellular animals with a simple body that consists of an outer layer of simple epithelium enclosing a loose sheet of stellate cells resembling the mesenchyme of some more complex animals. The genome of the only named placozoan Trichoplax adhaerens harbours versions of transcription factors and components of pathways such as cell signalling and cell adhesion that are associated typically with diverse cell types and developmental processes in more complex metazoans.55
Sponges have simple body plans that lack organs, muscles, and nerve cells. They are a somewhat loose association of cells in which the internal and external environments are undifferentiated (in terms of ionic homeostasis) and possess some of the most ancient tissue (differentiated epithelia). Sponges lack abundant intercellular junctions and a basement membrane, two features that contribute to the architecture and mechanical and absorptive/transport properties of eumetazoan epithelial cells. The genome of the marine sponge Amphimedon queenslandica harbours genes associated with processes that are the hallmarks of Metazoa and whose dysfunctions can drive diseases such as cancer, a failure of social controls on multicellularity, and autoimmune disorders, the disruption of distinctions between self and non-self.56 These processes are cell–cell and cell–extracellular matrix adhesion, developmental signalling and gene regulation, regulated cell cycling and growth, programmed cell death, specialised cell types, and allorecognition (an event central to adaptive immunity, the ability to recognise and remember specific pathogens and to mount stronger attacks each time the pathogen is encountered) and innate immunity (an older, evolutionarily conserved, “non-specific” defense mechanism that defends the host from infection by other organisms). Thus, almost all/most of the key genetic prerequisites for Metazoa were in place well before sponges split from other animals.
Neither choanoflagellates nor placozoans nor sponges possess true tissues organised into germ layers and an embryo that goes through a gastrula stage. Despite this, the presence of the toolkit in these organisms attests to their value as living models of the generation, maintenance, subversion, and decline of mammalian tissues and organs. Therefore, insights about microenvironmental influences on gene expression and tissue specificity in normal and malignant human breast tissues are likely to emerge from studies of species in the order Choanoflagellida, phylum Placozoa, and phylum Porifera. Indeed, 3D organotypic models where mammalian cells are replaced by cells from such species could illuminate not only normal biology and pathology, but also the evolution and diversification of Metazoa.
All metazoan body plans and organs are hypothesized to have emerged via a process of self-organisation using a “pattern language” for multicellular form, that is, a collection of Dynamical Patterning Modules (DPMs).59,60,62,63 A DPM consists of a set of molecules (usually proteins) produced by a cluster of cells and the suite of physical processes they mobilise so as to generate an aspect or to alter the form (shape, size, topology) or pattern (specific arrangement of cell types) of the cluster. Table 1 lists DPMs that have been enunciated to date.
DPM | Molecular constituents | Physical principles | Role in evolution and development |
---|---|---|---|
ADH | Cadherins | Adhesion | Multicellularity |
LAT | Notch | Lateral inhibition | Coexistence of alternative cell states |
DAD | Cadherins | Differential adhesion | Phase separation, tissue multilayering |
POLa | Wnt | Cell surface anisotropy | Topological change, interior cavities |
POLp | Wnt | Cell shape anisotropy | Tissue elongation |
ECM | Chitin, collagen | Stiffness, dispersal | Tissue solidification, elasticity, epithelial mesenchymal transformation |
OSC | Wnt + Notch | Chemical oscillation | Segmentation, periodic patterning |
MOR | TGF-β/BMP, FGF, Hh | Diffusion | Pattern formation |
TUR | MOR + Wnt + Notch | Dissipative structure | Segmentation, periodic patterning |
ASM | FGFs | Asymmetric interaction | Induction; epithelial-mesenchymal interaction |
MIT | MAPK | Mass increase | (Differential) growth |
APO | Bcl-2 | Mass decrease | (Differential) cell loss |
DPMs connect molecules that coordinate organismal development and the physical properties and processes of form. Developmental Transcription Factors (DTFs) are gene regulatory molecules that mediate cell type- and region-specific functions. DPMs were present in unicellular organisms but assumed their physical role—rapid and exhaustive exploration of many-celled forms—only in the context of multicellularity. Although DTFs are as ancient as DPM genes, their role in the unicellular world was mediating transcriptional responses to internal and external signals. Given their highly conserved nature, DPM molecules and DTFs can be seen as two broad categories of the toolkit.
The actions and interactions of DPMs on single-celled organisms as well as transient and variable many-celled forms generated polymorphic multicellular structures such as internal body cavities, appendages, segmentation, and primitive organs such as hearts and eyes. DTFs switched on and off certain genes and their main role was mediating the cell- or tissue-specific effects of DPMs and hardwiring the molecular players of DPMs in regulatory networks. Natural selection built upon, fine-tuned, stabilised, and locked in already-emerged but highly plastic ancient many-celled forms rendering these innovations in structure robust. Although extant DPMs play a role in the development of Metazoa, the price paid for stable body plans and organs is severe constraints on the ability of these DPMs to explore morphospace freely.
Tissue mass structures are made using molecules that are predisposed to mobilising certain processes of the physical world. The adhesion DPM (ADH in Table 1) is the sine qua non of multicellularity: the single-celled ancestors of Metazoa had cadherins on their surfaces even though those cells did not use cadherins to stick to each other.65 Since the function of a cell surface cadherin as a sticky protein is environmentally dependent, only certain milieus would have allowed cadherin-bearing single-cell organisms to clump together. The integrin family of metazoan extracellular matrix adhesion molecules are critical mediators of the actions of many DPMs. Although homologues are present in choanoflagellates, true integrins are found in the sponges where they mediate both adhesion and outside-in signaling. Thus, integrins acquired a novel morphogenetic function with the emergence of multicellularity and the organisational architecture provided by an extracellular matrix.
Tubes and hollow balls are two many-celled forms that approximate key functional units and architectural elements observed in many tissues and organs. In the mammary gland for example, a luminal epithelial cell layer forms the ducts (tubular structures) and secretory acini (balls of cells containing a central lumen), and a basal myoepithelial cell layer provides the forces for secretion. This epithelium is embedded in a fat pad composed of adipocytes, blood vessels, fibroblasts and immune cells. Human cancer mortality is attributable largely to the local invasion of tumours into adjacent normal tissues and the dissemination and metastasis of tumour cells to distant sites—processes that can be viewed as epithelial morphogenesis gone awry.40,41,48,71,72
We propose that mitotic spindle orientation is a new DPM, a single-cell function that antedated the evolution of multicellularity and takes on unanticipated roles when acting in a social context. For example, Cdc42, a Rho-GTPase common to animal cells and yeast, directs the mitotic spindle in one cell toward the point of contact with another in some immune responses.80 Assuming this interaction arose early, the role of this new DPM in evolution and development is postulated to be the coherent growth of and (re)shaping of tissue mass. Fig. 1 illustrates how this new DPM might play a role in normal and aberrant tissue biology.
![]() | ||
Fig. 1 The physical consequences of mitotic spindle orientation, a single-cell effect, in many-celled forms and a possible role for this behaviour in the collective state in tissue morphogenesis and cancer. Top Acini (balls of cells containing a central lumen) and ducts (tubular structures) are building blocks of many tissues and organs. One aspect of their formation can be abstracted as cell divisions, local events, in the context of a multicellular structure whose global architecture is a tube or a hollow ball. MiddleCytokinesis in the plane of a cell layer results in expansion of the layer (left). In a hollow ball, cell division at a point on the surface along any tangent increases the ball's radius. In a tube, cell division at a point on the surface along a direction parallel to the long axis L increases the tube's length, whereas division perpendicular to L increases its radius. Cytokinesis out of the plane of a cell layer can result in growth towards the lumen (for example, the first stage in ductal carcinoma in situ) (middle) or towards the basal cell direction (for example, branch initiation) (right). Bottom Globally oriented spindles could be induced by the local biochemical milieu (for example, the presence of morphogens) and/or physical forces (for example, tension generated by cell–cell, cell–extracellular matrix, and cytoskeleton–spindle interactions) on one or both faces of a cell layer and be distributed in a uniform or localised manner (green; only one spindle orientation in one cell is shown). Other putative globalisation mechanisms include a domino-like effect or something akin to magnetisation (local cell interactions nucleated by a domain of oriented cells). |
Compact (short length) representations are desirable for reasons that can almost always be related to efficient energy usage, a key desideratum of both human-engineered and natural biological systems. Consider the following question,
Given a set of unknown outcomes {am}, with relative frequencies (probabilities) {pm}, m = 1,…,M, what is the minimum average number of alphabet symbols needed to specify unequivocally which outcome occurred?
The answer is provided by the Coding Theorem81 which states that the minimum average number of symbols R(n) necessary to code the {am} using an n-ary alphabet obeys
Consider the communication problem associated with the “signal transduction code,” for example, a cell (sender) transmits signals to its environment (receiver) seeking direction about whether to divide symmetrically or asymmetrically, how to orient its mitotic spindle, whether to self-renew or to differentiate into a specialised cell type, and so on. Suppose this direction takes the form of a signal sequence {Xi} emitted by the sender and is interpreted by the receiver as some external (or internally processed) corresponding set of signals {Yi} that are corrupted by various sources of uncertainty. It is important to note that the physics behind the {Xi} and {Yi} are assumed arbitrary. All that is necessary is some stochastic mapping X → Y from sender to receiver summarised by a probability distribution fY|X().
Given our previous description of the number of bits necessary on average to provide answers, the natural fundamental question is how many bits per “channel use” (transmission of an Xi in this case) can the sender possibly convey in a reliable fashion? The term “reliable” means “where the probability of error can be made arbitrarily small by employing codes which span multiple channel uses.” A “code” is simply a sequence of symbols which correspond to a message. The answer is provided by the powerful Channel Coding Theorem81 which states that the capacity, C, of the channel is
This question of continuous versus discrete descriptions of an underlying reality cannot be and is not ignored. Often, we cannot simply assume discrete representations, for example, a membrane voltage or a chemical concentration. Rate distortion theory81,82 specifies how much information is necessary to approximate an information source under some fidelity criterion when a complete source specification is too large. With distortion measure d(X,q(X)) where q(X) is a finite (or “quantized”) representation of X, we can then ask how many bits per source symbol, R(ε), are required on average to represent an information source X under an average acceptable fidelity criterion ε. Rate distortion theory tells us
Consider the scenario where the signal upon which a receiver acts lies at the end of a cascade of intermediate processes. If these processes are deterministic, we could imagine them summarised by Z = g(Y) where Y is, say, the first signal that impinges on the cellular sensing apparatus and Z is the signal upon which the cell acts. The Data Processing Theorem81 states
I(Z;X) = I(g(Y);X) ≤ I(Y;X) |
I(Ω;X) ≤ I(Y;X) |
Note that nowhere has a particular coding scheme been specified. The information-theoretic concepts described provide bounds on information transfer regardless of the scheme used. That is, information theory is in some sense “mechanism blind.”
The well-known signal to noise ratio (SNR) is a simple but powerful example of mechanism-blindness in telecommunications theory. Consider a channel described by
r(t) = s(t) + w(t) |
A variety of similar bounds exist for efficient coding of specific objects and networks of communicating elements.81 It is this generality and implicit reduction of complexity that constitutes the power of a communication theoretic lens. Therefore, by analysing the physics of communication and using energy efficiency as a central tenet, communication theory may help to refine and to extend our understanding of communication in multicellular systems, be they natural or human-engineered, ancient or modern.
Perhaps the most overlooked and important feature of communication theory is that it can be used to determine what a cell can possibly know based only on the physics of the problem as opposed to detailed knowledge of genes, molecules, and networks—essentially an application of the data processing theorem to determine what is and is not possible for molecular, cellular and tissue behaviour. In other words, communication theory is blind to genetic and/or molecular mechanisms. This key notion is illustrated by the following examples.
Information transmission in a molecular machine was studied87 by comparing the implicit information necessary for protein binding to a specific DNA site and the energy involved in the transformation from bound to unbound state. That is, for DNA binding proteins, the before and after states correspond to the molecule bound to a non-specific site anywhere on a DNA molecule and to a specific functional site respectively. The isothermal efficiency of binding (for which neither external energy is supplied nor is any energy extracted from the system) was computed using the minimum energy dissipation required by a molecular machine to gain one bit of information (Emin = kBTln(2)) and the information in DNA sequences bound specifically by proteins (sequence logos). For a bistable molecular machine in which the two states are distinct, the maximal isothermal efficiency was calculated to be 70%.
Although these studies pertain to molecular machines in a cell rather than multicellular systems, their basic ethos is the same as ours. We hope to identify physical quantities of interest, include explicitly energy usage and then seek to understand the limits of operation (characterised essentially by information transfer) based on available energy. The specific result that DNA binding proteins make efficient use of energy gives us hope that the operation of biological systems at higher levels of organisation can be treated similarly. That is, efficient energy usage is a hallmark of biological systems and might be used to narrow the range of potential mechanisms by excluding those which provably cannot achieve the requisite information transfer with the available energy. That said, the extent to which the results for DNA–protein macromolecular complexes can be applied to other molecular machines and to multicellular systems remains to be seen.
Dictyostelium discoideum is a eukaryote that transitions from a collection of unicellular amoebae into a multicellular slug and then into a fruiting body within its life time. Information theory was employed to investigate how this organism can chemotax even when it is unclear whether the physics of the situation (cell/receptor positioning and stochasticity) could allow reliable measurement of spatial chemoattractant gradients.88,89 The study considered two random variables: the true gradient and the direction in which a cell moves. The underlying, seemingly counterintuitive, idea was that given the direction chosen by the cell, the less information provided by the true direction, the better the cell's decision must be. That is, if the cell generally chooses something approximating the correct direction, knowledge of the true direction does not add much additional knowledge. Thus, the study sought to minimise the mutual information between the true direction and the chosen direction.
The theoretical analysis found that the organism can chemotax efficiently by temporally sampling the gradient during directed stochastic cell motion. Furthermore, this observation explained the apparently puzzling result that cells could chemotax even when they lacked direct gradient information.88,90 Overall, information theory—the mutual information between the environmental and behavioural variables of interest—allowed the bounds of what was possible to be outlined without explicit consideration of the specific molecular workings of cells.
From a true communication theory perspective, the analysis was limited by its neglect of energy. Since cell mobility requires energy expenditure, it would be interesting to consider the energetics of mobile cell-driven gradient sensing compared to the maintenance of structures that perform spatial sensing in other ways, for example, anisotropic proliferation/retraction of surface receptors or even extension of (possibly mobile) filopodia to amplify spatial gradients. Put another way, a communication theoretic approach would consider what information is potentially available and then what cost in energy is required to secure the information. Hence, communication theory has the potential to both explain existing biology and perhaps even to identify under- or unappreciated biological solutions to a given problem.
To the best of our knowledge, this is the first use of information theory in a biological system where a single observable is related to more than one “channel input” (in this case, AI-1 and AI-2 receptor occupancies). From our perspective, the most encouraging result is that the information-theoretic approach generated new insights amenable to subsequent experimental investigation without explicit knowledge of biochemical/signalling circuits and mechanisms.
Perhaps the most striking finding is that cells might have evolved the capacity to combat information suppressing mismatch by employing a feedback mechanism to adjust dose response. A known signalling protein (Sst2) was found to have a previously uncharacterised role in generating such feedback. Overall, an information theoretic approach increased understanding of cell signalling, once again, without a search for explicit mechanisms and signaling agents.
Molecular codes, mappings that relate symbols and meanings written in one molecular language into another, are components of evolutionary transitions. For example, cells with translation possess the redundant, error-correcting genetic code which reads DNA base triplets (codons) and decodes them as amino acids or a stop signal. Information theoretic concepts were used to model the genetic code as a one-way noisy information channel that receives an input of codons (symbols) and outputs the corresponding amino acids (meanings).93 The statistical physics model was used to optimise a quantity representing the conflicting needs for error tolerance, diversity, and minimal energy cost. A code emerged when the benefits of specificity, the controlled synthesis of functional proteins due to non-random associations of codons and amino acids, exceeds the cost of producing and maintaining molecules able to recognise and distinguish targets from lookalikes. A similar topological rate distortion analysis has been applied to the problem of protein folding.94 The evolution of the genetic code was been studied using rate distortion theory and non-equilibrium dynamics constrained by the availability of environmental metabolic free energy.95
In order to analyse organic codes within a communication theoretic framework, two basic issues need to be addressed: source coding, replacing an initial message by a shorter but fully equivalent one, and channel coding, protecting an initial message against transmission errors (the introduction of redundancy and replacement of the original message by a longer one). Having addressed the issues of source coding and channel coding—undoubtedly, tasks that are more readily explicated than solved—information theory concepts and theorems can be applied. Given an organic code, one starting point is developing mathematical abstractions of the information conveyed by messages, for example, a sequence of symbols from a finite-size alphabet where each letter can be distinguished unambiguously from the others.
In broad terms, glycome-mediated communication consists of encoding information in the primary structure, density and presentation of glycans and glycoconjugates, deciphering messages by, for example, carbohydrate binding-molecules, and transducing the decoded messages by multiple signalling pathways. The glycocode is not a static organic code because the complex information conveyed by glycans and glycoconjugates is susceptible to internal and external influences such as age, diet, lifestyle, xenobiotic compounds, and microbiomes. Whilst the amino acid sequence of a protein is determined by the corresponding gene, glycans lack a direct template for their biosynthesis. Since a glycan is generated by the coordinated action of many enzymes in the subcellular compartments of a cell, its final structure depends on the expression, activity, accessibility, and interactions of many molecules. Thus, the glycocode is dynamically and reciprocally linked to both an organism's genome and to environmental factors (direct effects on individual enzymatic processes or indirect induction of epigenetic changes that modify gene expression patterns).
The protein and DNA specification alphabets have 20 and 4 letters respectively and even lipids can be represented by 8 categories.20 Glycans present greater challenges because compared to amino acids and nucleic acids, the number of naturally occurring monosaccharides is larger and ill-defined, a pair of residues can be linked in several ways, and a residue can be connected to three or four others. Glycans have been proposed to be derived initially from 32, possibly more, monosaccharides20 but this includes residues with secondary modifications. Thus, the exact size and precise nature of a glycan specification alphabet is an open question—∼100 is a reasonable guess for the number of “parent” monosaccharides (residues without substitutions) present in all three kingdoms of life.
DNA and protein molecules can be represented, at least in their most basic forms, as simple linear strings but glycan chains are complex branched structures. Permutations of the phosphodiester and peptide bonds are the main source of coding capacity in nucleic acids and proteins. The superior coding capacity of glycans can be attributed to (i) linkage options for the glycosidic bond (for example, 1–2, 1–3, 1–4 or 1–6 when L-fucose is conjugated to glycans), (ii) anomeric bond configuration (α or β; the only difference between glycogen/starch and cellulose), (iii) ring size (pyranose or furanose), and (iv) branches.30 With 20 different letters, ∼107 hexapeptides are possible but there are ∼1015 theoretically possible hexasaccharides. Given five monosaccharides, there are ∼104 possible glycan isomers if the monomers are identical and ∼106 if they are different—far more that the number of proteins that could be created from five amino acids.24
Entropy and mutual information are measures that have been used for protein- and nucleic acid-related statistical machine learning tasks such as alignment and searching. Thus, information theoretic studies of glycans are likely to have many practical applications. Given a glycan specification alphabet, the primary structure of a glycan could be represented as a graph where nodes correspond to monosaccharides and edges to glycosidic bonds. Information theoretic measures for graphs have been developed96 but the entropy of two glycans with the same graph topology would be identical even if they differed in their constituent monosaccharides and anomeric bond configurations. However, this coding scheme could be modified by “colouring” both nodes and edges and a theoretical upper bound obtained for the information necessary to specify glycans.
Whilst the graph theoretic approach offers an analytic bound, it is useful also to catalogue glycan types directly so that we can begin to quantify glycan specification information empirically. By enumerating glycan structures and their relative frequencies, an entropy can be calculated which provides an empirical but still quantitative lower bound on the amount of information—from whatever sources—necessary to produce glycans. From there, with the known map from genes to proteins as a guide, one can begin to provide quantitative lower bounds on, for example, how many genes are necessary to determine glycan structure—without specific knowledge of which genes are responsible or even the enzymes that determine glycan structure. That is, the data processing theorem requires that whatever the original source for glycan specification, the original amount of information necessary is at least as large as the entropy of glycan specification. Resources for such studies include EUROCarbDB whose central database of carbohydrate structures contains ∼13500 unique glycan sequences currently.35
Mutual information has been used to examine protein–nucleic acid interactions such as the properties of protein sequences and structures that are most useful in binding DNA.18,101,102 Thus, information theoretic investigations of protein–glycan interactions in general and galectin–glycan binding in particular are likely to have many practical applications. As discussed earlier, the isothermal efficacy of information transmission in a DNA binding protein–DNA complex has been examined.87 Whilst a protein–glycan macromolecular complex can be seen also as a “molecular machine,” a similar analysis is more challenging: sequence logos can be used to compute the information in DNA sequences bound by specific proteins but there is, at present, no analogous method for computing the information in glycans bound by specific glycan binding molecules.
Here, we propose a novel structural code, the CT code—a stochastic mapping between the spatial position pattern of the CTs in a cell nucleus (CT configuration) and a cell with a particular suite of architectural and functional characteristics (cellular phenotype). We assume the existence of machinery that encodes and decodes CT configurations and cellular phenotypes. We posit that CT configurations reside in a CT configuration space where a point corresponds to a particular arrangement of CTs in a nucleus. A cellular phenotype corresponds to a region in CT configuration space, that is, many CT position patterns are compatible with a particular cellular phenotype. This tolerance captures the probabilistic and non-random nature of CT organisation so that, for example, the spatial organisation of CTs in mammary gland luminal epithelial cell nuclei cannot be described by a single CT configuration. In essence, a cellular phenotype is a probability distribution on CT configuration space. Whilst the information needed to encode a particular cellular phenotype is degenerate, this degeneracy is constrained by a functional requirement to control the spatiotemporal locations of CTs.
Energy and resources are required to construct and maintain the components of this structural coding system, and to ensure that information transmission is both energy efficient and reliable. Information is conveyed only when CT configuration and cellular phenotype are correlated, even partially, that is, there is an association between a particular non-random arrangement of CTs and a specific set of cell behaviours. One proxy for cellular phenotype is a cell's molecular fingerprint, its profile of transcripts, proteins, glycans, metabolites and so on. Thus, cellular phenotypes and differences between them could be quantified by exploiting extant repositories of experimentally determined molecular profiling data. Since different regions of CT configuration space correspond to different phenotypes, paths through this space could correspond to processes associated with normal and/or aberrant biology, for example, the differentiation, dedifferentiation and dysdifferentiation of mammary gland luminal epithelial cells.
An important open problem is the stochastic mapping between CT configurations and cellular phenotypes, in general as well as for specific species. Currently, it is not feasible to determine experimentally the spatiotemporal position patterns of CTs in the nuclei of a large number of cells of the same let alone different type. One theoretical abstraction that may be useful is to treat the problem as a geometric packing problem: the arrangement of a defined number of objects (ellipsoids representing CTs) inside a container (an ellipsoid representing the cell nucleus) subject to constraints such as the allowable overlap between objects, and the relative and absolute position of an object with respect to both the container and other objects. A simple geometric model of a generic human cell nucleus consisting of spherical CTs of different radii packed randomly inside a spherical nucleus has been developed but the underlying technique has many limitations including an inability to handle constraints.111 Methods in mathematical and robust optimisation could be used to formulate more general purpose geometric packing problems that include constraints. However, finding feasible solutions to and suitable approximations for non-convex optimisation problems will be challenging.112,113
The CT code lends itself to a variety of communication problems. For example, a mammary gland luminal epithelial cell might transmit signals to its (a)biotic environment seeking direction about whether to be “normal” or “malignant.” Assume this takes the form of a signal sequence {Xi} emitted by the cell nucleus and interpreted by the environment as the corresponding set of signals {Y} that are corrupted by uncertainty. All that we require is some stochastic mapping X → Y summarised by a probability distribution fY|X(). This problem might be elaborated to include feedback: the environment transmits information to the cell nucleus with one outcome being an altered CT configuration.
The relationship between interphase cell–substrate adhesion and mitotic spindle orientation in adherent cells has been studied using two-dimensional (2D) patterns of fibronectin generated using microfabricated stamps.78 By controlling both cellular geometry and cell adhesion, the spindle axis in cells was induced to take up a variety of shapes and a predictive mathematical model of the process was developed to determine how novel patterns of interphase cell–substrate adhesions would be interpreted by the spindle (Fig. 2). From an information theoretic perspective, a cell might choose to minimise the mutual information between what it considers to be the “correct” orientation of its mitotic spindle and the true orientation required by its environment. For any DPM, communication theory might allow the bounds of what is possible to be outlined without explicit consideration of the specific workings of cells, for example, by making use of the mutual information between variables of interest in the single-cell and collective state, the cost of producing and interpreting signals associated with the molecular components of the DPM, and the energetics of the physical forces and processes mobilised by the DPM.
![]() | ||
Fig. 2 Spindle orientation and cortical forces in adherent cells (adapted from ref. 78). Left A spherical cell during mitosis (circle) linked by retraction fibres (green) to adhesion sites. The density of retraction fibres at the cortex activates cortical force generators (blue), which exert pulling forces on astral microtubules. The resultant torque on the spindle (red) rotates it as well as the metaphase plate (cyan) until a stable orientation angle is attained. RightSpindle geometry and cortical forces. Spindle poles (red) are separated by a distance 2a in a cell of radius R. Cortical force generators exert a pulling force f tangential to the orientation of astral microtubules, the unit vector m. This force exerts a torque R × f on the spindle. R is the vector pointing to the cortical position at which the force acts. |
Assume that a given cell senses its surroundings through extensions (filaments, fibres). An information theoretic perspective would then ask how much spatial information is conveyed to the cell through such tetherings. Modulo determining typical environments (the entropy of the milieus in which a cell might find itself), the data processing theorem and channel capacity theorems should allow determination of the precision with which a spindle can be oriented.
In addition to physical tethering, cells communicate in a panoply of ways and a plethora of mechanisms could be imagined through which a cell might gather data and learn about its environment. For example, the environment could produce anisotropic distributions of signalling molecules which diffuse through the extracellular spaces. A cell might produce auto-inducing signalling agents whose local concentrations are affected by anisotropic signal uptake in the surrounding medium. That is, cellular extensions are but one of many possible ways a cell could learn about its local environment. This begs the question: why might one data and knowledge gathering method be chosen over another?
Communication theory suggests an organising principle that might provide a path towards an answer. Energy is a fundamental resource in biology and is often, if not always, conserved where possible. Hence, the milieu-sensing problem could be cast in a formal communication theory context: how might a cell use energy resources most efficiently to determine, for example, the geometry of its surroundings? Given our knowledge of protein synthesis, it is relatively straightforward to calculate energy budgets for a variety of methods such as the construction of filaments and fibres and the diffusion of signalling molecules. It would be interesting to determine the circumstances under which what seems like an energy intensive method (assembly/disassembly of cellular extensions) is more energy efficient than synthesizing diffusive signalling molecules and maintaining an appropriate number and distribution of receptors. Doing so might yield unexpected results, as was the case for interstellar communication where sending information-bearing physical artifacts was found to be far more energy efficient than communicating via electromagnetic signals.114
If each discrete action implemented by a DPM is considered to be a “primitive” operation, multicellular systems of varying size and complexity can be “built” through a sequence of discrete actions in space and time. However, the molecules and actions mobilised by DPMs are not “orthogonal”—for example, “Cadherins” are part of both the ADH and DAD DPMs (Table 1)—and the sequential application of different DPMs can generate multiple paths to the same many-celled form. Biologically, this overlap is useful because different genes can achieve the same outcome, a way perhaps to hedge evolutionary bets. However, from the analytic standpoint of determining form and function from genome/proteome/glycome specification (reliable code-to-structure/function transformation), such non-orthogonality presents obvious combinatorial difficulties. When presented with such dependencies, communication theory seeks to determine a set of independent actions and effects which through combinations can be used to describe the dependent system.
An “orthogonal” set of DPMs would allow easy quantification of the information content embodied by biological form which could then be related to the eminently quantifiable information content of relevant genome portions. Even if such DPMs or similar entities exist, they may not have physical analogues in cells, that is, there may be no set of molecular signals and/or operations that define independent actions. Nonetheless, realisable/observed actions could by definition be constructed analytically from “basis actions” to help quantify specification and thence information flow in multicellular systems. Whether such a decomposition is actually possible remains to be seen.
We might seek to decompose the operations that generate a tissue or an organ into a collection of interacting actions—some combination of DPMs, DTFs, strategies similar to those underlying chemotaxis in slime mould, quorum sensing in bacteria, and pheromone signalling in yeast, approaches that generate many-celled forms, and so on. As interesting and important as this type of “recipe” might be, this is only the beginning from a communication theory perspective. By enumerating steps, we inherently provide some measure of the information necessary to construct, say, a mammary gland, and implicitly, some measure of the genome portion size associated with this task.
Extant organisms maintain form reliably, generating the same stable structures time and again, generation after generation. Since molecular, cellular, and tissue events are inherently stochastic and incomplete knowledge is pervasive, decision-making under uncertainty is the norm. The current and anticipated future state of the environment need to be inferred from noisy data, a cost–benefit analysis of each potential response in light of the predicted future and past history is required, the time taken to decide and generate a response needs to be considered, and decisions are made in the presence of other cooperative and/or competitive decision-makers.116 Tissues, organs and organisms require a web of intercommunicating processes that are stable in their ability to maintain form and function, yet flexible in their responses to the environment. Dysfunctions of multicellularlity likely reflect changes in efficiency (throughput) and resilience (adaptability).
Ageing is characterised by a systemic decline of the capacity to respond to internal and external stresses, leading to increased mortality. In most tissues and organs, the process of progressive loss of adaptive abilities is accompanied by functional decline but the exact pattern of phenotypic changes varies from one individual to the next and depends upon genotype as well as environmental conditions. In metazoans with renewable tissues, longevity is affected by the balance between stem cell quantity and quality.117Cellular senescence, an aggregate of phenotypes that arrests rather than kills would-be tumour cells, mobilises the systemic and local tissue milieu for resolution of tissue damage, a mobilisation that can be beneficial (tumour suppression and tissue repair in young organisms) or detrimental (tumour promotion and ageing in late life).69Ageing is thought to be a time-dependent process of cell dysdifferentiation and is associated with increased stochastic deregulation of cellular gene expression.118 As cell-to-cell heterogeneity increases over time and tissues become cellular mosaics, the energy efficiency and reliability of information transmission changes—communication, ageing, and uncertainty are intimately connected.
Probability distributions are central to communication theory. For example, the stochastic mapping from sender to receiver is summarised by fY|X(), and the noise in a channel described by an additive signalling model, w(t), is usually a white Gaussian. A communication theoretic analysis of ageing might benefit from the use of different classes of probability distributions such as members of the exponential, Cauchy, uniform, and Laplace families. Instead of the same distribution, we might seek to sample from a distribution of distributions, either at random or in a time dependent manner.
The Channel Coding Theorem addresses the question of conveying signals in a reliable fashion where the probability of error can be made arbitrarily small by employing codes which span multiple channel uses. A number of questions arise if ageing is associated with increasingly less reliable signal transmission and/or changes in the cost of producing or interpreting signals. Does this apply to signals from some or all organic codes? Is it cell- and/or tissue-type dependent? Are there fewer channel uses over time? Does the mutual information between the sent and received signals change over time? With respect to rate distortion theory, is ageing associated with changes in how much information is necessary to approximate an information source such as an organic code—increasing the number of wild-goose chases (attempts to seek perpetual motion strategies)?
If a sense of the information flow necessary to determine tissue, organ, organism and multicellular structure can be determined, then an immediate question arises. Given the physics of the communication channels between cells, what limits are placed on the interaction of cells and their various actions? Answers to such questions could help illuminate the limits of stable biological systems and/or suggest the possibility of previously unexplored portions of morphospace that might be engineered artificially for specific purposes.119–124
From simple representation and coding of macromolecules to the more complicated behaviours of morphogenesis and ageing, a communication theoretic approach allows quantitative statements to be made about the processes involved. The inherent mechanism-blind nature of communication theory means that a lack of a priori detailed molecular and/or genetic knowledge of agents and mechanisms does not preclude the provision of guidance about what signalling modalities are probable or improbable under energy constraints. We suspect communication theory could be of value in “forensic” retrospective studies that seek likely evolutionary developmental pathways and in experiments whose aim is to uncover (or rule out) detailed mechanisms.
So far, we have highlighted how communication theory has and could contribute to understanding a myriad of problems in biology. In turn, we expect the study of multicellular biology could lead to advances in communication theory, especially in the still immature field of network information theory.81 It is possible, if not likely, that biological systems, having had billions of years to evolve, will provide new insights into an area that has thus far been perceived as the sole domain of telecommunications engineering and for which it is proving devilishly difficult to derive a complete theory.
Communication in biological systems may have something to contribute even to the well-understood body of “classical” communication theory. For instance, it is well known that feedback over memoryless channels does not increase channel capacity but the same cannot be said of channels with memory.81 As a matter of mathematical convenience therefore, it is not uncommon for communication theorists to recast channels with memory into memoryless analogues for subsequent analysis and engineering. That is, codes are designed for the memoryless analogue and then inverse-transformed for use on the actual channel with memory. Modern wireless systems with arrays of transmit and receive antennas (Multiple-Input Multiple-Output or MIMO for short)125,126 are an obvious example of this design ethos. However, the ubiquity of feedback in biological systems and an assumption about evolved energy-efficiency suggests that communication theory, rather than viewing channels with memory as a complication or an annoyance, should consider the potential engineering advantages that channels with memory might afford.
Footnote |
† Published as part of an Integrative Biology themed issue in honour of Mina J. Bissell: Guest Editor Mary Helen Barcellos-Hoff. |
This journal is © The Royal Society of Chemistry 2011 |