Main

Modeling Archives

February 26, 2008

A refreshing model: peppermint terpenoids

Research highlight by Doron Lancet, Crown Human Genome Center, Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel

MSB Research HighlightsLiving cells are typically asymmetric, having tens of thousands different biopolymers (proteins and polynucleotides), but merely <1000 types of small molecules, such as amino acids and lipids. An exception is certain plant cells that harbor members of ~40,000 strong group of low molecular weight terpenoids, often displaying a complex compositional balance essential for plant growth and survival (Aharoni et al, 2005). Understanding the intricacies of biosynthesis and interconversion of such unusual cellular components appears to require the full power of Systems Biology. In a recent paper, Rios-Estepa et al (2008) harness a systems approach, including iterative cycles of mathematical modeling and experimental testing, to help elucidate the metabolic dynamics of the terpenoid universe.

Specifically they ask how plants vary their monoterpene profiles in response to environmental stress – changing levels of illumination. A highlight of their results is that the variation of terpene metabolic fluxes is mediated by specific events in which members of the terpenoid repertoire exert a regulatory effect on terpene biosynthesis enzymes. Rewardingly, this is predicted by a computer simulation and subsequently verified by experiment. The broader conclusion, applicable to all living organisms, is that as the power of computing grows, it will become possible to make increasingly specific and accurate predictions, that will allow both a better global understanding and the successful engineering of cellular networks.


Aharoni A, Jongsma MA, Bouwmeester HJ (2005) Volatile science? Metabolic engineering of terpenoids in plants. Trends Plant Sci. 10:594-602.

Rios-Estepa R, Turner GW, Lee JM, Croteau RB, Lange BM (2008) A systems biology approach identifies the biochemical mechanisms regulating monoterpenoid essential oil composition in peppermint. Proc Natl Acad Sci U S A. 105:2818-2823

February 12, 2008

Information processing in signaling networks

Research highlight by Charles Auffray, Functional Genomics and Systems Biology for Health, UMR7091, CNRS and Pierre & Marie Curie University—Paris VI, Villejuif, France

MSB Research Highlights The work presented by Helikar et al. (2008) in a paper recently published in the PNAS represents a promising new step in the development of computational cellular physiology in eukaryotes. From curated cellular and biochemical data available in the literature, the authors have assembled a discrete Boolean model of signal transduction comprising 130 nodes, and examined in a systematic and controlled manner how varying combinations of external inputs translate into a range of cellular responses. The qualitative model is not only able to reproduce known input-output relationships representative of major transduction pathways, but it also provides evidence in support of the emergence of information-processing functions from the complex cellular network of molecular interactions. This is strikingly demonstrated by the fact that a large sample of randomly selected input combinations result in a very limited fraction of the possible outputs, which correspond to well-characterized global biological responses, a result which is obtained irrespective of the level of noise introduced in the inputs of the model. Moreover, similar input combinations are neatly clustered by the model into equivalence classes of global outputs, reflecting the ability of the cell to integrate complex environmental signals and translate them into robust specific responses and behaviours through common intracellular pathways. While discrete Boolean modelling makes it possible to highlight emergent properties of transduction networks, overcoming the hurdle of parameter estimation, very much as in classical physiology, it provides only high-order views in the form of black boxes with limited predictive and explanatory power. Integration with continuous models will be essential to unravel and engineer the underlying mechanisms.

Helikar T, Konvalina J, Heidel J, Rogers JA (2008). Emergent decision-making in biological signal transduction networks. PNAS 105, 1913-1918

January 18, 2008

Will probiotics bring systems biology to our table?

(via Scintilla)

thumb080118.jpgThe article on "Probiotics modulation of mammalian metabolism" published this week in Molecular Systems Biology by Jeremy Nicholson and colleagues (Martin at al, 2008) has attracted some attention (read the nice summary in Science News) in some (very) popular media (here, here, here and here).

In this follow-up study of the paper published last year (Martin et al, 2007), the team lead by Jeremy Nicholson, in collaboration with Nestlé, demonstrates clear physiological effects of oral probiotics administration on mice harbouring a humanized microbiome. The effects are intricate: both the host flora and metabolism are altered. By analyzing metabolite pools in several compartments (liver, blood, urine, feces, gut), and following in parallel the host microbiota, patterns of correlations between microbial species and metabolites start to be visible and reveal the probiotics-induced modulation of the microbial-mammalian interactions. But the actual paper is really just next door (synopsis), so have a look...

How will these results translate to humans? What will be the best way to influence our microbiome? Drugs or yoghurt? These are fascinating questions and the understanding of how our physiology depends on the microbial flora could have profound consequences, particularly in these times when we seem to be in a "rush to gene-based solutions to all our problems" (Wilson, 2007). Will personal genomics have to ultimately develop into personal metagenomics to include our "extended" microbial genome?

Even if I usually prefer to resist the temptation of a self-promoting section in this blog, I find the attention of the media for this topic interesting (despite the usual variable accuracy of newspaper reports) because it points to an area where systems biology provides insights into topics of immediate interest to the general public.

The NIH has recently started its Human Microbiome Project. In this context, this study also underscores the importance of developing model systems and tools to manipulate the microbiome and to analyze the incredibly dense and intricate interactions that connect host and microbial species. A field where top-down systems biology seems indeed a very pragmatic and promising approach.

January 14, 2008

Morphogen Paradoxes

Bicoid morphogen gradientA controversy seems to be brewing over some recent theories and quantitative analyses addressing the fundamental question of how the Bicoid morphogen gradient is established and decoded in early Drosophila embryos. The transcription factor Bicoid controls the anterior-posterior patterning of the developing embryo. It is translated from maternal mRNA localized at the anterior pole of the egg and its graded distribution activates, in a concentration-dependent manner, the expression of gap genes, thus determining their spatial domain of expression. Synthesis from a localized source combined with diffusion and uniform degradation of the Bicoid morphogen provides one of the simplest models to explain the approximately exponential shape of its gradient. While, historically, patterning has been thought to rely on the gradient at its steady state – that is when synthesis, transport and degradation processes balance each other – the question arose as to whether steady-state can be reached rapidly enough in the quickly developing embryo (Lander, 2007).

In February last year, Naama Barkai and colleagues published a study (Bergmann et al, 2007) in which they propose that the gradient would in fact be interpreted before it has reached its steady-state, when the gradient is still "moving". Experimental evidence for a dynamic evolution of Bcd profile between cleavage cycle 11 and 12 is provided using a reporter gene driven by bicoid-binding sites. These authors further show that a pre-steady-state model implies a reduced sensitivity of the gradient readout to variations in the production of morphogen at its source. One biologically relevant example of this robustness is the observation that the domain of expression of hunchback, a Bicoid target gene, shifts much less in embryos from mothers with altered bicoid gene dosage than would be predicted by a steady-state model.

A few months later, Thomas Gregor and colleagues published two papers (Gregor et al, 2007a, 2007b) reporting a detailed analysis of the profile and dynamics of the Bicoid gradient. Quantitative in vivo imaging of a transgenic bicoid-eGFP reporter revealed several paradoxes. While a stable gradient of nuclear Bicoid is quickly established (within 90min, approx. cleavage cycle 9), the (local) diffusion coefficient of Bicoid, as deduced from photobleaching experiments, appears to be far too small (D=0.3 μm2/s, much less than expected from previous estimations made by injecting labeled dextran molecules) to be compatible with such a rapid establishment of the (long-range) gradient by diffusion alone. These experiments further show that nuclear Bicoid is under a highly dynamic nuclocytoplasmic equilibrium, pointing to a fundamental role for the nucleus in gradient establishment and stability. Finally, the precision with which the Bicoid gradient is transformed into Hunchback expression (see illustration, after Gregor et al 2007b) is estimated to be around 10%. This remarkable level of precision would not only be close to the physical limits of the system, but also strikingly matches the accuracy required to detect changes of Bicoid expression between adjacent cells (10%, equivalent to a difference of only 70 Bicoid molecules per nucleus) and the level of reproducibility of the absolute morphogen concentration from embryo to embryo (10% as well).

In a Correspondence published last week, Bergmann and colleagues (2008) dispute these interpretations and claim that a "reanalysis of their [Gregor et al's] data demonstrates that their findings are consistent with the well-accepted paradigm of diffusion-based patterning and provides further support for the notion that the Bicoid profile is decoded prior to reaching its steady state". Thus, according to these authors, constant nuclear Bicoid levels are not indicative of steady-state of the gradient itself given that cytoplasmic levels may still be changing. The small diffusion coefficient of Bicoid would then be an additional argument in favor of the necessity of a pre-steady-state decoding mechanism. If this is the case, the differences in Bicoid levels between adjascent cells would be much bigger at cleavage cycle 9 (50% instead of 10% at cycle 14), thus resolving the paradox of the high precision of the hunchback response.

In their response (Bialek et al, 2008), Gregor and colleagues reply that if cells would make a decision by reading Bicoid concentration at cycle 9, the boundary between expression domains would be 5 cells wide at stage 14 (=\sqrt{2^14/2^9}), while in reality it is only a single cell wide. While they agree that the overall gradient might not be at steady-state at these early stages, they argue that the stability of nuclear Bicoid levels is functionally highly relevant given that Bicoid is a transcription factor. Finally, they also point out that the deduced local diffusion constant is so small that it is in fact incompatible with observing any Bicoid in the middle of the embryo in the first place, thus suggesting the existence of additional mechanisms to explain establishment of the gradient at the scale of the entire embryo. These and some additional arguments lead Bialek et al to conclude that "the small values of the diffusion constant for Bcd we reported are superficially consistent with their model, but the model provides no basis for understanding any of our observations."

Mmmmh... not an easy one. Those who have additional insights into these subtle but fascinating questions, please let us know!

November 22, 2007

New feedback loop in Arabidopsis circadian clock

By James CW Locke, California Institute of Technology

A new Science paper from the lab of Alex Webb (Dodd et al, Science, 2007) represents an important step forward in plant circadian research (read also commentary by Imaizumi et al, Science, 2007). The circadian (24 h) clock controls processes throughout the day and night in most organisms, and in plants is involved in multiple pathways including photosynthesis, leaf movement and floral opening. The circadian clock has evolved to consist of multiple interlocking transcriptional feedback loops (at least in eukaryotes), which generate the 24 h rhythm even under constant environmental conditions.

thumb071121.jpgUsing a series of elegant experiments Dodd et al uncover a new level of complexity to the plant clock. They first show that cytosolic signaling molecule cyclic adenosine diphosphate ribose (cADPR) is regulated by the clock and is responsible for the previously reported circadian rhythm in intracellular calcium. They go on to show that disruption of cADPR signaling by addition of nicotinamide causes a strong period lengthening of the clock. Thus they have discovered an additional feedback loop (see drawing), and revealed a new class of circadian clock components: cytosolic signaling molecules.

It is also of note that Dodd et al use an existing mathematical model of the Arabidopsis clock (Locke et al, Mol Syst Biol, 2005) to frame the expected effects of their predicted feedback loop. A large amount of systems modeling work has recently been carried out on the clock (summarized in Ueda, Mol Syst Biol, 2006). It is exciting to see how the Arabidopsis clock community has responded to this work. In some cases the models have been used to simulate experiments and to test putative mechanisms (Dodd et al, Science, 2007, Martin-Tryon et al, Plant Physiol, 2007). I believe for systems biology approaches to succeed it is crucial that models must be made easily accessible to experimentalists, which is the case in this field (eg Circadian Modelling). I look forward to seeing the next iteration of the plant circadian clock model, perhaps from the Webb lab including the cADPR feedback loop.

November 18, 2007

Glia-neuron interactions

thumb071115b.jpg Nature Neuroscience has a nice special focus on glia and disease. The featured reviews and perspective articles discuss multiple aspects of neuron-glia interactions and their role in disease. The reason why I am highlighting this collection here is that I have the feeling that this field could potentially be a nice playground for systems biology.

For example, Rossi and colleagues (2007) review the various metabolic processes affected during brain ischemia. Several of the examples discussed illustrate very well how the extent of brain damage is determined by the concurrent dynamics of both harmful and protective processes engaging complex interactions between neurons and astrocytes. A critical determinant for ischemic damage is the catastrophic loss of ATP levels caused by deficient glucose and oxygen delivery. Astrocytes have glycogen stores that can normally be converted to lactate which is exported to neurons to provide energy during phases of high activity. In absence of oxygen however, lactate can no longer be oxidized. In this case, glucose may then help delay loss of ATP levels, via anaerobic glycolysis. But this beneficial effect might be counteracted by lactic acidosis caused by continued glycolysis in the absence of O2, which is known to accentuate ischemic damage in the case of hyperglycemia. Moreover, acidosis may activate Na+-H+ exchange, cytosolic Na+ accumulation, reversal of Na+-Ca2+ exchange resulting in astrocyte Ca2+ overload, either impairing their protective functions or even killing them.

A similar complexity is seen in the events underlying ischemic glutamate release. Loss of cellular ATP levels impairs the function of the Na+-K+ ATPase and thus disrupts ionic gradients. The resulting depolarization leads to a large increase in extracellular glutamate that is amplified by positive feedback, ultimately resulting in neuronal death by excitotoxicity. Astrocytes may contribute to increased extracellular glutamate levels via direct vesicular glutamate release and vesicular ATP release that in turn activates glutamate-permeable P2X receptors. Glutamate reuptake is normally carried out by five high-affinity sodium-dependent glutamate transporters. Disruption of transmembrane potential and of ionic gradients can cause transporter reversal thus further contributing to glutamate release. This depends in turn on the intracellular glutamate concentration which is much higher in astrocytes than neurons, determining the relative kinetic of neuronal and astrocytic reuptake/release as the ischemic perturbations progress. Further details are visible on Figure 3 from Rossi et al (2007):

Even if this short overview is condensed and incomplete, it suggests to me that quantitative measurements and integrated modeling could be quite helpful, if feasible, to understand the various contributions of the many processes involved and to identify potential points of protective synergies or characterize regimes under which the stability of the astrocyte-neuron system is catastrophically compromised. Perhaps this type of model and its calibration could even serve as a starting point to investigate the involvement of astrocytes in computational aspect of neuronal functions (Wang et al, 2006).

August 9, 2007

First q-bio Conference day 1

The First q-bio Conference on cellular information processing, Santa Fe, New Mexico started today with an opening lecture by William Bialek. Here is an attempt to provide a brief account of his beautiful talk.

Let's imagine a biological system to be modeled, says William Bialek, and for which sufficient experimental data are available to determine the relevant model parameters: the model can be "located" within a given region of the parameter space. Bialek asks: should we really be satisfied with the observation that this particular set of parameters fits the experimental data or is the fact that the model is located in this particular region of the parameter space tells us something more fundamental? Given that living organisms are shaped by evolutionary processes, which acts on biological functions (well, at least in part, see Lynch, 2007...), finding an appropriate notion of biological functionality would define some "metric" on the parameter space that would tell us why the living organisms is sitting in this particular region. But how can this concept of functionality be defined and quantified?

As concrete examples, Bialek mentioned the typical sigmoidal intput-output curves observed in neurobiology (eg photoreceptor membrane voltage in function of light stimulus, action potential frequency of visual neurons in function of a stimulus feature like motion) or in molecular biology (for example the expression levels of a target gene in function of transcription factor concentration). Why do these input-output functions have this shape and what are the general rules, independent of underlying molecular mechanisms, that explains where the midpoint of the curve is, its width etc... It appears that one of the keys to this problem is to look at the distribution of inputs received by the system in its "natural context" and analyze how this distribution matches the shape of the output function, something that was initially done by Laughlin (A simple coding procedure enhances a neuron’s information capacity. Z Naturforsch 1981 36c, 910–912). One can then ask the question: "how much information does the output provide about the input?" To do so, the level of resolution with which the output can distinguish between certain number of levels has to be taken into account, given the noise in the system. As it turns out, given the observed distribution of input, imposing maximal information transmission leads precisely to the observed input-output curve. The system seems to have been optimized for transmission of information, which represents a very general constraint. Wiliam Bialek went on with additional examples to show that the input-output curve may in fact, for the same system, depend on the context (the distribution of input can change over time) and to demonstrate how this type of analysis also extends to transcriptional regulation using the Bicoid-Hunchback as a model system.

The talk was followed by a quite animated discussion on how to interpret the observation that these particular systems appear to have optimized information transmission. Is optimization of information transmission the only key? Probably not... But William Bialek's very insightful lecture illustrated what sort of fundamental and general insight can be gained from the combination of a theoretical approach to a simple biological model system. And the resulting discussion also showed that even if the idealized theory might be wrong, the confrontation between theory and reality of biological systems serves to reveal those aspects of a system that are not sufficiently deeply understood.

July 5, 2007

What would you do with a petaflop?

The Blue Brain Project of the Ecole Polytechnique Fédérale de Lausanne (EPFL) currently uses an IBM Blue Gene/L supercomputer, reaching peaks of 22.4 teraflops (flops=Floating Point Operations Per Second), to simulate a model of a mammalian neocortical column composed of 10'000 neurons (Markram, 2006). IBM recently announced the release of the new Blue Gene/P, which, in its largest configuration, would be more than 100 times more powerful than the EPFL Blue Gene/L, reaching peaks of 3000 teraflops (3 petaflops, 3×1015 flops, 100'000 times more powerful than a home PC).

Apparently, this leap in computing power represents a challenge even for software developers, in particular due to the increase in parallelism (The petaflop challenge, Nature News). Would this open up fundamentally new avenues in systems biology? What are the applications in systems biology that would benefit the most from such supercomputing power? What would you do if you had a petaflop computer...?

May 23, 2007

Proofreading, repair and robustness

THUMB070129.jpg

How do proofreading and repair activities emerge and what is their impact on system-level properties such as robustness or evolvability?

Biological networks and the Internet may share common architectural principles common to "robust yet fragile" systems (Doyle and Csete, 2007). But the Internet does not (yet) repair itself while cells or tissues do. Repair may represent an example of "downward causation" (The Music of Life, Noble 2006 or a book review) in which a high-level functional property (eg repair activity) emerging from a multi-component system (eg the DNA repair machinery) acts on a component at a lower level of organization (eg one nucleotide). In other words, could repair be considered as an example of a "cross-scale" feedback motif? Can these types of motifs be generalized and how would they evolve? The topic could in fact be extended to the field of synthetic biology: how to assemble synthetic systems with self-repair capability?

Many questions... Who has answers?

see also: Fidelity and infidelity (2001) Radman, 2001

February 28, 2007

The connectivity map of drug combinations

How does the connectivity of biochemical networks influence the response to drug combinations?
To address this question, Joseph Lehár and colleagues analyzed the response of yeast and human tumour cell lines to thousands of drug pairs, and compared these experimental data to a computational model (Lehár et al, 2007) . By systematically measuring the biological effects of drug pairs over a range of concentrations, 'response surfaces' could be constructed. This strategy enabled the authors to establish a clear relationship between the shape of the biological response surface and the type of connection linking the targeted biochemical pathways. This dependancy can be used to learn how biological systems are connected as well as to elucidate the mechanism of multi-drug treatments, which are increasingly preferred for many diseases.

It will be interesting to see how far this kind of analysis takes us in the real world: how much will 'off-target' effects confound the analysis? To what extent is the mapping between network topologies and response surfaces a one-to-one function? Further, it would be important to extend this idea to three or more drug components. (Yeh and Kishony, 2007)