Main

Genome-wide Archives

February 21, 2008

Top-down mapping of gene regulatory pathways

Trey Ideker videoIn a very recent lecture (see full video from NIH VideoCasting) given for the NIH Systems Biology Special Interest Group, Trey Ideker presents a great overview of the various strategies his group has been developing in the recent years in order to integrate multiple types of large scale datasets. While one of the most pervasive 'meme' about high-throughput measurement is that they are "notoriously unreliable" (see Hakes et al, 2008, for a recent example), Trey beautifully illustrates how predictive computational models and novel biological insights can be generated by sophisticated data integration strategies. Three types of applications are presented in his talk:

  1. mapping of transcriptional response pathways
  2. functional mapping of protein complexes
  3. disease diagnosis and stratification

In the last section, Trey presents the study recently published in Molecular Systems Biology (Chuang et al, 2007, video: 00hr:39min:15sec) where the information provided by microarray expression profiling is superposed to a protein-protein physical interaction network to identify 'subnetwork' biomarkers that classify metastatic vs non-metastatic breast tumors.

February 15, 2008

Transcription paused and poised for regulation

Research highlight by Frank C.P. Holstege, Department of Physiological Chemistry, University Medical Center Utrecht, the Netherlands.

MSB Research HighlightsFor eukaryotes, it is widely thought that transcription is primarily regulated through recruitment of the essential machinery to transcription start-sites. Previous hints challenging this paradigm have been confirmed by recent analyses showing that transcription regulation of a large number of genes actually occurs after recruitment. Mechanistically, such studies have gone furthest in Drosophila melanogaster (Muse et al, 2007; Zeitlinger et al, 2007). Here, conservative estimates indicate that more than 10% of genes are regulated through promoter-proximal pausing. On such genes, RNA polymerase II is recruited and initiates transcription, but then pauses around 50 bp downstream of the transcription start-site where it awaits further signals to resume elongation and complete transcription proper. These observations tie in with other observations made in yeast (Radonjic et al, 2005), embryonic stem cells (Bernstein et al, 2006; Lee et al, 2006) and differentiated mammalian cells (Guenther et al, 2007). There are numerous implications to these findings. For example, the widely assumed link between the presence of gene-specific transcription activators and full-length transcription appears to be much looser than expected. These results also underscore the importance of testing established models on a genome-wide scale. Indeed, other such surveys (Birney et al, 2007), indicate that to understand transcription, we may need to take into account even more surprises – such as the presence of ten times more start-sites than protein-coding genes and overlapping transcription units, etc… – than the post-recruitment mechanisms demonstrated in Drosophila.

Bernstein BE, Mikkelsen TS, Xie X, Kamal M, Huebert DJ, Cuff J, Fry B, Meissner A, Wernig M, Plath K, et al. (2006) A bivalent chromatin structure marks key developmental genes in embryonic stem cells. Cell 125: 315-326

Birney E, Stamatoyannopoulos JA, Dutta A, Guigo R, Gingeras TR, Margulies EH, Weng Z, Snyder M, Dermitzakis ET, Thurman RE, et al. (2007) Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447: 799-816

Guenther MG, Levine SS, Boyer LA, Jaenisch R, and Young RA (2007) A chromatin landmark and transcription initiation at most promoters in human cells. Cell 130: 77-88

Lee TI, Jenner RG, Boyer LA, Guenther MG, Levine SS, Kumar RM, Chevalier B, Johnstone SE, Cole MF, Isono K, et al. (2006) Control of developmental regulators by Polycomb in human embryonic stem cells. Cell 125: 301-313

Muse GW, Gilchrist DA, Nechaev S, Shah R, Parker JS, Grissom SF, Zeitlinger J, and Adelman K (2007) RNA polymerase is poised for activation across the genome. Nat Genet 39: 1507-1511

Radonjic M, Andrau JC, Lijnzaad P, Kemmeren P, Kockelkorn TT, van Leenen D, van Berkum NL, and Holstege FC (2005) Genome-wide analyses reveal RNA polymerase II located upstream of genes poised for rapid response upon S. cerevisiae stationary phase exit. Mol Cell 18: 171-183

Zeitlinger J, Stark A, Kellis M, Hong JW, Nechaev S, Adelman K, Levine M, and Young RA (2007) RNA polymerase stalling at developmental control genes in the Drosophila melanogaster embryo. Nat Genet 39: 1512-1516

January 18, 2008

Will probiotics bring systems biology to our table?

(via Scintilla)

thumb080118.jpgThe article on "Probiotics modulation of mammalian metabolism" published this week in Molecular Systems Biology by Jeremy Nicholson and colleagues (Martin at al, 2008) has attracted some attention (read the nice summary in Science News) in some (very) popular media (here, here, here and here).

In this follow-up study of the paper published last year (Martin et al, 2007), the team lead by Jeremy Nicholson, in collaboration with Nestlé, demonstrates clear physiological effects of oral probiotics administration on mice harbouring a humanized microbiome. The effects are intricate: both the host flora and metabolism are altered. By analyzing metabolite pools in several compartments (liver, blood, urine, feces, gut), and following in parallel the host microbiota, patterns of correlations between microbial species and metabolites start to be visible and reveal the probiotics-induced modulation of the microbial-mammalian interactions. But the actual paper is really just next door (synopsis), so have a look...

How will these results translate to humans? What will be the best way to influence our microbiome? Drugs or yoghurt? These are fascinating questions and the understanding of how our physiology depends on the microbial flora could have profound consequences, particularly in these times when we seem to be in a "rush to gene-based solutions to all our problems" (Wilson, 2007). Will personal genomics have to ultimately develop into personal metagenomics to include our "extended" microbial genome?

Even if I usually prefer to resist the temptation of a self-promoting section in this blog, I find the attention of the media for this topic interesting (despite the usual variable accuracy of newspaper reports) because it points to an area where systems biology provides insights into topics of immediate interest to the general public.

The NIH has recently started its Human Microbiome Project. In this context, this study also underscores the importance of developing model systems and tools to manipulate the microbiome and to analyze the incredibly dense and intricate interactions that connect host and microbial species. A field where top-down systems biology seems indeed a very pragmatic and promising approach.

November 20, 2007

Personal genomics for a fistful of dollars

The wave of personal genomics is progressing rapidly. A string of four papers appeared recently (Porreca et al, 2007, Albert et al, 2007, Okou et al 2007, Hodges et al, 2007) reporting on microarrray-based technologies that enable the enrichment of selected genomic fragments in a single massively multiplexed reaction, thus greatly facilitating subsequent resequencing of pre-defined portions of the human genome (eg all coding exons). These technologies are expected to reduce dramatically the cost of targeted resequencing of individual genomes.

On the commercial front, deCODE and 23andMe have launched their personal genome service offering genome-wide SNPs profiling for a little less than $1,000 (NYT articles: Nicholas Wade, Amy Harmon, or Wired, ScienceRoll, Sandra, DNA and You).

The chips used by 23andMe are the "Illumina HumanHap550+ BeadChip, which reads more than 550,000 SNPs (single nucleotide polymorphisms) plus a 23andMe custom-designed set that analyzes more than 30,000 additional SNPs." The profile provided by deCODEme includes "over one million variants across the genome."

So what do you think?

May 21, 2007

Systems biology & global warming

thumb070520c.jpg

Observations suggest that current climatic models may underestimate how quickly the climate system is changing (in particular for sea level), according to a report in Science a few weeks ago (Rahmstorf et al, 2007). Another Science paper published last week shows that the capacity of the Southern Ocean CO2 sink is weakening, which may result in increased atmospheric CO2 levels in the long run (Le Quere et al, 2007).

I remember Hiroaki Kitano calling the systems biology community, in his talk at the ICSB meeting last October in Yokohama, for ideas on how system-level approaches could contribute to address the challenge of global warming. In response to the studies above, a similar call is now sent to the microbiology community by Jonathan Eisen on his blog. Research topics suggested in his post include:

  • Marine Microbiology
  • Carbon fixation processes
  • Hydrogen production
  • Carbon sequestration
  • Methane capture
  • Microbial fuel cells
  • A similar list of priorities related to energy challenges, environmental remediation and carbon cycling and sequestration can be found on the site of the Genomics:GTL research program from the US Department of Energy.

    For all the topics listed above, systems biology and synthetic biology approaches are likely to be crucial not only to accumulate the necessary fundamental knowledge but also to find ways to translate it into technological applications. Proposals, insights and visionary suggestions are more than welcome...


    some additional links:
    Special issue on Energy and Sustainability
    ASM Report on Microbial Energy Conversion
    Microbial ecology meets electrochemistry: electricity-driven and driving communities. Rabaey et al, 2007, The ISME Journal 1:9

    March 5, 2007

    Catch me if you can: VelociMouse unleashed

    "Germline transmission": how many time have mouse geneticists prayed to the Gods to decipher this magical message from their Southern blots and PCRs when trying to generate a knockout line?

    thumb070305.jpg

    Hopefully the recent technology (the "VelociMouse method") developed by Regeneron Pharmaceuticals will definitively overcome this hurdle (Poueymirou et al, 2007): by disrupting the zona pellucida with help of a laser, ES cells could be injected into 8-cell stage embryos, resulting in F0 generation founder mice entirely derived from the ES cells, thus directly amenable to phenotypic analysis and breeding.

    Together with the large-scale gene targeting in ES cells (Valenzuela et al, 2003, Auwerx et al, 2004, Austin et al, 2004,Grimm, 2006), this technological advance may well represent a major step towards a high-throughput functional genomics of a mammalian species.

    February 14, 2007

    Open Source Biology

    /thumb070214b.jpg

    Novartis, The Broad Institute, and Lund University today announced the completion of a genome-wide map of genetic differences in humans and their relationship to type 2 diabetes and other metabolic disorders. All results of the analysis are being made accessible, free of charge on the internet to scientists around the world (Novartis Media Release, Feb 12, 2007)

    The results of this study are available at http://www.broad.mit.edu/diabetes/

    Has the increasing complexity of genome-wide studies and other large-scale systems biology datasets reached a threshold that makes the open source option more attractive to the pharma industry?

    January 19, 2007

    Analyzing time-series expression data

    tree-like Ziv Bar-Joseph and colleague describe their new method Dynamic Regulatory Events Miner (DREM) to analyze time-series gene expression data and combine them with static ChIP-chip experiments. The expression profiles are modeled using an extension of Hidden Markov Model that enforces a tree structure onto the expression profiles. The technique allows to deduce the condition-specific or time-dependent activity of transcription factors that explain the observed expression profiles.

    sharp transitionsIn their analysis of developmental time-series of gene expression in Drosophila, Peer Bork and colleagues apply a more drastic principle to identify robust groups of genes that correlate with major development phases. They required "four points of low expression and four subsequent points of high expression (or vice versa) even if the amplitude change was relatively low (see Materials and methods). This type of convolution not only requires a sharp increase or decrease of expression, but also that the change in transcript level is consistent over a period of time, thereby reducing the rate of false positives owing to individual outliers."