Probabilistic inference of transcription factor concentrations and gene-specific regulatory activities

Guido Sanguinetti, Neil D. Lawrence, Magnus Rattray

    Research output: Contribution to journalArticlepeer-review

    Abstract

    Motivation: Quantitative estimation of the regulatory relationship between transcription factors and genes is a fundamental stepping stone when trying to develop models of cellular processes. Recent experimental high-throughput techniques, such as Chromatin Immunoprecipitation (ChIP) provide important information about the architecture of the regulatory networks in the cell. However, it is very difficult to measure the concentration levels of transcription factor proteins and determine their regulatory effect on gene transcription. It is therefore an important computational challenge to infer these quantities using gene expression data and network architecture data. Results: We develop a probabilistic state space model that allows genome-wide inference of both transcription factor protein concentrations and their effect on the transcription rates of each target gene from microarray data. We use variational inference techniques to learn the model parameters and perform posterior inference of protein concentrations and regulatory strengths. The probabilistic nature of the model also means that we can associate credibility intervals to our estimates, as well as providing a tool to detect which binding events lead to significant regulation. We demonstrate our model on artificial data and on two yeast datasets in which the network structure has previously been obtained using ChIP data. Predictions from our model are consistent with the underlying biology and offer novel quantitative insights into the regulatory structure of the yeast cell. © 2006 Oxford University Press.
    Original languageEnglish
    Pages (from-to)2775-2781
    Number of pages6
    JournalBioinformatics
    Volume22
    Issue number22
    DOIs
    Publication statusPublished - 15 Nov 2006

    Fingerprint

    Dive into the research topics of 'Probabilistic inference of transcription factor concentrations and gene-specific regulatory activities'. Together they form a unique fingerprint.

    Cite this