Fast randomization of large genomic datasets while preserving alteration counts

A. Gobbi, F. Iorio, K.J. Dawson, D.C. Wedge, D. Tamborero, L.B. Alexandrov, N. Lopez-Bigas, M.J. Garnett, G. Jurman, J. Saez-Rodriguez

Research output: Contribution to journalConference articlepeer-review

Abstract

Motivation: Studying combinatorial patterns in cancer genomic datasets has recently emerged as a tool for identifying novel cancer driver networks. Approaches have been devised to quantify, for example, the tendency of a set of genes to be mutated in a 'mutually exclusive' manner. The significance of the proposed metrics is usually evaluated by computing P-values under appropriate null models. To this end, a Monte Carlo method (the switching-Algorithm) is used to sample simulated datasets under a null model that preserves patient-and genewise mutation rates. In this method, a genomic dataset is represented as a bipartite network, to which Markov chain updates (switchingsteps) are applied. These steps modify the network topology, and a minimal number of them must be executed to draw simulated datasets independently under the null model. This number has previously been deducted empirically to be a linear function of the total number of variants, making this process computationally expensive. Results: We present a novel approximate lower bound for the number of switching-steps, derived analytically. Additionally, we have developed the R package BiRewire, including new efficient implementations of the switching-Algorithm. We illustrate the performances of BiRewire by applying it to large real cancer genomics datasets. We report vast reductions in time requirement, with respect to existing implementations/bounds and equivalent P-value computations. Thus, we propose BiRewire to study statistical properties in genomic datasets, and other data that can be modeled as bipartite networks..
Original languageUndefined
Pages (from-to)i617-i623
Number of pages7
JournalBioinformatics
Volume30
Issue number17
DOIs
Publication statusPublished - 1 Sept 2014

Research Beacons, Institutes and Platforms

  • Manchester Cancer Research Centre

Cite this