Identification of cancer genes using a statistical framework for multiexperiment analysis of nondiscretized array CGH data

Christiaan Klijn, Henne Holstege, Jeroen de Ridder, Xiaoling Liu, Marcel Reinders, Jos Jonkers, Lodewyk Wessels

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Tumor formation is in part driven by DNA copy number alterations (CNAs), which can be measured using microarray-based Comparative Genomic Hybridization (aCGH). Multiexperiment analysis of aCGH data from tumors allows discovery of recurrent CNAs that are potentially causal to cancer development. Until now, multiexperiment aCGH data analysis has been dependent on discretization of measurement data to a gain, loss or no-change state. Valuable biological information is lost when a heterogeneous system such as a solid tumor is reduced to these states. We have developed a new approach which inputs nondiscretized aCGH data to identify regions that are significantly aberrant across an entire tumor set. Our method is based on kernel regression and accounts for the strength of a probe's signal, its local genomic environment and the signal distribution across multiple tumors. In an analysis of 89 human breast tumors, our method showed enrichment for known cancer genes in the detected regions and identified aberrations that are strongly associated with breast cancer subtypes and clinical parameters. Furthermore, we identified 18 recurrent aberrant regions in a new dataset of 19 p53-deficient mouse mammary tumors. These regions, combined with gene expression microarray data, point to known cancer genes and novel candidate cancer genes. © 2008 The Author(s).
Original languageEnglish
Article numbere13
JournalNucleic Acids Research
Volume36
Issue number2
DOIs
Publication statusPublished - 2008
Externally publishedYes

Cite this