TAGCNA: a method to identify significant consensus events of copy number alterations in cancer.
Abstract
Somatic copy number alteration (CNA) is a common phenomenon in cancer genome. Distinguishing significant consensus events (SCEs) from random background CNAs in a set of subjects has been proven to be a valuable tool to study cancer. In order to identify SCEs with an acceptable type I error rate, better computational approaches should be developed based on reasonable statistics and null distributions. In this article, we propose a new approach named TAGCNA for identifying SCEs in somatic CNAs that may encompass cancer driver genes. TAGCNA employs a peel-off permutation scheme to generate a reasonable null distribution based on a prior step of selecting tag CNA markers from the genome being considered. We demonstrate the statistical power of TAGCNA on simulated ground truth data, and validate its applicability using two publicly available cancer datasets: lung and prostate adenocarcinoma. TAGCNA identifies SCEs that are known to be involved with proto-oncogenes (e.g. EGFR, CDK4) and tumor suppressor genes (e.g. CDKN2A, CDKN2B), and provides many additional SCEs with potential biological relevance in these data. TAGCNA can be used to analyze the significance of CNAs in various cancers. It is implemented in R and is freely available at http://tagcna.sourceforge.net/.
Links
Authors
Yuan X, Zhang J, Yang L, Zhang S, Chen B, Geng Y, Wang Y
Institution
School of Computer Science and Technology, Xidian University, Xi'an, People's Republic of China.
Source
PloS one 7:7 2012 pg e41082MeSH
AdenocarcinomaAlgorithms
Computer Simulation
Databases, Genetic
Gene Deletion
Gene Dosage
Genome
Genome, Human
Humans
Lung Neoplasms
Male
Models, Genetic
Models, Statistical
Oligonucleotide Array Sequence Analysis
Prostatic Neoplasms
Reproducibility of Results
Pub Type(s)
Journal ArticleResearch Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Language
eng
PubMed ID
22815924
Log In

