DMCM: a Data-adaptive Mutation Clustering Method to identify cancer-related mutation clusters

被引:26
作者
Lu, Xinguo [1 ]
Qian, Xin [1 ]
Li, Xing [1 ]
Miao, Qiumai [1 ]
Peng, Shaoliang [1 ,2 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410082, Hunan, Peoples R China
[2] Natl Univ Def Technol, Sch Comp Sci, Changsha 410073, Hunan, Peoples R China
关键词
SOMATIC MUTATIONS; GENOMES;
D O I
10.1093/bioinformatics/bty624
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Functional somatic mutations within coding amino acid sequences confer growth advantage in pathogenic process. Most existing methods for identifying cancer-related mutations focus on the single amino acid or the entire gene level. However, gain-of-function mutations often cluster in specific protein regions instead of existing independently in the amino acid sequences. Some approaches for identifying mutation clusters with mutation density on amino acid chain have been proposed recently. But their performance in identification of mutation clusters remains to be improved. Results: Here we present a Data-adaptive Mutation Clustering Method ( DMCM), in which kernel density estimate (KDE) with a data-adaptive bandwidth is applied to estimate the mutation density, to find variable clusters with different lengths on amino acid sequences. We apply this approach in the mutation data of 571 genes in over twenty cancer types from The Cancer Genome Atlas (TCGA). We compare the DMCM with (MC)-C-2, OncodriveCLUST and Pfam Domain and find that DMCM tends to identify more significant clusters. The cross-validation analysis shows DMCM is robust and cluster cancer type enrichment analysis shows that specific cancer types are enriched for specific mutation clusters.
引用
收藏
页码:389 / 397
页数:9
相关论文
共 25 条
[1]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[2]   Making sense of cancer genomic data [J].
Chin, Lynda ;
Hahn, William C. ;
Getz, Gad ;
Meyerson, Matthew .
GENES & DEVELOPMENT, 2011, 25 (06) :534-555
[3]  
Chwialkowski K., 2016, 33 INT C MACH LEARN, V6, P3854
[4]   MuSiC: Identifying mutational significance in cancer genomes [J].
Dees, Nathan D. ;
Zhang, Qunyuan ;
Kandoth, Cyriac ;
Wendl, Michael C. ;
Schierding, William ;
Koboldt, Daniel C. ;
Mooney, Thomas B. ;
Callaway, Matthew B. ;
Dooling, David ;
Mardis, Elaine R. ;
Wilson, Richard K. ;
Ding, Li .
GENOME RESEARCH, 2012, 22 (08) :1589-1598
[5]   Systematic analysis of somatic mutations impacting gene expression in 12 tumour types [J].
Ding, Jiarui ;
McConechy, Melissa K. ;
Horlings, Hugo M. ;
Ha, Gavin ;
Chan, Fong Chun ;
Funnell, Tyler ;
Mullaly, Sarah C. ;
Reimand, Jueri ;
Bashashati, Ali ;
Bader, Gary D. ;
Huntsman, David ;
Aparicio, Samuel ;
Condon, Anne ;
Shah, Sohrab P. .
NATURE COMMUNICATIONS, 2015, 6
[6]  
Finn R.D., 2010, NUCLC ACIDS RES, V40, P290
[7]   On the interpretation of x(2) from contingency tables, and the calculation of P [J].
Fisher, RA .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY, 1922, 85 :87-94
[8]   Functional impact bias reveals cancer drivers [J].
Gonzalez-Perez, Abel ;
Lopez-Bigas, Nuria .
NUCLEIC ACIDS RESEARCH, 2012, 40 (21)
[9]   Mutational heterogeneity in cancer and the search for new cancer-associated genes [J].
Lawrence, Michael S. ;
Stojanov, Petar ;
Polak, Paz ;
Kryukov, Gregory V. ;
Cibulskis, Kristian ;
Sivachenko, Andrey ;
Carter, Scott L. ;
Stewart, Chip ;
Mermel, Craig H. ;
Roberts, Steven A. ;
Kiezun, Adam ;
Hammerman, Peter S. ;
McKenna, Aaron ;
Drier, Yotam ;
Zou, Lihua ;
Ramos, Alex H. ;
Pugh, Trevor J. ;
Stransky, Nicolas ;
Helman, Elena ;
Kim, Jaegil ;
Sougnez, Carrie ;
Ambrogio, Lauren ;
Nickerson, Elizabeth ;
Shefler, Erica ;
Cortes, Maria L. ;
Auclair, Daniel ;
Saksena, Gordon ;
Voet, Douglas ;
Noble, Michael ;
DiCara, Daniel ;
Lin, Pei ;
Lichtenstein, Lee ;
Heiman, David I. ;
Fennell, Timothy ;
Imielinski, Marcin ;
Hernandez, Bryan ;
Hodis, Eran ;
Baca, Sylvan ;
Dulak, Austin M. ;
Lohr, Jens ;
Landau, Dan-Avi ;
Wu, Catherine J. ;
Melendez-Zajgla, Jorge ;
Hidalgo-Miranda, Alfredo ;
Koren, Amnon ;
McCarroll, Steven A. ;
Mora, Jaume ;
Lee, Ryan S. ;
Crompton, Brian ;
Onofrio, Robert .
NATURE, 2013, 499 (7457) :214-218
[10]   Recurrent point mutations in the kinetochore gene KNSTRN in cutaneous squamous cell carcinoma [J].
Lee, Carolyn S. ;
Bhaduri, Aparna ;
Mah, Angela ;
Johnson, Whitney L. ;
Ungewickell, Alexander ;
Aros, Cody J. ;
Nguyen, Christie B. ;
Rios, Eon J. ;
Siprashvili, Zurab ;
Straight, Aaron ;
Kim, Jinah ;
Aasi, Sumaira Z. ;
Khavari, Paul A. .
NATURE GENETICS, 2014, 46 (10) :1060-1062