Prediction and Prioritization of Rare Oncogenic Mutations in the Cancer Kinome Using Novel Features and Multiple Classifiers

被引:27
作者
ManChon, U. [1 ]
Talevich, Eric [2 ]
Katiyar, Samiksha [3 ]
Rasheed, Khaled [1 ]
Kannan, Natarajan [3 ,4 ]
机构
[1] Univ Georgia, Dept Comp Sci, Athens, GA 30602 USA
[2] Univ Calif San Francisco, Dept Dermatol, San Francisco, CA 94143 USA
[3] Univ Georgia, Dept Biochem & Mol Biol, Athens, GA 30602 USA
[4] Univ Georgia, Inst Bioinformat, Athens, GA 30602 USA
关键词
SOMATIC MUTATIONS; LUNG-CANCER; MISSENSE MUTATIONS; DRIVER MUTATIONS; PROTEIN-KINASES; HUMAN BREAST; PHOSPHORYLATION; PATTERNS; DISEASE; GENOME;
D O I
10.1371/journal.pcbi.1003545
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Cancer is a genetic disease that develops through a series of somatic mutations, a subset of which drive cancer progression. Although cancer genome sequencing studies are beginning to reveal the mutational patterns of genes in various cancers, identifying the small subset of causative mutations from the large subset of non-causative mutations, which accumulate as a consequence of the disease, is a challenge. In this article, we present an effective machine learning approach for identifying cancer-associated mutations in human protein kinases, a class of signaling proteins known to be frequently mutated in human cancers. We evaluate the performance of 11 well known supervised learners and show that a multiple-classifier approach, which combines the performances of individual learners, significantly improves the classification of known cancer-associated mutations. We introduce several novel features related specifically to structural and functional characteristics of protein kinases and find that the level of conservation of the mutated residue at specific evolutionary depths is an important predictor of oncogenic effect. We consolidate the novel features and the multiple-classifier approach to prioritize and experimentally test a set of rare unconfirmed mutations in the epidermal growth factor receptor tyrosine kinase (EGFR). Our studies identify T725M and L861R as rare cancer-associated mutations inasmuch as these mutations increase EGFR activity in the absence of the activating EGF ligand in cell-based assays. Author Summary Cancer progresses by accumulation of mutations in a subset of genes that confer growth advantage. The 518 protein kinase genes encoded in the human genome, collectively called the kinome, represent one of the largest families of oncogenes. Targeted sequencing studies of many different cancers have shown that the mutational landscape comprises both cancer-causing driver mutations and harmless passenger mutations. While the frequent recurrence of some driver mutations in human cancers helps distinguish them from the large number of passenger mutations, a significant challenge is to identify the rare driver mutations that are less frequently observed in patient samples and yet are causative. Here we combine computational and experimental approaches to identify rare cancer-associated mutations in Epidermal Growth Factor receptor kinase (EGFR), a signaling protein frequently mutated in cancers. Specifically, we evaluate a novel multiple-classifier approach and features specific to the protein kinase super-family in distinguishing known cancer-associated mutations from benign mutations. We then apply the multiple classifier to identify and test the functional impact of rare cancer-associated mutations in EGFR. We report, for the first time, that the EGFR mutations T725M and L861R, which are infrequently observed in cancers, constitutively activate EGFR in a manner analogous to the frequently observed driver mutations.
引用
收藏
页数:12
相关论文
共 86 条
[71]   Somatic mutations of epidermal growth factor receptor signaling pathway in lung cancers [J].
Shigematsu, H ;
Gazdar, AF .
INTERNATIONAL JOURNAL OF CANCER, 2006, 118 (02) :257-262
[72]   The structural impact of cancer-associated missense mutations in oncogenes and tumor suppressors [J].
Stehr, Henning ;
Jang, Seon-Hi J. ;
Duarte, Jose M. ;
Wierling, Christoph ;
Lehrach, Hans ;
Lappe, Michael ;
Lange, Bodo M. H. .
MOLECULAR CANCER, 2011, 10
[73]   A screen of the complete protein kinase gene family identifies diverse patterns of somatic mutations in human breast cancer [J].
Stephens, P ;
Edkins, S ;
Davies, H ;
Greenman, C ;
Cox, C ;
Hunter, C ;
Bignell, G ;
Teague, J ;
Smith, R ;
Stevens, C ;
O'Meara, S ;
Parker, A ;
Tarpey, P ;
Avis, T ;
Barthorpe, A ;
Brackenbury, L ;
Buck, G ;
Butler, A ;
Clements, J ;
Cole, J ;
Dicks, E ;
Edwards, K ;
Forbes, S ;
Gorton, M ;
Gray, K ;
Halliday, K ;
Harrison, R ;
Hills, K ;
Hinton, J ;
Jones, D ;
Kosmidou, V ;
Laman, R ;
Lugg, R ;
Menzies, A ;
Perry, J ;
Petty, R ;
Raine, K ;
Shepherd, R ;
Small, A ;
Solomon, H ;
Stephens, Y ;
Tofts, C ;
Varian, J ;
Webb, A ;
West, S ;
Widaa, S ;
Yates, A ;
Brasseur, F ;
Cooper, CS ;
Flanagan, AM .
NATURE GENETICS, 2005, 37 (06) :590-592
[74]   The cancer genome [J].
Stratton, Michael R. ;
Campbell, Peter J. ;
Futreal, P. Andrew .
NATURE, 2009, 458 (7239) :719-724
[75]   UniRef: comprehensive and non-redundant UniProt reference clusters [J].
Suzek, Baris E. ;
Huang, Hongzhan ;
McGarvey, Peter ;
Mazumder, Raja ;
Wu, Cathy H. .
BIOINFORMATICS, 2007, 23 (10) :1282-1288
[76]   Coding sing le-nucleotide polymorphisms associated with complex vs. Mendelian disease: Evolutionary evidence for differences in molecular effects [J].
Thomas, PD ;
Kejariwal, A .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (43) :15398-15403
[77]   Prediction of cancer driver mutations in protein kinases [J].
Torkamani, Ali ;
Schork, Nicholas J. .
CANCER RESEARCH, 2008, 68 (06) :1675-1682
[78]   Accurate prediction of deleterious protein kinase polymorphisms [J].
Torkamani, Ali ;
Schork, Nicholas J. .
BIOINFORMATICS, 2007, 23 (21) :2918-2925
[79]   Identification of rare cancer driver mutations by network reconstruction [J].
Torkamani, Ali ;
Schork, Nicholas J. .
GENOME RESEARCH, 2009, 19 (09) :1570-1578
[80]   The molecular basis of targeting protein kinases in cancer therapeutics [J].
Tsai, Chung-Jung ;
Nussinov, Ruth .
SEMINARS IN CANCER BIOLOGY, 2013, 23 (04) :235-242