Support vector machine model of developmental brain gene expression data for prioritization of Autism risk gene candidates

被引:49
作者
Cogill, S. [1 ]
Wang, L. [1 ]
机构
[1] Clemson Univ, Dept Biochem & Genet, Clemson, SC 29634 USA
关键词
LONG NONCODING RNAS; SPECTRUM DISORDERS; PREDICTION; KNOWLEDGEBASE; IMPLICATE; EVOLUTION; CHILDREN; INSIGHTS; GENCODE; DNA;
D O I
10.1093/bioinformatics/btw498
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Autism spectrum disorders (ASD) are a group of neurodevelopmental disorders with clinical heterogeneity and a substantial polygenic component. High-throughput methods for ASD risk gene identification produce numerous candidate genes that are time-consuming and expensive to validate. Prioritization methods can identify high-confidence candidates. Previous ASD gene prioritization methods have focused on a priori knowledge, which excludes genes with little functional annotation or no protein product such as long non-coding RNAs (lncRNAs). Results: We have developed a support vector machine (SVM) model, trained using brain developmental gene expression data, for the classification and prioritization of ASD risk genes. The selected feature model had a mean accuracy of 76.7%, mean specificity of 77.2% and mean sensitivity of 74.4%. Gene lists comprised of an ASD risk gene and adjacent genes were ranked using the model's decision function output. The known ASD risk genes were ranked on average in the 77.4th, 78.4th and 80.7th percentile for sets of 101, 201 and 401 genes respectively. Of 10,840 lncRNA genes, 63 were classified as ASD-associated candidates with a confidence greater than 0.95. Genes previously associated with brain development and neurodevelopmental disorders were prioritized highly within the lncRNA gene list.
引用
收藏
页码:3611 / 3618
页数:8
相关论文
共 53 条
[1]   SFARI Gene 2.0: a community-driven knowledgebase for the autism spectrum disorders (ASDs) [J].
Abrahams, Brett S. ;
Arking, Dan E. ;
Campbell, Daniel B. ;
Mefford, Heather C. ;
Morrow, Eric M. ;
Weiss, Lauren A. ;
Menashe, Idan ;
Wadkins, Tim ;
Banerjee-Basu, Sharmila ;
Packer, Alan .
MOLECULAR AUTISM, 2013, 4
[2]   Individual common variants exert weak effects on the risk for autism spectrum disorderspi [J].
Anney, Richard ;
Klei, Lambertus ;
Pinto, Dalila ;
Almeida, Joana ;
Bacchelli, Elena ;
Baird, Gillian ;
Bolshakova, Nadia ;
Boelte, Sven ;
Bolton, Patrick F. ;
Bourgeron, Thomas ;
Brennan, Sean ;
Brian, Jessica ;
Casey, Jillian ;
Conroy, Judith ;
Correia, Catarina ;
Corsello, Christina ;
Crawford, Emily L. ;
de Jonge, Maretha ;
Delorme, Richard ;
Duketis, Eftichia ;
Duque, Frederico ;
Estes, Annette ;
Farrar, Penny ;
Fernandez, Bridget A. ;
Folstein, Susan E. ;
Fombonne, Eric ;
Gilbert, John ;
Gillberg, Christopher ;
Glessner, Joseph T. ;
Green, Andrew ;
Green, Jonathan ;
Guter, Stephen J. ;
Heron, Elizabeth A. ;
Holt, Richard ;
Howe, Jennifer L. ;
Hughes, Gillian ;
Hus, Vanessa ;
Igliozzi, Roberta ;
Jacob, Suma ;
Kenny, Graham P. ;
Kim, Cecilia ;
Kolevzon, Alexander ;
Kustanovich, Vlad ;
Lajonchere, Clara M. ;
Lamb, Janine A. ;
Law-Smith, Miriam ;
Leboyer, Marion ;
Le Couteur, Ann ;
Leventhal, Bennett L. ;
Liu, Xiao-Qing .
HUMAN MOLECULAR GENETICS, 2012, 21 (21) :4781-4792
[3]  
[Anonymous], 2012, Diagnostic and statistical manual of mental disorders, V4th
[4]   Transcriptome sequencing during mouse brain development identifies long non-coding RNAs functionally involved in neurogenic commitment [J].
Aprea, Julieta ;
Prenninger, Silvia ;
Dori, Martina ;
Ghosh, Tanay ;
Monasor, Laura Sebastian ;
Wessendorf, Elke ;
Zocher, Sara ;
Massalini, Simone ;
Alexopoulou, Dimitra ;
Lesche, Mathias ;
Dahl, Andreas ;
Groszer, Matthias ;
Hiller, Michael ;
Calegari, Federico .
EMBO JOURNAL, 2013, 32 (24) :3145-3160
[5]   Psychiatric disorders in adolescents and adults with autism and intellectual disability: A representative study in one county in Norway [J].
Bakken, Trine L. ;
Helverschou, Sissel B. ;
Eilertsen, Dag E. ;
Heggelund, Trond ;
Myrbakk, Even ;
Martinsen, Harald .
RESEARCH IN DEVELOPMENTAL DISABILITIES, 2010, 31 (06) :1669-1677
[6]   A long nuclear-retained non-coding RNA regulates synaptogenesis by modulating gene expression [J].
Bernard, Delphine ;
Prasanth, Kannanganattu V. ;
Tripathi, Vidisha ;
Colasse, Sabrina ;
Nakamura, Tetsuya ;
Xuan, Zhenyu ;
Zhang, Michael Q. ;
Sedel, Frederic ;
Jourdren, Laurent ;
Coulpier, Fanny ;
Triller, Antoine ;
Spector, David L. ;
Bessis, Alain .
EMBO JOURNAL, 2010, 29 (18) :3082-3093
[7]   Behavioral signatures related to genetic disorders in autism [J].
Bruining, Hilgo ;
Eijkemans, Marinus J. C. ;
Kas, Martien J. H. ;
Curran, Sarah R. ;
Vorstman, Jacob A. S. ;
Bolton, Patrick F. .
MOLECULAR AUTISM, 2014, 5
[8]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[9]   Large-Scale Use of the Modified Checklist for Autism in Low-Risk Toddlers [J].
Chlebowski, Colby ;
Robins, Diana L. ;
Barton, Marianne L. ;
Fein, Deborah .
PEDIATRICS, 2013, 131 (04) :E1121-E1127
[10]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411