CNVDeep: deep association of copy number variants with neurocognitive disorders

被引:2
作者
Rahaie, Zahra [1 ]
Rabiee, Hamid R. [1 ]
Alinejad-Rokny, Hamid [2 ]
机构
[1] Sharif Univ Technol, Dept Comp Engn, BCB Grp, DML, Tehran, Iran
[2] UNSW Sydney, Sch Biomed Engn, UNSW Biomed Machine Learning Lab BML, Sydney, Australia
基金
美国国家科学基金会;
关键词
Copy number variant; Association; Deep learning; Proximal gradient descent; Group LASSO; Autism; Schizophrenia; Developmental delay; GLOBAL DEVELOPMENTAL DELAY; AUTISM SPECTRUM; GENES; MAP;
D O I
10.1186/s12859-024-05874-8
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundCopy number variants (CNVs) have become increasingly instrumental in understanding the etiology of all diseases and phenotypes, including Neurocognitive Disorders (NDs). Among the well-established regions associated with ND are small parts of chromosome 16 deletions (16p11.2) and chromosome 15 duplications (15q3). Various methods have been developed to identify associations between CNVs and diseases of interest. The majority of methods are based on statistical inference techniques. However, due to the multi-dimensional nature of the features of the CNVs, these methods are still immature. The other aspect is that regions discovered by different methods are large, while the causative regions may be much smaller.ResultsIn this study, we propose a regularized deep learning model to select causal regions for the target disease. With the help of the proximal [20] gradient descent algorithm, the model utilizes the group LASSO concept and embraces a deep learning model in a sparsity framework. We perform the CNV analysis for 74,811 individuals with three types of brain disorders, autism spectrum disorder (ASD), schizophrenia (SCZ), and developmental delay (DD), and also perform cumulative analysis to discover the regions that are common among the NDs. The brain expression of genes associated with diseases has increased by an average of 20 percent, and genes with homologs in mice that cause nervous system phenotypes have increased by 18 percent (on average). The DECIPHER data source also seeks other phenotypes connected to the detected regions alongside gene ontology analysis. The target diseases are correlated with some unexplored regions, such as deletions on 1q21.1 and 1q21.2 (for ASD), deletions on 20q12 (for SCZ), and duplications on 8p23.3 (for DD). Furthermore, our method is compared with other machine learning algorithms.ConclusionsOur model effectively identifies regions associated with phenotypic traits using regularized deep learning. Rather than attempting to analyze the whole genome, CNVDeep allows us to focus only on the causative regions of disease.
引用
收藏
页数:25
相关论文
共 30 条
[1]   FANTOM enters 20th year: expansion of transcriptomic atlases and functional annotation of non-coding RNAs [J].
Abugessaisa, Imad ;
Ramilowski, Jordan A. ;
Lizio, Marina ;
Severin, Jesicca ;
Hasegawa, Akira ;
Harshbarger, Jayson ;
Kondo, Atsushi ;
Noguchi, Shuhei ;
Yip, Chi Wai ;
Ooi, Jasmine Li Ching ;
Tagami, Michihira ;
Hori, Fumi ;
Agrawal, Saumya ;
Hon, Chung Chau ;
Cardon, Melissa ;
Ikeda, Shuya ;
Ono, Hiromasa ;
Bono, Hidemasa ;
Kato, Masaki ;
Hashimoto, Kosuke ;
Bonetti, Alessandro ;
Kobayashi, Norio ;
Shin, Jay ;
de Hoon, Michiel ;
Hayashizakil, Yoshihide ;
Carninci, Piero ;
Kawaji, Hideya ;
Kasukawa, Takeya .
NUCLEIC ACIDS RESEARCH, 2021, 49 (D1) :D892-D898
[2]   Brain-Enriched Coding and Long Non-coding RNA Genes Are Overrepresented in Recurrent Neurodevelopmental Disorder CNVs [J].
Alinejad-Rokny, Hamid ;
Heng, Julian I. T. ;
Forrest, Alistair R. R. .
CELL REPORTS, 2020, 33 (04)
[3]   Permutation importance: a corrected feature importance measure [J].
Altmann, Andre ;
Tolosi, Laura ;
Sander, Oliver ;
Lengauer, Thomas .
BIOINFORMATICS, 2010, 26 (10) :1340-1347
[4]   AutDB: a gene reference resource for autism research [J].
Basu, Saumyendra N. ;
Kollu, Ravi ;
Banerjee-Basu, Sharmila .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D832-D836
[5]   Refining analyses of copy number variation identifies specific genes associated with developmental delay [J].
Coe, Bradley P. ;
Witherspoon, Kali ;
Rosenfeld, Jill A. ;
van Bon, Bregje W. M. ;
Vulto-van Silfhout, Anneke T. ;
Bosco, Paolo ;
Friend, Kathryn L. ;
Baker, Carl ;
Buono, Serafino ;
Vissers, Lisenka E. L. M. ;
Schuurs-Hoeijmakers, Janneke H. ;
Hoischen, Alex ;
Pfundt, Rolph ;
Krumm, Nik ;
Carvill, Gemma L. ;
Li, Deana ;
Amaral, David ;
Brown, Natasha ;
Lockhart, Paul J. ;
Scheffer, Ingrid E. ;
Alberti, Antonino ;
Shaw, Marie ;
Pettinato, Rosa ;
Tervo, Raymond ;
de Leeuw, Nicole ;
Reijnders, Margot R. F. ;
Torchia, Beth S. ;
Peeters, Hilde ;
O'Roak, Brian J. ;
Fichera, Marco ;
Hehir-Kwa, Jayne Y. ;
Shendure, Jay ;
Mefford, Heather C. ;
Haan, Eric ;
Gecz, Jozef ;
de Vries, Bert B. A. ;
Romano, Corrado ;
Eichler, Evan E. .
NATURE GENETICS, 2014, 46 (10) :1063-1071
[6]   A copy number variation morbidity map of developmental delay [J].
Cooper, Gregory M. ;
Coe, Bradley P. ;
Girirajan, Santhosh ;
Rosenfeld, Jill A. ;
Vu, Tiffany H. ;
Baker, Carl ;
Williams, Charles ;
Stalker, Heather ;
Hamid, Rizwan ;
Hannig, Vickie ;
Abdel-Hamid, Hoda ;
Bader, Patricia ;
McCracken, Elizabeth ;
Niyazov, Dmitriy ;
Leppig, Kathleen ;
Thiese, Heidi ;
Hummel, Marybeth ;
Alexander, Nora ;
Gorski, Jerome ;
Kussmann, Jennifer ;
Shashi, Vandana ;
Johnson, Krys ;
Rehder, Catherine ;
Ballif, Blake C. ;
Shaffer, Lisa G. ;
Eichler, Evan E. .
NATURE GENETICS, 2011, 43 (09) :838-U44
[7]   DECIPHER: Database of Chromosomal Imbalance and Phenotype in Humans Using Ensembl Resources [J].
Firth, Helen V. ;
Richards, Shola M. ;
Bevan, A. Paul ;
Clayton, Stephen ;
Corpas, Manuel ;
Rajan, Diana ;
Van Vooren, Steven ;
Moreau, Yves ;
Pettett, Roger M. ;
Carter, Nigel P. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2009, 84 (04) :524-533
[8]   Updates in the Genetic Evaluation of the Child with Global Developmental Delay or Intellectual Disability [J].
Flore, Leigh Anne ;
Milunsky, Jeff M. .
SEMINARS IN PEDIATRIC NEUROLOGY, 2012, 19 (04) :173-180
[9]   CONAN: copy number variation analysis software for genome-wide association studies [J].
Forer, Lukas ;
Schoenherr, Sebastian ;
Weissensteiner, Hansi ;
Haider, Florian ;
Kluckner, Thomas ;
Gieger, Christian ;
Wichmann, Heinz-Erich ;
Specht, Guenther ;
Kronenberg, Florian ;
Kloss-Brandstaetter, Anita .
BMC BIOINFORMATICS, 2010, 11
[10]   Toward evidence-based medical statistics.: 1:: The P value fallacy [J].
Goodman, SN .
ANNALS OF INTERNAL MEDICINE, 1999, 130 (12) :995-1004