Integration of transcriptomic data identifies key hallmark genes in hypertrophic cardiomyopathy

被引:10
作者
Xu, Jing [1 ]
Liu, Xiangdong [2 ]
Dai, Qiming [3 ]
机构
[1] Southeast Univ, ZhongDa Hosp, Dept Clin Lab, Nanjing, Peoples R China
[2] Southeast Univ, Inst Life Sci, Nanjing, Peoples R China
[3] Southeast Univ, ZhongDa Hosp, Dept Cardiol, Nanjing, Peoples R China
关键词
Hypertrophic cardiomyopathy; Microarray; RNA-Seq; Classification; JAK2; FEATURE-SELECTION; EXPRESSION; CYTOSCAPE; PATIENT; MODELS;
D O I
10.1186/s12872-021-02147-7
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background Hypertrophic cardiomyopathy (HCM) represents one of the most common inherited heart diseases. To identify key molecules involved in the development of HCM, gene expression patterns of the heart tissue samples in HCM patients from multiple microarray and RNA-seq platforms were investigated. Methods The significant genes were obtained through the intersection of two gene sets, corresponding to the identified differentially expressed genes (DEGs) within the microarray data and within the RNA-Seq data. Those genes were further ranked using minimum-Redundancy Maximum-Relevance feature selection algorithm. Moreover, the genes were assessed by three different machine learning methods for classification, including support vector machines, random forest and k-Nearest Neighbor. Results Outstanding results were achieved by taking exclusively the top eight genes of the ranking into consideration. Since the eight genes were identified as candidate HCM hallmark genes, the interactions between them and known HCM disease genes were explored through the protein-protein interaction (PPI) network. Most candidate HCM hallmark genes were found to have direct or indirect interactions with known HCM diseases genes in the PPI network, particularly the hub genes JAK2 and GADD45A. Conclusions This study highlights the transcriptomic data integration, in combination with machine learning methods, in providing insight into the key hallmark genes in the genetic etiology of HCM.
引用
收藏
页数:10
相关论文
共 50 条
[1]   HTSeq-a Python']Python framework to work with high-throughput sequencing data [J].
Anders, Simon ;
Pyl, Paul Theodor ;
Huber, Wolfgang .
BIOINFORMATICS, 2015, 31 (02) :166-169
[2]   Inhibition of Jak2 phosphorylation attenuates pressure overload cardiac hypertrophy [J].
Beckles, Daniel L. ;
Mascareno, Eduardo ;
Siddiqui, M. A. Q. .
VASCULAR PHARMACOLOGY, 2006, 45 (06) :350-357
[3]   ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks [J].
Bindea, Gabriela ;
Mlecnik, Bernhard ;
Hackl, Hubert ;
Charoentong, Pornpimol ;
Tosolini, Marie ;
Kirilovsky, Amos ;
Fridman, Wolf-Herman ;
Pages, Franck ;
Trajanoski, Zlatko ;
Galon, Jerome .
BIOINFORMATICS, 2009, 25 (08) :1091-1093
[4]  
Boateng E.Y., 2020, J. Data Anal. Inf. Process., V8, P341, DOI DOI 10.4236/JDAIP.2020.84020
[5]   Transcriptome Profiling in Human Diseases: New Advances and Perspectives [J].
Casamassimi, Amelia ;
Federico, Antonio ;
Rienzo, Monica ;
Esposito, Sabrina ;
Ciccodicola, Alfredo .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2017, 18 (08)
[6]   Integration of RNA-Seq data with heterogeneous microarray data for breast cancer profiling [J].
Castillo, Daniel ;
Manuel Galvez, Juan ;
Javier Herrera, Luis ;
San Roman, Belen ;
Rojas, Fernando ;
Rojas, Ignacio .
BMC BIOINFORMATICS, 2017, 18
[7]   5′RNA-Seq identifies Fhl1 as a genetic modifier in cardiomyopathy [J].
Christodoulou, Danos C. ;
Wakimoto, Hiroko ;
Onoue, Kenji ;
Eminaga, Seda ;
Gorham, Joshua M. ;
DePalma, Steve R. ;
Herman, Daniel S. ;
Teekakirikul, Polakit ;
Conner, David A. ;
McKean, David M. ;
Domenighetti, Andrea A. ;
Aboukhalil, Anton ;
Chang, Stephen ;
Srivastava, Gyan ;
McDonough, Barbara ;
De Jager, Philip L. ;
Chen, Ju ;
Bulyk, Martha L. ;
Muehlschlege, Jochen D. ;
Seidman, Christine E. ;
Seidman, J. G. .
JOURNAL OF CLINICAL INVESTIGATION, 2014, 124 (03) :1364-1370
[8]   Gene selection and classification of microarray data using random forest -: art. no. 3 [J].
Díaz-Uriarte, R ;
de Andrés, SA .
BMC BIOINFORMATICS, 2006, 7 (1)
[9]  
Ding Chris, 2005, Journal of Bioinformatics and Computational Biology, V3, P185, DOI 10.1142/S0219720005001004
[10]   STAR: ultrafast universal RNA-seq aligner [J].
Dobin, Alexander ;
Davis, Carrie A. ;
Schlesinger, Felix ;
Drenkow, Jorg ;
Zaleski, Chris ;
Jha, Sonali ;
Batut, Philippe ;
Chaisson, Mark ;
Gingeras, Thomas R. .
BIOINFORMATICS, 2013, 29 (01) :15-21