Dimension Reduction of Microarray Data Using Gene Ontology and Correlation Filter

被引:1
作者
Banerjee, Ayan [1 ]
Pati, Soumen Kumar [2 ]
Gupta, Manan Kumar [2 ]
机构
[1] Jalpaiguri Govt Engn Coll, Dept Comp Sci, Jalpaiguri, W Bengal, India
[2] Maulana Abul Kalam Azad Univ Technol, Dept Bioinformat, Nadia, W Bengal, India
来源
COMPUTATIONAL INTELLIGENCE IN PATTERN RECOGNITION, CIPR 2020 | 2020年 / 1120卷
关键词
Gene ontology; Protein-protein interaction; Pearson's correlation coefficient; Gene selection;
D O I
10.1007/978-981-15-2449-3_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Analysis of every variable at a microscopic level is not a feasible way. It might take a long time to perform any meaningful analysis. So, the valuable time and money are wasted for analysis of high dimensional data. In this paper, a better way is given to deal with high dimensional data and proposed a novel dimension reduction technique based on gene ontology and Pearson's correlation coefficient. The names of genes are extracted from microarray data and classify them based on biological processes of gene ontology and find their relation with cellular components. Next, the gene correlation factor is identified on every network of each of the group. Lastly, the independent component (IC) value of the genes related to the gene ontology is calculated and selects only those genes having the highest IC value of their corresponding network and eliminates the rest of the genes. It reduces size of the dataset at least 40% from its original size.
引用
收藏
页码:303 / 313
页数:11
相关论文
共 50 条