Dimension Reduction of Microarray Data Using Gene Ontology and Correlation Filter

被引:1
作者
Banerjee, Ayan [1 ]
Pati, Soumen Kumar [2 ]
Gupta, Manan Kumar [2 ]
机构
[1] Jalpaiguri Govt Engn Coll, Dept Comp Sci, Jalpaiguri, W Bengal, India
[2] Maulana Abul Kalam Azad Univ Technol, Dept Bioinformat, Nadia, W Bengal, India
来源
COMPUTATIONAL INTELLIGENCE IN PATTERN RECOGNITION, CIPR 2020 | 2020年 / 1120卷
关键词
Gene ontology; Protein-protein interaction; Pearson's correlation coefficient; Gene selection;
D O I
10.1007/978-981-15-2449-3_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Analysis of every variable at a microscopic level is not a feasible way. It might take a long time to perform any meaningful analysis. So, the valuable time and money are wasted for analysis of high dimensional data. In this paper, a better way is given to deal with high dimensional data and proposed a novel dimension reduction technique based on gene ontology and Pearson's correlation coefficient. The names of genes are extracted from microarray data and classify them based on biological processes of gene ontology and find their relation with cellular components. Next, the gene correlation factor is identified on every network of each of the group. Lastly, the independent component (IC) value of the genes related to the gene ontology is calculated and selects only those genes having the highest IC value of their corresponding network and eliminates the rest of the genes. It reduces size of the dataset at least 40% from its original size.
引用
收藏
页码:303 / 313
页数:11
相关论文
共 50 条
[41]   Gene Ontology Assisted Exploratory Microarray Clustering and Its Application to Cancer [J].
Macintyre, Geoff ;
Bailey, James ;
Gustafsson, Daniel ;
Boussioutas, Alex ;
Haviv, Izhak ;
Kowalczyk, Adam .
PATTERN RECOGNITION IN BIOINFORMATICS, PROCEEDINGS, 2008, 5265 :400-+
[42]   Data mining powered by the gene ontology [J].
Manda, Prashanti .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 10 (03)
[43]   Mining Gene Ontology Data with AGENDA [J].
Ovezmyradov, Guvanch ;
Lu, Qianhao ;
Goepfert, Martin C. .
BIOINFORMATICS AND BIOLOGY INSIGHTS, 2012, 6 :63-67
[44]   A multi-filter enhanced genetic ensemble system for gene selection and sample classification of microarray data [J].
Pengyi Yang ;
Bing B Zhou ;
Zili Zhang ;
Albert Y Zomaya .
BMC Bioinformatics, 11
[45]   Assessing gene stability and gene affinity in microarray data classification using an extended relieff algorithm [J].
Srivastava, Neha ;
Tayal, Devendra K. .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (15) :45761-45776
[46]   Assessing gene stability and gene affinity in microarray data classification using an extended relieff algorithm [J].
Neha Srivastava ;
Devendra K. Tayal .
Multimedia Tools and Applications, 2024, 83 :45761-45776
[47]   Targeted cellular process profiling approach for uterine leiomyoma using cDNA microarray, proteomics and gene ontology analysis [J].
Ahn, WS ;
Kim, KW ;
Bae, SM ;
Yoon, JH ;
Lee, JM ;
Namkoong, SE ;
Kim, JH ;
Kim, CK ;
Lee, YJ ;
Kim, YW .
INTERNATIONAL JOURNAL OF EXPERIMENTAL PATHOLOGY, 2003, 84 (06) :267-279
[48]   A hybrid filter/wrapper gene selection method for microarray classification [J].
Ni, B ;
Liu, J .
PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, :2537-2542
[49]   Gene selection of microarray data using Heatmap Analysis and Graph Neural Network [J].
Pati, Soumen Kumar ;
Banerjee, Ayan ;
Manna, Sweta .
APPLIED SOFT COMPUTING, 2023, 135
[50]   Gene Selection for Microarray Data Classification Using Hybrid Meta-Heuristics [J].
Dif, Nassima ;
Attaoui, Mohamed Walid ;
Elberrichi, Zakaria .
MODELLING AND IMPLEMENTATION OF COMPLEX SYSTEMS, 2019, 64 :119-132