An Agent-Based Clustering Approach for Gene Selection in Gene Expression Microarray

被引:0
作者
Juan Ramos
José A. Castellanos-Garzón
Alfonso González-Briones
Juan F. de Paz
Juan M. Corchado
机构
[1] University of Salamanca,
[2] IBSAL/BISITE Research Group,undefined
[3] University of Coimbra,undefined
[4] CISUC,undefined
[5] ECOS Research Group,undefined
[6] Osaka Institute of Technology,undefined
来源
Interdisciplinary Sciences: Computational Life Sciences | 2017年 / 9卷
关键词
Gene selection; Filter method; Multi-agent system; Clustering; Classification; Machine learning; Visual analytics; DNA-microarray;
D O I
暂无
中图分类号
学科分类号
摘要
Gene selection is a major research area in microarray analysis, which seeks to discover differentially expressed genes for a particular target annotation. Such genes also often called informative genes are able to differentiate tissue samples belonging to different classes of the studied disease. Despite the fact that there is a wide number of proposals, the complexity imposed by this problem remains a challenge today. This research proposes a gene selection approach by means of a clustering-based multi-agent system. This proposal manages different filter methods and gene clustering through coordinated agents to discover informative gene subsets. To assess the reliability of our approach, we have used four important and public gene expression datasets, two Lung cancer datasets, Colon and Leukemia cancer dataset. The achieved results have been validated through cluster validity measures, visual analytics, a classifier and compared with other gene selection methods, proving the reliability of our proposal.
引用
收藏
页码:1 / 13
页数:12
相关论文
共 161 条
[31]  
Caligiuri M(2005)Applications of generating functions in nonparametric tests Math J 9 803-317
[32]  
Bloomfield D(2006)Hybrid hierarchical clustering with applications to microarray data Biostatistics 7 302-2591
[33]  
Lander E(2013)An evolutionary computational model applied to cluster analysis of DNA microarray data Expert Syst Appl (Elsevier) 40 2575-8
[34]  
Zappa C(2009)Global gene expression analysis reveals specific patterns of cell junctions in non-small cell lung cancer subtypes Lung Cancer 63 32-6750
[35]  
Mousa S(1999)Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays Proc Natl Acad Sci USA 96 6745-37
[36]  
Castellanos-Garzón JA(2008)Top 10 algorithms in data mining Knowl Inf Syst 14 1-20
[37]  
García CA(2014)A feature selection method for classification within functional genomics experiments based on the proportional overlapping score BMC Bioinform 15 1-13
[38]  
Novais P(2010)Feature selection with the Boruta package J Stat Softw 36 1-18
[39]  
Díaz F(2015)A genetic algorithm for selection of fixed-size subsets with application to design problems J Stat Softw 68 1-519
[40]  
Lazar C(2010)Feature selection in omics prediction problems using CAT scores and false non-discovery rate control Ann Appl Stat 4 503-773