The Extraction Method of DNA Microarray Features Based on Modified F Statistics vs. Classifier Based on Rough Mereology

被引:0
作者
Artiemjew, Piotr [1 ]
机构
[1] Univ Warmia & Mazury, Dept Math & Comp Sci, Olsztyn, Poland
来源
FOUNDATIONS OF INTELLIGENT SYSTEMS | 2011年 / 6804卷
关键词
rough mereology; granular computing; rough sets; DNA microarrays; features extraction;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The paradigm of Granular Computing has emerged quite recently as an area of research on its own; in particular, it is pursued within the rough set theory initiated by Zdzislaw Pawlak. Granules of knowledge can be used for the approximation of knowledge. Another natural application of granular structures is using them in the classification process. In this work we apply the granular classifier based on rough mereology, recently studied by Polkowski and Artiemjew 8_v1_w4 algorithm in exploration of DNA Microarrays. An indispensable element of the analysis of DNA microarray are the gene extraction methods, because of their high number of attributes and a relatively small number of objects, which in turn results in overfitting during the classification. In this paper we present one of our approaches to gene separation based on modified F statistics. The modification of F statistics, widely used in binary decision systems, consists in an extension to multiple decision classes and the application of a particular method to choose the best genes after their calculation for particular pairs of decision classes. The results of our research, obtained for modified F statistics, are comparable to, or even better than, the results obtained in other methods with data from the Advanced Track of the recent DNA Microarray data mining competition.
引用
收藏
页码:33 / 42
页数:10
相关论文
共 17 条
[1]  
[Anonymous], 2008, HDB GRANULAR COMPUTI
[2]  
Artiemjew P., 2010, P 2010 IEEE INT C SO
[3]  
Artiemjew P., 2009, THESIS POLISH JAPANE
[4]  
Artiemjew P, 2008, LECT NOTES ARTIF INT, V5009, P221, DOI 10.1007/978-3-540-79721-0_33
[5]  
Brown M. P. S., 1999, KNOWLEDGE BASED ANAL
[6]  
Eisen MB, 1999, METHOD ENZYMOL, V303, P179
[7]   Support vector machine classification and validation of cancer tissue samples using microarray expression data [J].
Furey, TS ;
Cristianini, N ;
Duffy, N ;
Bednarski, DW ;
Schummer, M ;
Haussler, D .
BIOINFORMATICS, 2000, 16 (10) :906-914
[8]  
Gorecki Przemyslaw, 2010, 2010 International Conference of Soft Computing and Pattern Recognition (SoCPaR 2010), P269, DOI 10.1109/SOCPAR.2010.5686494
[9]  
Hajek P., 1998, METAMATHEMATICS FUZZ
[10]   Prediction error estimation: a comparison of resampling methods [J].
Molinaro, AM ;
Simon, R ;
Pfeiffer, RM .
BIOINFORMATICS, 2005, 21 (15) :3301-3307