Feature selection and classification of leukocytes using random forest

被引:0
作者
Mukesh Saraswat
K. V. Arya
机构
[1] ABV-Indian Institute of Information Technology and Management,
来源
Medical & Biological Engineering & Computing | 2014年 / 52卷
关键词
Leukocytes classification; Random forest; Gini importance; Dimensionality reduction; Feature selection;
D O I
暂无
中图分类号
学科分类号
摘要
In automatic segmentation of leukocytes from the complex morphological background of tissue section images, a vast number of artifacts/noise are also extracted causing large amount of multivariate data generation. This multivariate data degrades the performance of a classifier to discriminate between leukocytes and artifacts/noise. However, the selection of prominent features plays an important role in reducing the computational complexity and increasing the performance of the classifier as compared to a high-dimensional features space. Therefore, this paper introduces a novel Gini importance-based binary random forest feature selection method. Moreover, the random forest classifier is used to classify the extracted objects into artifacts, mononuclear cells, and polymorphonuclear cells. The experimental results establish that the proposed method effectively eliminates the irrelevant features, maintaining the high classification accuracy as compared to other feature reduction methods.
引用
收藏
页码:1041 / 1052
页数:11
相关论文
共 146 条
[1]  
Alon U(1999)Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays Proc Natl Acad Sci 96 6745-6750
[2]  
Barkai N(2014)Automatic feature selection of motor imagery EEG signals using differential evolution and learning automata Med Biol Eng Comput 52 131-139
[3]  
Notterman DA(2001)Random forests Mach Learn 45 5-32
[4]  
Gish K(2001)Backward sequential elimination for sparse vector subset selection Signal Process 81 1849-1864
[5]  
Ybarra S(2010)Feature selection on movement imagery discrimination and attention detection Med Biol Eng Comput 48 331-341
[6]  
Mack D(2006)Gene selection and classification of microarray data using random forest BMC Bioinform 7 3-770
[7]  
Levine AJ(2010)Sensitivity versus accuracy in multiclass problems using memetic pareto evolutionary neural networks IEEE Trans Neural Netw 21 750-3145
[8]  
Bhattacharyya S(2005)Proteomic mass spectra classification using decision tree based ensemble methods Bioinformatics 21 3138-537
[9]  
Sengupta A(1999)Molecular classification of cancer: class discovery and class prediction by gene expression monitoring Science 286 531-171
[10]  
Chakraborti T(2009)Histopathological image analysis: a review IEEE Rev Biomed Eng 2 147-422