Clustering Based Feature Selection using Extreme Learning Machines for Text Classification

被引:0
作者
Roul, Rajendra Kumar [1 ]
Gugnani, Shashank [1 ]
Kalpeshbhai, Shah Mit [1 ]
机构
[1] BITS Pilani, Dept Comp Sci, KK Birla Goa Campus, Pilani 403726, Goa, India
来源
2015 ANNUAL IEEE INDIA CONFERENCE (INDICON) | 2015年
关键词
Classification; ELM; K-Means; ML-ELM; SVM; Wordnet;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The expansion of the dynamic Web increases the digital documents, which has attracted many researchers to work in the field of text classification. It is an important and well studied area of machine learning with a variety of modern applications. A good feature selection is of paramount importance to increase the efficiency of the classifiers working on text data. Choosing the most relevant features out of what can be an incredibly large set of data, is particularly important for accurate text classification. This paper is a motivation in that direction where we propose a new clustering based feature selection technique that reduces the feature size. Traditional k-means clustering technique along with TF-IDF and Wordnet helps us to form a quality and reduced feature vector to train the Extreme Learning Machine (ELM) and Multi-layer ELM (ML-ELM) which have been used as the classifiers for text classification. The experimental work has been carried out on 20-Newsgroups and DMOZ datasets. Results on these two standard datasets demonstrate the efficiency of our approach using ELM and ML-ELM as the classifiers over the state-of-the-art classifiers.
引用
收藏
页数:6
相关论文
共 16 条
[11]   Learning capability and storage capacity of two-hidden-layer feedforward networks [J].
Huang, GB .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2003, 14 (02) :274-281
[12]   Extreme learning machine: Theory and applications [J].
Huang, Guang-Bin ;
Zhu, Qin-Yu ;
Siew, Chee-Kheong .
NEUROCOMPUTING, 2006, 70 (1-3) :489-501
[13]  
Kasun H. G. V., 2013, IEEE INTELI IN PRESS
[14]   A fast and accurate online sequential learning algorithm for feedforward networks [J].
Liang, Nan-Ying ;
Huang, Guang-Bin ;
Saratchandran, P. ;
Sundararajan, N. .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2006, 17 (06) :1411-1423
[15]  
Qiu XP, 2011, LECT NOTES ARTIF INT, V6634, P50, DOI 10.1007/978-3-642-20841-6_5
[16]   Machine learning in automated text categorization [J].
Sebastiani, F .
ACM COMPUTING SURVEYS, 2002, 34 (01) :1-47