Clustering Based Feature Selection using Extreme Learning Machines for Text Classification

被引:0
作者
Roul, Rajendra Kumar [1 ]
Gugnani, Shashank [1 ]
Kalpeshbhai, Shah Mit [1 ]
机构
[1] BITS Pilani, Dept Comp Sci, KK Birla Goa Campus, Pilani 403726, Goa, India
来源
2015 ANNUAL IEEE INDIA CONFERENCE (INDICON) | 2015年
关键词
Classification; ELM; K-Means; ML-ELM; SVM; Wordnet;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The expansion of the dynamic Web increases the digital documents, which has attracted many researchers to work in the field of text classification. It is an important and well studied area of machine learning with a variety of modern applications. A good feature selection is of paramount importance to increase the efficiency of the classifiers working on text data. Choosing the most relevant features out of what can be an incredibly large set of data, is particularly important for accurate text classification. This paper is a motivation in that direction where we propose a new clustering based feature selection technique that reduces the feature size. Traditional k-means clustering technique along with TF-IDF and Wordnet helps us to form a quality and reduced feature vector to train the Extreme Learning Machine (ELM) and Multi-layer ELM (ML-ELM) which have been used as the classifiers for text classification. The experimental work has been carried out on 20-Newsgroups and DMOZ datasets. Results on these two standard datasets demonstrate the efficiency of our approach using ELM and ML-ELM as the classifiers over the state-of-the-art classifiers.
引用
收藏
页数:6
相关论文
共 16 条
[1]  
Aggarwal CharuC., 2012, MINING TEXT DATA, DOI 10.1007/978-1-4614-3223-4_6
[2]  
[Anonymous], 2013, Hyperopt: A python library for optimizing the hyperparameters of machine learning algorithms
[3]  
[Anonymous], 1997, ICML
[4]   Extreme Learning Machines [J].
Cambria, Erik ;
Huang, Guang-Bin .
IEEE INTELLIGENT SYSTEMS, 2013, 28 (06) :30-31
[5]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[6]  
Dasgupta A, 2007, KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P230
[7]  
Deng ZH, 2004, LECT NOTES COMPUT SC, V3007, P588
[8]  
Forman G., 2003, Journal of Machine Learning Research, V3, P1289, DOI 10.1162/153244303322753670
[9]  
Guyon I., 2003, Journal of Machine Learning Research, V3, P1157, DOI 10.1162/153244303322753616
[10]   A fast learning algorithm for deep belief nets [J].
Hinton, Geoffrey E. ;
Osindero, Simon ;
Teh, Yee-Whye .
NEURAL COMPUTATION, 2006, 18 (07) :1527-1554