Efficient classifiers for multi-class classification problems

被引:22
作者
Lin, Hung-Yi [1 ]
机构
[1] Natl Taichung Univ Sci & Technol, Dept Distribut Management, Taichung, Taiwan
关键词
Multivariate analysis; Multi-class problems; Feature evaluation; Feature selection; Feature extraction; Inductive learning; FEATURE-SELECTION; INFORMATION; DISCOVERY; CRITERIA;
D O I
10.1016/j.dss.2012.02.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification problems have become more complex and intricate in modern applications in the face of continuous data explosion. In addition to great quantities of features and large numbers of instances, modern classification applications are continuously developed with multiple classes (objectives). The ever-increasing growth in data quantity and computation complexity has largely deteriorated the performance and accuracy of classification models. In order to deal with such situations, multivariate statistical analyses are adopted in this paper. Multivariate statistical analyses have two advantages. First, they can explore the relationships between variables and find the most characterizing features of the observed data. Second, they can solve problems which are stalled by high dimensionality. In this paper, the first advantage is applied to the selection of relevant features and the second is employed to generate the multivariate classifier. Experimental results show that our model can significantly improve classification training time by combining a compact subset of relevant features without the loss of accuracy in multi-class classification problems. In addition, the discrimination degree of our classifier outperforms other conventional classifiers. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:473 / 481
页数:9
相关论文
共 45 条
  • [1] [Anonymous], 2005, UCI LEARN REP
  • [2] A multi-class classification strategy for Fisher scores: Application to signer independent sign language recognition
    Aran, Oya
    Akarun, Lale
    [J]. PATTERN RECOGNITION, 2010, 43 (05) : 1776 - 1788
  • [3] Link Analysis for Web Spam Detection
    Becchetti, Luca
    Castillo, Carlos
    Donato, Debora
    Baeza-Yates, Ricardo
    Leonardi, Stefano
    [J]. ACM TRANSACTIONS ON THE WEB, 2008, 2 (01)
  • [4] Support vector machines for credit scoring and discovery of significant features
    Bellotti, Tony
    Crook, Jonathan
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) : 3302 - 3308
  • [5] Bolton RJ, 2002, STAT SCI, V17, P235
  • [6] Multi-objective nature-inspired clustering and classification techniques for image segmentation
    Bong, Chin-Wei
    Rajeswari, Mandava
    [J]. APPLIED SOFT COMPUTING, 2011, 11 (04) : 3271 - 3282
  • [7] Hormesis outperforms threshold model in National Cancer Institute antitumor drug screening database
    Calabrese, Edward J.
    Staudenmayer, John W.
    Stanek, Edward J., III
    Hoffmann, George R.
    [J]. TOXICOLOGICAL SCIENCES, 2006, 94 (02) : 368 - 378
  • [8] Carreira-Perpinan M. A., 2001, Continuous latent variable models for dimensionality reduction and sequential data reconstruction
  • [9] Castillo Carlos, 2007, 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P423, DOI 10.1145/1277741.1277814
  • [10] Chang C.-C., 2001, SOFTWARE AVAILABLE L