Multiclass MTS for Simultaneous Feature Selection and Classification

被引:51
作者
Su, Chao-Ton [1 ]
Hsiao, Yu-Hsiang [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Ind Engn & Engn Management, Hsinchu 30013, Taiwan
关键词
Classification; feature selection; multiclass problem; Mahalanobis-Taguchi system (MTS); weighted Mahalanobis distance; Gram-Schmidt orthogonalization process; gestational diabetes mellitus; MAHALANOBIS DISTANCE; VECTOR MACHINES; ROBUSTNESS; WOMEN; RISK;
D O I
10.1109/TKDE.2008.128
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multiclass Mahalanobis-Taguchi system (MMTS), the extension of MTS, is developed for simultaneous multiclass classification and feature selection. In MMTS, the multiclass measurement scale is constructed by establishing an individual Mahalanobis space for each class. To increase the validity of the measurement scale, the Gram-Schmidt process is performed to mutually orthogonalize the features and eliminate the multicollinearity. The important features are identified using the orthogonal arrays and the signal-to-noise ratio, and are then used to construct a reduced model measurement scale. The contribution of each important feature to classification is also derived according to the effect gain to develop a weighted Mahalanobis distance which is finally used as the distance metric for the classification of MMTS. Using the reduced model measurement scale, an unknown example will be classified into the class with minimum weighted Mahalanobis distance considering only the important features. For evaluating the effectiveness of MMTS, a numerical experiment is implemented, and the results show that MMTS outperforms other well-known algorithms not only on classification accuracy but also on feature selection efficiency. Finally, a real case about gestational diabetes mellitus is studied, and the results indicate the practicality of MMTS in real-world applications.
引用
收藏
页码:192 / 205
页数:14
相关论文
共 48 条
  • [1] EFFICIENT CLASSIFICATION FOR MULTICLASS PROBLEMS USING MODULAR NEURAL NETWORKS
    ANAND, R
    MEHROTRA, K
    MOHAN, CK
    RANKA, S
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1995, 6 (01): : 117 - 124
  • [2] [Anonymous], [No title captured]
  • [3] [Anonymous], 2001, Pattern Classification
  • [4] Asharaf S, 2007, P 24 INT C MACH LEAR
  • [5] Caring for a woman at high risk for type 2 diabetes
    Barger, MK
    Bidgood-Wilson, M
    [J]. JOURNAL OF MIDWIFERY & WOMENS HEALTH, 2006, 51 (03) : 222 - 226
  • [6] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [7] Support vector machines for histogram-based image classification
    Chapelle, O
    Haffner, P
    Vapnik, VN
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (05): : 1055 - 1064
  • [8] Waist circumference is the key risk factor for diabetes in Korean women with history of gestational diabetes
    Cho, NH
    Jang, HC
    Park, HK
    Cho, YW
    [J]. DIABETES RESEARCH AND CLINICAL PRACTICE, 2006, 71 (02) : 177 - 183
  • [9] Exploring the effects of chemical composition in hot rolled steel product using Mahalanobis distance scale under Mahalanobis-Taguchi system
    Das, Prasun
    Datta, Shubhabrata
    [J]. COMPUTATIONAL MATERIALS SCIENCE, 2007, 38 (04) : 671 - 677
  • [10] Dietterich T. G., 1995, Journal of Artificial Intelligence Research, V2, P263