Feature Selection Based on Pairwise Classification Performance

被引:0
作者
Dreiseitl, Stephan [1 ]
Osl, Melanie [2 ]
机构
[1] Upper Austria Univ Appl Sci, Dept Software Engn, A-4232 Hagenberg, Austria
[2] Univ Hlth Sci, Med Informat & Technol, Dept Biomed Engn, A-6060 Halle, Germany
来源
COMPUTER AIDED SYSTEMS THEORY - EUROCAST 2009 | 2009年 / 5717卷
关键词
Feature selection; feature ranking; pairwise evaluation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The process of feature selection is an important first step in building machine learning models. Feature selection algorithms can be grouped into wrappers and filters: the former use machine learning models to evaluate feature sets, the latter use other criteria to evaluate features individually. We present a new approach to feature selection that combines advantages of both wrapper as well as filter approaches, by using logistic regression and the area, under the ROC curve (AUC) to evaluate pairs of features. After choosing as starting feature the one with the highest individual discriminatory power, we incrementally rank features by choosing as next feature the one that achieves the highest, AUC in combination with an already chosen feature. To evaluate our approach, we compared it to standard filter and wrapper algorithms. Using two data sets from the biomedical domain, we are able to demonstrate that the performance of our approach exceeds that of filter methods, while being comparable to wrapper methods at smaller computational cost.
引用
收藏
页码:769 / +
页数:3
相关论文
共 16 条
  • [1] [Anonymous], Journal of machine learning research
  • [2] Bo TH, 2002, GENOME BIOL, V3
  • [3] Dreiseitl S, 2001, J BIOMED INFORM, V34, P28, DOI 10.1006/jbin.2001.10004
  • [4] Benchmarking attribute selection techniques for discrete class data mining
    Hall, MA
    Holmes, G
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2003, 15 (06) : 1437 - 1447
  • [5] THE MEANING AND USE OF THE AREA UNDER A RECEIVER OPERATING CHARACTERISTIC (ROC) CURVE
    HANLEY, JA
    MCNEIL, BJ
    [J]. RADIOLOGY, 1982, 143 (01) : 29 - 36
  • [6] Pairwise feature evaluation for constructing reduced representations
    Harol, Artsiom
    Lai, Carmen
    Pezkalska, Elzbieta
    Duin, Robert P. W.
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2007, 10 (01) : 55 - 68
  • [7] Hosmer W., 2000, Applied Logistic Regression, VSecond
  • [8] JOHN G, 1994, P 11 INT C MACH LEAR
  • [9] Kohavi R., 1998, The Springer International Series in Engineering and Computer Science, V453, P33, DOI [10.1007/978-1-4615-5725-8, DOI 10.1007/978-1-4615-5725-8]
  • [10] Kononenko I., 1994, LECT NOTES COMPUT SC, V784, P171, DOI [10.1007/3-540-57868-4_57/COVER, DOI 10.1007/3-540-57868-457, 10.1007/3-540-57868-4_57, DOI 10.1007/3-540-57868-4_57]