Improving Recall of k-Nearest Neighbor Algorithm for Classes of Uneven Size

被引:1
作者
Boiculese, Vasile Lucian [1 ]
Dimitriu, Gabriel [1 ]
Moscalu, Mihaela [1 ]
机构
[1] Grigore T Popa Univ Med & Pharm, Dept Med Informat & Biostat, Iasi, Romania
来源
2013 E-HEALTH AND BIOENGINEERING CONFERENCE (EHB) | 2013年
关键词
k-nearest neighbor; classification; uneven size classes; RULE;
D O I
10.1109/EHB.2013.6707403
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
The k-nearest neighbor algorithm is one of the most suitable method of classification for its simplicity, adaptability and performance. The real problem arises when classes do overlap and when samples size is unevenly distributed between categories. Many studies present optimization techniques on discriminant metrics, on weighting the features, on using probabilistic measures or adjusting the prototypes position. Classes that are represented by a small sample size are overwhelmed by the large number of prototypes of dominated groups. In this paper we describe a method of weighting the prototypes for each class of the k nearest neighbors to cope with the uneven distribution of data. The proposed method increases the classification rate in terms of recall measure.
引用
收藏
页数:4
相关论文
共 10 条
  • [1] [Anonymous], 2014, Discovering Knowledge in Data, DOI [10.1002/9781118874059.CH7, DOI 10.1002/9781118874059.CH7]
  • [2] Boiculese LV, 2009, P ROMANIAN ACAD A, V10, P205
  • [3] Duda R. O., 2000, PATTERN CLASSIFICATI, P174
  • [4] A probabilistic approach for semi-supervised nearest neighbor classification
    Ghosh, Anil K.
    [J]. PATTERN RECOGNITION LETTERS, 2012, 33 (09) : 1127 - 1133
  • [5] Learning weighted metrics to minimize nearest-neighbor classification error
    Paredes, R
    Vidal, E
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (07) : 1100 - 1110
  • [6] Learning prototypes and distances (LPD). A prototype reduction technique based on nearest neighbor error minimization
    Paredes, R
    Vidal, E
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, 2004, : 442 - 445
  • [7] Neighbor-weighted K-nearest neighbor for unbalanced text corpus
    Tan, SB
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2005, 28 (04) : 667 - 671
  • [8] Dimensionality reduction by minimizing nearest-neighbor classification error
    Villegas, Mauricio
    Paredes, Roberto
    [J]. PATTERN RECOGNITION LETTERS, 2011, 32 (04) : 633 - 639
  • [9] Neighborhood size selection in the k-nearest-neighbor rule using statistical confidence
    Wang, JG
    Neskovic, P
    Cooper, LN
    [J]. PATTERN RECOGNITION, 2006, 39 (03) : 417 - 423
  • [10] Improving nearest neighbor rule with a simple adaptive distance measure
    Wang, Jigang
    Neskovic, Predrag
    Cooper, Leon N.
    [J]. PATTERN RECOGNITION LETTERS, 2007, 28 (02) : 207 - 213