Locally Adaptive Text Classification based k-nearest Neighbors

被引:0
|
作者
Yu, Xiao-gao [1 ]
Yu, Xiao-peng [2 ]
机构
[1] Hubei Univ Econ, Dept Informat Management, Wuhan, Peoples R China
[2] Wuhan Inst Technol, Dept Econ Management, Wuhan, Peoples R China
关键词
K-nearest neighbor; kernel density estimation; sum-of-squared-error criterion; text classification;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Due to the exponential growth of documents on the Internet and the emergent need to organize them, the automated categorization of documents into predefined labels has received an ever-increased attention in the recent years. Among all these classifiers, K-Nearest Neighbors (KNNC) is a widely used classifier in text categorization community because of its simplicity and efficiency. However, KNNC still suffers from inductive biases or model misfits that result from its assumptions, such as the presumption that training data are evenly distributed among all categories. In this paper, we propose a new refinement strategy (LAKNNC) for the KNN Classifier, which adopts sum-of-squared-error criterion to adaptively select the contributing part from these neighbors and classifies the input document in term of the disturbance degree which it brings to the kernel densities of these selected neighbors. The experimental results indicate that our algorithm LAKNNC is not sensitive to the parameter k and achieves significant classification performance improvement on imbalanced corpora.
引用
收藏
页码:5651 / +
页数:2
相关论文
共 50 条
  • [1] Classification with learning k-nearest neighbors
    Laaksonen, J
    Oja, E
    ICNN - 1996 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS. 1-4, 1996, : 1480 - 1483
  • [2] Adaptive condensed fuzzy monotonic K-nearest neighbors for monotonic classification
    Chen, Jiankai
    Li, Zhongyan
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024,
  • [3] Adaptive Global k-Nearest Neighbors for Hierarchical Classification of Data Streams
    Tieppo, Eduardo
    Barddal, Jean Paul
    Nievola, Julio Cesar
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 631 - 636
  • [4] The research on an adaptive k-nearest neighbors classifier
    Yu, Xiao-Gao
    Yu, Xiao-Peng
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 1241 - 1246
  • [5] AutoML for Stream k-Nearest Neighbors Classification
    Bahri, Maroua
    Veloso, Bruno
    Bifet, Albert
    Gama, Joao
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 597 - 602
  • [6] The research on an adaptive k-nearest neighbors classifier
    Yu, Xiaopeng
    Yu, Xiaogao
    PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, VOLS 1 AND 2, 2006, : 535 - 540
  • [7] Novel text classification based on K-nearest neighbor
    Yu, Xiao-Peng
    Yu, Xiao-Gao
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 3425 - +
  • [8] Adaptive geometric median prototype selection method for k-nearest neighbors classification
    Kasemtaweechok, Chatchai
    Suwannik, Worasait
    INTELLIGENT DATA ANALYSIS, 2019, 23 (04) : 855 - 876
  • [9] A new locally adaptive K-nearest centroid neighbor classification based on the average distance
    Wang, Benqiang
    Zhang, Shunxiang
    CONNECTION SCIENCE, 2022, 34 (01) : 2084 - 2107
  • [10] Machine learning classification based on k-Nearest Neighbors for PolSAR data
    Ferreira, Jodavid A.
    Rodrigues, Anny K. G.
    Ospina, Raydonal
    Gomez, Luis
    ANAIS DA ACADEMIA BRASILEIRA DE CIENCIAS, 2024, 96 (01):