A Two-Stage Methodology Using K-NN and False-Positive Minimizing ELM for Nominal Data Classification

被引:32
|
作者
Akusok, Anton [1 ]
Miche, Yoan [1 ]
Hegedus, Jozsef [1 ]
Nian, Rui [2 ]
Lendasse, Amaury [1 ,3 ,4 ]
机构
[1] Aalto Univ, Dept Informat & Comp Sci, Aalto 00076, Finland
[2] Ocean Univ China, Coll Informat & Engn, Qingdao 266003, Peoples R China
[3] Basque Fdn Sci, IKERBASQUE, Bilbao 48011, Spain
[4] Arcada Univ Appl Sci, Helsinki 00550, Finland
关键词
ELM; K-NN; Malware detection; False positives; EXTREME LEARNING-MACHINE; NETWORKS;
D O I
10.1007/s12559-014-9253-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on the problem of making decisions in the context of nominal data under specific constraints. The underlying goal driving the methodology proposed here is to build a decision-making model capable of classifying as many samples as possible while avoiding false positives at all costs, all within the smallest possible computational time. Under such constraints, one of the best type of model is the cognitive-inspired extreme learning machine (ELM), for the final decision process. A two-stage decision methodology using two types of classifiers, a distance-based one, K-NN, and the cognitive-based one, ELM, provides a fast means of obtaining a classification decision on a sample, keeping false positives as low as possible while classifying as many samples as possible (high coverage). The methodology only has two parameters, which, respectively, set the precision of the distance approximation and the final trade-off between false-positive rate and coverage. Experimental results using a specific dataset provided by F-Secure Corporation show that this methodology provides a rapid decision on new samples, with a direct control over the false positives and thus on the decision capabilities of the model.
引用
收藏
页码:432 / 445
页数:14
相关论文
共 5 条
  • [1] A Two-Stage Methodology Using K-NN and False-Positive Minimizing ELM for Nominal Data Classification
    Anton Akusok
    Yoan Miche
    Jozsef Hegedus
    Rui Nian
    Amaury Lendasse
    Cognitive Computation, 2014, 6 : 432 - 445
  • [2] Classification of File Data Based on Confidentiality in Cloud Computing using K-NN Classifier
    Zardari, Munwar Ali
    Jung, Low Tang
    INTERNATIONAL JOURNAL OF BUSINESS ANALYTICS, 2016, 3 (02) : 61 - 78
  • [3] K-NN DATA CLASSIFICATION TECHNIQUE USING SEMANTIC SEARCH ON ENCRYPTED RELATIONAL DATA BASE
    Uttarwar, Nikita
    Pradhan, M. A.
    2016 INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2016,
  • [4] Gene selection for enhanced classification on microarray data using a weighted k-NN based algorithm
    Ventura-Molina, Elias
    Alarcon-Paredes, Antonio
    Aldape-Perez, Mario
    Yanez-Marquez, Cornelio
    Adolfo Alonso, Gustavo
    INTELLIGENT DATA ANALYSIS, 2019, 23 (01) : 241 - 253
  • [5] Data Classification with k-NN using Novel Character Frequency-Direct Word Frequency (CF-DWF) Similarity Formula
    Zardari, Munwar Ali
    Jung, Low Tang
    2015 INTERNATIONAL SYMPOSIUM ON MATHEMATICAL SCIENCES AND COMPUTING RESEARCH (ISMSC), 2015, : 280 - 285