Efficient feature selection for logical analysis of large-scale multi-class datasets

被引:3
|
作者
Yan, Kedong [1 ]
Miao, Dongjing [2 ]
Guo, Cui [3 ]
Huang, Chanying [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, 200 Xiaolingwei, Nanjing 210094, Peoples R China
[2] Harbin Inst Technol, Fac Comp, 92 Xidazhi, Harbin 150001, Peoples R China
[3] Shantou Univ, Business Sch, 243 Daxue Rd, Shantou 515063, Peoples R China
基金
中国国家自然科学基金;
关键词
Logical Analysis of Data; Supervised Learning; Feature Selection; Multi-classification; Set Covering;
D O I
10.1007/s10878-021-00732-2
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Feature selection in logical analysis of data (LAD) can be cast into a set covering problem. In this paper, extending the results on feature selection for binary classification using LAD, we present a mathematical model that selects a minimum set of necessary features for multi-class datasets and develop a heuristic algorithm that is both memory and time efficient for this model correspondingly. The utility of the algorithm is illustrated on a small example and the superiority of our work is demonstrated through experiments on 6 real-life multi-class datasets from UCI repository.
引用
收藏
页码:1 / 23
页数:23
相关论文
共 50 条
  • [1] Efficient feature selection for logical analysis of large-scale multi-class datasets
    Kedong Yan
    Dongjing Miao
    Cui Guo
    Chanying Huang
    Journal of Combinatorial Optimization, 2021, 42 : 1 - 23
  • [2] A RANDOMIZED HEURISTIC FOR KERNEL PARAMETER SELECTION WITH LARGE-SCALE MULTI-CLASS DATA
    Hansen, Toke Jansen
    Abrahamsen, Trine Julie
    Hansen, Lars Kai
    2011 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2011,
  • [3] Efficient large-scale multi-class image classification by learning balanced trees
    Tien-Dung Mai
    Thanh Duc Ngo
    Duy-Dinh Le
    Duc Anh Duong
    Kiem Hoang
    Satoh, Shin'ichi
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 156 : 151 - 161
  • [4] Incremental Parallel Support Vector Machines for Classifying Large-Scale Multi-class Image Datasets
    Thanh-Nghi Do
    Tran-Nguyen, Minh-Thu
    FUTURE DATA AND SECURITY ENGINEERING, FDSE 2016, 2016, 10018 : 20 - 39
  • [5] Consistent Matrix: A Feature Selection Framework for Large-Scale Datasets
    Yang, Tian
    Li, Yuan-Jiang
    Qian, Yuhua
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2023, 31 (11) : 4024 - 4038
  • [6] Latent-lSVM classification of very high-dimensional and large-scale multi-class datasets
    Thanh-Nghi Do
    Poulet, Francois
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (02):
  • [7] Logical Analysis of Multi-Class Data
    Felix Avila-Herrera, Juan
    Subasi, Munevver Mine
    2015 XLI LATIN AMERICAN COMPUTING CONFERENCE (CLEI), 2015, : 276 - 285
  • [8] Efficient Algorithms for Feature Selection in Multi-class Support Vector Machine
    Hoai An Le Thi
    Manh Cuong Nguyen
    ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING, 2013, 479 : 41 - 52
  • [9] Model for the recognition of large-scale multi-class diseases and pests
    Wen C.
    Wang Q.
    Chen H.
    Wu J.
    Ni J.
    Yang C.
    Su H.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2022, 38 (08): : 169 - 177
  • [10] Design and implementation of a large-scale multi-class text classifier
    于水
    张亮
    马范援
    Journal of Harbin Institute of Technology, 2005, (06) : 690 - 695