Hypergraph-based importance assessment for binary classification data

被引:1
作者
Misiorek, Pawel [1 ]
Janowski, Szymon [1 ]
机构
[1] Poznan Univ Tech, Inst Comp Sci, Piotrowo 3, PL-60965 Poznan, Poland
关键词
Hypergraphs; Machine learning; Imbalanced data; Random undersampling; Feature selection; GRAPH EDIT DISTANCE; COMPUTATION; ALGORITHM; NETWORK;
D O I
10.1007/s10115-022-01786-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel hypergraph-based framework enabling an assessment of the importance of binary classification data elements. Specifically, we apply the hypergraph model to rate data samples' and categorical feature values' relevance to classification labels. The proposed Hypergraph-based Importance ratings are theoretically grounded on the hypergraph cut conductance minimization concept. As a result of using hypergraph representation, which is a lossless representation from the perspective of higher-order relationships in data, our approach allows for more precise exploitation of the information on feature and sample coincidences. The solution was tested using two scenarios: undersampling for imbalanced classification data and feature selection. The experimentation results have proven the good quality of the new approach when compared with other state-of-the-art and baseline methods for both scenarios measured using the average precision evaluation metric.
引用
收藏
页码:1657 / 1683
页数:27
相关论文
共 50 条
  • [21] HGC: Hypergraph-based dynamic stable clustering scheme model for vehicular ad-hoc networks (VANETs)
    Kumar, Parveen
    Dahiya, Pawan Kumar
    Singh, Bijay Kumar
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2022, 35 (06)
  • [22] Feature Selection for Data Classification based on Binary Brain Storm Optimization
    Pourpanah, Farhad
    Wang, Ran
    Wang, Xizhao
    PROCEEDINGS OF 2019 6TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2019, : 108 - 113
  • [23] A novel cost sensitive classification algorithm based on neighborhood hypergraph
    Chongqing Key Laboratory of Computational Intelligence, Chongqing University of Posts and Telecommunications, Chongqing, China
    J. Comput. Inf. Syst., 1 (109-121): : 109 - 121
  • [24] Imbalanced Data Classification Based on Feature Selection Techniques
    Ksieniewicz, Pawel
    Wozniak, Michal
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING (IDEAL 2018), PT II, 2018, 11315 : 296 - 303
  • [25] Feature selection based on chaotic binary black hole algorithm for data classification
    Qasim, Omar Saber
    Al-Thanoon, Niam Abdulmunim
    Algamal, Zakariya Yahya
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2020, 204
  • [26] Feature selection based on improved binary global harmony search for data classification
    Gholami, Jafar
    Pourpanah, Farhad
    Wang, Xizhao
    APPLIED SOFT COMPUTING, 2020, 93 (93)
  • [27] A Hybrid Approach for Binary Classification of Imbalanced Data
    Tsai, Hsinhan
    Yang, Ta-Wei
    Wong, Wai-Man
    Kao, Han-Yi
    Chou, Cheng-Fu
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2024, 23 (03)
  • [28] A Genetic-Based Ensemble Learning Applied to Imbalanced Data Classification
    Klikowski, Jakub
    Ksieniewicz, Pawel
    Wozniak, Michal
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING (IDEAL 2019), PT II, 2019, 11872 : 340 - 352
  • [29] Diagnosis of Rolling Bearing Based on Classification for High Dimensional Unbalanced Data
    Hang, Qi
    Yang, Jinghui
    Xing, Lining
    IEEE ACCESS, 2019, 7 : 79159 - 79172
  • [30] Meta-learning for imbalanced data and classification ensemble in binary classification
    Lin, Sung-Chiang
    Chang, Yuan-chin I.
    Yang, Wei-Ning
    NEUROCOMPUTING, 2009, 73 (1-3) : 484 - 494