Hypergraph-based importance assessment for binary classification data

被引:1
作者
Misiorek, Pawel [1 ]
Janowski, Szymon [1 ]
机构
[1] Poznan Univ Tech, Inst Comp Sci, Piotrowo 3, PL-60965 Poznan, Poland
关键词
Hypergraphs; Machine learning; Imbalanced data; Random undersampling; Feature selection; GRAPH EDIT DISTANCE; COMPUTATION; ALGORITHM; NETWORK;
D O I
10.1007/s10115-022-01786-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel hypergraph-based framework enabling an assessment of the importance of binary classification data elements. Specifically, we apply the hypergraph model to rate data samples' and categorical feature values' relevance to classification labels. The proposed Hypergraph-based Importance ratings are theoretically grounded on the hypergraph cut conductance minimization concept. As a result of using hypergraph representation, which is a lossless representation from the perspective of higher-order relationships in data, our approach allows for more precise exploitation of the information on feature and sample coincidences. The solution was tested using two scenarios: undersampling for imbalanced classification data and feature selection. The experimentation results have proven the good quality of the new approach when compared with other state-of-the-art and baseline methods for both scenarios measured using the average precision evaluation metric.
引用
收藏
页码:1657 / 1683
页数:27
相关论文
共 50 条
  • [41] Spark based classification of microarray data using scalable artificial neural network
    Kumar, Mukesh
    Ray, Ransingh B.
    Rath, Santanu K.
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2017, 19 (04) : 312 - 339
  • [42] Rank Based Binary Particle Swarm Optimisation for Feature Selection in Classification
    Mafarja, Majdi
    Sabar, Nasser R.
    ICFNDS'18: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON FUTURE NETWORKS AND DISTRIBUTED SYSTEMS, 2018,
  • [43] A Binary Grey Wolf Optimization based Hybrid Convolutional Neural Network (BGWOHCNN) framework for hyperspectral image classification
    Kumar, Deepak
    Kumar, Dharmender
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) : 10091 - 10114
  • [44] Imbalanced Data Classification Method Based on LSSASMOTE
    Wang, Zhi
    Liu, Qicheng
    IEEE ACCESS, 2023, 11 : 32252 - 32260
  • [45] A Hybrid Cancer Classification Model Based Recursive Binary Gravitational Search Algorithm in Microarray Data
    Han, Xiao Hong
    Li, Deng Ao
    Wang, Li
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE OF INFORMATION AND COMMUNICATION TECHNOLOGY [ICICT-2019], 2019, 154 : 274 - 282
  • [46] Detection of Cyberattacks in Industrial Control Systems Using Enhanced Principal Component Analysis and Hypergraph-Based Convolution Neural Network (EPCA-HG-CNN)
    Priyanga, S.
    Krithivasan, Kannan
    Pravinraj, S.
    Sriram, Shankar V. S.
    IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2020, 56 (04) : 4394 - 4404
  • [47] A Skyline-Based Decision Boundary Estimation Method for Binominal Classification in Big Data
    Kalyvas, Christos
    Maragoudakis, Manolis
    COMPUTATION, 2020, 8 (03)
  • [48] Development of Hypergraph Based Improved Random Forest Algorithm for Partial Discharge Pattern Classification
    Govindarajan, Suganya
    Ardila-Rey, Jorge Alfredo
    Krithivasan, Kannan
    Subbaiah, Jayalalitha
    Sannidhi, Nikhith
    Balasubramanian, M.
    IEEE ACCESS, 2021, 9 : 96 - 109
  • [49] Mutation-based Binary Aquila optimizer for gene selection in cancer classification
    Pashaei, Elham
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2022, 101
  • [50] A Classification Method Based on Feature Selection for Imbalanced Data
    Liu, Yi
    Wang, Yanzhen
    Ren, Xiaoguang
    Zhou, Hao
    Diao, Xingchun
    IEEE ACCESS, 2019, 7 : 81794 - 81807