Multi-Label Feature Selection with Feature-Label Subgraph Association and Graph Representation Learning

被引:0
作者
Ruan, Jinghou [1 ]
Wang, Mingwei [1 ]
Liu, Deqing [1 ]
Chen, Maolin [2 ]
Gao, Xianjun [3 ]
机构
[1] Hubei Univ Technol, Sch Comp Sci, Wuhan 430068, Peoples R China
[2] Chongqing Jiaotong Univ, Sch Smart City, Chongqing 400074, Peoples R China
[3] Yangtze Univ, Sch Geosci, Wuhan 430100, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-label data; feature selection; feature-label subgraph association; graph representation learning; optimal feature subset; CLASSIFICATION; OPTIMIZATION; ALGORITHM;
D O I
10.3390/e26110992
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
In multi-label data, a sample is associated with multiple labels at the same time, and the computational complexity is manifested in the high-dimensional feature space as well as the interdependence and unbalanced distribution of labels, which leads to challenges regarding feature selection. As a result, a multi-label feature selection method based on feature-label subgraph association with graph representation learning (SAGRL) is proposed to represent the complex correlations of features and labels, especially the relationships between features and labels. Specifically, features and labels are mapped to nodes in the graph structure, and the connections between nodes are established to form feature and label sets, respectively, which increase intra-class correlation and decrease inter-class correlation. Further, feature-label subgraphs are constructed by feature and label sets to provide abundant feature combinations. The relationship between each subgraph is adjusted by graph representation learning, the crucial features in different label sets are selected, and the optimal feature subset is obtained by ranking. Experimental studies on 11 datasets show the superior performance of the proposed method with six evaluation metrics over some state-of-the-art multi-label feature selection methods.
引用
收藏
页数:24
相关论文
共 72 条
[1]   Automatic ensemble feature selection using fast non-dominated sorting [J].
Abasabadi, Sedighe ;
Nematzadeh, Hossein ;
Motameni, Homayun ;
Akbari, Ebrahim .
INFORMATION SYSTEMS, 2021, 100
[2]   A matching-minor monotone parameter for bipartite graphs [J].
Arav, Marina ;
Deaett, Louis ;
Hall, H. Tracy ;
van der Holst, Hein ;
Young, Derek .
LINEAR ALGEBRA AND ITS APPLICATIONS, 2024, 680 :254-273
[3]   A novel binary many-objective feature selection algorithm for multi-label data classification [J].
Asilian Bidgoli, Azam ;
Ebrahimpour-komleh, Hossein ;
Rahnamayan, Shahryar .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (07) :2041-2057
[4]  
Bapat RB., 1996, Math. Student, V65, P214
[5]   Entropy [J].
Bein, Berthold .
BEST PRACTICE & RESEARCH-CLINICAL ANAESTHESIOLOGY, 2006, 20 (01) :101-109
[6]   A multi-feature selection approach for gender identification of handwriting based on kernel mutual information [J].
Bi, Ning ;
Suen, Ching Y. ;
Nobile, Nicola ;
Tan, Jun .
PATTERN RECOGNITION LETTERS, 2019, 121 :123-132
[7]   EQUIMATCHABLE BIPARTITE GRAPHS *,&DAG; [J].
Buyukcolak, Yasemin ;
Gozupek, Didem ;
Ozkan, Sibel .
DISCUSSIONES MATHEMATICAE GRAPH THEORY, 2023, 43 (01) :77-94
[8]   A novel approach for learning label correlation with application to feature selection of multi-label data [J].
Che, Xiaoya ;
Chen, Degang ;
Mi, Jusheng .
INFORMATION SCIENCES, 2020, 512 :795-812
[9]   Multi-label feature selection by strongly relevant label gain and label mutual aid [J].
Dai, Jianhua ;
Huang, Weiyi ;
Zhang, Chucai ;
Liu, Jie .
PATTERN RECOGNITION, 2024, 145
[10]   Feature selection for label distribution learning using dual-similarity based neighborhood fuzzy entropy [J].
Deng, Zhixuan ;
Li, Tianrui ;
Deng, Dayong ;
Liu, Keyu ;
Zhang, Pengfei ;
Zhang, Shiming ;
Luo, Zhipeng .
INFORMATION SCIENCES, 2022, 615 :385-404