Partial label feature selection via label disambiguation and neighborhood mutual information

被引:2
作者
Ding, Jinfei [1 ]
Qian, Wenbin [1 ,2 ]
Li, Yihui [1 ]
Yang, Wenji [2 ]
Huang, Jintao [3 ]
机构
[1] Jiangxi Agr Univ, Sch Comp & Informat Engn, Nanchang 330045, Peoples R China
[2] Jiangxi Agr Univ, Sch Software, Nanchang 330045, Peoples R China
[3] Hong Kong Baptist Univ, Dept Comp Sci, Hong Kong 266237, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Granular computing; Label disambiguation; Partail label learning; Neighborhood mutual information;
D O I
10.1016/j.ins.2024.121163
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Partial label learning aims to learn from training instances, each of which is associated with a set of candidate labels but only one is a ground-truth label. Feature selection is an effective method to improve the generalization capability of the learning model; however, partial label feature selection work is exceptionally challenging due to the limitation and ambiguity of label information. Therefore, this paper proposes a partial label feature selection algorithm based on label disambiguation and neighborhood mutual information. Firstly, neighborhood granularity is utilized to determine the neighborhoods of instances to disambiguate the candidate labels. Secondly, based on label confidence induced by disambiguation, feature relevance and redundancy are measured by neighborhood mutual information, which avoids the negative impact of data discretization on feature selection and directly handles continuous features. Concurrently, the kappa coefficient is employed to estimate the label consistency for describing the influences of feature changes on the label space. Then, the significance of each feature is evaluated by fusing feature relevance, feature redundancy, and label consistency. Finally, the effectiveness of the proposed algorithm is verified by comparing the proposed algorithm with four base classifiers and other feature selection methods. Furthermore, the feasibility of the proposed disambiguation method is demonstrated through comparison with four state-of-the-art disambiguation methods.
引用
收藏
页数:23
相关论文
共 40 条
[1]   Submodular Feature Selection for Partial Label Learning [J].
Bao, Wei-Xuan ;
Hang, Jun-Yi ;
Zhang, Min-Ling .
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, :26-34
[2]   Partial Label Dimensionality Reduction via Confidence-Based Dependence Maximization [J].
Bao, Wei-Xuan ;
Hang, Jun-Yi ;
Zhang, Min-Ling .
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, :46-54
[3]   Dimensionality Reduction for Categorical Data [J].
Bera, Debajyoti ;
Pratap, Rameshwar ;
Verma, Bhisham Dev .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) :3658-3671
[4]   Acoustic classification of multiple simultaneous bird species: A multi-instance multi-label approach [J].
Briggs, Forrest ;
Lakshminarayanan, Balaji ;
Neal, Lawrence ;
Fern, Xiaoli Z. ;
Raich, Raviv ;
Hadley, Sarah J. K. ;
Hadley, Adam S. ;
Betts, Matthew G. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (06) :4640-4650
[5]   Rough set-based feature selection for weakly labeled data [J].
Campagner, Andrea ;
Ciucci, Davide ;
Huellermeier, Eyke .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2021, 136 :150-167
[6]   Secure Detection of Image Manipulation by Means of Random Feature Selection [J].
Chen, Zhipeng ;
Tondi, Benedetta ;
Li, Xiaolong ;
Ni, Rongrong ;
Zhao, Yao ;
Barni, Mauro .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019, 14 (09) :2454-2469
[7]  
Cour T, 2011, J MACH LEARN RES, V12, P1501
[8]   Partial Label Learning via GANs With Multiclass SVMs and Information Maximization [J].
Fan, Jinfu ;
Wang, Zhongjie .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) :8409-8421
[9]  
Feng L, 2020, 37 INT C MACH LEAR F
[10]   Measuring relevance between discrete and continuous features based on neighborhood mutual information [J].
Hu, Qinghua ;
Zhang, Lei ;
Zhang, David ;
Pan, Wei ;
An, Shuang ;
Pedrycz, Witold .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (09) :10737-10750