Active partial label learning based on adaptive sample selection

被引：4

作者：

Li, Yan ^{[1
,2
,3
]}

Liu, Chang ^{[2
,3
]}

Zhao, Suyun ^{[4
,5
]}

Hua, Qiang ^{[2
,3
]}

机构：

[1] Beijing Normal Univ Zhuhai, Res Ctr Appl Math & Interdisciplinary Sci, Zhuhai 519087, Peoples R China

[2] Hebei Univ, Key Lab Machine Learning & Computat Intelligence, Baoding 071002, Peoples R China

[3] Hebei Univ, Coll Math & Informat Sci, Baoding 071002, Peoples R China

[4] Renmin Univ China, Key Lab Data Engn & Knowledge Engn MOE, Beijing, Peoples R China

[5] Renmin Univ China, Sch Informat, Beijing, Peoples R China

来源：

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS | 2022年 / 13卷 / 06期

关键词：

Partial label learning; Active learning; Sample selection strategy; Label transfer ability;

D O I：

10.1007/s13042-021-01470-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Partial label learning is a type of weak supervised learning which uses samples with candidate label sets to train a classifier. Most of the related researches assume that there are a lot of available training samples with partial labels in advance, that is to assume that the candidate label set is easy to obtain. In many practical problems, however, there are still a large number of unlabeled samples, and obtaining their partial labels is costly. In this paper, we consider using a small number of partially labeled samples and a large number of unlabeled samples to form the training set, and propose a partial label learning method based on active learning mechanism to construct an effective classifier. Firstly, the weak supervised information in candidate label set is used to determine the possible labels of the partially labeled samples by using iterative label transfer process; then an adaptive sample selection strategy in active learning framework is proposed to comprehensively measure the labeling value of each unlabeled sample based on its uncertainty, graph density and label transfer ability, and the most valuable samples are selected from unlabeled sample set for manual labeling. Finally, the labeled samples are used to re-optimize the existing partially labeled samples, and the final classifier is trained. The experimental results on some benchmark datasets show that the proposed active partial label learning method has higher classification accuracy than the representative similar methods, and only needs to label a small number of samples to achieve stable performance.

引用

页码：1603 / 1617

页数：15

共 33 条

[1] Partial Label Dimensionality Reduction via Confidence-Based Dependence Maximization
Bao, Wei-Xuan
Hang, Jun-Yi
Zhang, Min-Ling
[J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 46 - 54
[2] Briggs Forrest, 2012, P 18 ACM SIGKDD INT, P534, DOI 10.1145/2339530.2339616
[3] Ambiguously Labeled Learning Using Dictionaries
Chen, Yi-Chen
Patel, Vishal M.
Chellappa, Rama
Phillips, P. Jonathon
[J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2014, 9 (12) : 2076 - 2088
[4] Cour T, 2011, J MACH LEARN RES, V12, P1501
[5] Dong HC, 2018, AAAI CONF ARTIF INTE, P2926
[6] Du P., 2021, P IEEE CVF INT C COM, P8927
[7] EBERT S, 2012, PROC CVPR IEEE, P3626
[8] Feng L, 2019, P 28 INT JOINT C ART
[9] Freytag A, 2014, LECT NOTES COMPUT SC, V8692, P562, DOI 10.1007/978-3-319-10593-2_37
[10] Guillaumin M, 2010, LECT NOTES COMPUT SC, V6311, P634, DOI 10.1007/978-3-642-15549-9_46

← 1 2 3 4 →