Sample diversity selection strategy based on label distribution morphology for active label distribution learning

被引：2

作者：

Li, Weiwei ^{[1
]}

Qian, Wei ^{[1
]}

Chen, Lei ^{[2
]}

Jia, Xiuyi ^{[3
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210016, Peoples R China

[2] Nanjing Univ Posts & Telecommun, Sch Comp Sci, Nanjing 210023, Peoples R China

[3] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China

来源：

PATTERN RECOGNITION | 2024年 / 150卷

基金：

中国国家自然科学基金;

关键词：

Label distribution learning; Active learning; Representativeness; Diversity; Label distribution morphology;

D O I：

10.1016/j.patcog.2024.110322

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Labeling a sample in label distribution learning is highly expensive because it involves several labels at the same time and also requires assigning an exact value as the significance of each label. Therefore, active learning, which lowers the labeling cost by actively querying the labels of the most useful data, becomes especially critical for label distribution learning. Most of the known active query algorithms are for multilabel learning, and applying them directly to label distribution learning will lose some key supervisory information and thus fail to yield satisfactory experimental results. In this paper, we propose a sample diversity selection strategy based on the label distribution morphology, which can select diverse samples with different distribution morphologies from a large number of unlabeled samples for active querying. First, we use the feature space to construct a dissimilarity matrix that describes the pairwise dissimilarity among the unlabeled samples in order to pick a subset of samples that are representative of the unlabeled dataset. Second, using the information about the label distribution morphologies provided by the predicted labels of the unlabeled samples, we design a diversity loss score for each unlabeled sample. This score reflects the degree of difference between the sample and the labeled training sample. Finally, we use a convex optimization method to select valuable samples that are diverse from the labeled samples and represent the distribution of the unlabeled samples. The results of the comparison experiments demonstrate the effectiveness of our approach.

引用

页数：14

共 40 条

[1] Label-dependent feature exploration for label distribution learning [J].

Bai, Run-Ting ;

Zhang, Heng-Ru ;

Min, Fan .

INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (11) :3685-3704

[2] Distributed optimization and statistical learning via the alternating direction method of multipliers [J].

Boyd S. ;

Parikh N. ;

Chu E. ;

Peleato B. ;

Eckstein J. .

Foundations and Trends in Machine Learning, 2010, 3 (01) :1-122

[3] Manifold Adaptive Experimental Design for Text Categorization [J].

Cai, Deng ;

He, Xiaofei .

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (04) :707-719

[4]

Dasgupta S., 2008, P 25 INT C MACHINE L, P208

[5] Active label distribution learning via kernel maximum mean discrepancy [J].

Dong, Xinyue ;

Luo, Tingjin ;

Fan, Ruidong ;

Zhuge, Wenzhang ;

Hou, Chenping .

FRONTIERS OF COMPUTER SCIENCE, 2023, 17 (04)

[6] Active label distribution learning [J].

Dong, Xinyue ;

Gu, Shilin ;

Zhuge, Wenzhang ;

Luo, Tingjin ;

Hou, Chenping .

NEUROCOMPUTING, 2021, 436 :12-21

[7] Robust and Discriminative Labeling for Multi-Label Active Learning Based on Maximum Correntropy Criterion [J].

Du, Bo ;

Wang, Zengmao ;

Zhang, Lefei ;

Zhang, Liangpei ;

Tao, Dacheng .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (04) :1694-1707

[8] Cluster analysis and display of genome-wide expression patterns [J].

Eisen, MB ;

Spellman, PT ;

Brown, PO ;

Botstein, D .

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :14863-14868

[9] Dissimilarity-Based Sparse Subset Selection [J].

Elhamifar, Ehsan ;

Sapiro, Guillermo ;

Sastry, S. Shankar .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (11) :2182-2197

[10] A Convex Optimization Framework for Active Learning [J].

Elhamifar, Ehsan ;

Sapiro, Guillermo ;

Yang, Allen ;

Sastry, S. Shankar .

2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :209-216

← 1 2 3 4 →