Robust Positive-Unlabeled Learning via Noise Negative Sample Self-correction
被引:2
|
作者:
Zhu, Zhangchi
论文数: 0引用数: 0
h-index: 0
机构:
East China Normal Univ, Shanghai, Peoples R China
Microsoft Res, Beijing, Peoples R ChinaEast China Normal Univ, Shanghai, Peoples R China
Zhu, Zhangchi
[1
,2
]
Wang, Lu
论文数: 0引用数: 0
h-index: 0
机构:
Microsoft Res, Beijing, Peoples R ChinaEast China Normal Univ, Shanghai, Peoples R China
Wang, Lu
[2
]
Zhao, Pu
论文数: 0引用数: 0
h-index: 0
机构:
Microsoft Res, Beijing, Peoples R ChinaEast China Normal Univ, Shanghai, Peoples R China
Zhao, Pu
[2
]
Du, Chao
论文数: 0引用数: 0
h-index: 0
机构:
Microsoft Res, Beijing, Peoples R ChinaEast China Normal Univ, Shanghai, Peoples R China
Du, Chao
[2
]
Zhang, Wei
论文数: 0引用数: 0
h-index: 0
机构:
East China Normal Univ, Shanghai, Peoples R ChinaEast China Normal Univ, Shanghai, Peoples R China
Zhang, Wei
[1
]
Dong, Hang
论文数: 0引用数: 0
h-index: 0
机构:
Microsoft Res, Beijing, Peoples R ChinaEast China Normal Univ, Shanghai, Peoples R China
Dong, Hang
[2
]
Qiao, Bo
论文数: 0引用数: 0
h-index: 0
机构:
Microsoft Res, Beijing, Peoples R ChinaEast China Normal Univ, Shanghai, Peoples R China
Qiao, Bo
[2
]
Lin, Qingwei
论文数: 0引用数: 0
h-index: 0
机构:
Microsoft Res, Beijing, Peoples R ChinaEast China Normal Univ, Shanghai, Peoples R China
Lin, Qingwei
[2
]
Rajmohan, Saravan
论文数: 0引用数: 0
h-index: 0
机构:
Microsoft 365, Seattle, WA USAEast China Normal Univ, Shanghai, Peoples R China
Rajmohan, Saravan
[3
]
Zhang, Dongmei
论文数: 0引用数: 0
h-index: 0
机构:
Microsoft Res, Beijing, Peoples R ChinaEast China Normal Univ, Shanghai, Peoples R China
Zhang, Dongmei
[2
]
机构:
[1] East China Normal Univ, Shanghai, Peoples R China
[2] Microsoft Res, Beijing, Peoples R China
[3] Microsoft 365, Seattle, WA USA
来源:
PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023
|
2023年
基金:
中国国家自然科学基金;
关键词:
positive-unlabeled learning;
curriculum learning;
D O I:
10.1145/3580305.3599491
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
Learning from positive and unlabeled data is known as positive-unlabeled (PU) learning in literature and has attracted much attention in recent years. One common approach in PU learning is to sample a set of pseudo-negatives from the unlabeled data using ad-hoc thresholds so that conventional supervised methods can be applied with both positive and negative samples. Owing to the label uncertainty among the unlabeled data, errors of misclassifying unlabeled positive samples as negative samples inevitably appear and may even accumulate during the training processes. Those errors often lead to performance degradation and model instability. To mitigate the impact of label uncertainty and improve the robustness of learning with positive and unlabeled data, we propose a new robust PU learning method with a training strategy motivated by the nature of human learning: easy cases should be learned first. Similar intuition has been utilized in curriculum learning to only use easier cases in the early stage of training before introducing more complex cases. Specifically, we utilize a novel "hardness" measure to distinguish unlabeled samples with a high chance of being negative from unlabeled samples with large label noise. An iterative training strategy is then implemented to fine-tune the selection of negative samples during the training process in an iterative manner to include more "easy" samples in the early stage of training. Extensive experimental validations over a wide range of learning tasks show that this approach can effectively improve the accuracy and stability of learning with positive and unlabeled data. Our code is available at https://github.com/woriazzc/Robust-PU.
机构:
Hebei GEO Univ, Hebei Key Lab Strateg Crit Mineral Resources, Shijiazhuang 050031, Hebei, Peoples R ChinaHebei GEO Univ, Hebei Key Lab Strateg Crit Mineral Resources, Shijiazhuang 050031, Hebei, Peoples R China
Zhang, Zhiqiang
Wang, Gongwen
论文数: 0引用数: 0
h-index: 0
机构:
China Univ Geosci, Sch Earth Sci & Resources, Beijing 100083, Peoples R China
China Univ Geosci, MNR Key Lab Explorat Theory & Technol Crit Minera, Beijing 100083, Peoples R China
Beijing Key Lab Land & Resources Informat Res & D, Beijing 100083, Peoples R ChinaHebei GEO Univ, Hebei Key Lab Strateg Crit Mineral Resources, Shijiazhuang 050031, Hebei, Peoples R China
Wang, Gongwen
Carranza, Emmanuel John M.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Free State, Fac Nat & Agr Sci, Dept Geol, Bloemfontein, South AfricaHebei GEO Univ, Hebei Key Lab Strateg Crit Mineral Resources, Shijiazhuang 050031, Hebei, Peoples R China
Carranza, Emmanuel John M.
Fan, Junjie
论文数: 0引用数: 0
h-index: 0
机构:
China Geol Survey, Geophys Survey Ctr, Langfang 065000, Peoples R ChinaHebei GEO Univ, Hebei Key Lab Strateg Crit Mineral Resources, Shijiazhuang 050031, Hebei, Peoples R China
Fan, Junjie
Liu, Xinxing
论文数: 0引用数: 0
h-index: 0
机构:
Hebei GEO Univ, Hebei Key Lab Strateg Crit Mineral Resources, Shijiazhuang 050031, Hebei, Peoples R China
Hebei GEO Univ, Sch Earth Sci, Shijiazhuang 050031, Hebei, Peoples R ChinaHebei GEO Univ, Hebei Key Lab Strateg Crit Mineral Resources, Shijiazhuang 050031, Hebei, Peoples R China
Liu, Xinxing
Zhang, Xiang
论文数: 0引用数: 0
h-index: 0
机构:
China Geol Survey, Geophys Survey Ctr, Langfang 065000, Peoples R ChinaHebei GEO Univ, Hebei Key Lab Strateg Crit Mineral Resources, Shijiazhuang 050031, Hebei, Peoples R China
Zhang, Xiang
Dong, Yulong
论文数: 0引用数: 0
h-index: 0
机构:
China Geol Survey, Geophys Survey Ctr, Langfang 065000, Peoples R ChinaHebei GEO Univ, Hebei Key Lab Strateg Crit Mineral Resources, Shijiazhuang 050031, Hebei, Peoples R China
Dong, Yulong
Chang, XiaoPeng
论文数: 0引用数: 0
h-index: 0
机构:
China Geol Survey, Geophys Survey Ctr, Langfang 065000, Peoples R ChinaHebei GEO Univ, Hebei Key Lab Strateg Crit Mineral Resources, Shijiazhuang 050031, Hebei, Peoples R China
Chang, XiaoPeng
Sha, Deming
论文数: 0引用数: 0
h-index: 0
机构:
China Geol Survey, Shenyang 110000, Peoples R ChinaHebei GEO Univ, Hebei Key Lab Strateg Crit Mineral Resources, Shijiazhuang 050031, Hebei, Peoples R China