Active learning for regression using greedy sampling

被引:112
作者
Wu, Dongrui [1 ]
Lin, Chin-Teng [2 ]
Huang, Jian [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Automat, Key Lab, Minist Educ Image Proc & Intelligent Control, Wuhan 430074, Hubei, Peoples R China
[2] Univ Technol, Fac Engn & Informat Technol, Sydney, NSW, Australia
基金
中国国家自然科学基金;
关键词
Active learning; Regression; Greedy sampling; Driver drowsiness estimation; MULTIPLE COMPARISONS; REGULARIZATION; SELECTION;
D O I
10.1016/j.ins.2018.09.060
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Regression problems are pervasive in real-world applications. Generally a substantial amount of labeled samples are needed to build a regression model with good generalization ability. However, many times it is relatively easy to collect a large number of unlabeled samples, but time-consuming or expensive to label them. Active learning for regression (ALR) is a methodology to reduce the number of labeled samples, by selecting the most beneficial ones to label, instead of random selection. This paper proposes two new ALR approaches based on greedy sampling (GS). The first approach (GSy) selects new samples to increase the diversity in the output space, and the second (iGS) selects new samples to increase the diversity in both input and output spaces. Extensive experiments on 10 UCI and CMU StatLib datasets from various domains, and on 15 subjects on EEG-based driver drowsiness estimation, verified their effectiveness and robustness. (C) 2018 Elsevier Inc. All rights reserved.
引用
收藏
页码:90 / 105
页数:16
相关论文
共 34 条
[1]  
[Anonymous], P IEEE 13 INT C DAT
[2]  
[Anonymous], 2006, IEEE T NEURAL NETWOR
[3]  
[Anonymous], P IEEE INT C SYST MA
[4]  
[Anonymous], 1648 U WISC MAD
[5]  
[Anonymous], IEEE T NEURAL NETW L
[6]  
[Anonymous], 2010, P INTERSPEECH
[7]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[8]  
Bradley MM, 2007, Tech Rep, pB
[9]  
Bujrbidge R, 2007, LECT NOTES COMPUT SC, V4881, P209
[10]   Batch Mode Active Learning for Regression With Expected Model Change [J].
Cai, Wenbin ;
Zhang, Muhan ;
Zhang, Ya .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (07) :1668-1681