Subsampling oriented active learning method for multi-category classification problem

被引:0
作者
Shi W. [1 ]
Huang H. [1 ]
Feng Y. [1 ]
Liu Z. [1 ]
机构
[1] College of Systems Engineering, National University of Defense Technology, Changsha
来源
Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics | 2021年 / 43卷 / 03期
关键词
Active learning; Multi-category classification problem; Subsampling; Unsupervised clustering;
D O I
10.12305/j.issn.1001-506X.2021.03.13
中图分类号
学科分类号
摘要
Because the computational amount of the traditional active learning method increases exponentially with the increase of problem size, it is difficult to apply to the large-scale multi-category data classification tasks. To solve this problem, a subsampling-based active learning (SBAL) algorithm is designed. This algorithm integrates unsupervised clustering algorithm with traditional active learning method, and adds subsampling operation between them. This operation can significantly reduce the time complexity of the algorithm, reduce the experimental time-consuming on the basis of ensuring the accuracy of the experiment, so as to deal with the classification problem of large-scale data sets more efficiently. The experimental results show that the experimental performance of the SBAL algorithm is better than that of the traditional active learning algorithm, which proves that the proposed method can break through the limitation that the traditional active learning method can not deal with multi-category classification of large-scale data sets. © 2021, Editorial Office of Systems Engineering and Electronics. All right reserved.
引用
收藏
页码:700 / 708
页数:8
相关论文
共 32 条
[1]  
FENG Y T, ZHANG H J, HAO W N., Named entity recognition for military text, Computer Science, 42, 7, pp. 15-18, (2015)
[2]  
CHENG G, YANG C Y, YAO X W, Et al., When deep learning meets metric learning: remote sensing image scene classification via learning discriminative CNNs, IEEE Trans. on Geoence and Remote Sensing, 56, 5, pp. 2811-2821, (2018)
[3]  
WANG G Y, HE X., A self-learning model under uncertain condition, Journal of Software, 14, 6, pp. 68-74, (2003)
[4]  
YANG Y, MA Z G, NIE F P, Et al., Multi-class active learning by uncertainty sampling with diversity maximization, International Journal of Computer Vision, 113, 2, pp. 113-127, (2015)
[5]  
JENNIFER V, ALDEA E., Evidential query-by-committee active learning for pedestrian detection in high-density crowds, International Journal of Approximate Reasoning, 104, 1, pp. 166-184, (2019)
[6]  
HAO S., Query by diverse committee in transfer active learning, Frontiers of Computer Science, 13, 2, pp. 280-291, (2019)
[7]  
WU W N, LIU Y, GUO M Z, Et al., Advances in active learning algorithms based on sampling strategy, Journal of Computer Research and Development, 49, 6, pp. 1162-1173, (2012)
[8]  
TOMASZ G, MACIEJ L., Stacked regression with a generalization of the moore-penrose pseudoinverse, Statistics in Transition New Series, 18, 3, pp. 443-458, (2017)
[9]  
MENG Q, WANG Y, CHEN W, Et al., Generalization error bounds for optimization algorithms via stability, Proc. of the 31st AAAI Conference on Artificial Intelligence, pp. 2336-2342, (2017)
[10]  
ZHANG C, CHEN T., An active learning framework for content-based information retrieval, IEEE Trans. on Multimedia, 4, 2, pp. 260-268, (2002)