Distributed Active Learning

被引:10
作者
Shen, Pengcheng
Li, Chunguang [1 ]
Zhang, Zhaoyang
机构
[1] Zhejiang Univ, Zhejiang Prov Key Lab Informat Proc Commun & Netw, Hangzhou 310027, Zhejiang, Peoples R China
来源
IEEE ACCESS | 2016年 / 4卷
关键词
Active learning; uncertainty sampling; representative sampling; diversity; multi-class logistic regression; CENSORED REGRESSION; LEAST-SQUARES; CONVERGENCE;
D O I
10.1109/ACCESS.2016.2572198
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Active learning aims at obtaining high-accuracy models with as a few labeled data as possible, by iteratively and elaborately selecting most valuable data to query labels during the learning process, thereby the cost of labeling data can be reduced. Most previous active learning approaches consider the situation of centralized processing, where all the unlabeled data are supposed to be gathered together in one place. Due to the development of distributed applications, distributed processing has attracted a lot of interests given the situation that data are distributed at different nodes over network. In this paper, we focus on the issue of distributed active learning (DAL) for the classification problem. We propose a fully decentralized active learning approach, which consists of two parts, namely, a distributed sample selection strategy and a distributed classification algorithm. The former helps nodes to cooperatively select data based on uncertainty, diversity, and representativeness of data. Due to the introducing of a randomized preselection method in the strategy, we can achieve diversity of the selected data without any information exchange among nodes. The latter helps each node to train its local multi-class classification model in a global sense without transmitting original data among nodes. We demonstrate the effectiveness of the proposed DAL approach on several real data sets. Simulation results show that the proposed approach can significantly reduce the number of labeled data needed for obtaining a high-accuracy classifier in distributed case.
引用
收藏
页码:2572 / 2579
页数:8
相关论文
共 50 条
  • [41] Active Nearest-Neighbor Learning in Metric Spaces
    Kontorovich, Aryeh
    Sabato, Sivan
    Urner, Ruth
    JOURNAL OF MACHINE LEARNING RESEARCH, 2018, 18
  • [42] Incremental Multi-Label Learning with Active Queries
    Huang, Sheng-Jun
    Li, Guo-Xiang
    Huang, Wen-Yu
    Li, Shao-Yuan
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2020, 35 (02) : 234 - 246
  • [43] Active Learning Through Multi-Standard Optimization
    Wang, Min
    Zhang, Ying-Yi
    Min, Fan
    IEEE ACCESS, 2019, 7 : 56772 - 56784
  • [44] Incremental Multi-Label Learning with Active Queries
    Sheng-Jun Huang
    Guo-Xiang Li
    Wen-Yu Huang
    Shao-Yuan Li
    Journal of Computer Science and Technology, 2020, 35 : 234 - 246
  • [45] Exponential Savings in Agnostic Active Learning Through Abstention
    Puchkin, Nikita
    Zhivotovskiy, Nikita
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2022, 68 (07) : 4651 - 4665
  • [46] Design of Active Learning Framework for Collaborative Anomaly Detection
    Cai, He
    Hua, Cunqing
    Xu, Wenchao
    2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
  • [47] How to measure uncertainty in uncertainty sampling for active learning
    Vu-Linh Nguyen
    Mohammad Hossein Shaker
    Eyke Hüllermeier
    Machine Learning, 2022, 111 : 89 - 122
  • [48] Active Learning With Sampling by Uncertainty and Density for Data Annotations
    Zhu, Jingbo
    Wang, Huizhen
    Tsou, Benjamin K.
    Ma, Matthew
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1323 - 1331
  • [49] Evidence-based uncertainty sampling for active learning
    Sharma, Manali
    Bilgic, Mustafa
    DATA MINING AND KNOWLEDGE DISCOVERY, 2017, 31 (01) : 164 - 202
  • [50] Active Learning in Cooperative Communication Based Cognitive Radio
    Zhang, Xian
    Wu, Qihui
    Wang, Jinlong
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2011, 14 (06): : 2151 - 2162