Global multiclass classification and dataset construction via heterogeneous local experts

被引:1
|
作者
Ahn S. [1 ]
Özgür A. [1 ]
Pilanci M. [1 ]
机构
[1] Department of Electrical Engineering, Stanford University, Stanford, 94305, CA
来源
IEEE Journal on Selected Areas in Information Theory | 2020年 / 1卷 / 03期
基金
美国国家科学基金会;
关键词
Crowdsourcing; Dataset construction; Ensemble learning; Federated learning; Heterogeneous data; Multiclass classification; Set cover problem;
D O I
10.1109/JSAIT.2020.3041804
中图分类号
学科分类号
摘要
In the domains of dataset construction and crowdsourcing, a notable challenge is to aggregate labels from a heterogeneous set of labelers, each of whom is potentially an expert in some subset of tasks (and less reliable in others). To reduce costs of hiring human labelers or training automated labeling systems, it is of interest to minimize the number of labelers while ensuring the reliability of the resulting dataset. We model this as the problem of performing K-class classification using the predictions of smaller classifiers, each trained on a subset of [K], and derive bounds on the number of classifiers needed to accurately infer the true class of an unlabeled sample under both adversarial and stochastic assumptions. By exploiting a connection to the classical set cover problem, we produce a near-optimal scheme for designing such configurations of classifiers which recovers the well known one-vs.-one classification approach as a special case. Experiments with the MNIST and CIFAR-10 datasets demonstrate the favorable accuracy (compared to a centralized classifier) of our aggregation scheme applied to classifiers trained on subsets of the data. These results suggest a new way to automatically label data or adapt an existing set of local classifiers to larger-scale multiclass problems. © 2020 IEEE Journal on Selected Areas in Information Theory.All right reserved.
引用
收藏
页码:870 / 883
页数:13
相关论文
共 26 条
  • [1] Combining local and global learners in the pairwise multiclass classification
    Mohammad Ali Bagheri
    Qigang Gao
    Sergio Escalera
    Pattern Analysis and Applications, 2015, 18 : 845 - 860
  • [2] Combining local and global learners in the pairwise multiclass classification
    Bagheri, Mohammad Ali
    Gao, Qigang
    Escalera, Sergio
    PATTERN ANALYSIS AND APPLICATIONS, 2015, 18 (04) : 845 - 860
  • [3] Improved learning algorithms for mixture of experts in multiclass classification
    Chen, K
    Xu, L
    Chi, H
    NEURAL NETWORKS, 1999, 12 (09) : 1229 - 1252
  • [4] Lightweight Local-Global Fusion for Robust Multiclass Classification of Skin Lesions
    Li, Guangli
    Zhou, Xinjiong
    Ye, Yiyuan
    Lv, Jingqin
    Ji, Donghong
    Wu, Jianguo
    Zhang, Ruiyang
    Zhang, Hongbin
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2025, 35 (02)
  • [5] Collaborative and Incremental Learning for Modulation Classification With Heterogeneous Local Dataset in Cognitive IoT
    Qi, Peihan
    Zhou, Xiaoyu
    Ding, Yuanlei
    Zheng, Shilian
    Jiang, Tao
    Li, Zan
    IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2023, 7 (02): : 881 - 893
  • [6] A hybrid ensemble for classification in multiclass datasets: An application to oilseed disease dataset
    Chaudhary, Archana
    Kolhe, Savita
    Kamal, Raj
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2016, 124 : 65 - 72
  • [7] Multiclass support vector classification via coding and regression
    Chen, Pei-Chun
    Lee, Kuang-Yao
    Lee, Tsung-Ju
    Lee, Yuh-Jye
    Huang, Su-Yun
    NEUROCOMPUTING, 2010, 73 (7-9) : 1501 - 1512
  • [8] Effect of Dataset Size and Train/Test Split Ratios in QSAR/QSPR Multiclass Classification
    Racz, Anita
    Bajusz, David
    Heberger, Karoly
    MOLECULES, 2021, 26 (04):
  • [9] Improving the accuracy of multiclass classification in machine learning: A case study in a cell signaling dataset
    Pablo Gonzalez-Perez, Pedro
    Eduardo Sanchez-Gutierrez, Maximo
    INTELLIGENT DATA ANALYSIS, 2022, 26 (02) : 481 - 500
  • [10] Better multiclass classification via a margin-optimized single binary problem
    El-Yaniv, Ran
    Pechyony, Dmitry
    Yom-Tov, Elad
    PATTERN RECOGNITION LETTERS, 2008, 29 (14) : 1954 - 1959