Robust distributed multicategory angle-based classification for massive data

被引:0
|
作者
Sun, Gaoming [1 ]
Wang, Xiaozhou [1 ,2 ]
Yan, Yibo [1 ]
Zhang, Riquan [3 ]
机构
[1] East China Normal Univ, Sch Stat, Shanghai 200062, Peoples R China
[2] East China Normal Univ, Key Lab Adv Theory & Applicat Stat & Data Sci, MOE, Shanghai 200062, Peoples R China
[3] Shanghai Univ Int Business & Econ, Sch Stat & Informat, Shanghai 200062, Peoples R China
基金
中国国家自然科学基金;
关键词
Multicategory classification; Distributed setting; Robust distributed algorithms; MOM-based gradient estimation; Weighted-based gradient estimation; REGRESSION; FRAMEWORK; ALGORITHM;
D O I
10.1007/s00184-023-00915-3
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Multicategory classification problems are frequently encountered in practice. Considering that the massive data sets are increasingly common and often stored locally, we first provide a distributed estimation in the multicategory angle-based classification framework and obtain its excess risk under general conditions. Further, under varied robustness settings, we develop two robust distributed algorithms to provide robust estimations of the multicategory classification. The first robust distributed algorithm takes advantage of median-of-means (MOM) and is designed by the MOM-based gradient estimation. The second robust distributed algorithm is implemented by constructing the weighted-based gradient estimation. The theoretical guarantees of our algorithms are established via the non-asymptotic error bounds of the iterative estimations. Some numerical simulations demonstrate that our methods can effectively reduce the impact of outliers.
引用
收藏
页码:299 / 323
页数:25
相关论文
共 50 条
  • [31] Multi-Armed Angle-Based Direct Learning for Estimating Optimal Individualized Treatment Rules With Various Outcomes
    Qi, Zhengling
    Liu, Dacheng
    Fu, Haoda
    Liu, Yufeng
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (530) : 678 - 691
  • [32] Robust linear classification from limited training data
    Chakrabarti, Deepayan
    MACHINE LEARNING, 2022, 111 (05) : 1621 - 1649
  • [33] Robust semi-supervised classification based on data augmented online ELMs with deep features
    Hu, Xiaochang
    Zeng, Yujun
    Xu, Xin
    Zhou, Sihang
    Liu, Li
    KNOWLEDGE-BASED SYSTEMS, 2021, 229
  • [34] A Unified Classification Model Based on Robust Optimization
    Takeda, Akiko
    Mitsugi, Hiroyuki
    Kanamori, Takafumi
    NEURAL COMPUTATION, 2013, 25 (03) : 759 - 804
  • [35] Distributed independent vector machine for big data classification problems
    Almaspoor, Mohammad Hassan
    Safaei, Ali A.
    Salajegheh, Afshin
    Minaei-Bidgoli, Behrouz
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (06): : 7207 - 7244
  • [36] Data gravitation based classification
    Peng, Lizhi
    Yang, Bo
    Chen, Yuehui
    Abraham, Ajith
    INFORMATION SCIENCES, 2009, 179 (06) : 809 - 819
  • [37] Robust classification model for PMU-based on-line power system DSA with missing data
    Zhang, Yuchen
    Xu, Yan
    Dong, Zhao Yang
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2017, 11 (18) : 4484 - 4491
  • [38] A Decentralized Technique for Robust Probabilistic Mixture Modelling of a Distributed Data Set
    El Attar, Ali
    Pigeau, Antoine
    Gelgon, Marc
    INTELLIGENT DISTRIBUTED COMPUTING V, 2011, 382 : 271 - 276
  • [39] Intraoperative Diagnosis Support Tool for Serous Ovarian Tumors Based on Microarray Data Using Multicategory Machine Learning
    Park, Jee Soo
    Choi, Soo Beom
    Kim, Hee Jung
    Cho, Nam Hoon
    Kim, Sang Wun
    Kim, Young Tae
    Nam, Eun Ji
    Chung, Jai Won
    Kim, Deok Won
    INTERNATIONAL JOURNAL OF GYNECOLOGICAL CANCER, 2016, 26 (01) : 104 - 113
  • [40] An algorithm for classification in data mining based on classification codes
    Sankar, H. Ravi
    Naidu, M. M.
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 766 - +