Efficient Divide-and-Conquer Classification Based on Parallel Feature-Space Decomposition for Distributed Systems

被引:2
|
作者
Guo, Qi [1 ,2 ]
Chen, Bo-Wei [3 ]
Rho, Seungmin [4 ]
Ji, Wen [5 ]
Jiang, Feng [6 ]
Ji, Xianyang [7 ]
Kung, Sun-Yuan [3 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
[2] Harvard Univ, John A Paulson Sch Engn & Appl Sci, Cambridge, MA 02138 USA
[3] Princeton Univ, Dept Elect Engn, Princeton, NJ 08544 USA
[4] Sungkyul Univ, Dept Multimedia, Gyeonggi 430742, South Korea
[5] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
[6] Harbin Inst Technol, Dept Comp Sci, Harbin 150001, Heilongjiang, Peoples R China
[7] Tsinghua UniversityBeijing, Dept Automat, Beijing 100084, Peoples R China
来源
IEEE SYSTEMS JOURNAL | 2018年 / 12卷 / 02期
关键词
Classification; divide and conquer (DC); feature-space decomposition; feature-space division; fusion; SUPPORT VECTOR MACHINES;
D O I
10.1109/JSYST.2015.2478800
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a divide-and-conquer (DC) approach based on feature-space decomposition for classification. When large-scale data sets are present, typical approaches usually employed truncated kernel methods on the feature space or DC approaches on the sample space. However, this did not guarantee separability between classes, owing to overfitting. To overcome such problems, this paper proposes a novel DC approach on feature spaces consisting of three steps. First, we divide the feature space into several subspaces using the decomposition method proposed in this paper. Subsequently, these feature subspaces are sent into individual local classifiers for training. Finally, the outcome of local classifiers are fused together to generate the final classification results. We also propose a Cascade-TRBFKRR classifier to reweight training samples for data refinement. Experiments on large-scale data sets are carried out for performance evaluation. The results show that the error rates of the proposed DC method decreased compared with the state-of-the-art fast support vector machine solvers, e.g., reducing error rates by 10.53% and 7.53% on RCV1 and covtype data sets, respectively.
引用
收藏
页码:1492 / 1498
页数:7
相关论文
共 50 条
  • [11] An simple and efficient fault tolerance mechanism for divide-and-conquer systems
    Wrzesinska, G
    van Nieuwpoort, RV
    Maassen, J
    Bal, HE
    2004 IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID - CCGRID 2004, 2004, : 735 - 735
  • [12] Divide-and-Conquer Determinization of Buchi Automata Based on SCC Decomposition
    Li, Yong
    Turrini, Andrea
    Feng, Weizhi
    Vardi, Moshe Y.
    Zhang, Lijun
    COMPUTER AIDED VERIFICATION (CAV 2022), PT II, 2022, 13372 : 152 - 173
  • [13] An efficient hybrid tridiagonal divide-and-conquer algorithm on distributed memory architectures
    Li, Shengguo
    Rouet, Francois-Henry
    Liu, Jie
    Huang, Chun
    Gao, Xingyu
    Chi, Xuebin
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2018, 344 : 512 - 520
  • [14] A divide-and-conquer parallel computing scheme for the optimization analysis of tribological systems
    Wang, NZ
    Chen, LW
    TRIBOLOGY TRANSACTIONS, 2004, 47 (03) : 313 - 320
  • [15] A divide-and-conquer parallel computing scheme for the optimization analysis of tribological systems
    Wang, N
    Chen, LW
    TRIBOLOGY & LUBRICATION TECHNOLOGY, 2005, 61 (01) : 38 - 46
  • [16] Parallel out-of-core divide-and-conquer techniques with application to classification trees
    Sreenivas, MK
    AlSabti, K
    Ranka, S
    IPPS/SPDP 1999: 13TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM & 10TH SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 1999, : 555 - 562
  • [17] Parallel out-of-core divide-and-conquer techniques with application to classification trees
    Univ of Florida, Gainesville, United States
    Proc Int Parall Process Symp IPPS, (555-562):
  • [18] Divide-and-Conquer Kronecker Product Decomposition for Memory-Efficient Graph Approximation
    Maringanti, Venkata Suhas
    Shao, Ming
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 3766 - 3773
  • [19] An efficient divide-and-conquer technique for parallel computation of modular multi-exponentiation
    Lou, DC
    Chang, CC
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2000, 15 (02): : 111 - 117
  • [20] Research on parallel collision detection algorithm based on pipelining and divide-and-conquer
    Zhao, Wei
    He, Yanshuang
    Li, Wenhui
    PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, PROCEEDINGS, 2007, : 277 - 282