Distributed Semi-Supervised Learning With Missing Data

被引:19
作者
Xu, Zhen [1 ,2 ]
Liu, Ying [1 ,2 ]
Li, Chunguang [1 ,2 ]
机构
[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, Zhejiang Prov Key Lab Informat Proc Commun & Netw, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
chi(2) kernel; classification; distributed; missing data; random feature map; semi-supervised learning; subspace learning; STRATEGIES; MACHINE;
D O I
10.1109/TCYB.2020.2967072
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data classification is usually challenged by the difficulty and/or high cost in collecting sufficient labeled data, and unavoidability of data missing. Besides, most of the existing algorithms belong to centralized processing, in which all of the training data must be stored and processed at a fusion center. But in many real applications, data are distributed over multiple nodes, and cannot be centralized to one node for processing due to various reasons. Considering this, in this article, we focus on the problem of distributed classification of missing data with a small proportion of labeled data samples, and develop a distributed semi-supervised missing-data classification (dS(2)MDC) algorithm. The proposed algorithm is a distributed joint subspace/classifier learning, that is, a latent subspace representation for missing feature imputation is learned jointly with the training of nonlinear classifiers modeled by the chi(2) kernel using a semi-supervised learning strategy. Theoretical performance analysis and simulations on several datasets clearly validate the effectiveness of the proposed dS(2)MDC algorithm from different perspectives.
引用
收藏
页码:6165 / 6178
页数:14
相关论文
共 40 条
[1]  
[Anonymous], 2004, Nonlinear Programming
[2]   ON THE CONVERGENCE OF BLOCK COORDINATE DESCENT TYPE METHODS [J].
Beck, Amir ;
Tetruashvili, Luba .
SIAM JOURNAL ON OPTIMIZATION, 2013, 23 (04) :2037-2060
[3]  
Belkin M, 2006, J MACH LEARN RES, V7, P2399
[4]  
Blake C., 1998, UCI repository of machine learning databases
[5]  
Boskos D, 2016, IEEE DECIS CONTR P, P81, DOI 10.1109/CDC.2016.7798250
[6]   Semi-supervised graph-based hyperspectral image classification [J].
Camps-Valls, Gustavo ;
Bandos, Tatyana V. ;
Zhou, Dengyong .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2007, 45 (10) :3044-3054
[7]   Diffusion LMS Strategies for Distributed Estimation [J].
Cattivelli, Federico S. ;
Sayed, Ali H. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2010, 58 (03) :1035-1048
[8]  
Chapelle O., 2006, Semi-Supervised
[9]   Diffusion Adaptation Strategies for Distributed Optimization and Learning Over Networks [J].
Chen, Jianshu ;
Sayed, Ali H. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (08) :4289-4305
[10]  
Demsar J, 2006, J MACH LEARN RES, V7, P1