Weakly Paired Multimodal Fusion for Object Recognition

被引:74
作者
Liu, Huaping [1 ,2 ,3 ]
Wu, Yupei [1 ,2 ,3 ]
Sun, Fuchun [1 ,2 ,3 ]
Fang, Bin [1 ,2 ,3 ]
Guo, Di [1 ,2 ,3 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Tsinghua Natl Lab Informat Sci & Technol, State Key Lab Intelligent Technol & Syst, Beijing 100084, Peoples R China
[3] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Intelligent robot system; manipulation and grasping; multimodal data; projective dictionary learning; weakly paired data;
D O I
10.1109/TASE.2017.2692271
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The ever-growing development of sensor technology has led to the use of multimodal sensors to develop robotics and automation systems. It is therefore highly expected to develop methodologies capable of integrating information from multimodal sensors with the goal of improving the performance of surveillance, diagnosis, prediction, and so on. However, real multimodal data often suffer from significant weak-pairing characteristics, i.e., the full pairing between data samples may not be known, while pairing of a group of samples from one modality to a group of samples in another modality is known. In this paper, we establish a novel projective dictionary learning framework for weakly paired multimodal data fusion. By introducing a latent pairing matrix, we realize the simultaneous dictionary learning and the pairing matrix estimation, and therefore improve the fusion effect. In addition, the kernelized version and the optimization algorithms are also addressed. Extensive experimental validations on some existing data sets are performed to show the advantages of the proposed method. Note to Practitioners-In many industrial environments, we usually use multiple heterogeneous sensors, which provide multimodal information. Such multimodal data usually lead to two technical challenges. First, different sensors may provide different patterns of data. Second, the full-pairing information between modalities may not be known. In this paper, we develop a unified model to tackle such problems. This model is based on a projective dictionary learning method, which efficiently produces the representation vector for the original data by an explicit form. In addition, the latent pairing relation between samples can be learned automatically and be used to improve the classification performance. Such a method can be flexibly used for multimodal fusion with full-pairing, partial-pairing and weak-pairing cases.
引用
收藏
页码:784 / 795
页数:12
相关论文
共 59 条
[1]   Projective dictionary pair learning for EEG signal classification in brain computer interface applications [J].
Ameri, Rasool ;
Pouyan, Ali ;
Abolghasemi, Vahid .
NEUROCOMPUTING, 2016, 218 :382-389
[2]  
[Anonymous], JOINTLY LEARNING NON
[3]  
[Anonymous], P 2016 ACM MULT C
[4]  
[Anonymous], 2010, ACAD MANAGEMENT ANN
[5]  
[Anonymous], 2013, AAAI C ART INT
[6]   Multimodal Task-Driven Dictionary Learning for Image Classification [J].
Bahrampour, Soheil ;
Nasrabadi, Nasser M. ;
Ray, Asok ;
Jenkins, William Kenneth .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (01) :24-38
[7]  
Beksi WJ, 2015, IEEE INT CONF ROBOT, P1880, DOI 10.1109/ICRA.2015.7139443
[8]   Heterogeneous Sensor Data Fusion Approach for Real-time Monitoring in Ultraprecision Machining (UPM) Process Using Non-Parametric Bayesian Clustering and Evidence Theory [J].
Beyca, Omer F. ;
Rao, Prahalad K. ;
Kong, Zhenyu ;
Bukkapatnam, Satish T. S. ;
Komanduri, Ranga .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2016, 13 (02) :1033-1044
[9]   Gaussian Process Decentralized Data Fusion and Active Sensing for Spatiotemporal Traffic Modeling and Prediction in Mobility-on-Demand Systems [J].
Chen, Jie ;
Low, Kian Hsiang ;
Yao, Yujian ;
Jaillet, Patrick .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2015, 12 (03) :901-921
[10]   Robotic learning of haptic adjectives through physical interaction [J].
Chu, Vivian ;
McMahon, Ian ;
Riano, Lorenzo ;
McDonald, Craig G. ;
He, Qin ;
Perez-Tejada, Jorge Martinez ;
Arrigo, Michael ;
Darrell, Trevor ;
Kuchenbecker, Katherine J. .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2015, 63 :279-292