Online Learning for Multimodal Data Fusion With Application to Object Recognition

被引：11

作者：

Shahrampour, Shahin ^{[1
]}

Noshad, Mohammad ^{[2
]}

Ding, Jie ^{[1
]}

Tarokh, Vahid ^{[1
]}

机构：

[1] Harvard Univ, John A Paulson Sch Engn & Appl Sci, Cambridge, MA 02138 USA

[2] VLNComm, Charlottesville, VA 22911 USA

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS | 2018年 / 65卷 / 09期

关键词：

Online learning; mirror descent; tactile sensing; object recognition; PREDICTION; FEATURES; SETS;

D O I：

10.1109/TCSII.2017.2754141

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

We consider online multimodal data fusion, where the goal is to combine information from multiple modes to identify an element in a large dictionary. We address this problem in the context of object recognition by focusing on tactile sensing as one of the modes. Using a tactile glove with seven sensors, various individuals grasp different objects to obtain 7-D time series, where each component represents the pressure sequence applied to one sensor. The pressure data of all objects is stored in a dictionary as a reference. The objective is to match a streaming vector time series from grasping an unknown object to a dictionary object. We propose an algorithm that may start with prior knowledge provided by other modes. Receiving pressure data sequentially, the algorithm uses a dissimilarity metric to modify the prior and form a probability distribution over the dictionary. When the dictionary objects are dissimilar in shape, we empirically show that our algorithm recognize the unknown object even with a uniform prior. If there exists a similar object to the unknown object in the dictionary, our algorithm needs the prior from other modes to detect the unknown object. Notably, our algorithm maintains a similar performance to standard offline classification techniques, such as support vector machine, with a significantly lower computational time.

引用

页码：1259 / 1263

页数：5

共 50 条

[1] Multimodal data fusion for object recognition
Knyaz, Vladimir
MULTIMODAL SENSING: TECHNOLOGIES AND APPLICATIONS, 2019, 11059
[2] Learning behaviour recognition method of English online course based on multimodal data fusion
Li, Liangjie
International Journal of Business Intelligence and Data Mining, 2024, 25 (3-4) : 336 - 349
[3] Weakly Paired Multimodal Fusion for Object Recognition
Liu, Huaping
Wu, Yupei
Sun, Fuchun
Fang, Bin
Guo, Di
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2018, 15 (02) : 784 - 795
[4] Deep Learning-based Multimodal Fusion for Improved Object Recognition Accuracy
Wang, Qi
Cheng, Xiaohan
Gao, Zijun
Gu, Wenjun
Mei, Taiyuan
Xia, Haohao
2024 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, ARTIFICIAL INTELLIGENCE AND INTELLIGENT CONTROL, RAIIC 2024, 2024, : 471 - 474
[5] Data Fusion and its Application in Object Recognition in Stereo Sequences
Wei, Yi
Tan, Chunyan
Wang, Wei
FIRST IITA INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, : 330 - 332
[6] A Deep Reinforcement Learning Method For Multimodal Data Fusion in Action Recognition
Guo, Jiale
Liu, Qiang
Chen, Enqing
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 120 - 124
[7] Multimodal Deep-Learning for Object Recognition Combining Camera and LIDAR Data
Melotti, Gledson
Premebida, Cristiano
Goncalves, Nuno
2020 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC 2020), 2020, : 177 - 182
[8] Multimodal Physiological Signals Fusion for Online Emotion Recognition
Pan, Tongjie
Ye, Yalan
Cai, Hecheng
Huang, Shudong
Yang, Yang
Wang, Guoqing
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5879 - 5888
[9] Cosmo: Contrastive Fusion Learning with Small Data for Multimodal Human Activity Recognition
Ouyang, Xiaomin
Shuai, Xian
Zhou, Jiayu
Shi, Ivy Wang
Xie, Zhiyuan
Xing, Guoliang
Huang, Jianwei
PROCEEDINGS OF THE 2022 THE 28TH ANNUAL INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND NETWORKING, ACM MOBICOM 2022, 2022, : 324 - 337
[10] Improving Unimodal Object Recognition with Multimodal Contrastive Learning
Meyer, Johannes
Eitel, Andreas
Brox, Thomas
Burgard, Wolfram
2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5656 - 5663

← 1 2 3 4 5 →