Online Learning for Multimodal Data Fusion With Application to Object Recognition

被引:11
|
作者
Shahrampour, Shahin [1 ]
Noshad, Mohammad [2 ]
Ding, Jie [1 ]
Tarokh, Vahid [1 ]
机构
[1] Harvard Univ, John A Paulson Sch Engn & Appl Sci, Cambridge, MA 02138 USA
[2] VLNComm, Charlottesville, VA 22911 USA
关键词
Online learning; mirror descent; tactile sensing; object recognition; PREDICTION; FEATURES; SETS;
D O I
10.1109/TCSII.2017.2754141
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We consider online multimodal data fusion, where the goal is to combine information from multiple modes to identify an element in a large dictionary. We address this problem in the context of object recognition by focusing on tactile sensing as one of the modes. Using a tactile glove with seven sensors, various individuals grasp different objects to obtain 7-D time series, where each component represents the pressure sequence applied to one sensor. The pressure data of all objects is stored in a dictionary as a reference. The objective is to match a streaming vector time series from grasping an unknown object to a dictionary object. We propose an algorithm that may start with prior knowledge provided by other modes. Receiving pressure data sequentially, the algorithm uses a dissimilarity metric to modify the prior and form a probability distribution over the dictionary. When the dictionary objects are dissimilar in shape, we empirically show that our algorithm recognize the unknown object even with a uniform prior. If there exists a similar object to the unknown object in the dictionary, our algorithm needs the prior from other modes to detect the unknown object. Notably, our algorithm maintains a similar performance to standard offline classification techniques, such as support vector machine, with a significantly lower computational time.
引用
收藏
页码:1259 / 1263
页数:5
相关论文
共 50 条
  • [1] Multimodal data fusion for object recognition
    Knyaz, Vladimir
    MULTIMODAL SENSING: TECHNOLOGIES AND APPLICATIONS, 2019, 11059
  • [2] Learning behaviour recognition method of English online course based on multimodal data fusion
    Li, Liangjie
    International Journal of Business Intelligence and Data Mining, 2024, 25 (3-4) : 336 - 349
  • [3] Weakly Paired Multimodal Fusion for Object Recognition
    Liu, Huaping
    Wu, Yupei
    Sun, Fuchun
    Fang, Bin
    Guo, Di
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2018, 15 (02) : 784 - 795
  • [4] Deep Learning-based Multimodal Fusion for Improved Object Recognition Accuracy
    Wang, Qi
    Cheng, Xiaohan
    Gao, Zijun
    Gu, Wenjun
    Mei, Taiyuan
    Xia, Haohao
    2024 3RD INTERNATIONAL CONFERENCE ON ROBOTICS, ARTIFICIAL INTELLIGENCE AND INTELLIGENT CONTROL, RAIIC 2024, 2024, : 471 - 474
  • [5] Data Fusion and its Application in Object Recognition in Stereo Sequences
    Wei, Yi
    Tan, Chunyan
    Wang, Wei
    FIRST IITA INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, : 330 - 332
  • [6] A Deep Reinforcement Learning Method For Multimodal Data Fusion in Action Recognition
    Guo, Jiale
    Liu, Qiang
    Chen, Enqing
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 120 - 124
  • [7] Multimodal Deep-Learning for Object Recognition Combining Camera and LIDAR Data
    Melotti, Gledson
    Premebida, Cristiano
    Goncalves, Nuno
    2020 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC 2020), 2020, : 177 - 182
  • [8] Multimodal Physiological Signals Fusion for Online Emotion Recognition
    Pan, Tongjie
    Ye, Yalan
    Cai, Hecheng
    Huang, Shudong
    Yang, Yang
    Wang, Guoqing
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5879 - 5888
  • [9] Cosmo: Contrastive Fusion Learning with Small Data for Multimodal Human Activity Recognition
    Ouyang, Xiaomin
    Shuai, Xian
    Zhou, Jiayu
    Shi, Ivy Wang
    Xie, Zhiyuan
    Xing, Guoliang
    Huang, Jianwei
    PROCEEDINGS OF THE 2022 THE 28TH ANNUAL INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND NETWORKING, ACM MOBICOM 2022, 2022, : 324 - 337
  • [10] Improving Unimodal Object Recognition with Multimodal Contrastive Learning
    Meyer, Johannes
    Eitel, Andreas
    Brox, Thomas
    Burgard, Wolfram
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 5656 - 5663