Multimodal Convolutional Neural Network for Object Detection Using RGB-D Images

被引:0
作者
Mocanu, Irina [1 ]
Clapon, Cosmin [1 ]
机构
[1] Univ Politehn Bucuresti, Comp Sci Dept, Bucharest, Romania
来源
2018 41ST INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP) | 2018年
基金
欧盟地平线“2020”;
关键词
object detection; convolutional neural network; RGB-D images;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a new convolutional neural network architecture for performing object detection based on RGB-D images. The network is an extension of the Faster-RCNN network where we add an additional input network branch for processing the depth image. The network was evaluated on the SUN RGB-D dataset for object detection and we obtained a positive difference in mAP score of about 4%, compared to the original one.
引用
收藏
页码:307 / 310
页数:4
相关论文
共 50 条
  • [31] Self-Supervised Pretraining With Multimodality Representation Enhancement for Salient Object Detection in RGB-D Images
    Gao, Lina
    Liu, Bing
    Fu, Ping
    Xu, Mingzhu
    Zhang, Yonggang
    Huang, Yulong
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [32] SASD: A Shape-Aware Saliency Object Detection Approach for RGB-D Images
    Zi, Lingling
    Cong, Xin
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 179 - 190
  • [33] Advancing in RGB-D Salient Object Detection: A Survey
    Chen, Ai
    Li, Xin
    He, Tianxiang
    Zhou, Junlin
    Chen, Duanbing
    APPLIED SCIENCES-BASEL, 2024, 14 (17):
  • [34] Oriented object detection in satellite images using convolutional neural network based on ResNeXt
    Haryono, Asep
    Jati, Grafika
    Jatmiko, Wisnu
    ETRI JOURNAL, 2024, 46 (02) : 307 - 322
  • [35] Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images
    Wang, Xiaoqiang
    Zhu, Lei
    Tang, Siliang
    Fu, Huazhu
    Li, Ping
    Wu, Fei
    Yang, Yi
    Zhuang, Yueting
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1107 - 1119
  • [36] Robust Localization Using RGB-D Images
    Oh, Yoonseon
    Oh, Songhwai
    2014 14TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2014), 2014, : 1023 - 1026
  • [37] 3D Object Tracking in RGB-D Images Using Particle Swarm Optimization
    dos Santos Junior, Jose Guedes
    Silva do Monte Lima, Joao Paulo
    2017 19TH SYMPOSIUM ON VIRTUAL AND AUGMENTED REALITY (SVR), 2017, : 107 - 115
  • [38] Transparent Object Detection Using Convolutional Neural Network
    Khaing, May Phyo
    Masayuki, Mukunoki
    BIG DATA ANALYSIS AND DEEP LEARNING APPLICATIONS, 2019, 744 : 86 - 93
  • [39] ResFusion: deeply fused scene parsing network for RGB-D images
    Dai, Juting
    Tang, Xinyi
    IET COMPUTER VISION, 2018, 12 (08) : 1171 - 1178
  • [40] ASIF-Net: Attention Steered Interweave Fusion Network for RGB-D Salient Object Detection
    Li, Chongyi
    Cong, Runmin
    Kwong, Sam
    Hou, Junhui
    Fu, Huazhu
    Zhu, Guopu
    Zhang, Dingwen
    Huang, Qingming
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (01) : 88 - 100