Semantic R-CNN for Natural Language Object Detection

被引:0
|
作者
Ye, Shuxiong [1 ]
Qin, Zheng [1 ]
Xu, Kaiping [1 ]
Huang, Kai [1 ]
Wang, Guolong [1 ]
机构
[1] Tsinghua Univ, Sch Software, Beijing, Peoples R China
来源
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT II | 2018年 / 10736卷
关键词
Object detection; Natural language; RPN;
D O I
10.1007/978-3-319-77383-4_10
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a simple and effective framework for natural language object detection, to localize a target within an image based on description of the target. The method, called semantic R-CNN, extends RPN (Region Proposal Network) [1] by adding LSTM [20] module for processing natural language query text. LSTM [20] module take encoded query text and image descriptors as input and output the probability of the query text conditioned on visual features of candidate box and whole image. Those candidate boxes are generated by RPN and their local features are extracted by ROI pooling. RPN can be initialized from pre-trained Faster R-CNN model [1], transfers object visual knowledge from traditional object detection domain to our task. Experimental results demonstrate that our method significantly outperform previous baseline SCRC (Spatial Context Recurrent ConvNet) [7] model on Referit dataset [8], moreover, our model is simple to train similar to Faster R-CNN.
引用
收藏
页码:98 / 107
页数:10
相关论文
共 50 条
  • [31] Design and Implementation of an Object Detection System Using Faster R-CNN
    Wang Cheng
    Peng Zhihao
    2019 INTERNATIONAL CONFERENCE ON ROBOTS & INTELLIGENT SYSTEM (ICRIS 2019), 2019, : 204 - 206
  • [32] Nested object detection using mask R-CNN: application to bee and varroa detection
    Kriouile, Yassine
    Ancourt, Corinne
    Wegrzyn-Wolska, Katarzyna
    Bougueroua, Lamine
    Neural Computing and Applications, 2024, 36 (35) : 22587 - 22609
  • [33] Object detection and recognition using contour based edge detection and fast R-CNN
    Rani, Shilpa
    Ghai, Deepika
    Kumar, Sandeep
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) : 42183 - 42207
  • [34] Object detection and recognition using contour based edge detection and fast R-CNN
    Shilpa Rani
    Deepika Ghai
    Sandeep Kumar
    Multimedia Tools and Applications, 2022, 81 : 42183 - 42207
  • [35] Research on abnormal object detection in specific region based on Mask R-CNN
    Xiong, Haitao
    Wu, Jiaqing
    Liu, Qing
    Cai, Yuanyuan
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (03)
  • [36] Mask R-CNN Object Detection Method Based on Improved Feature Pyramid
    Ren Zhijun
    Lin Suzhen
    Li Dawei
    Wang Lifang
    Zuo Jianhong
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (04)
  • [37] IoU-aware feature fusion R-CNN for dense object detection
    Jixuan Hong
    Xueqin He
    Zhaoli Deng
    Chenhui Yang
    Machine Vision and Applications, 2024, 35
  • [38] Privacy-Preserving Object Detection for Medical Images With Faster R-CNN
    Liu, Yang
    Ma, Zhuo
    Liu, Ximeng
    Ma, Siqi
    Ren, Kui
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2022, 17 : 69 - 84
  • [39] Mask R-CNN Based Object Detection for Intelligent Wireless Power Transfer
    Wu, Aozhou
    Zhang, Qingqing
    Fang, Wen
    Deng, Hao
    Jiang, Sai
    Liu, Qingwen
    Xia, Pengfei
    2018 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2018,
  • [40] Multiscale anchor box and optimized classification with faster R-CNN for object detection
    Wang, Sheng-Ye
    Qu, Zhong
    IET IMAGE PROCESSING, 2023, 17 (05) : 1322 - 1333