Semantic R-CNN for Natural Language Object Detection

被引:0
|
作者
Ye, Shuxiong [1 ]
Qin, Zheng [1 ]
Xu, Kaiping [1 ]
Huang, Kai [1 ]
Wang, Guolong [1 ]
机构
[1] Tsinghua Univ, Sch Software, Beijing, Peoples R China
来源
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT II | 2018年 / 10736卷
关键词
Object detection; Natural language; RPN;
D O I
10.1007/978-3-319-77383-4_10
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a simple and effective framework for natural language object detection, to localize a target within an image based on description of the target. The method, called semantic R-CNN, extends RPN (Region Proposal Network) [1] by adding LSTM [20] module for processing natural language query text. LSTM [20] module take encoded query text and image descriptors as input and output the probability of the query text conditioned on visual features of candidate box and whole image. Those candidate boxes are generated by RPN and their local features are extracted by ROI pooling. RPN can be initialized from pre-trained Faster R-CNN model [1], transfers object visual knowledge from traditional object detection domain to our task. Experimental results demonstrate that our method significantly outperform previous baseline SCRC (Spatial Context Recurrent ConvNet) [7] model on Referit dataset [8], moreover, our model is simple to train similar to Faster R-CNN.
引用
收藏
页码:98 / 107
页数:10
相关论文
共 50 条
  • [21] DECONV R-CNN FOR SMALL OBJECT DETECTION ON REMOTE SENSING IMAGES
    Zhang, Wei
    Wang, Shihao
    Thachan, Sophanyouly
    Chen, Jingzhou
    Qian, Yuntao
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 2483 - 2486
  • [22] Improved Faster R-CNN for Multi-Scale Object Detection
    Li X.
    Fu C.
    Li X.
    Wang Z.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (07): : 1095 - 1101
  • [23] Relief R-CNN: Utilizing Convolutional Features for Fast Object Detection
    Li, Guiying
    Liu, Junlong
    Jiang, Chunhui
    Zhang, Liangpeng
    Lin, Minlong
    Tang, Ke
    ADVANCES IN NEURAL NETWORKS, PT I, 2017, 10261 : 386 - 394
  • [24] Distributed Edge Cloud R-CNN for Real Time Object Detection
    Herrera, Joshua
    Demir, Mevlut A.
    Yousefi, Parsa
    Prevost, John J.
    Rad, Paul
    2018 WORLD AUTOMATION CONGRESS (WAC), 2018, : 146 - 151
  • [25] Foreign Object Detection of Transmission Lines Based on Faster R-CNN
    Guo, Shuqiang
    Bai, Qianlong
    Zhou, Xinxin
    INFORMATION SCIENCE AND APPLICATIONS, 2020, 621 : 269 - 275
  • [26] Crowd R-CNN: An Object Detection Model Utilizing Crowdsourced Labels
    Hu, Yucheng
    Song, Meina
    ICVISP 2019: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON VISION, IMAGE AND SIGNAL PROCESSING, 2019,
  • [27] Cascade R-CNN: High Quality Object Detection and Instance Segmentation
    Cai, Zhaowei
    Vasconcelos, Nuno
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) : 1483 - 1498
  • [28] An Automatic Object Detection and Location System applying Faster R-CNN
    Falquete, Rodrigo Bernardes
    Cavalieri, Daniel Cruz
    Pereira, Flavio Garcia
    2018 13TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRY APPLICATIONS (INDUSCON), 2018, : 902 - 908
  • [29] ATTENTION-ENHANCED AND MORE BALANCED R-CNN FOR OBJECT DETECTION
    Mei, Ruohong
    Wang, Haiying
    Men, Aidong
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2136 - 2140
  • [30] Faster R-CNN with Attention Feature Map for Robust Object Detection
    Lee, Youl-Kyeong
    Jo, Kang-Hyun
    FRONTIERS OF COMPUTER VISION, 2020, 1212 : 180 - 191