Feature Fusion for Scene Text Detection

被引:4
作者
Zhu, Zhen [1 ]
Liao, Minghui [1 ]
Shi, Baoguang [1 ]
Bai, Xiang [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Hubei, Peoples R China
来源
2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS) | 2018年
基金
中国国家自然科学基金;
关键词
Oriented; Faster R-CNN; Feature fusion;
D O I
10.1109/DAS.2018.60
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A significant challenge in scene text detection is the large variation in text sizes. In particular, small text are usually hard to detect. This paper presents an accurate oriented text detector based on Faster R-CNN. We observe that Faster R-CNN is suitable for general object detection but inadequate for scene text detection due to the large variation in text size. We apply feature fusion both in RPN and Fast R-CNN to alleviate this problem and furthermore, enhance model's ability to detect relatively small text. Our text detector achieves comparable results to those state of the art methods on ICDAR 2015 and MSRA-TD500, showing its advantage and applicability.
引用
收藏
页码:193 / 198
页数:6
相关论文
共 41 条
  • [21] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [22] Du W, 2016, 2016 IEEE INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), P1, DOI 10.1109/ITNEC.2016.7560307
  • [23] HE D., 2016, Asian Conference on Computer Vision, P280
  • [24] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [25] He P, 2017, P ICCV
  • [26] He W., 2017, P ICCV
  • [27] Hu H., 2017, P ICCV
  • [28] Huang W., 2016, P 2016 ACM MULTIMEDI, P551, DOI 10.1145/2964284.2967282
  • [29] Reading Text in the Wild with Convolutional Neural Networks
    Jaderberg, Max
    Simonyan, Karen
    Vedaldi, Andrea
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 116 (01) : 1 - 20
  • [30] Li H., 2017, P ICCV