TKDN: Scene Text Detection via Keypoints Detection

被引:0
作者
Cui, Yuanshun [1 ,2 ]
Li, Jie [1 ,2 ]
Han, Hu [1 ,3 ]
Shan, Shiguang [1 ,2 ,4 ]
Chen, Xilin [1 ,2 ]
机构
[1] Chinese Acad Sci, Key Lab Intelligent Informat Proc, Inst Comp Technol, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Peng Cheng Lab, Shenzhen, Peoples R China
[4] CAS Ctr Excellence Brain Sci & Intelligence Techn, Shanghai, Peoples R China
来源
COMPUTER VISION - ACCV 2018, PT V | 2019年 / 11365卷
关键词
RECOGNITION;
D O I
10.1007/978-3-030-20873-8_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the past few years, great efforts have been devoted to scene text detection. Nevertheless, efficient text detection in the wild remains a challenging problem. Methods for general object detection usually have limitations in handling the arbitrary orientations and large aspect ratios of scene text. In this paper, we present a novel scene text detection method which treats text detection as a text keypoint detection task performed in a coarse-to-fine scheme (text keypoint detection network, TKDN). Specifically, in TKDN we first generate the coarse text instance regions using feature pyramid network (FPN) as well as region proposal network (RPN) and ResNet50. Within the coarse text regions, we then perform text keypoint detection, bounding box classification and regression, and text region segmentation in a multi-task way. In the inference stage, an effective post-processing algorithm is designed to combine the outputs from three branches and obtain the final text keypoint detection results. The proposed TKDN approach outperforms the state-of-the-art approach and achieves an F-measure of 82.0% on the public-domain ICDAR2015 database.
引用
收藏
页码:231 / 246
页数:16
相关论文
共 38 条
  • [1] [Anonymous], IEEE CVPR
  • [2] [Anonymous], IEEE BTAS
  • [3] [Anonymous], IEEE ICCV
  • [4] [Anonymous], AAAI
  • [5] [Anonymous], IEEE CVPR
  • [6] [Anonymous], IEEE CVPR
  • [7] [Anonymous], IEEE CVPR
  • [8] [Anonymous], IEEE CVPR
  • [9] [Anonymous], 2018, AAAI
  • [10] [Anonymous], 2015, ICDAR