Delaunay triangulation based text detection from multi-view images of natural scene

被引:26
作者
Roy, Soumyadip [1 ]
Shivakumara, Palaiahnakote [2 ]
Pal, Umapada [3 ]
Lu, Tong [4 ]
Kumar, Govindaraj Hemantha [5 ]
机构
[1] Heritage Inst Technol, Comp Sci & Engn, Kolkata, India
[2] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur, Malaysia
[3] Indian Stat Inst, Comp Vis & Pattern Recognit Unit, Kolkata, India
[4] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Jiangsu, Peoples R China
[5] Univ Mysore, Dept Studies Comp Sci, Mysore, Karnataka, India
关键词
Clustering algorithms;
D O I
10.1016/j.patrec.2019.11.021
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text detection in the wild is still considered as a challenging issue to the researchers because of its several real time applications like forensic application, where CCTV camera captures images at different angles of the same scene. Unlike the existing methods that consider a single view captured orthogonally for text detection, this paper considers multi-view (view-1 and view-2 of the same spot) of the same scene captured at different angles or different height distances for text detection. For each pair of the same scene, the proposed method extracts features that describe characteristics of text components based on Delaunay Triangulation (DT), namely corner points, area and cavity of the DT. The features of corresponding DT in view-1 and view-2 are compared through cosine distance measure to estimate the similarity between two components of respective view-1 and view-2. If the pair satisfies the similarity condition, the components are considered as Candidate Text Components (CTC). In other words, these are the common components for view-1 and view-2 that satisfy the similarity condition. From each CTC of view-1 and view-2, the proposed method finds nearest neighbor components to restore the components of the same text line based on estimating degree of similarly between CTC and neighbor components using Chi-square and cosine distance measures. Furthermore, the proposed method uses a recognition step to detect correct texts by comparing recognition results of view-1 and view-2. The same recognition step is used for removing false positives to improve the performance of the proposed method. Experimental results on our own dataset, which contains pair of images of different situations, and the standard datasets, namely, ICDAR 2013, MSRATD-500, CTW1500, Total-text, ICDAR 2017 MLT and COCO-text, show that the proposed method outperforms the existing methods. (C) 2019 Published by Elsevier B.V.
引用
收藏
页码:92 / 100
页数:9
相关论文
共 17 条
[11]   Rotation-sensitive Regression for Oriented Scene Text Detection [J].
Liao, Minghui ;
Zhu, Zhen ;
Shi, Baoguang ;
Xia, Gui-song ;
Bai, Xiang .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5909-5918
[12]   TextBoxes plus plus : A Single-Shot Oriented Scene Text Detector [J].
Liao, Minghui ;
Shi, Baoguang ;
Bai, Xiang .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (08) :3676-3690
[13]   Intelligent character recognition using fully convolutional neural networks [J].
Ptucha, Raymond ;
Such, Felipe Petroski ;
Pillai, Suhas ;
Brockler, Frank ;
Singh, Vatsala ;
Hutkowski, Paul .
PATTERN RECOGNITION, 2019, 88 :604-613
[14]   A robust arbitrary text detection system for natural scene images [J].
Risnumawan, Anhar ;
Shivakumara, Palaiahankote ;
Chan, Chee Seng ;
Tan, Chew Lim .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (18) :8027-8048
[15]   Detecting Oriented Text in Natural Images by Linking Segments [J].
Shi, Baoguang ;
Bai, Xiang ;
Belongie, Serge .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3482-3490
[16]   A New Technique for Multi-Oriented Scene Text Line Detection and Tracking in Video [J].
Wu, Liang ;
Shivakumara, Palaiahnakote ;
Lu, Tong ;
Tan, Chew Lim .
IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (08) :1137-1152
[17]   EAST: An Efficient and Accurate Scene Text Detector [J].
Zhou, Xinyu ;
Yao, Cong ;
Wen, He ;
Wang, Yuzhi ;
Zhou, Shuchang ;
He, Weiran ;
Liang, Jiajun .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2642-2651