A Novel Approach for Video Text Detection and Recognition Based on a Corner Response Feature Map and Transferred Deep Convolutional Neural Network

被引:22
作者
Lu, Wei [1 ]
Sun, Hongbo [1 ]
Chu, Jinghui [1 ]
Huang, Xiangdong [1 ]
Yu, Jiexiao [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
Video text detection and recognition; corner response feature map; transferred convolutional neural network; fuzzy c-means clustering; IMAGES; ALGORITHM; CAPTION; FCM;
D O I
10.1109/ACCESS.2018.2851942
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The text presented in videos contains important information for content analysis, indexing, and retrieval of videos. The key technique for extracting this information is to find, verify, and recognize video text in various languages and fonts against complex backgrounds. In this paper, we propose a novel method that combines a corner response feature map and transferred deep convolutional neural networks for detecting and recognizing video text. First, we use a corner response feature map to detect candidate text regions with a high recall. Next, we partition the candidate text regions into candidate text lines by projection analysis using two alternative methods. We then construct classification networks transferred from VGG16, ResNet50, and InceptionV3 to eliminate false positives. Finally, we develop a novel fuzzy c-means clustering-based separation algorithm to obtain a clean text layer from complex backgrounds so that the text is correctly recognized by commercial optical character recognition software. The proposed method is robust and has good performance on video text detection and recognition, which was evaluated on three publicly available test data sets and on the high-resolution test data set we constructed.
引用
收藏
页码:40198 / 40211
页数:14
相关论文
共 62 条
[1]  
[Anonymous], 2013, ARXIV PREPRINT ARXIV
[2]  
[Anonymous], 2016, ARXIV160502688
[3]   Techniques and systems for image and video retrieval [J].
Aslandogan, YA ;
Yu, CT .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1999, 11 (01) :56-63
[4]   FCM - THE FUZZY C-MEANS CLUSTERING-ALGORITHM [J].
BEZDEK, JC ;
EHRLICH, R ;
FULL, W .
COMPUTERS & GEOSCIENCES, 1984, 10 (2-3) :191-203
[5]  
Bhaskar H., 2010, P 13 C INF FUS FUSIO, P1
[6]  
Cai M, 2002, IEEE IMAGE PROC, P117
[7]  
de Jesus A M., 2011, Proceedings of the 2011 IEEE International Symposium on Multimedia (ISM 2011), P305, DOI 10.1109/ISM.2011.55
[8]  
Delakis M, 2008, VISAPP 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, P290
[9]  
Dozat T., 2016, Incorporating nesterov momentum into adam, P1
[10]  
Epshtein B, 2010, PROC CVPR IEEE, P2963, DOI 10.1109/CVPR.2010.5540041