Deep-CNNTL: Text Localization from Natural Scene Images Using Deep Convolution Neural Network with Transfer Learning

被引:4
作者
Chaitra, Y. L. [1 ]
Dinesh, R. [1 ]
Gopalakrishna, M. T. [2 ]
Prakash, B. V. Ajay [2 ]
机构
[1] Jain Univ, Bengaluru, India
[2] Visvesvaraya Technol Univ, SJB Inst Technol, Bengaluru, India
关键词
Text localization; Deep learning; Transfer Learning; Scene Images; VGG16; architecture; SEARCH;
D O I
10.1007/s13369-021-06309-9
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Text localization from natural images plays an essential role in reading the text content present in the illustration. It is complex to localize the textual content because the text in natural scene images will be scattered. Prior information about the location of the text, size of the text, the orientation of the text, and the number of text present in the images are not available. These factors have posed a challenge to localize text in natural scene images. We have proposed a comprehensive solution for localizing text using Deep Convolution Neural Network (DCNN) and Transfer Learning (TL). DCNN layers such as convolution, dense layers, dropout, and learning rate are optimized using a random search. A combination of DCNN+TL is more effective in processing complex text images using VGG16 architecture. The proposed method has experimented on the standard ICDAR 2015 dataset, and the obtained results proved to be more effective with accuracy and an F-score of 0.8279 compared to state-of-art methods.
引用
收藏
页码:9629 / 9640
页数:12
相关论文
共 54 条
[1]  
Agrahari Anurag, 2020, Procedia Computer Science, V171, P322, DOI 10.1016/j.procs.2020.04.033
[2]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[3]  
Chaitra Y, 2022, ICT INTELLIGENT APPL, V1, P563, DOI [10.1007/978-981-16-4177-0_55, DOI 10.1007/978-981-16-4177-0_55]
[4]  
Epshtein B, 2010, PROC CVPR IEEE, P2963, DOI 10.1109/CVPR.2010.5540041
[5]  
FOGEL I, 1989, BIOL CYBERN, V61, P103, DOI 10.1007/BF00204594
[6]   Convolutional neural network with joint stepwise character/word modeling based system for scene text recognition [J].
Harizi, Riadh ;
Walha, Rim ;
Drira, Fadoua ;
Zaied, Mourad .
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (03) :3091-3106
[7]  
Huang WL, 2014, LECT NOTES COMPUT SC, V8692, P497, DOI 10.1007/978-3-319-10593-2_33
[8]   Reading Text in the Wild with Convolutional Neural Networks [J].
Jaderberg, Max ;
Simonyan, Karen ;
Vedaldi, Andrea ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 116 (01) :1-20
[9]  
Jaderberg M, 2014, LECT NOTES COMPUT SC, V8692, P512, DOI 10.1007/978-3-319-10593-2_34
[10]   Text information extraction in images and video: a survey [J].
Jung, K ;
Kim, KI ;
Jain, AK .
PATTERN RECOGNITION, 2004, 37 (05) :977-997