Scene Based Text Recognition From Natural Images and Classification Based on Hybrid CNN Models with Performance Evaluation

被引:0
作者
Dasari, Sunil Kumar [1 ]
Mehta, Shilpa [1 ]
机构
[1] Presidency Univ, Dept ECE, SOE, Bangalore 560054, India
关键词
CNN; Scene based text detection; RESNET50; Text Detection; YOLO;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Similar to the recognition of captions, pictures, or overlapped text that typically appears horizontally, multi-oriented text recognition in video frames is challenging since it has high contrast related to its background. Multi-oriented form of text normally denotes scene text which makes text recognition further stimulating and remarkable owing to the disparaging features of scene text. Hence, predictable text detection approaches might not give virtuous outcomes for multi-oriented scene text detection. Text detection from any such natural image has been challenging since earlier times, and significant enhancement has been made recently to execute this task. While coming to blurred, low-resolution, and small-sized images, most of the previous research conducted doesn't work well; hence, there is a research gap in that area. Scene-based text detection is a key area due to its adverse applications. One such primary reason for the failure of earlier methods is that the existing methods could not generate precise alignments across feature areas and targets for those images. This research focuses on scene-based text detection with the aid of YOLO based object detector and a CNN-based classification approach. The experiments were conducted in MATLAB 2019A, and the packages used were RESNET50, INCEPTIONRESNETV2, and DENSENET201. The efficiency of the proposed methodology -Hybrid resnet-YOLO procured maximum accuracy of 91%, Hybrid inceptionresnetv2-YOLO of 81.2%, and Hybrid densenet201-YOLO of 83.1% and was verified by comparing it with the existing research works Resnet50 of 76.9%, ResNet-101 of 79.5%, and ResNet-152 of 82%.
引用
收藏
页码:293 / 300
页数:8
相关论文
共 30 条
  • [1] Akin L. T, 2021, IMAGE VIDEO PROCESSI, V15, P885
  • [2] An improved faster-RCNN model for handwritten character recognition
    Albahli, Saleh
    Nawaz, Marriam
    Javed, Ali
    Irtaza, Aun
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2021, 46 (09) : 8509 - 8523
  • [3] Ali Asghar, 2019, 2019 IEEE 4th International Conference on Image, Vision and Computing (ICIVC), P321, DOI 10.1109/ICIVC47709.2019.8980992
  • [4] Urdu-Text Detection and Recognition in Natural Scene Images Using Deep Learning
    Arafat, Syed Yasser
    Iqbal, Muhammad Javed
    [J]. IEEE ACCESS, 2020, 8 : 96787 - 96803
  • [5] Attention-Based CNN-RNN Arabic Text Recognition from Natural Scene Images
    Butt, Hanan
    Raza, Muhammad Raheel
    Ramzan, Muhammad Javed
    Ali, Muhammad Junaid
    Haris, Muhammad
    [J]. FORECASTING, 2021, 3 (03): : 520 - 540
  • [6] Text Recognition in the Wild: A Survey
    Chen, Xiaoxue
    Jin, Lianwen
    Zhu, Yuanzhi
    Luo, Canjie
    Wang, Tianwei
    [J]. ACM COMPUTING SURVEYS, 2021, 54 (02)
  • [7] Natural Scene Text Detection and Segmentation Using Phase-Based Regions and Character Retrieval
    Diaz-Escobar, Julia
    Kober, Vitaly
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [8] A pooling based scene text proposal technique for scene text reading in the wild
    Dinh NguyenVan
    Lu, Shijian
    Tian, Shangxuan
    Ouarti, Nizar
    Mokhtari, Mounir
    [J]. PATTERN RECOGNITION, 2019, 87 : 118 - 129
  • [9] Gupta Neeraj, 2020, 2020 International Conference on Contemporary Computing and Applications (IC3A), P150, DOI 10.1109/IC3A48958.2020.233287
  • [10] Jiang F., 2017, ARXIV170805133