Word Level Script Identification Using Convolutional Neural Network Enhancement for Scenic Images

被引:11
|
作者
Mahajan, Shilpa [1 ]
Rani, Rajneesh [1 ]
机构
[1] Natl Inst Technol, Jalandhar 144011, Punjab, India
关键词
Natural scene images; script identification; convolutional neural network; transfer learning; benchmarked datasets;
D O I
10.1145/3506699
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Script identification from complex and colorful images is an integral part of the text recognition and classification system. Such images may contain twofold challenges: (1) Challenges related to the camera like blurring effect, non-uniform illumination and noisy background, and so on, and (2) Challenges related to the text shape, orientation, and text size. The present work in this area is much focused on non-Indian scripts. In contrast, Gurumukhi, Hindi, and English scripts play a vital role in communication among Indians and foreigners. In this article, we focus on the above said challenges in the field of identifying the script. Additionally, we have introduced a new dataset that contains Hindi, Gurumukhi, and English scripts from scenic images collected from different sources. We also proposed a CNN-based model, which is capable of distinguishing between the scripts with good accuracy. Performance of the method has been evaluated for own dataset, i.e., NITJDATASET and other benchmarked datasets available for Indian scripts, i.e., CVSI-2015 (Task-1 and Task 4) and ILST. This work is an extension to find the script from strict text background.
引用
收藏
页数:29
相关论文
共 50 条
  • [31] Neuropsychiatric Disorders Identification Using Convolutional Neural Network
    Lin, Chih-Wei
    Ding, Qilu
    MULTIMEDIA MODELING, MMM 2019, PT II, 2019, 11296 : 315 - 327
  • [32] Retinal biometric identification using convolutional neural network
    Rodiah
    Madenda, Sarifuddin
    Susetianingtias, Diana Tri
    Fitrianingsih
    Adlina, Dea
    Arianty, Rini
    COMPUTER OPTICS, 2021, 45 (06) : 865 - 872
  • [33] Identification of Functional piRNAs Using a Convolutional Neural Network
    Ali, Syed Danish
    Alam, Waleed
    Tayara, Hilal
    Chong, Kil To
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (03) : 1661 - 1669
  • [34] The Enhancement of WiFi Fingerprint Positioning Using Convolutional Neural Network
    Zhang, Ting
    Man, Yi
    2018 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND NETWORK TECHNOLOGY (CCNT 2018), 2018, 291 : 479 - 483
  • [35] Speech Enhancement using Convolutional Neural Network with Skip Connections
    Shi, Yupeng
    Rong, Weicong
    Zheng, Nengheng
    2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 6 - 10
  • [36] Image Enhancement and Exposure Correction Using Convolutional Neural Network
    Parab M.
    Bhanushali A.
    Ingle P.
    Pavan Kumar B.N.
    SN Computer Science, 4 (2)
  • [37] Single channel speech enhancement using convolutional neural network
    Kounovsky, Tomas
    Malek, Jiri
    2017 IEEE INTERNATIONAL WORKSHOP OF ELECTRONICS, CONTROL, MEASUREMENT, SIGNALS AND THEIR APPLICATION TO MECHATRONICS (ECMSM), 2017,
  • [38] Blur Detection in Identity Images Using Convolutional Neural Network
    Khajuria, Karan
    Mehrotra, Kapil
    Gupta, Manish Kumar
    2019 FIFTH INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP 2019), 2019, : 332 - 337
  • [39] Feature Extraction for Histopathological Images Using Convolutional Neural Network
    Hatipoglu, Nuh
    Bilgin, Gokhan
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 645 - 648
  • [40] Characterizing soot in TEM images using a convolutional neural network
    Sipkens, Timothy A.
    Frei, Max
    Baldelli, Alberto
    Kirchen, Patrick
    Kruis, Frank E.
    Rogak, Steven N.
    POWDER TECHNOLOGY, 2021, 387 : 313 - 324