Scene Text Script Identification with Convolutional Recurrent Neural Networks

被引:0
|
作者
Mei, Jieru [1 ]
Dai, Luo [2 ]
Shi, Baoguang [2 ]
Bai, Xiang [2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Automat, Wuhan 430074, Hubei, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Hubei, Peoples R China
来源
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2016年
基金
中国国家自然科学基金;
关键词
FEATURES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Script identification for scene text images is a challenging task. This paper describes a novel deep neural network structure that efficiently identifies scripts of images. In our design, we exploit two important factors, namely the image representation, and the spatial dependencies within text lines. To this end, we bring together a Convolutional Neural Network (CNN) and a Recurrent Neural Network (RNN) into one end-to-end trainable network. The former generates rich image representations, while the latter effectively analyzes long-term spatial dependencies. Besides, on top of the structure, we adopt an average pooling structure in order to deal with input images of arbitrary sizes. Experiments on several datasets, including SIW-13 and CVSI2015, demonstrate that our approach achieves superior performance, compared with previous approaches.
引用
收藏
页码:4053 / 4058
页数:6
相关论文
共 50 条
  • [31] Automated identification of diverse Neotropical pollen samples using convolutional neural networks
    Punyasena, Surangi W.
    Haselhorst, Derek S.
    Kong, Shu
    Fowlkes, Charless C.
    Moreno, J. Enrique
    METHODS IN ECOLOGY AND EVOLUTION, 2022, 13 (09): : 2049 - 2064
  • [32] Transferring Deep Convolutional Neural Networks for the Scene Classification of High-Resolution Remote Sensing Imagery
    Hu, Fan
    Xia, Gui-Song
    Hu, Jingwen
    Zhang, Liangpei
    REMOTE SENSING, 2015, 7 (11) : 14680 - 14707
  • [33] Integration Convolutional Neural Network for Person Re-Identification in Camera Networks
    Zhang, Zhong
    Si, Tongzhen
    Liu, Shuang
    IEEE ACCESS, 2018, 6 : 36887 - 36896
  • [34] Open writer identification from handwritten text fragments using lite convolutional neural network
    Briber, Amina
    Chibani, Youcef
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2024, 27 (04) : 529 - 551
  • [35] Robust Approach Based on Convolutional Neural Networks for Identification of Focal EEG Signals
    Bajaj, Varun
    Taran, Sachin
    Tanyildizi, Erkan
    Sengur, Abdulkadir
    IEEE SENSORS LETTERS, 2019, 3 (05)
  • [36] An automatic approach for heart failure typing based on heart sounds and convolutional recurrent neural networks
    Wang, Hui
    Guo, Xingming
    Zheng, Yineng
    Yang, Yang
    PHYSICAL AND ENGINEERING SCIENCES IN MEDICINE, 2022, 45 (02) : 475 - 485
  • [37] MRI to MGMT: predicting methylation status in glioblastoma patients using convolutional recurrent neural networks
    Han, Lichy
    Kamdar, Maulik R.
    PACIFIC SYMPOSIUM ON BIOCOMPUTING 2018 (PSB), 2018, : 331 - 342
  • [38] Focal Cosine Metric and Adaptive Attention Module for Remote Sensing Scene Classification With Siamese Convolutional Neural Networks
    Min, Lei
    Gao, Kun
    Wang, Hong
    Liu, Yutong
    Zhang, Zhenzhou
    Hu, Zibo
    Zhang, Xiaodian
    IEEE ACCESS, 2022, 10 : 84212 - 84226
  • [39] Construction activity recognition with convolutional recurrent networks
    Slaton, Trevor
    Hernandez, Carlos
    Akhavian, Reza
    AUTOMATION IN CONSTRUCTION, 2020, 113
  • [40] Recent advances in convolutional neural networks
    Gu, Jiuxiang
    Wang, Zhenhua
    Kuen, Jason
    Ma, Lianyang
    Shahroudy, Amir
    Shuai, Bing
    Liu, Ting
    Wang, Xingxing
    Wang, Gang
    Cai, Jianfei
    Chen, Tsuhan
    PATTERN RECOGNITION, 2018, 77 : 354 - 377