Improved Ring Radius Transform-Based Reconstruction for Video Character Recognition

被引:3
|
作者
Huang, Zhiheng [1 ]
Shivakumara, Palaiahnakote [2 ]
Lu, Tong [1 ]
Pal, Umapada [3 ]
Blumenstein, Michael [4 ]
Chetty, Bhaarat [5 ,6 ]
Kumar, G. Hemantha [7 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur, Malaysia
[3] Indian Stat Inst, Comp Vis & Pattern Recognit Unit, Kolkata, India
[4] Univ Technol Sydney, Australian Artificial Intelligence Inst, Sydney, NSW, Australia
[5] Google Developers Grp, Bangalore, Karnataka, India
[6] NASDAQ, Bangalore, Karnataka, India
[7] Univ Mysore, Dept Studies Comp Sci, Mysore, Karnataka, India
关键词
Video character recognition; reconstruction; ring radius transform;
D O I
10.1142/S0218001421500233
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Character shape reconstruction in video is challenging due to low contrast, complex backgrounds and arbitrary orientation of characters. This work proposes an Improved Ring Radius Transform (IRRT) for reconstructing impaired characters through medial axis prediction. At first, the technique proposes a novel idea based on the Tangent Vector (TV) concept that identifies each actual pair of end pixels caused by gaps in impaired character components. Next, the actual direction to predict medial axis pixels using IRRT for each pair of end pixels is proposed with a new normal vector concept. The process of prediction repeats iteratively to find all the medial axis pixels for every gap in question. Further, medial axis pixels with their radii are used to reconstruct the shapes of impaired characters. The proposed technique is tested on benchmark datasets consisting of video, natural scenes, objects and multi-lingual data to demonstrate that it reconstructs shapes well, even for heterogeneous data. Comparative studies with different binarization and character recognition methods show that the proposed technique is effective, useful and outperforms existing methods.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] A novel ring radius transform for video character reconstruction
    Shivakumara, Palaiahnakote
    Trung Quy Phan
    Bhowmick, Souvik
    Tan, Chew Lim
    Pal, Umapada
    PATTERN RECOGNITION, 2013, 46 (01) : 131 - 140
  • [2] A new ring radius transform-based thinning method for multi-oriented video characters
    Yirui Wu
    Palaiahnakote Shivakumara
    Wang Wei
    Tong Lu
    Umapada Pal
    International Journal on Document Analysis and Recognition (IJDAR), 2015, 18 : 137 - 151
  • [3] A new ring radius transform-based thinning method for multi-oriented video characters
    Wu, Yirui
    Shivakumara, Palaiahnakote
    Wei, Wang
    Lu, Tong
    Pal, Umapada
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2015, 18 (02) : 137 - 151
  • [4] Complex Wavelet Transform-based Approach for Human Action Recognition in Video
    Khare, Manish
    Gwak, Jeonghwan
    Jeon, Moongu
    2017 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (ICCAIS), 2017, : 157 - 162
  • [5] An Improved Hough Transform-Based Method for Transformer Blower Target Recognition
    Sun Fengjie
    Liao Huifen
    Fan Jieqing
    PROCEEDINGS OF 2009 CONFERENCE ON COMMUNICATION FACULTY, 2009, : 398 - +
  • [6] Complex Wavelet Transform-Based Face Recognition
    Eleyan, Alaa
    Ozkaramanli, Huseyin
    Demirel, Hasan
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2008, 2008 (1)
  • [7] Complex Wavelet Transform-Based Face Recognition
    Alaa Eleyan
    Hüseyin Özkaramanli
    Hasan Demirel
    EURASIP Journal on Advances in Signal Processing, 2008
  • [8] Transform-based Arabic sign language recognition
    Sidig, Ala Addin I.
    Luqman, Hamzah
    Mahmoud, Sabri A.
    ARABIC COMPUTATIONAL LINGUISTICS (ACLING 2017), 2017, 117 : 2 - 9
  • [9] Adaptive lapped transform-based image and video coding
    Klausutis, TJ
    Madisetti, VK
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING '97, PTS 1-2, 1997, 3024 : 117 - 128
  • [10] Performance Measurement for a Wavelet Transform-based Video Compression
    Dhungel, Abinashi
    Weeks, Michael
    PROCEEDINGS OF THE 49TH ANNUAL ASSOCIATION FOR COMPUTING MACHINERY SOUTHEAST CONFERENCE (ACMSE '11), 2011, : 216 - 220