Multi-Spectral Fusion Based Approach for Arbitrarily Oriented Scene Text Detection in Video Images

被引:42
|
作者
Liang, Guozhu [1 ]
Shivakumara, Palaiahnakote [2 ]
Lu, Tong [1 ]
Tan, Chew Lim [3 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Jiangsu, Peoples R China
[2] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur 50603, Malaysia
[3] Natl Univ Singapore, Sch Comp, Singapore 119077, Singapore
基金
美国国家科学基金会;
关键词
Laplacian-wavelet; multi spectral fusion; maxima stable extreme regions; stroke width transform; arbitrarily oriented video text detection; EXTRACTION; GRADIENT; RECOGNITION; SCHEME;
D O I
10.1109/TIP.2015.2465169
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene text detection from video as well as natural scene images is challenging due to the variations in background, contrast, text type, font type, font size, and so on. Besides, arbitrary orientations of texts with multi-scripts add more complexity to the problem. The proposed approach introduces a new idea of convolving Laplacian with wavelet sub-bands at different levels in the frequency domain for enhancing low resolution text pixels. Then, the results obtained from different sub-bands (spectral) are fused for detecting candidate text pixels. We explore maxima stable extreme regions along with stroke width transform for detecting candidate text regions. Text alignment is done based on the distance between the nearest neighbor clusters of candidate text regions. In addition, the approach presents a new symmetry driven nearest neighbor for restoring full text lines. We conduct experiments on our collected video data as well as several benchmark data sets, such as ICDAR 2011, ICDAR 2013, and MSRA-TD500 to evaluate the proposed method. The proposed approach is compared with the state-of-the-art methods to show its superiority to the existing methods.
引用
收藏
页码:4488 / 4501
页数:14
相关论文
共 50 条
  • [41] Fusion of multi-spectral and panchromatic images by complex steerable pyramids
    College of Electrical and Information Engineering, Hunan University, Changsha, 410082, China
    Adv Model Anal B, 2006, 1-2 (13-21):
  • [42] Active Region Detection in Multi-spectral Solar Images
    Almahasneh, Majedaldein
    Paiement, Adeline
    Xie, Xianghua
    Aboudarham, Jean
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM), 2021, : 452 - 459
  • [43] A New Corner Detection Operator for Multi-Spectral Images
    El Houari, Hassan
    El Ouafdi, Ahmed Fouad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (04) : 746 - 751
  • [44] Weed detection in multi-spectral images of cotton fields
    Alchanatis, V
    Ridel, L
    Hetzroni, A
    Yaroslavsky, L
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2005, 47 (03) : 243 - 260
  • [45] A Robust Approach for Scene Text Detection and Tracking in Video
    Wang, Yang
    Wang, Lan
    Su, Feng
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 303 - 314
  • [46] Scene Text Detection Based On Fusion Network
    Zhao, Xuezhuan
    Zhou, Ziheng
    Li, Lingling
    Pei, Lishen
    Ye, Zhaoyi
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (10)
  • [47] Comparison of data fusion methods with high preservation based on multi-spectral and panchromatic images
    Li, Wenbo
    Zhang, Qiuwen
    Hu, Guangyi
    MIPPR 2007: MULTISPECTRAL IMAGE PROCESSING, 2007, 6787
  • [48] A fusion method of panchromatic and multi-spectral remote sensing images based on wavelet transform
    Xue X.
    Xiang F.
    Wang H.
    Journal of Computational and Theoretical Nanoscience, 2016, 13 (02) : 1479 - 1485
  • [49] Research on Detection of Ship Target at Sea Based on Multi-Spectral Infrared Images
    Qiu Rong-chao
    Lou Shu-li
    Li Ting-jun
    Gong Jian
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2019, 39 (03) : 698 - 704
  • [50] A neural netwotk based approach for multi-spectral snowfall detection and estimation
    Mejia, Yajaira
    Ghedira, Hosni
    Mahani, Shayesteh
    Khanbilvardi, Reza
    IGARSS: 2007 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-12: SENSING AND UNDERSTANDING OUR PLANET, 2007, : 2276 - 2279