Met-MLTS: Leveraging Smartphones for End-to-End Spotting of Multilingual Oriented Scene Texts and Traffic Signs in Adverse Meteorological Conditions

被引:8
作者
Bagi, Randheer [1 ]
Dutta, Tanima [1 ]
Nigam, Nitika [1 ]
Verma, Deepali [1 ]
Gupta, Hari Prabhat [1 ]
机构
[1] Banaras Hindu Univ, Dept Comp Sci & Engn, Indian Inst Technol, Varanasi 221005, Uttar Pradesh, India
关键词
Text recognition; Image edge detection; Semantics; Feature extraction; Vehicles; Convolution; Roads; Text spotting; scene text detection; deep learning; text recognition; noisy images;
D O I
10.1109/TITS.2021.3117793
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Intelligent systems, like driver assistance systems, remain within vehicles and help drivers by providing essential information about traffic, blockage of roads, and possible routes for safe driving. The objective of scene text spotting in a driver assistance system is to localize and recognize scene texts, signs of milestones, traffic panels, and road marks in natural scene images. However, text edges get fainted due to adverse weather conditions, like fog, rain, smog, or poor contrast. This makes the task of spotting more challenging. In this paper, we propose an end-to-end trainable deep neural network, known as Met-MLTS, that can address the issue of spotting multi-oriented text instances in scene images captured in adverse meteorological conditions. It localizes words, predicts script class, and performs word spotting for every rotated bounding box. It is a fast multilingual scene text spotter that utilizes hierarchical spatial context, channel-wise inter-dependencies, and semantic edge supervision to localize and recognize words and predict script class in scene images using smartphones. We explore inter-class interference to reduce the misclassification problem. A light-weight recognition module for multilingual character segmentation, word-level recognition, and script identification is incorporated. We demonstrate the efficacy of our spotting network on resource-constraint devices.
引用
收藏
页码:12801 / 12810
页数:10
相关论文
共 51 条
[1]  
[Anonymous], P ASS ADV ART INT
[2]  
Baek, 2020, P IEEE CVF C COMP VI, P564
[3]   Cost-Effective and Smart Text Sensing and Spotting in Blurry Scene Images Using Deep Networks [J].
Bagi, Randheer ;
Dutta, Tanima .
IEEE SENSORS JOURNAL, 2021, 21 (22) :25307-25314
[4]   Cluttered TextSpotter: An End-to-End Trainable Light-Weight Scene Text Spotter for Cluttered Environment [J].
Bagi, Randheer ;
Dutta, Tanima ;
Gupta, Hari Prabhat .
IEEE ACCESS, 2020, 8 :111433-111447
[5]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127
[6]   E2E-MLT - An Unconstrained End-to-End Method for Multi-language Scene Text [J].
Busta, Michal ;
Patel, Yash ;
Matas, Jiri .
COMPUTER VISION - ACCV 2018 WORKSHOPS, 2019, 11367 :127-143
[7]   Deep TextSpotter: An End-to-End Trainable Scene Text Localization and Recognition Framework [J].
Busta, Michal ;
Neumann, Lukas ;
Matas, Jiri .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2223-2231
[8]   Total-Text: A Comprehensive Dataset for Scene Text Detection and Recognition [J].
Ch'ng, Chee Kheng ;
Chan, Chee Seng .
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, :935-942
[9]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[10]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848