Multi-task Learning for License Plate Recognition in Unconstrained Scenarios

被引:0
作者
Mo, Zhen-Lun [1 ]
Chen, Song-Lu [1 ]
Liu, Qi [1 ]
Chen, Feng [2 ]
Yin, Xu-Cheng [1 ]
机构
[1] Univ Sci & Technol Beijing, Beijing, Peoples R China
[2] EEasy Technol Co Ltd, Zhuhai, Peoples R China
来源
DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT I | 2024年 / 14804卷
关键词
License plate recognition; Multi-task; Multi-directional; Multi-line; End-to-end; NETWORK;
D O I
10.1007/978-3-031-70533-5_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recognition of license plates in natural scenes often face challenges such as multi-directional and multi-line variations. Additionally, previous studies have treated license plate detection and recognition as separate tasks, resulting in inefficiencies and error accumulation. To address these challenges, we propose an end-to-end method for license plate detection and recognition using multi-task learning. Firstly, we introduce two parallel branches to detect the horizontal bounding box and the four corners of the license plate, enabling multi-directional license plate detection in a multi-task manner. The outputs from these branches are combined to enhance recognition accuracy. Secondly, we propose to extract global features to perceive character layout and utilize reading order to spatially attend to characters for recognizing multi-line license plates. Finally, we combine detection and recognition using the same backbone, with the detection branch based on multiple deep layers and the recognition branch based on multiple shallow layers, thereby constructing an end-to-end detection and recognition network. Comparative experiments on CCPD and RodoSol datasets validate that our method significantly outperforms state-of-the-art methods, particularly in scenarios involving multi-directional and multi-line license plates.
引用
收藏
页码:34 / 50
页数:17
相关论文
共 43 条
[1]   Data Augmentation for Scene Text Recognition [J].
Atienza, Rowel .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, :1561-1570
[2]  
Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, DOI 10.48550/ARXIV.2004.10934]
[3]  
Cao Y, 2018, INT C PATT RECOG, P3698
[4]   End-to-End Multi-line License Plate Recognition with Cascaded Perception [J].
Chen, Song-Lu ;
Liu, Qi ;
Chen, Feng ;
Yin, Xu-Cheng .
DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2023, PT V, 2023, 14191 :274-289
[5]  
Chen Song-Lu, 2022, 2022 IEEE Smartworld, Ubiquitous Intelligence & Computing, Scalable Computing & Communications, Digital Twin, Privacy Computing, Metaverse, Autonomous & Trusted Vehicles (SmartWorld/UIC/ScalCom/DigitalTwin/PriComp/Meta), P285, DOI 10.1109/SmartWorld-UIC-ATC-ScalCom-DigitalTwin-PriComp-Metaverse56740.2022.00063
[6]   End-to-end trainable network for degraded license plate detection via vehicle-plate relation mining [J].
Chen, Song-Lu ;
Tian, Shu ;
Ma, Jia-Wei ;
Liu, Qi ;
Yang, Chun ;
Chen, Feng ;
Yin, Xu-Cheng .
NEUROCOMPUTING, 2021, 446 :1-10
[7]   Focusing Attention: Towards Accurate Text Recognition in Natural Images [J].
Cheng, Zhanzhan ;
Bai, Fan ;
Xu, Yunlu ;
Zheng, Gang ;
Pu, Shiliang ;
Zhou, Shuigeng .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5086-5094
[8]   A Survey of Vision-Based Traffic Monitoring of Road Intersections [J].
Datondji, Sokemi Rene Emmanuel ;
Dupuis, Yohan ;
Subirats, Peggy ;
Vasseur, Pascal .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2016, 17 (10) :2681-2698
[9]   Improving Robustness of License Plates Automatic Recognition in Natural Scenes [J].
Fan, Xudong ;
Zhao, Wei .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) :18845-18854
[10]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587