Multi-task Learning for License Plate Recognition in Unconstrained Scenarios

被引：0

作者：

Mo, Zhen-Lun ^{[1
]}

Chen, Song-Lu ^{[1
]}

Liu, Qi ^{[1
]}

Chen, Feng ^{[2
]}

Yin, Xu-Cheng ^{[1
]}

机构：

[1] Univ Sci & Technol Beijing, Beijing, Peoples R China

[2] EEasy Technol Co Ltd, Zhuhai, Peoples R China

来源：

DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT I | 2024年 / 14804卷

关键词：

License plate recognition; Multi-task; Multi-directional; Multi-line; End-to-end; NETWORK;

D O I：

10.1007/978-3-031-70533-5_3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The recognition of license plates in natural scenes often face challenges such as multi-directional and multi-line variations. Additionally, previous studies have treated license plate detection and recognition as separate tasks, resulting in inefficiencies and error accumulation. To address these challenges, we propose an end-to-end method for license plate detection and recognition using multi-task learning. Firstly, we introduce two parallel branches to detect the horizontal bounding box and the four corners of the license plate, enabling multi-directional license plate detection in a multi-task manner. The outputs from these branches are combined to enhance recognition accuracy. Secondly, we propose to extract global features to perceive character layout and utilize reading order to spatially attend to characters for recognizing multi-line license plates. Finally, we combine detection and recognition using the same backbone, with the detection branch based on multiple deep layers and the recognition branch based on multiple shallow layers, thereby constructing an end-to-end detection and recognition network. Comparative experiments on CCPD and RodoSol datasets validate that our method significantly outperforms state-of-the-art methods, particularly in scenarios involving multi-directional and multi-line license plates.

引用

页码：34 / 50

页数：17

共 50 条

[41] Multi-view representation learning in multi-task scene
Run-kun Lu
Jian-wei Liu
Si-ming Lian
Xin Zuo
Neural Computing and Applications, 2020, 32 : 10403 - 10422
[42] Multi-view representation learning in multi-task scene
Lu, Run-kun
Liu, Jian-wei
Lian, Si-ming
Zuo, Xin
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (14) : 10403 - 10422
[43] Parallel processing of FBG spectral distortion recognition and temperature demodulation based on multi-task learning
Tang, Rui
Jiang, Hong
Cao, Zepu
OPTICS COMMUNICATIONS, 2025, 578
[44] Multimodal Sentiment Analysis With Two-Phase Multi-Task Learning
Yang, Bo
Wu, Lijun
Zhu, Jinhua
Shao, Bo
Lin, Xiaola
Liu, Tie-Yan
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2015 - 2024
[45] License plate recognition technology in the Chinese license plates
Su, Tong
Liu, Shan
ENERGY SCIENCE AND APPLIED TECHNOLOGY (ESAT 2016), 2016, : 705 - 708
[46] Using Synthetic Images for Deep Learning Recognition Process on Automatic License Plate Recognition
Barreto, Saulo Cardoso
Lambert, Jorge Albuquerque
Vidal, Flavio de Barros
PATTERN RECOGNITION, MCPR 2019, 2019, 11524 : 115 - 126
[47] Improving sentiment analysis with multi-task learning of negation
Barnes, Jeremy
Velldal, Erik
Ovrelid, Lilja
NATURAL LANGUAGE ENGINEERING, 2021, 27 (02) : 249 - 269
[48] Multi-task Learning for Stance and Early Rumor Detection
Chen, Yongheng
Yin, Chunyan
Zuo, Wanli
OPTICAL MEMORY AND NEURAL NETWORKS, 2021, 30 (02) : 131 - 139
[49] Multi-task Representation Learning for Travel Time Estimation
Li, Yaguang
Fu, Kun
Wang, Zheng
Shahabi, Cyrus
Ye, Jieping
Liu, Yan
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1695 - 1704
[50] Multi-task Multimodal Learning for Disaster Situation Assessment
Wang, Tianyi
Tao, Yudong
Chen, Shu-Ching
Shyu, Mei-Ling
THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2020), 2020, : 209 - 212

← 1 2 3 4 5 →