Multi-task Learning for License Plate Recognition in Unconstrained Scenarios

被引:0
|
作者
Mo, Zhen-Lun [1 ]
Chen, Song-Lu [1 ]
Liu, Qi [1 ]
Chen, Feng [2 ]
Yin, Xu-Cheng [1 ]
机构
[1] Univ Sci & Technol Beijing, Beijing, Peoples R China
[2] EEasy Technol Co Ltd, Zhuhai, Peoples R China
来源
DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT I | 2024年 / 14804卷
关键词
License plate recognition; Multi-task; Multi-directional; Multi-line; End-to-end; NETWORK;
D O I
10.1007/978-3-031-70533-5_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recognition of license plates in natural scenes often face challenges such as multi-directional and multi-line variations. Additionally, previous studies have treated license plate detection and recognition as separate tasks, resulting in inefficiencies and error accumulation. To address these challenges, we propose an end-to-end method for license plate detection and recognition using multi-task learning. Firstly, we introduce two parallel branches to detect the horizontal bounding box and the four corners of the license plate, enabling multi-directional license plate detection in a multi-task manner. The outputs from these branches are combined to enhance recognition accuracy. Secondly, we propose to extract global features to perceive character layout and utilize reading order to spatially attend to characters for recognizing multi-line license plates. Finally, we combine detection and recognition using the same backbone, with the detection branch based on multiple deep layers and the recognition branch based on multiple shallow layers, thereby constructing an end-to-end detection and recognition network. Comparative experiments on CCPD and RodoSol datasets validate that our method significantly outperforms state-of-the-art methods, particularly in scenarios involving multi-directional and multi-line license plates.
引用
收藏
页码:34 / 50
页数:17
相关论文
共 50 条
  • [41] Multi-view representation learning in multi-task scene
    Run-kun Lu
    Jian-wei Liu
    Si-ming Lian
    Xin Zuo
    Neural Computing and Applications, 2020, 32 : 10403 - 10422
  • [42] Multi-view representation learning in multi-task scene
    Lu, Run-kun
    Liu, Jian-wei
    Lian, Si-ming
    Zuo, Xin
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (14) : 10403 - 10422
  • [43] Parallel processing of FBG spectral distortion recognition and temperature demodulation based on multi-task learning
    Tang, Rui
    Jiang, Hong
    Cao, Zepu
    OPTICS COMMUNICATIONS, 2025, 578
  • [44] Multimodal Sentiment Analysis With Two-Phase Multi-Task Learning
    Yang, Bo
    Wu, Lijun
    Zhu, Jinhua
    Shao, Bo
    Lin, Xiaola
    Liu, Tie-Yan
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2015 - 2024
  • [45] License plate recognition technology in the Chinese license plates
    Su, Tong
    Liu, Shan
    ENERGY SCIENCE AND APPLIED TECHNOLOGY (ESAT 2016), 2016, : 705 - 708
  • [46] Using Synthetic Images for Deep Learning Recognition Process on Automatic License Plate Recognition
    Barreto, Saulo Cardoso
    Lambert, Jorge Albuquerque
    Vidal, Flavio de Barros
    PATTERN RECOGNITION, MCPR 2019, 2019, 11524 : 115 - 126
  • [47] Improving sentiment analysis with multi-task learning of negation
    Barnes, Jeremy
    Velldal, Erik
    Ovrelid, Lilja
    NATURAL LANGUAGE ENGINEERING, 2021, 27 (02) : 249 - 269
  • [48] Multi-task Learning for Stance and Early Rumor Detection
    Chen, Yongheng
    Yin, Chunyan
    Zuo, Wanli
    OPTICAL MEMORY AND NEURAL NETWORKS, 2021, 30 (02) : 131 - 139
  • [49] Multi-task Representation Learning for Travel Time Estimation
    Li, Yaguang
    Fu, Kun
    Wang, Zheng
    Shahabi, Cyrus
    Ye, Jieping
    Liu, Yan
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1695 - 1704
  • [50] Multi-task Multimodal Learning for Disaster Situation Assessment
    Wang, Tianyi
    Tao, Yudong
    Chen, Shu-Ching
    Shyu, Mei-Ling
    THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2020), 2020, : 209 - 212