CROSS-LINGUAL LEARNING IN MULTILINGUAL SCENE TEXT RECOGNITION

被引:0
作者
Baek, Jeonghun [1 ]
Matsui, Yusuke [1 ]
Aizawa, Kiyoharu [1 ]
机构
[1] Univ Tokyo, Tokyo, Japan
来源
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024 | 2024年
关键词
Cross-lingual learning; transfer learning; scene text recognition; multilingual;
D O I
10.1109/ICASSP48485.2024.10445946
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we investigate cross-lingual learning (CLL) for multilingual scene text recognition (STR). CLL transfers knowledge from one language to another. We aim to find the condition that exploits knowledge from high-resource languages for improving performance in low-resource languages. To do so, we first examine if two general insights about CLL discussed in previous works are applied to multilingual STR: (1) Joint learning with high- and low-resource languages may reduce performance on low-resource languages, and (2) CLL works best between typologically similar languages. Through extensive experiments, we show that two general insights may not be applied to multilingual STR. After that, we show that the crucial condition for CLL is the dataset size of high-resource languages regardless of the kind of high-resource languages. Our code, data, and models are available at https://github.com/ku21fan/CLL-STR.
引用
收藏
页码:6770 / 6774
页数:5
相关论文
共 23 条
[1]   What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis [J].
Baek, Jeonghun ;
Kim, Geewook ;
Lee, Junyeop ;
Park, Sungrae ;
Han, Dongyoon ;
Yun, Sangdoo ;
Oh, Seong Joon ;
Lee, Hwalsuk .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :4714-4722
[2]  
Baek Jeonghun, 2021, CVPR
[3]  
Bautista Darwin, 2022, ECCV
[4]  
Busta M., 2018, ACCV
[5]  
Chee Kheng Chng, 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR). Proceedings, P1571, DOI 10.1109/ICDAR.2019.00252
[6]  
Conneau Alexis, 2020, P 58 ANN M ASS COMPU, P8440, DOI DOI 10.18653/V1/2020.ACL-MAIN.747
[7]  
Dosovitskiy A., 2021, INT C LEARNING REPRE
[8]  
Dryer Matthew S., 2013, WALS ONLINE V2020 3
[9]  
Du Yifan, 2022, IJCAI
[10]  
Etter David, 2023, ICDAR