SCENE TEXT RECOGNITION MODELS EXPLAINABILITY USING LOCAL FEATURES

被引：1

作者：

Ty, Mark Vincent ^{[1
]}

Atienza, Rowel ^{[1
,2
]}

机构：

[1] Univ Philippines, Elect & Elect Engn Inst, Quezon City, Philippines

[2] Univ Philippines, AI Grad Program, Quezon City, Philippines

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2023年

关键词：

Computer Vision; Scene Text Recognition; Explainable AI;

D O I：

10.1109/ICIP49359.2023.10222406

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Explainable AI (XAI) is the study on how humans can be able to understand the cause of a model's prediction. In this work, the problem of interest is Scene Text Recognition (STR) Explainability, using XAI to understand the cause of an STR model's prediction. Recent XAI literatures on STR only provide a simple analysis and do not fully explore other XAI methods. In this study, we specifically work on data explainability frameworks, called attribution-based methods, that explains the important parts of an input data in deep learning models. However, integrating them into STR produces inconsistent and ineffective explanations, because they only explain the model in the global context. To solve this problem, we propose a new method, STRExp, to take into consideration the local explanations, i.e. the individual character prediction explanations. This is then benchmarked across different attribution-based methods on different STR datasets and evaluated across different STR models.

引用

页码：645 / 649

页数：5

共 50 条

[21] Text Font Correction and Alignment Method for Scene Text Recognition
Ding, Liuxu
Liu, Yuefeng
Zhao, Qiyan
Liu, Yunong
SENSORS, 2024, 24 (24)
[22] Text-Level Contrastive Learning for Scene Text Recognition
Zhuang, Junbin
Ren, Yixuan
Li, Xia
Liang, Zhanpeng
2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 231 - 236
[23] Video Scene Text Frames Categorization for Text Detection and Recognition
Qin, Longfei
Shivakumara, Palaiahnakote
Lu, Tong
Pal, Umapada
Tan, Chew Lim
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3886 - 3891
[24] SCENE TEXT RECOGNITION IN MULTIPLE FRAMES BASED ON TEXT TRACKING
Rong, Xuejian
Yi, Chucai
Yang, Xiaodong
Tian, Yingli
2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2014,
[25] Towards Scene Text Recognition with Genetic Programming
Barlow, Brendan
Song, Andy
2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2013, : 1310 - 1317
[26] Instruction-Guided Scene Text Recognition
Du, Yongkun
Chen, Zhineng
Su, Yuchen
Jia, Caiyan
Jiang, Yu-Gang
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (04) : 2723 - 2738
[27] DIFFUSIONSTR: DIFFUSION MODEL FOR SCENE TEXT RECOGNITION
Fujitake, Masato
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1585 - 1589
[28] Dual Relation Network for Scene Text Recognition
Li, Ming
Fu, Bin
Chen, Han
He, Junjun
Qiao, Yu
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4094 - 4107
[29] Scene Text Recognition with Multi-Encoders
Wang, Yao
Ha, Jong-Eun
2022 22ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2022), 2022, : 1615 - 1620
[30] HIERARCHICAL REFINED ATTENTION FOR SCENE TEXT RECOGNITION
Zhang, Min
Ma, Meng
Wang, Ping
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4175 - 4179

← 1 2 3 4 5 →