SCENE TEXT RECOGNITION MODELS EXPLAINABILITY USING LOCAL FEATURES

被引:1
|
作者
Ty, Mark Vincent [1 ]
Atienza, Rowel [1 ,2 ]
机构
[1] Univ Philippines, Elect & Elect Engn Inst, Quezon City, Philippines
[2] Univ Philippines, AI Grad Program, Quezon City, Philippines
来源
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2023年
关键词
Computer Vision; Scene Text Recognition; Explainable AI;
D O I
10.1109/ICIP49359.2023.10222406
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Explainable AI (XAI) is the study on how humans can be able to understand the cause of a model's prediction. In this work, the problem of interest is Scene Text Recognition (STR) Explainability, using XAI to understand the cause of an STR model's prediction. Recent XAI literatures on STR only provide a simple analysis and do not fully explore other XAI methods. In this study, we specifically work on data explainability frameworks, called attribution-based methods, that explains the important parts of an input data in deep learning models. However, integrating them into STR produces inconsistent and ineffective explanations, because they only explain the model in the global context. To solve this problem, we propose a new method, STRExp, to take into consideration the local explanations, i.e. the individual character prediction explanations. This is then benchmarked across different attribution-based methods on different STR datasets and evaluated across different STR models.
引用
收藏
页码:645 / 649
页数:5
相关论文
共 50 条
  • [21] Text Font Correction and Alignment Method for Scene Text Recognition
    Ding, Liuxu
    Liu, Yuefeng
    Zhao, Qiyan
    Liu, Yunong
    SENSORS, 2024, 24 (24)
  • [22] Text-Level Contrastive Learning for Scene Text Recognition
    Zhuang, Junbin
    Ren, Yixuan
    Li, Xia
    Liang, Zhanpeng
    2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 231 - 236
  • [23] Video Scene Text Frames Categorization for Text Detection and Recognition
    Qin, Longfei
    Shivakumara, Palaiahnakote
    Lu, Tong
    Pal, Umapada
    Tan, Chew Lim
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3886 - 3891
  • [24] SCENE TEXT RECOGNITION IN MULTIPLE FRAMES BASED ON TEXT TRACKING
    Rong, Xuejian
    Yi, Chucai
    Yang, Xiaodong
    Tian, Yingli
    2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2014,
  • [25] Towards Scene Text Recognition with Genetic Programming
    Barlow, Brendan
    Song, Andy
    2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2013, : 1310 - 1317
  • [26] Instruction-Guided Scene Text Recognition
    Du, Yongkun
    Chen, Zhineng
    Su, Yuchen
    Jia, Caiyan
    Jiang, Yu-Gang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (04) : 2723 - 2738
  • [27] DIFFUSIONSTR: DIFFUSION MODEL FOR SCENE TEXT RECOGNITION
    Fujitake, Masato
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1585 - 1589
  • [28] Dual Relation Network for Scene Text Recognition
    Li, Ming
    Fu, Bin
    Chen, Han
    He, Junjun
    Qiao, Yu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4094 - 4107
  • [29] Scene Text Recognition with Multi-Encoders
    Wang, Yao
    Ha, Jong-Eun
    2022 22ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2022), 2022, : 1615 - 1620
  • [30] HIERARCHICAL REFINED ATTENTION FOR SCENE TEXT RECOGNITION
    Zhang, Min
    Ma, Meng
    Wang, Ping
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4175 - 4179