SCENE TEXT RECOGNITION MODELS EXPLAINABILITY USING LOCAL FEATURES

被引:1
|
作者
Ty, Mark Vincent [1 ]
Atienza, Rowel [1 ,2 ]
机构
[1] Univ Philippines, Elect & Elect Engn Inst, Quezon City, Philippines
[2] Univ Philippines, AI Grad Program, Quezon City, Philippines
来源
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2023年
关键词
Computer Vision; Scene Text Recognition; Explainable AI;
D O I
10.1109/ICIP49359.2023.10222406
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Explainable AI (XAI) is the study on how humans can be able to understand the cause of a model's prediction. In this work, the problem of interest is Scene Text Recognition (STR) Explainability, using XAI to understand the cause of an STR model's prediction. Recent XAI literatures on STR only provide a simple analysis and do not fully explore other XAI methods. In this study, we specifically work on data explainability frameworks, called attribution-based methods, that explains the important parts of an input data in deep learning models. However, integrating them into STR produces inconsistent and ineffective explanations, because they only explain the model in the global context. To solve this problem, we propose a new method, STRExp, to take into consideration the local explanations, i.e. the individual character prediction explanations. This is then benchmarked across different attribution-based methods on different STR datasets and evaluated across different STR models.
引用
收藏
页码:645 / 649
页数:5
相关论文
共 50 条
  • [31] Adaptive Adversarial Attack on Scene Text Recognition
    Yuan, Xiaoyong
    He, Pan
    Li, Xiaolin
    Wu, Dapeng
    IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2020, : 358 - 363
  • [32] Triggered Attention Model for Scene Text Recognition
    Zhang, Churong
    Ming, Yue
    ELEVENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2019), 2020, 11373
  • [33] Scene Text Recognition with Cascade Attention Network
    Zhang, Min
    Ma, Meng
    Wang, Ping
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 385 - 393
  • [34] A Feature Learning Method for Scene Text Recognition
    Ho Vu Duong
    Quoc Ngoc Ly
    2012 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2012, : 176 - 180
  • [35] Representative Batch Normalization for Scene Text Recognition
    Sun, Yajie
    Cao, Xiaoling
    Sun, Yingying
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2022, 16 (07): : 2390 - 2406
  • [36] Scene Text Recognition with Multi-decoders
    Wang, Yao
    Ha, Jong-Eun
    2021 21ST INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2021), 2021,
  • [37] Lightweight Scene Text Recognition Based on Transformer
    Luan, Xin
    Zhang, Jinwei
    Xu, Miaomiao
    Silamu, Wushouer
    Li, Yanbing
    SENSORS, 2023, 23 (09)
  • [38] SCENE TEXT RECOGNITION WITH TEMPORAL CONVOLUTIONAL ENCODER
    Du, Xiangcheng
    Ma, Tianlong
    Zheng, Yingbin
    Ye, Hao
    Wu, Xingjiao
    He, Liang
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2383 - 2387
  • [39] An extended attention mechanism for scene text recognition
    Xiao, Zheng
    Nie, Zhenyu
    Song, Chao
    Chronopoulos, Anthony Theodore
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 203
  • [40] Text recognition in scene image and video frame using Color Channel selection
    Bhunia, Ayan Kumar
    Kumar, Gautam
    Roy, Partha Pratim
    Balasubramanian, R.
    Pal, Umapada
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (07) : 8551 - 8578