REO-Relevance, Extraness, Omission: A Fine-grained Evaluation for Image Captioning

被引:0
|
作者
Jiang, Ming [1 ]
Hu, Junjie [2 ]
Huang, Qiuyuan [3 ]
Zhang, Lei [3 ]
Diesner, Jana [1 ]
Gao, Jianfeng [3 ]
机构
[1] Univ Lllinois Urbana Champaign, Champaign, IL 61820 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[3] Microsoft Res, Redmond, WA USA
关键词
GENERATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Popular metrics used for evaluating image captioning systems, such as BLEU and CIDEr, provide a single score to gauge the system's overall effectiveness. This score is often not informative enough to indicate what specific errors are made by a given system. In this study, we present a fine-grained evaluation method REO for automatically measuring the performance of image captioning systems. REO assesses the quality of captions from three perspectives: 1) Relevance to the ground truth, 2) Extraness of the content that is irrelevant to the ground truth, and 3) Omission of the elements in the images and human references. Experiments on three benchmark datasets demonstrate that our method achieves a higher consistency with human judgments and provides more intuitive evaluation results than alternative metrics.(1)
引用
收藏
页码:1475 / 1480
页数:6
相关论文
共 50 条
  • [21] Attention-Guided Hierarchical Parsing for Fine-Grained Person-Centric Image Captioning
    Gu, Zhengcheng
    Jin, Jing
    IEEE ACCESS, 2024, 12 : 86293 - 86301
  • [22] Attribute-Driven Filtering: A new attributes predicting approach for fine-grained image captioning
    Hossen, Md. Bipul
    Ye, Zhongfu
    Abdussalam, Amr
    Ul Hassan, Shabih
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 137
  • [23] Fine-Grained Length Controllable Video Captioning With Ordinal Embeddings
    Nitta, Tomoya
    Fukuzawa, Takumi
    Tamaki, Toru
    IEEE ACCESS, 2024, 12 : 189667 - 189688
  • [24] A Survey of Fine-Grained Image Categorization
    Zheng, Min
    Li, Qingyong
    Geng, Yangli-ao
    Yu, Haomin
    Wang, Jianzhu
    Gan, Jinrui
    Xue, Wenyuan
    PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 533 - 538
  • [25] Lifelong Fine-Grained Image Retrieval
    Chen, Wei
    Xu, Haoyang
    Pu, Nan
    Liu, Yu
    Lao, Mingrui
    Wang, Weiping
    Liu, Li
    Lew, Michael S.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 7533 - 7544
  • [26] IGINet: integrating geometric information to enhance inter-modal interaction for fine-grained image captioning
    Hossain, Md. Shamim
    Aktar, Shamima
    Liu, Weiyong
    Gu, Naijie
    Huang, Zhangjin
    MULTIMEDIA SYSTEMS, 2025, 31 (01)
  • [27] SAM-GUIDED ENHANCED FINE-GRAINED ENCODING WITH MIXED SEMANTIC LEARNING FOR MEDICAL IMAGE CAPTIONING
    Zhang, Zhenyu
    Wang, Benlu
    Liang, Weijie
    Li, Yizhi
    Guo, Xuechen
    Wang, Guanhong
    Li, Shiyan
    Wang, Gaoang
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 1731 - 1735
  • [28] END-TO-END SPATIALLY-CONSTRAINED MULTI-PERSPECTIVE FINE-GRAINED IMAGE CAPTIONING
    Zhang, Yifan
    Lin, Chunzhen
    Cao, Donglin
    Lin, Dazhen
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3360 - 3364
  • [29] A Fine-Grained Spatial-Temporal Attention Model for Video Captioning
    Liu, An-An
    Qiu, Yurui
    Wong, Yongkang
    Su, Yu-Ting
    Kankanhalli, Mohan
    IEEE ACCESS, 2018, 6 : 68463 - 68471
  • [30] EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching
    Shi, Yaya
    Yang, Xu
    Xu, Haiyang
    Yuan, Chunfeng
    Li, Bing
    Hu, Weiming
    Zha, Zheng-Jun
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17908 - 17917