Flexible Visual Grounding

被引:0
作者
Kim, Yongmin [1 ]
Chu, Chenhui [1 ]
Kurohashi, Sadao [1 ]
机构
[1] Kyoto Univ, Kyoto, Japan
来源
PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): STUDENT RESEARCH WORKSHOP | 2022年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing visual grounding datasets are artificially made, where every query regarding an entity must be able to be grounded to a corresponding image region, i.e., answerable. However, in real-world multimedia data such as news articles and social media, many entities in the text cannot be grounded to the image, i.e., unanswerable, due to the fact that the text is unnecessarily directly describing the accompanying image. A robust visual grounding model should be able to flexibly deal with both answerable and unanswerable visual grounding. To study this flexible visual grounding problem, we construct a pseudo dataset and a social media dataset including both answerable and unanswerable queries. In order to handle unanswerable visual grounding, we propose a novel method by adding a pseudo image region corresponding to a query that cannot be grounded. The model is then trained to ground to ground-truth regions for answerable queries and pseudo regions for unanswerable queries. In our experiments, we show that our model can flexibly process both answerable and unanswerable queries with high accuracy on our datasets.(1)
引用
收藏
页码:285 / 299
页数:15
相关论文
共 50 条
  • [41] Arc High Resistance Grounding Fault Detection Method for Active Flexible Grounding Distribution Network
    Liu B.
    Zeng X.
    Zhang H.
    Ma H.
    Zhongguo Dianji Gongcheng Xuebao/Proceedings of the Chinese Society of Electrical Engineering, 2022, 42 (11): : 4001 - 4012
  • [42] A Hybrid Flexible Neutral Grounding Mode for Large Generators
    Wang, Yikai
    Yin, Xin
    Yin, Xianggen
    Qiao, Jian
    Tan, Liming
    MACHINES, 2022, 10 (08)
  • [43] An Inverter-Based Flexible Microgrid Grounding Scheme
    Li, Dingrui
    Ma, Yiwei
    Su, Yu
    Zhang, Chengwen
    Zhu, Lin
    Yin, He
    Wang, Fred
    Tolbert, Leon M.
    IEEE TRANSACTIONS ON POWER ELECTRONICS, 2024, 39 (08) : 10189 - 10203
  • [44] A New Flexible Graphite Composite Electrical Grounding Material
    Huang, Tao
    Hu, Yuanchao
    Xie, Hongping
    Du, Changqing
    An, Yunzhu
    Shen, Wentao
    Liu, Zhixiang
    Cheng, Meng
    FRONTIERS IN MATERIALS, 2022, 9
  • [45] Language conditioned multi-scale visual attention networks for visual grounding
    Yao, Haibo
    Wang, Lipeng
    Cai, Chengtao
    Wang, Wei
    Zhang, Zhi
    Shang, Xiaobing
    IMAGE AND VISION COMPUTING, 2024, 150
  • [46] Performance comparison between flexible graphite-copper composited grounding material and conventional grounding materials
    Gong, Ruohan
    Ruan, Jiangjun
    Hu, Yuanchao
    Ge, Hefei
    Jin, Shuo
    2016 IEEE INTERNATIONAL CONFERENCE ON HIGH VOLTAGE ENGINEERING AND APPLICATION (ICHVE), 2016,
  • [47] Interpretable Visual Question Answering by Visual Grounding from Attention Supervision Mining
    Zhang, Yundong
    Niebles, Juan Carlos
    Soto, Alvaro
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 349 - 357
  • [48] Grounding Language with Visual Affordances over Unstructured Data
    Mees, Oier
    Borja-Diaz, Jessica
    Burgard, Wolfram
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 11576 - 11582
  • [49] A Sensorimotor Perspective on Grounding the Semantic of Simple Visual Features
    Laflaquiere, Alban
    2018 JOINT IEEE 8TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING AND EPIGENETIC ROBOTICS (ICDL-EPIROB), 2018, : 60 - 65
  • [50] A Visual Grounding Method with Contrastive Learning Large Model
    Lu, Qing-Yang
    Yuan, Guang-Lin
    Zhu, Hong
    Qin, Xiao-Yan
    Xue, Mo-Gen
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (10): : 3448 - 3458