Neuro-Symbolic Visual Reasoning: Disentangling "Visual" from "Reasoning"

被引:0
|
作者
Amizadeh, Saeed [1 ]
Palangi, Hamid [2 ]
Polozov, Oleksandr [2 ]
Huang, Yichen [2 ]
Koishida, Kazuhito [1 ]
机构
[1] Microsoft Appl Sci Grp ASG, Redmond, WA 98052 USA
[2] Microsoft Res AI, Redmond, WA USA
来源
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119 | 2020年 / 119卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual reasoning tasks such as visual question answering (VQA) require an interplay of visual perception with reasoning about the question semantics grounded in perception. However, recent advances in this area are still primarily driven by perception improvements (e.g. scene graph generation) rather than reasoning. Neuro-symbolic models such as Neural Module Networks bring the benefits of compositional reasoning to VQA, but they are still entangled with visual representation learning, and thus neural reasoning is hard to improve and assess on its own. To address this, we propose (1) a framework to isolate and evaluate the reasoning aspect of VQA separately from its perception, and (2) a novel top-down calibration technique that allows the model to answer reasoning questions even with imperfect perception. To this end, we introduce a differentiable first-order logic formalism for VQA that explicitly decouples question answering from visual perception. On the challenging GQA dataset, this framework is used to perform in-depth, disentangled comparisons between well-known VQA models leading to informative insights regarding the participating models as well as the task.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] NeuSyRE: Neuro-symbolic visual understanding and reasoning framework based on scene graph enrichment
    Khan, M. Jaleed
    Breslin, John G.
    Curry, Edward
    SEMANTIC WEB, 2024, 15 (04) : 1389 - 1413
  • [2] Conversational Neuro-Symbolic Commonsense Reasoning
    Arabshahi, Forough
    Lee, Jennifer
    Gawarecki, Mikayla
    Mazaitis, Kathryn
    Azaria, Amos
    Mitchell, Tom
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 4902 - 4911
  • [3] A probabilistic approximate logic for neuro-symbolic learning and reasoning
    Stehr, Mark-Oliver
    Kim, Minyoung
    Talcott, Carolyn L.
    JOURNAL OF LOGICAL AND ALGEBRAIC METHODS IN PROGRAMMING, 2022, 124
  • [4] Neuro-Symbolic Integration for Reasoning and Learning on Knowledge Graphs
    Werner, Luisa
    THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23429 - 23430
  • [5] The Role of Foundation Models in Neuro-Symbolic Learning and Reasoning
    Cunnington, Daniel
    Law, Mark
    Lobo, Jorge
    Russo, Alessandra
    NEURAL-SYMBOLIC LEARNING AND REASONING, PT I, NESY 2024, 2024, 14979 : 84 - 100
  • [6] Interaction with Industrial Digital Twin Using Neuro-Symbolic Reasoning
    Siyaev, Aziz
    Valiev, Dilmurod
    Jo, Geun-Sik
    SENSORS, 2023, 23 (03)
  • [7] Neuro-Symbolic Techniques for Description Logic Reasoning (Student Abstract)
    Singh, Gunjan
    Mondal, Sutapa
    Bhatia, Sumit
    Mutharaju, Raghava
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15891 - 15892
  • [8] An Interpretable Neuro-symbolic Model for Raven’s Progressive Matrices Reasoning
    Shukuo Zhao
    Hongzhi You
    Ru-Yuan Zhang
    Bailu Si
    Zonglei Zhen
    Xiaohong Wan
    Da-Hui Wang
    Cognitive Computation, 2023, 15 : 1703 - 1724
  • [9] Knowledge-based Analogical Reasoning in Neuro-symbolic Latent Spaces
    Shah, Vishwa
    Sharma, Aditya
    Shroff, Gautam
    Vig, Lovekesh
    Dash, Tirtharaj
    Srinivasan, Ashwin
    NEURAL-SYMBOLIC LEARNING AND REASONING, NESY 2022, 2022, : 142 - 154
  • [10] The KANDY benchmark: Incremental neuro-symbolic learning and reasoning with Kandinsky patterns
    Luca Salvatore Lorello
    Marco Lippi
    Stefano Melacci
    Machine Learning, 2025, 114 (7)