Neuro-Symbolic Visual Reasoning: Disentangling "Visual" from "Reasoning"

被引：0

作者：

Amizadeh, Saeed ^{[1
]}

Palangi, Hamid ^{[2
]}

Polozov, Oleksandr ^{[2
]}

Huang, Yichen ^{[2
]}

Koishida, Kazuhito ^{[1
]}

机构：

[1] Microsoft Appl Sci Grp ASG, Redmond, WA 98052 USA

[2] Microsoft Res AI, Redmond, WA USA

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119 | 2020年 / 119卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Visual reasoning tasks such as visual question answering (VQA) require an interplay of visual perception with reasoning about the question semantics grounded in perception. However, recent advances in this area are still primarily driven by perception improvements (e.g. scene graph generation) rather than reasoning. Neuro-symbolic models such as Neural Module Networks bring the benefits of compositional reasoning to VQA, but they are still entangled with visual representation learning, and thus neural reasoning is hard to improve and assess on its own. To address this, we propose (1) a framework to isolate and evaluate the reasoning aspect of VQA separately from its perception, and (2) a novel top-down calibration technique that allows the model to answer reasoning questions even with imperfect perception. To this end, we introduce a differentiable first-order logic formalism for VQA that explicitly decouples question answering from visual perception. On the challenging GQA dataset, this framework is used to perform in-depth, disentangled comparisons between well-known VQA models leading to informative insights regarding the participating models as well as the task.

引用

页数：12

共 50 条

[1] NeuSyRE: Neuro-symbolic visual understanding and reasoning framework based on scene graph enrichment
Khan, M. Jaleed
Breslin, John G.
Curry, Edward
SEMANTIC WEB, 2024, 15 (04) : 1389 - 1413
[2] Conversational Neuro-Symbolic Commonsense Reasoning
Arabshahi, Forough
Lee, Jennifer
Gawarecki, Mikayla
Mazaitis, Kathryn
Azaria, Amos
Mitchell, Tom
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 4902 - 4911
[3] A probabilistic approximate logic for neuro-symbolic learning and reasoning
Stehr, Mark-Oliver
Kim, Minyoung
Talcott, Carolyn L.
JOURNAL OF LOGICAL AND ALGEBRAIC METHODS IN PROGRAMMING, 2022, 124
[4] Neuro-Symbolic Integration for Reasoning and Learning on Knowledge Graphs
Werner, Luisa
THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23429 - 23430
[5] The Role of Foundation Models in Neuro-Symbolic Learning and Reasoning
Cunnington, Daniel
Law, Mark
Lobo, Jorge
Russo, Alessandra
NEURAL-SYMBOLIC LEARNING AND REASONING, PT I, NESY 2024, 2024, 14979 : 84 - 100
[6] Interaction with Industrial Digital Twin Using Neuro-Symbolic Reasoning
Siyaev, Aziz
Valiev, Dilmurod
Jo, Geun-Sik
SENSORS, 2023, 23 (03)
[7] Neuro-Symbolic Techniques for Description Logic Reasoning (Student Abstract)
Singh, Gunjan
Mondal, Sutapa
Bhatia, Sumit
Mutharaju, Raghava
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15891 - 15892
[8] An Interpretable Neuro-symbolic Model for Raven’s Progressive Matrices Reasoning
Shukuo Zhao
Hongzhi You
Ru-Yuan Zhang
Bailu Si
Zonglei Zhen
Xiaohong Wan
Da-Hui Wang
Cognitive Computation, 2023, 15 : 1703 - 1724
[9] Knowledge-based Analogical Reasoning in Neuro-symbolic Latent Spaces
Shah, Vishwa
Sharma, Aditya
Shroff, Gautam
Vig, Lovekesh
Dash, Tirtharaj
Srinivasan, Ashwin
NEURAL-SYMBOLIC LEARNING AND REASONING, NESY 2022, 2022, : 142 - 154
[10] The KANDY benchmark: Incremental neuro-symbolic learning and reasoning with Kandinsky patterns
Luca Salvatore Lorello
Marco Lippi
Stefano Melacci
Machine Learning, 2025, 114 (7)

← 1 2 3 4 5 →