As a pivotal branch of intelligent human-computer interaction, visual dialog is a technically challenging task that requires artificial intelligence (AI) agents to answer consecutive questions based on image content and history dialog. Despite considerable progresses, visual dialog still suffers from two major problems: (1) how to design flexible cross-modal interaction patterns instead of over-reliance on expert experience and (2) how to infer underlying semantic dependencies between dialogues effectively. To address these issues, an end-to-end framework employing dynamic interaction and hybrid graph reasoning is proposed in this work. Specifically, three major components are designed and the practical benefits are demonstrated by extensive experiments. First, a dynamic interaction module is developed to automatically determine the optimal modality interaction route for multifarious questions, which consists of three elaborate functional interaction blocks endowed with dynamic routers. Second, a hybrid graph reasoning module is designed to explore adequate semantic associations between dialogues from multiple perspectives, where the hybrid graph is constructed by aggregating a structured coreference graph and a context-aware temporal graph. Third, a unified one-stage visual dialog model with an end-to-end structure is developed to train the dynamic interaction module and the hybrid graph reasoning module in a collaborative manner. Extensive experiments on the benchmark datasets of VisDial v0.9 and VisDial v1.0 demonstrate the effectiveness of the proposed method compared to other state-of-the-art approaches.
机构:
Cent China Normal Univ, Sch Comp Sci, China, Wuhan 430079, Peoples R ChinaCent China Normal Univ, Sch Comp Sci, China, Wuhan 430079, Peoples R China
Li, Pengfei
Zhou, Guangyou
论文数: 0引用数: 0
h-index: 0
机构:
Cent China Normal Univ, Sch Comp Sci, China, Wuhan 430079, Peoples R China
Cent China Normal Univ, Hubei Prov Key Lab Artificial Intelligence & Smart, Wuhan 430079, Peoples R ChinaCent China Normal Univ, Sch Comp Sci, China, Wuhan 430079, Peoples R China
Zhou, Guangyou
Xie, Zhiwen
论文数: 0引用数: 0
h-index: 0
机构:
Cent China Normal Univ, Sch Comp Sci, China, Wuhan 430079, Peoples R China
Cent China Normal Univ, Hubei Prov Key Lab Artificial Intelligence & Smart, Wuhan 430079, Peoples R ChinaCent China Normal Univ, Sch Comp Sci, China, Wuhan 430079, Peoples R China
Xie, Zhiwen
Xie, Penghui
论文数: 0引用数: 0
h-index: 0
机构:
Cent China Normal Univ, Sch Comp Sci, China, Wuhan 430079, Peoples R ChinaCent China Normal Univ, Sch Comp Sci, China, Wuhan 430079, Peoples R China
Xie, Penghui
Huang, Jimmy Xiangji
论文数: 0引用数: 0
h-index: 0
机构:
York Univ, Sch Informat Technol, Informat Retrieval & Knowledge Management Res Lab, Toronto M3J 1P3, ON, CanadaCent China Normal Univ, Sch Comp Sci, China, Wuhan 430079, Peoples R China
机构:
Sun Yat Sen Univ, Guangzhou 510006, Peoples R China
Minist Educ, Key Lab Machine Intelligence & Adv Comp, Guangzhou, Peoples R China
Minist Educ, Engn Res Ctr Adv Comp Engn Software, Guangzhou, Peoples R ChinaSun Yat Sen Univ, Guangzhou 510006, Peoples R China
Lin, Liang
Gao, Yiming
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Guangzhou 510006, Peoples R China
Minist Educ, Key Lab Machine Intelligence & Adv Comp, Guangzhou, Peoples R China
Minist Educ, Engn Res Ctr Adv Comp Engn Software, Guangzhou, Peoples R ChinaSun Yat Sen Univ, Guangzhou 510006, Peoples R China
Gao, Yiming
Gong, Ke
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Guangzhou 510006, Peoples R China
Minist Educ, Key Lab Machine Intelligence & Adv Comp, Guangzhou, Peoples R China
Minist Educ, Engn Res Ctr Adv Comp Engn Software, Guangzhou, Peoples R ChinaSun Yat Sen Univ, Guangzhou 510006, Peoples R China
Gong, Ke
Wang, Meng
论文数: 0引用数: 0
h-index: 0
机构:
Hefei Univ Technol, Hefei 230000, Peoples R ChinaSun Yat Sen Univ, Guangzhou 510006, Peoples R China
Wang, Meng
Liang, Xiaodan
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Guangzhou 510006, Peoples R China
Minist Educ, Key Lab Machine Intelligence & Adv Comp, Guangzhou, Peoples R China
Minist Educ, Engn Res Ctr Adv Comp Engn Software, Guangzhou, Peoples R ChinaSun Yat Sen Univ, Guangzhou 510006, Peoples R China
机构:
Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R ChinaSun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R China
Gao, Yiming
Kuang, Zhanghui
论文数: 0引用数: 0
h-index: 0
机构:
Sensetime Reasearch, Hong Kong, Peoples R ChinaSun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R China
Kuang, Zhanghui
Li, Guanbin
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R ChinaSun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R China
Li, Guanbin
Luo, Ping
论文数: 0引用数: 0
h-index: 0
机构:
Univ Hong Kong, Hong Kong, Peoples R ChinaSun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R China
Luo, Ping
Chen, Yimin
论文数: 0引用数: 0
h-index: 0
机构:
Sensetime Reasearch, Hong Kong, Peoples R ChinaSun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R China
Chen, Yimin
Lin, Liang
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R ChinaSun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R China
Lin, Liang
Zhang, Wayne
论文数: 0引用数: 0
h-index: 0
机构:
Sensetime Reasearch, Hong Kong, Peoples R China
Shanghai Jiao Tong Univ, Qing Yuan Res Inst, Shanghai 200240, Peoples R ChinaSun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R China
机构:
Incept Inst Artificial Intelligence, Abu Dhabi, U Arab EmiratesIncept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
Liu, Nian
Li, Long
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Sch Automat, Xian 710060, Peoples R ChinaIncept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
Li, Long
Zhao, Wangbo
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Sch Automat, Xian 710060, Peoples R ChinaIncept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
Zhao, Wangbo
Han, Junwei
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Sch Automat, Xian 710060, Peoples R ChinaIncept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
Han, Junwei
Shao, Ling
论文数: 0引用数: 0
h-index: 0
机构:
Incept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
Mohamed Bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab EmiratesIncept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
机构:
Cent China Normal Univ, Sch Comp Sci, China, Wuhan 430079, Peoples R ChinaCent China Normal Univ, Sch Comp Sci, China, Wuhan 430079, Peoples R China
Li, Pengfei
Zhou, Guangyou
论文数: 0引用数: 0
h-index: 0
机构:
Cent China Normal Univ, Sch Comp Sci, China, Wuhan 430079, Peoples R China
Cent China Normal Univ, Hubei Prov Key Lab Artificial Intelligence & Smart, Wuhan 430079, Peoples R ChinaCent China Normal Univ, Sch Comp Sci, China, Wuhan 430079, Peoples R China
Zhou, Guangyou
Xie, Zhiwen
论文数: 0引用数: 0
h-index: 0
机构:
Cent China Normal Univ, Sch Comp Sci, China, Wuhan 430079, Peoples R China
Cent China Normal Univ, Hubei Prov Key Lab Artificial Intelligence & Smart, Wuhan 430079, Peoples R ChinaCent China Normal Univ, Sch Comp Sci, China, Wuhan 430079, Peoples R China
Xie, Zhiwen
Xie, Penghui
论文数: 0引用数: 0
h-index: 0
机构:
Cent China Normal Univ, Sch Comp Sci, China, Wuhan 430079, Peoples R ChinaCent China Normal Univ, Sch Comp Sci, China, Wuhan 430079, Peoples R China
Xie, Penghui
Huang, Jimmy Xiangji
论文数: 0引用数: 0
h-index: 0
机构:
York Univ, Sch Informat Technol, Informat Retrieval & Knowledge Management Res Lab, Toronto M3J 1P3, ON, CanadaCent China Normal Univ, Sch Comp Sci, China, Wuhan 430079, Peoples R China
机构:
Sun Yat Sen Univ, Guangzhou 510006, Peoples R China
Minist Educ, Key Lab Machine Intelligence & Adv Comp, Guangzhou, Peoples R China
Minist Educ, Engn Res Ctr Adv Comp Engn Software, Guangzhou, Peoples R ChinaSun Yat Sen Univ, Guangzhou 510006, Peoples R China
Lin, Liang
Gao, Yiming
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Guangzhou 510006, Peoples R China
Minist Educ, Key Lab Machine Intelligence & Adv Comp, Guangzhou, Peoples R China
Minist Educ, Engn Res Ctr Adv Comp Engn Software, Guangzhou, Peoples R ChinaSun Yat Sen Univ, Guangzhou 510006, Peoples R China
Gao, Yiming
Gong, Ke
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Guangzhou 510006, Peoples R China
Minist Educ, Key Lab Machine Intelligence & Adv Comp, Guangzhou, Peoples R China
Minist Educ, Engn Res Ctr Adv Comp Engn Software, Guangzhou, Peoples R ChinaSun Yat Sen Univ, Guangzhou 510006, Peoples R China
Gong, Ke
Wang, Meng
论文数: 0引用数: 0
h-index: 0
机构:
Hefei Univ Technol, Hefei 230000, Peoples R ChinaSun Yat Sen Univ, Guangzhou 510006, Peoples R China
Wang, Meng
Liang, Xiaodan
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Guangzhou 510006, Peoples R China
Minist Educ, Key Lab Machine Intelligence & Adv Comp, Guangzhou, Peoples R China
Minist Educ, Engn Res Ctr Adv Comp Engn Software, Guangzhou, Peoples R ChinaSun Yat Sen Univ, Guangzhou 510006, Peoples R China
机构:
Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R ChinaSun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R China
Gao, Yiming
Kuang, Zhanghui
论文数: 0引用数: 0
h-index: 0
机构:
Sensetime Reasearch, Hong Kong, Peoples R ChinaSun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R China
Kuang, Zhanghui
Li, Guanbin
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R ChinaSun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R China
Li, Guanbin
Luo, Ping
论文数: 0引用数: 0
h-index: 0
机构:
Univ Hong Kong, Hong Kong, Peoples R ChinaSun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R China
Luo, Ping
Chen, Yimin
论文数: 0引用数: 0
h-index: 0
机构:
Sensetime Reasearch, Hong Kong, Peoples R ChinaSun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R China
Chen, Yimin
Lin, Liang
论文数: 0引用数: 0
h-index: 0
机构:
Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R ChinaSun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R China
Lin, Liang
Zhang, Wayne
论文数: 0引用数: 0
h-index: 0
机构:
Sensetime Reasearch, Hong Kong, Peoples R China
Shanghai Jiao Tong Univ, Qing Yuan Res Inst, Shanghai 200240, Peoples R ChinaSun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Guangdong, Peoples R China
机构:
Incept Inst Artificial Intelligence, Abu Dhabi, U Arab EmiratesIncept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
Liu, Nian
Li, Long
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Sch Automat, Xian 710060, Peoples R ChinaIncept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
Li, Long
Zhao, Wangbo
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Sch Automat, Xian 710060, Peoples R ChinaIncept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
Zhao, Wangbo
Han, Junwei
论文数: 0引用数: 0
h-index: 0
机构:
Northwestern Polytech Univ, Sch Automat, Xian 710060, Peoples R ChinaIncept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
Han, Junwei
Shao, Ling
论文数: 0引用数: 0
h-index: 0
机构:
Incept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
Mohamed Bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab EmiratesIncept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates