Context-Aware Zero-Shot Recognition

被引:0
作者
Luo, Ruotian [1 ]
Zhang, Ning [2 ]
Han, Bohyung [3 ]
Yang, Linjie [4 ]
机构
[1] TTI Chicago, Chicago, IL 60637 USA
[2] Vaitl Inc, Palo Alto, CA USA
[3] Seoul Natl Univ, Seoul, South Korea
[4] ByteDance AI Lab, Beijing, Peoples R China
来源
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2020年 / 34卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel problem setting in zero-shot learning, zero-shot object recognition and detection in the context. Contrary to the traditional zero-shot learning methods, which simply infers unseen categories by transferring knowledge from the objects belonging to semantically similar seen categories, we aim to understand the identity of the novel objects in an image surrounded by the known objects using the inter-object relation prior. Specifically, we leverage the visual context and the geometric relationships between all pairs of objects in a single image, and capture the information useful to infer unseen categories. We integrate our context-aware zero-shot learning framework into the traditional zero-shot learning techniques seamlessly using a Conditional Random Field (CRF). The proposed algorithm is evaluated on both zero-shot region classification and zero-shot detection tasks. The results on Visual Genome (VG) dataset show that our model significantly boosts performance with the additional visual context compared to traditional methods.
引用
收藏
页码:11709 / 11716
页数:8
相关论文
共 48 条
[11]  
Changpinyo S., 2018, ARXIV181206423
[12]   Predicting Visual Exemplars of Unseen Classes for Zero-Shot Learning [J].
Changpinyo, Soravit ;
Chao, Wei-Lun ;
Sha, Fei .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :3496-3505
[13]   Synthesized Classifiers for Zero-Shot Learning [J].
Changpinyo, Soravit ;
Chao, Wei-Lun ;
Gong, Boqing ;
Sha, Fei .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5327-5336
[14]   Iterative Visual Reasoning Beyond Convolutions [J].
Chen, Xinlei ;
Li, Li-Jia ;
Li Fei-Fei ;
Gupta, Abhinav .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7239-7248
[15]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[16]  
Demirel B., 2018, P BRIT MACHINE VISIO
[17]   Discriminative Models for Multi-Class Object Layout [J].
Desai, Chaitanya ;
Ramanan, Deva ;
Fowlkes, Charless C. .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 95 (01) :1-12
[18]  
Divvala SK, 2009, PROC CVPR IEEE, P1271, DOI 10.1109/CVPRW.2009.5206532
[19]  
Duvenaudt D, 2015, ADV NEUR IN, V28
[20]  
Frome Andrea, 2013, Adv. Neural Inf. Process. Syst.