Learning to Localize Objects with Noisy Labeled Instances

被引:0
作者
Zhang, Xiaopeng [1 ]
Yang, Yang [2 ,3 ]
Feng, Jiashi [1 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
[2] Univ Elect Sci & Technol China, Ctr Future Media, Hefei, Anhui, Peoples R China
[3] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Hefei, Anhui, Peoples R China
来源
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2019年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses Weakly Supervised Object Localization (WSOL) with only image-level supervision. We model the missing object locations as latent variables, and contribute a novel self-directed optimization strategy to infer them. With the strategy, our developed Self-Directed Localization Network (SD-LocNet) is able to localize object instance whose initial location is noisy. The self-directed inference hinges on an adaptive sampling method to identify reliable object instance via measuring its localization stability score. In this way, the resulted model is robust to noisy initialized object locations which we find is important in WSOL. Furthermore, we introduce a reliability induced prior propagation strategy to transfer object priors of the reliable instances to those unreliable ones by promoting their feature similarity, which effectively refines the unreliable object instances for better localization. The proposed SD-LocNet achieves 70.9% Cor-Loc and 51.3% mAP on PASCAL VOC 2007, surpassing the state-of-the-arts by a large margin.
引用
收藏
页码:9219 / 9226
页数:8
相关论文
共 32 条
[11]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[12]   The PASCAL Visual Object Classes Challenge: A Retrospective [J].
Everingham, Mark ;
Eslami, S. M. Ali ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136
[13]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[14]   Deep Self-Taught Learning for Weakly Supervised Object Localization [J].
Jie, Zequn ;
Wei, Yunchao ;
Jin, Xiaojie ;
Feng, Jiashi ;
Liu, Wei .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4294-4302
[15]  
Joulin A., 2012, ARXIV12066413
[16]   ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised Localization [J].
Kantorov, Vadim ;
Oquab, Maxime ;
Cho, Minsu ;
Laptev, Ivan .
COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 :350-365
[17]  
Kumar M. P., 2010, Adv. Neural Inf. Process. Syst., V23, P1189
[18]   Microsoft COCO: Common Objects in Context [J].
Lin, Tsung-Yi ;
Maire, Michael ;
Belongie, Serge ;
Hays, James ;
Perona, Pietro ;
Ramanan, Deva ;
Dollar, Piotr ;
Zitnick, C. Lawrence .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755
[19]   Weakly supervised discriminative localization and classification: a joint learning process [J].
Minh Hoai Nguyen ;
Torresani, Lorenzo ;
de la Torre, Fernando ;
Rother, Carsten .
2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :1925-1932
[20]   Video-based Person Re-identification via Self-Paced Learning and Deep Reinforcement Learning Framework [J].
Ouyang, Deqiang ;
Shao, Jie ;
Zhang, Yonghui ;
Yang, Yang ;
Shen, Heng Tao .
PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, :1562-1570