Webly-Supervised Video Recognition by Mutually Voting for Relevant Web Images and Web Video Frames

被引:65
作者
Gan, Chuang [1 ]
Sun, Chen [2 ]
Duan, Lixin [3 ]
Gong, Boqing [4 ]
机构
[1] Tsinghua Univ, IIIS, Beijing, Peoples R China
[2] Google Res, Mountain View, CA USA
[3] Amazon, Seattle, WA USA
[4] Univ Cent Florida, CRCV, Orlando, FL 32816 USA
来源
COMPUTER VISION - ECCV 2016, PT III | 2016年 / 9907卷
基金
美国国家科学基金会;
关键词
D O I
10.1007/978-3-319-46487-9_52
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video recognition usually requires a large amount of training samples, which are expensive to be collected. An alternative and cheap solution is to draw from the large-scale images and videos from the Web. With modern search engines, the top ranked images or videos are usually highly correlated to the query, implying the potential to harvest the labeling-free Web images and videos for video recognition. However, there are two key difficulties that prevent us from using the Web data directly. First, they are typically noisy and may be from a completely different domain from that of users' interest (e.g. cartoons). Second, Web videos are usually untrimmed and very lengthy, where some query-relevant frames are often hidden in between the irrelevant ones. A question thus naturally arises: to what extent can such noisy Web images and videos be utilized for labeling-free video recognition? In this paper, we propose a novel approach to mutually voting for relevant Web images and video frames, where two forces are balanced, i.e. aggressive matching and passive video frame selection. We validate our approach on three large-scale video recognition datasets.
引用
收藏
页码:849 / 866
页数:18
相关论文
共 47 条
[1]  
[Anonymous], 2016, ECCV
[2]  
[Anonymous], 2013, INT C MACH LEARN PML
[3]  
[Anonymous], 2015, CVPR
[4]  
[Anonymous], 2015, P IEEE INT C COMPUTE
[5]  
[Anonymous], 2015, ICCV
[6]  
[Anonymous], 2013, ICCV
[7]  
[Anonymous], 2016, CVPR
[8]  
[Anonymous], 2015, CVPR
[9]  
[Anonymous], 2014, NIPS
[10]  
[Anonymous], 1997, Neural Computation