Name-Face Association in Web Videos: A Large-Scale Dataset, Baselines, and Open Issues

被引：14

作者：

Chen, Zhi-Neng ^{[1
,2
]}

Ngo, Chong-Wah ^{[2
]}

Zhang, Wei ^{[2
]}

Cao, Juan ^{[3
]}

Jiang, Yu-Gang ^{[4
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China

[2] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China

[3] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China

[4] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China

来源：

JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY | 2014年 / 29卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Web video; celebrity; name-face association; dataset construction; community analysis; SEARCH;

D O I：

10.1007/s11390-014-1468-z

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Associating faces appearing in Web videos with names presented in the surrounding context is an important task in many applications. However, the problem is not well investigated particularly under large-scale realistic scenario, mainly due to the scarcity of dataset constructed in such circumstance. In this paper, we introduce a Web video dataset of celebrities, named WebV-Cele, for name-face association. The dataset consists of 75 073 Internet videos of over 4 000 hours, covering 2 427 celebrities and 649 001 faces. This is, to our knowledge, the most comprehensive dataset for this problem. We describe the details of dataset construction, discuss several interesting findings by analyzing this dataset like celebrity community discovery, and provide experimental results of name-face association using five existing techniques. We also outline important and challenging research problems that could be investigated in the future.

引用

页码：785 / 798

页数：14

共 36 条

[21]

Ramanan D, 2007, IEEE I CONF COMP VIS, P1432

[22] Recognition of Faces in Unconstrained Environments: A Comparative Study [J].

Ruiz-del-Solar, Javier ;

Verschae, Rodrigo ;

Correa, Mauricio .

EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2009,

[23] Name-it: Naming and detecting faces in news videos [J].

Satoh, S ;

Nakamura, Y ;

Kanade, T .

IEEE MULTIMEDIA, 1999, 6 (01) :22-35

[24] Web Video Geolocation by Geotagged Social Resources [J].

Song, Yi-Cheng ;

Zhang, Yong-Dong ;

Cao, Juan ;

Xia, Tian ;

Li, Jin-Tao .

IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (02) :456-470

[25] Toward Large-Scale Face Recognition Using Social Network Context [J].

Stone, Zak ;

Zickler, Todd ;

Darrell, Trevor .

PROCEEDINGS OF THE IEEE, 2010, 98 (08) :1408-1415

[26] Mining Weakly Labeled Web Facial Images for Search-Based Face Annotation [J].

Wang, Dayong ;

Hoi, Steven C. H. ;

He, Ying ;

Zhu, Jianke .

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (01) :166-179

[27]

Wolf L, 2011, PROC CVPR IEEE, P529, DOI 10.1109/CVPR.2011.5995566

[28] Real-Time Near-Duplicate Elimination for Web Video Search With Content and Context [J].

Wu, Xiao ;

Ngo, Chong-Wah ;

Hauptmann, Alexander G. ;

Tan, Hung-Khoon .

IEEE TRANSACTIONS ON MULTIMEDIA, 2009, 11 (02) :196-207

[29] Contextual Query Expansion for Image Retrieval [J].

Xie, Hongtao ;

Zhang, Yongdong ;

Tan, Jianlong ;

Guo, Li ;

Li, Jintao .

IEEE TRANSACTIONS ON MULTIMEDIA, 2014, 16 (04) :1104-1114

[30]

Yang J, 2005, Proceedings of the 2005 International Conference on Active Media Technology (AMT 2005), P28

← 1 2 3 4 →