MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition

被引:1210
作者
Guo, Yandong [1 ]
Zhang, Lei [1 ]
Hu, Yuxiao [1 ]
He, Xiaodong [1 ]
Gao, Jianfeng [1 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
来源
COMPUTER VISION - ECCV 2016, PT III | 2016年 / 9907卷
关键词
Face recognition; Large scale; Benchmark; Training data; Celebrity recognition; Knowledge base;
D O I
10.1007/978-3-319-46487-9_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we design a benchmark task and provide the associated datasets for recognizing face images and link them to corresponding entity keys in a knowledge base. More specifically, we propose a benchmark task to recognize one million celebrities from their face images, by using all the possibly collected face images of this individual on the web as training data. The rich information provided by the knowledge base helps to conduct disambiguation and improve the recognition accuracy, and contributes to various real-world applications, such as image captioning and news video analysis. Associated with this task, we design and provide concrete measurement set, evaluation protocol, as well as training data. We also present in details our experiment setup and report promising baseline results. Our benchmark task could lead to one of the largest classification problems in computer vision. To the best of our knowledge, our training dataset, which contains 10M images in version 1, is the largest publicly available one in the world.
引用
收藏
页码:87 / 102
页数:16
相关论文
共 24 条
[1]  
[Anonymous], 2015, P IEEE COMP SOC C CO
[2]  
[Anonymous], 2014, UMCS2014003 U MASS A
[3]  
[Anonymous], 2011, P IEEE COMP SOC C CO
[4]  
[Anonymous], 2015, P BRIT MACH VIS
[5]  
[Anonymous], 2014, ARXIV150200873
[6]  
[Anonymous], ARXIV E PRINTS
[7]  
[Anonymous], 2014, P IEEE COMP SOC C CO
[8]  
[Anonymous], 2014, ICIP
[9]  
[Anonymous], P IEEE COMP SOC C CO
[10]  
[Anonymous], P IEEE COMP SOC C CO