Divergence-Driven Consistency Training for Semi-Supervised Facial Age Estimation

被引:7
作者
Bao, Zenghao [1 ,2 ]
Tan, Zichang [3 ,4 ]
Wan, Jun [1 ,2 ,5 ]
Ma, Xibo [1 ,2 ]
Guo, Guodong [6 ]
Lei, Zhen [1 ,2 ,7 ]
机构
[1] Chinese Acad Sci CASIA, Inst Automat, Natl Lab Pattern Recognit NLPR, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci UCAS, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[3] Baidu Res, Inst Deep Learning, Beijing 100085, Peoples R China
[4] Natl Engn Lab Deep Learning Technol & Applicat, Beijing 100101, Peoples R China
[5] Macau Univ Sci & Technol MUST, Fac Innovat Engn, Macau, Peoples R China
[6] Ant Grp, Beijing 100026, Peoples R China
[7] Chinese Acad Sci, Hong Kong Inst Sci & Innovat, Ctr Artificial Intelligence & Robot, Hong Kong, Peoples R China
关键词
Facial age estimation; semi-supervised; efficient sample selection; identity consistency; RECOGNITION;
D O I
10.1109/TIFS.2022.3218431
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Facial age estimation has attracted considerable attention owing to its great potential in applications. However, it still falls short of reliable age estimation due to the lack of sufficient training data with accurate age labels. Using conventional semi-supervised methods to exploit unlabeled data appears to be a good solution, but it does not yield sufficient performance gains while significantly increasing training time. Therefore, to tackle these problems, we present a Divergence-driven Consistency Training (DCT) method for enhancing both efficiency and performance in this paper. Following the idea of pseudo-labeling and consistency regularization, we assign pseudo labels predicted by the teacher model to unlabeled samples and then train the student model on labeled and unlabeled samples based on consistency regularization. Based on this, we propose two main promotions. The first is the Efficient Sample Selection (ESS) strategy, which is based on the Divergence Score to select effective samples from massive unlabeled images to reduce the training time and improve efficiency. The second is Identity Consistency (IC) regularization as the additional loss function, which introduces a high dependency of aging traits on a person. Moreover, we propose Local Prediction (LP), which is a plug-and-play component, to capture local semantics. Extensive experiments on multiple age benchmark datasets, including CACD, Morph II, MIVIA, and Chalearn LAP 2015, indicate DCT outperforms the state-of-the-art approaches significantly.
引用
收藏
页码:221 / 232
页数:12
相关论文
共 67 条
[1]   Anchored Regression Networks applied to Age Estimation and Super Resolution [J].
Agustsson, Eirikur ;
Timofte, Radu ;
Van Gool, Luc .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1652-1661
[2]  
Arjovsky M, 2017, PR MACH LEARN RES, V70
[3]  
Bao Z., 2021, P 19 INT C COMP AN I, P308
[4]  
Bengio Y, 2009, P 26 ANN INT C MACH, P41, DOI [10.1145/1553374.1553380, DOI 10.1145/1553374.1553380]
[5]  
Berthelot D., 2019, Remixmatch: Semi-supervised learning with distribution alignment and augmentation anchoring
[6]  
Berthelot D, 2019, ADV NEUR IN, V32
[7]   VGGFace2: A dataset for recognising faces across pose and age [J].
Cao, Qiong ;
Shen, Li ;
Xie, Weidi ;
Parkhi, Omkar M. ;
Zisserman, Andrew .
PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, :67-74
[8]   Age from Faces in the Deep Learning Revolution [J].
Carletti, Vincenzo ;
Greco, Antonio ;
Percannella, Gennaro ;
Vento, Mario .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (09) :2113-2132
[9]  
Chen BC, 2014, LECT NOTES COMPUT SC, V8694, P768, DOI 10.1007/978-3-319-10599-4_49
[10]   Using Ranking-CNN for Age Estimation [J].
Chen, Shixing ;
Zhang, Caojin ;
Dong, Ming ;
Le, Jialiang ;
Rao, Mike .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :742-751