Model Level Ensemble for Facial Action Unit Recognition at the 3rd ABAW Challenge

被引:4
作者
Jiang, Wenqiang [1 ]
Wu, Yannan [1 ]
Qiao, Fengsheng [1 ]
Meng, Liyu [1 ]
Deng, Yuanyuan [1 ]
Liu, Chuanhe [1 ]
机构
[1] Beijing Seek Truth Data Technol Co Ltd, Beijing, Peoples R China
来源
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022 | 2022年
关键词
D O I
10.1109/CVPRW56347.2022.00260
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we present our latest work on Action Unit Detection, which is a part of the Affective Behavior Analysis in-the-wild (ABAW) 2022 Competition [15]. Our proposed network is based on the IResnet100 [6]. First of all, We utilize feature pyramid networks (FPN) [25] and single stage headless (SSH) [29] to enlarge the receptive field and extract more facial texture features. Then we employ the ML-ROS data balancing [4] and the BCE Loss plus Multi-label Loss to solve the multi-label imbalance problem. We also use three different models as the base model to fine-tune the Aff-Wild2 dataset. The pre-train backbones are the AU detection model, expression model and face recognition model. Finally, we adopt an ensemble methodology to get the final result. Our f1 score achieved 49.82 on the AU test set and ranked second in this challenge with a very small difference from the first team 49.89.
引用
收藏
页码:2336 / 2343
页数:8
相关论文
共 41 条
[1]   Partial FC: Training 10 Million Identities on a Single Machine [J].
An, Xiang ;
Zhu, Xuhan ;
Gao, Yuan ;
Xiao, Yang ;
Zhao, Yongle ;
Feng, Ziyong ;
Wu, Lan ;
Qin, Bin ;
Zhang, Ming ;
Zhang, Debing ;
Fu, Ying .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, :1445-1449
[2]  
BARSOUM E, 2016, ACM INT C MULT INT I, P279, DOI DOI 10.1145/2993148.2993165
[3]  
Buitinck Lars., 2013, API DESIGN MACHINE L
[4]   Addressing imbalance in multilabel classification: Measures and random resampling algorithms [J].
Charte, Francisco ;
Rivera, Antonio J. ;
del Jesus, Maria J. ;
Herrera, Francisco .
NEUROCOMPUTING, 2015, 163 :3-16
[5]   RetinaFace: Single-shot Multi-level Face Localisation in the Wild [J].
Deng, Jiankang ;
Guo, Jia ;
Ververas, Evangelos ;
Kotsia, Irene ;
Zafeiriou, Stefanos .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5202-5211
[6]   Improved Residual Networks for Image and Video Recognition [J].
Duta, Ionut Cosmin ;
Liu, Li ;
Zhu, Fan ;
Shao, Ling .
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :9415-9422
[7]  
Ekman P., 1978, MANUAL FACIAL ACTION, DOI DOI 10.1037/T27734-000
[8]  
Fan YR, 2020, AAAI CONF ARTIF INTE, V34, P12701
[9]  
He Kaiming, 2016, Lecture Notes in Computer Science, V9908, P630, DOI [DOI 10.1109/CVPR.2016.90, 10.1007/978-3-319-46493-0_38, DOI 10.1007/978-3-319-46493-0_38]
[10]  
Hoai Duy Le, 2022, ARXIV220312428