Geometric prior guided hybrid deep neural network for facial beauty analysis

被引:5
作者
Peng, Tianhao [1 ,2 ]
Li, Mu [3 ]
Chen, Fangmei [4 ]
Xu, Yong [3 ]
Zhang, David [5 ]
机构
[1] Guizhou Univ, Sch Comp Sci & Technol, Guiyang, Peoples R China
[2] Moutai Inst, Dept Automat, Renhuai, Guizhou, Peoples R China
[3] Harbin Inst Technol, Sch Comp Sci & Technol, Shenzhen, Peoples R China
[4] Dalian Minzu Univ, Informat & Commun Engn Dept, Dalian, Peoples R China
[5] Chinese Univ Hong Kong, Sch Data Sci, Shenzhen, Peoples R China
关键词
deep neural networks; face analysis; face biometrics; image analysis; ATTRACTIVENESS; COMPUTATION; SYMMETRY; CANONS; FACE;
D O I
10.1049/cit2.12197
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial beauty analysis is an important topic in human society. It may be used as a guidance for face beautification applications such as cosmetic surgery. Deep neural networks (DNNs) have recently been adopted for facial beauty analysis and have achieved remarkable performance. However, most existing DNN-based models regard facial beauty analysis as a normal classification task. They ignore important prior knowledge in traditional machine learning models which illustrate the significant contribution of the geometric features in facial beauty analysis. To be specific, landmarks of the whole face and facial organs are introduced to extract geometric features to make the decision. Inspired by this, we introduce a novel dual-branch network for facial beauty analysis: one branch takes the Swin Transformer as the backbone to model the full face and global patterns, and another branch focuses on the masked facial organs with the residual network to model the local patterns of certain facial parts. Additionally, the designed multi-scale feature fusion module can further facilitate our network to learn complementary semantic information between the two branches. In model optimisation, we propose a hybrid loss function, where especially geometric regulation is introduced by regressing the facial landmarks and it can force the extracted features to convey facial geometric features. Experiments performed on the SCUT-FBP5500 dataset and the SCUT-FBP dataset demonstrate that our model outperforms the state-of-the-art convolutional neural networks models, which proves the effectiveness of the proposed geometric regularisation and dual-branch structure with the hybrid network. To the best of our knowledge, this is the first study to introduce a Vision Transformer into the facial beauty analysis task.
引用
收藏
页码:467 / 480
页数:14
相关论文
共 56 条
  • [1] [Anonymous], 2008, Proceedings of the Twenty-Fifth International Conference on Machine Learning
  • [2] History and current concepts in the analysis of facial attractiveness
    Bashour, Mounir
    [J]. PLASTIC AND RECONSTRUCTIVE SURGERY, 2006, 118 (03) : 741 - 756
  • [3] Carion N, 2020, European conference on computer vision, P213, DOI DOI 10.1007/978-3-030-58452-813
  • [4] Adaptive Feature Aggregation in Deep Multi-Task Convolutional Neural Networks
    Cui, Chaoran
    Shen, Zhen
    Huang, Jin
    Chen, Meng
    Xu, Mingliang
    Wang, Meng
    Yin, Yilong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2133 - 2144
  • [5] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [6] TDD-net: a tiny defect detection network for printed circuit boards
    Ding, Runwei
    Dai, Linhui
    Li, Guangpeng
    Liu, Hong
    [J]. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2019, 4 (02) : 110 - 116
  • [7] Hierarchical Feature Aggregation Based on Transformer for Image-Text Matching
    Dong, Xinfeng
    Zhang, Huaxiang
    Zhu, Lei
    Nie, Liqiang
    Liu, Li
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6437 - 6447
  • [8] Dosovitskiy A., 2020, ICLR, V20, DOI 10.48550/arXiv.2010.11929
  • [9] Facial attractiveness: Beauty and the machine
    Eisenthal, Y
    Dror, G
    Ruppin, E
    [J]. NEURAL COMPUTATION, 2006, 18 (01) : 119 - 142
  • [10] Label Distribution-Based Facial Attractiveness Computation by Deep Residual Learning
    Fan, Yang-Yu
    Liu, Shu
    Li, Bo
    Guo, Zhe
    Samal, Ashok
    Wan, Jun
    Li, Stan Z.
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (08) : 2196 - 2208