Multi-threshold deep metric learning for facial expression recognition

被引:0
|
作者
Yang, Wenwu [1 ]
Yu, Jinyi [1 ]
Chen, Tuo [1 ]
Liu, Zhenguang [2 ]
Wang, Xun [1 ]
Shen, Jianbing [3 ]
机构
[1] Zhejiang GongShang Univ, Hangzhou 310018, Peoples R China
[2] Zhejiang Univ, Hangzhou 310012, Peoples R China
[3] Univ Macau, Taipa 999078, Macau, Peoples R China
关键词
Facial expression recognition; Triplet loss learning; Multiple thresholds;
D O I
10.1016/j.patcog.2024.110711
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature representations generated through triplet-based deep metric learning offer significant advantages for facial expression recognition (FER). Each threshold in triplet loss inherently shapes a distinct distribution of inter-class variations, leading to unique representations of expression features. Nonetheless, pinpointing the optimal threshold for triplet loss presents a formidable challenge, as the ideal threshold varies not only across different datasets but also among classes within the same dataset. In this paper, we propose a novel multi-threshold deep metric learning approach that bypasses the complex process of threshold validation and markedly improves the effectiveness in creating expression feature representations. Instead of choosing a single optimal threshold from a valid range, we comprehensively sample thresholds throughout this range, which ensures that the representation characteristics exhibited by the thresholds within this spectrum are fully captured and utilized for enhancing FER. Specifically, we segment the embedding layer of the deep metric learning network into multiple slices, with each slice representing a specific threshold sample. We subsequently train these embedding slices in an end-to-end fashion, applying triplet loss at its associated threshold to each slice, which results in a collection of unique expression features corresponding to each embedding slice. Moreover, we identify the issue that the traditional triplet loss may struggle to converge when employing the widely-used Batch Hard strategy for mining informative triplets, and introduce a novel loss termed dual triplet loss to address it. Extensive evaluations demonstrate the superior performance of the proposed approach on both posed and spontaneous facial expression datasets.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] A Deep Learning Approach to Facial Expression Recognition in the Presence of Masked Occlusion
    Thavarekere, Shree R.
    Hebbar, Anushka
    Uma, D.
    2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
  • [32] Occlusion-aware facial expression recognition: A deep learning approach
    Naveen, Palanichamy
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (11) : 32895 - 32921
  • [33] A Continuous Facial Expression Recognition Model based on Deep Learning Method
    Lin, Szu-Yin
    Tseng, Yi-Wen
    Wu, Chang-Rong
    Kung, Yun-Ching
    Chen, Yi-Zhen
    Wu, Chao-Ming
    2019 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2019,
  • [34] Adaptive Deep Disturbance-Disentangled Learning for Facial Expression Recognition
    Ruan, Delian
    Mo, Rongyun
    Yan, Yan
    Chen, Si
    Xue, Jing-Hao
    Wang, Hanzi
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (02) : 455 - 477
  • [35] HistNet: Histogram-based convolutional neural network with Chi-squared deep metric learning for facial expression recognition
    Sadeghi, Hamid
    Raie, Abolghasem-A
    INFORMATION SCIENCES, 2022, 608 : 472 - 488
  • [36] FACIAL EXPRESSION RECOGNITION WITH DEEP AGE
    Luo, Zhaojie
    Chen, Jinhui
    Takiguchi, Tetsuya
    Ariki, Yasuo
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
  • [37] Learning Affective Video Features for Facial Expression Recognition via Hybrid Deep Learning
    Zhang, Shiqing
    Pan, Xianzhang
    Cui, Yueli
    Zhao, Xiaoming
    Liu, Limei
    IEEE ACCESS, 2019, 7 : 32297 - 32304
  • [38] Deep Facial Expression Recognition: A Survey
    Li, Shan
    Deng, Weihong
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (03) : 1195 - 1215
  • [39] Transfer Model Collaborating Metric Learning and Dictionary Learning for Cross-Domain Facial Expression Recognition
    Ni, Tongguang
    Zhang, Cong
    Gu, Xiaoqing
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2021, 8 (05) : 1213 - 1222
  • [40] Accurate Facial Parts Localization and Deep Learning for 3D Facial Expression Recognition
    Jan, Asim
    Ding, Huaxiong
    Meng, Hongying
    Chen, Liming
    Li, Huibin
    PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, : 466 - 472