Unlocking Human-Like Facial Expressions in Humanoid Robots: A Novel Approach for Action Unit Driven Facial Expression Disentangled Synthesis

被引:0
作者
Liu, Xiaofeng [1 ]
Ni, Rongrong [2 ]
Yang, Biao [2 ]
Song, Siyang [3 ]
Cangelosi, Angelo [4 ]
机构
[1] Hohai Univ, Coll Artificial Intelligence & Automat, Changzhou 213200, Peoples R China
[2] Changzhou Univ, Sch Microelect & Control Engn, Changzhou 213000, Peoples R China
[3] Univ Leicester, Sch Comp & Math Sci, Leicester LE1 7RH, England
[4] Univ Manchester, Cognit Robot Lab, Manchester M13 9PL, England
关键词
Affective human-robot interaction; facial action units (AUs); facial expression generation; humanoid robots; motor command; CHILDREN;
D O I
10.1109/TRO.2024.3422051
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Humanoid robots often struggle to express the intricate and authentic facial expressions characteristic of humans, potentially hampering user engagement. To address this challenge, we introduce a comprehensive two-stage methodology to empower our autonomous affective robot with the capacity to exhibit rich and natural facial expressions. In the initial stage, we present an innovative action unit (AU) driven facial expression disentangled synthesis method, enabling the generation of nuanced robot facial expression images guided by AUs. By harnessing facial AUs within a framework of weakly supervised learning, we effectively surmount the scarcity of paired training data (comprising source and target facial expression images). To preserve the integrity of AUs while mitigating identity interference, we leverage a latent facial attribute space to disentangle expression-related and expression-unrelated cues, employing solely the former for expression synthesis. In the subsequent phase, we actualize an affective robot endowed with multifaceted degrees of freedom for facial movements, facilitating the embodiment of the synthesized fine-grained facial expressions. We devise a specialized motor command mapping network that serves as a conduit between the generated expression images and the robot's realistic facial responses. By utilizing the physical motor positions as constraints, we refine the prediction of precise motor commands from the robot's generated facial expressions. This refinement process ensures that the robot's facial movements authentically express accurate and natural expressions. Finally, qualitative and quantitative evaluations on the benchmarking Emotionet dataset verify the effectiveness of the proposed generation method. Results on the self-developed affective robot indicate that our method achieves a promising generation of specific facial expressions with given AUs, significantly enhancing the affective human-robot interaction.
引用
收藏
页码:3850 / 3865
页数:16
相关论文
共 77 条
  • [1] Ahn HS, 2012, IEEE-RAS INT C HUMAN, P799, DOI 10.1109/HUMANOIDS.2012.6651611
  • [2] Amos B., 2016, CMUCS16118, V6
  • [3] Humanoid Head Face Mechanism with Expandable Facial Expressions
    Asheber, Wagshum Techane
    Lin, Chyi-Yeu
    Yen, Shih Hsiang
    [J]. INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2016, 13
  • [4] Bringing Portraits to Life
    Averbuch-Elor, Hadar
    Cohen-Or, Daniel
    Kopf, Johannes
    Cohen, Michael F.
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (06):
  • [5] A Step Towards Developing Adaptive Robot-Mediated Intervention Architecture (ARIA) for Children With Autism
    Bekele, Esubalew T.
    Lahiri, Uttama
    Swanson, Amy R.
    Crittendon, Julie A.
    Warren, Zachary E.
    Sarkar, Nilanjan
    [J]. IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2013, 21 (02) : 289 - 299
  • [6] EmotioNet: An accurate, real-time algorithm for the automatic annotation of a million facial expressions in the wild
    Benitez-Quiroz, C. Fabian
    Srinivasan, Ramprakash
    Martinez, Aleix M.
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 5562 - 5570
  • [7] Facial Performance Enhancement Using Dynamic Shape Space Analysis
    Bermano, Amit H.
    Bradley, Derek
    Beeler, Thabo
    Zund, Fabio
    Nowrouzezahrai, Derek
    Baran, Ilya
    Sorkine-Hornung, Olga
    Pfister, Hanspeter
    Sumner, Robert W.
    Bickel, Bernd
    Gross, Markus
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2014, 33 (02):
  • [8] A morphable model for the synthesis of 3D faces
    Blanz, V
    Vetter, T
    [J]. SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, : 187 - 194
  • [9] Reanimating faces in images and video
    Blanz, V
    Basso, C
    Poggio, T
    Vetter, T
    [J]. COMPUTER GRAPHICS FORUM, 2003, 22 (03) : 641 - 650
  • [10] Real-time Facial Animation with Image-based Dynamic Avatars
    Cao, Chen
    Wu, Hongzhi
    Weng, Yanlin
    Shao, Tianjia
    Zhou, Kun
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (04):