Unlocking Human-Like Facial Expressions in Humanoid Robots: A Novel Approach for Action Unit Driven Facial Expression Disentangled Synthesis

被引：0

作者：

Liu, Xiaofeng ^{[1
]}

Ni, Rongrong ^{[2
]}

Yang, Biao ^{[2
]}

Song, Siyang ^{[3
]}

Cangelosi, Angelo ^{[4
]}

机构：

[1] Hohai Univ, Coll Artificial Intelligence & Automat, Changzhou 213200, Peoples R China

[2] Changzhou Univ, Sch Microelect & Control Engn, Changzhou 213000, Peoples R China

[3] Univ Leicester, Sch Comp & Math Sci, Leicester LE1 7RH, England

[4] Univ Manchester, Cognit Robot Lab, Manchester M13 9PL, England

来源：

IEEE TRANSACTIONS ON ROBOTICS | 2024年 / 40卷

关键词：

Affective human-robot interaction; facial action units (AUs); facial expression generation; humanoid robots; motor command; CHILDREN;

D O I：

10.1109/TRO.2024.3422051

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Humanoid robots often struggle to express the intricate and authentic facial expressions characteristic of humans, potentially hampering user engagement. To address this challenge, we introduce a comprehensive two-stage methodology to empower our autonomous affective robot with the capacity to exhibit rich and natural facial expressions. In the initial stage, we present an innovative action unit (AU) driven facial expression disentangled synthesis method, enabling the generation of nuanced robot facial expression images guided by AUs. By harnessing facial AUs within a framework of weakly supervised learning, we effectively surmount the scarcity of paired training data (comprising source and target facial expression images). To preserve the integrity of AUs while mitigating identity interference, we leverage a latent facial attribute space to disentangle expression-related and expression-unrelated cues, employing solely the former for expression synthesis. In the subsequent phase, we actualize an affective robot endowed with multifaceted degrees of freedom for facial movements, facilitating the embodiment of the synthesized fine-grained facial expressions. We devise a specialized motor command mapping network that serves as a conduit between the generated expression images and the robot's realistic facial responses. By utilizing the physical motor positions as constraints, we refine the prediction of precise motor commands from the robot's generated facial expressions. This refinement process ensures that the robot's facial movements authentically express accurate and natural expressions. Finally, qualitative and quantitative evaluations on the benchmarking Emotionet dataset verify the effectiveness of the proposed generation method. Results on the self-developed affective robot indicate that our method achieves a promising generation of specific facial expressions with given AUs, significantly enhancing the affective human-robot interaction.

引用

页码：3850 / 3865

页数：16

共 77 条

[1] Ahn HS, 2012, IEEE-RAS INT C HUMAN, P799, DOI 10.1109/HUMANOIDS.2012.6651611
[2] Amos B., 2016, CMUCS16118, V6
[3] Humanoid Head Face Mechanism with Expandable Facial Expressions
Asheber, Wagshum Techane
Lin, Chyi-Yeu
Yen, Shih Hsiang
[J]. INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2016, 13
[4] Bringing Portraits to Life
Averbuch-Elor, Hadar
Cohen-Or, Daniel
Kopf, Johannes
Cohen, Michael F.
[J]. ACM TRANSACTIONS ON GRAPHICS, 2017, 36 (06):
[5] A Step Towards Developing Adaptive Robot-Mediated Intervention Architecture (ARIA) for Children With Autism
Bekele, Esubalew T.
Lahiri, Uttama
Swanson, Amy R.
Crittendon, Julie A.
Warren, Zachary E.
Sarkar, Nilanjan
[J]. IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2013, 21 (02) : 289 - 299
[6] EmotioNet: An accurate, real-time algorithm for the automatic annotation of a million facial expressions in the wild
Benitez-Quiroz, C. Fabian
Srinivasan, Ramprakash
Martinez, Aleix M.
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 5562 - 5570
[7] Facial Performance Enhancement Using Dynamic Shape Space Analysis
Bermano, Amit H.
Bradley, Derek
Beeler, Thabo
Zund, Fabio
Nowrouzezahrai, Derek
Baran, Ilya
Sorkine-Hornung, Olga
Pfister, Hanspeter
Sumner, Robert W.
Bickel, Bernd
Gross, Markus
[J]. ACM TRANSACTIONS ON GRAPHICS, 2014, 33 (02):
[8] A morphable model for the synthesis of 3D faces
Blanz, V
Vetter, T
[J]. SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, : 187 - 194
[9] Reanimating faces in images and video
Blanz, V
Basso, C
Poggio, T
Vetter, T
[J]. COMPUTER GRAPHICS FORUM, 2003, 22 (03) : 641 - 650
[10] Real-time Facial Animation with Image-based Dynamic Avatars
Cao, Chen
Wu, Hongzhi
Weng, Yanlin
Shao, Tianjia
Zhou, Kun
[J]. ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (04):

← 1 2 3 4 5 6 7 8 →