A modular machine learning tool for holistic and fine-grained behavioral analysis

被引:1
作者
Michelot, Bruno [1 ]
Corneyllie, Alexandra [1 ]
Thevenet, Marc [1 ]
Duffner, Stefan [2 ]
Perrin, Fabien [1 ]
机构
[1] Ctr Rech Neurosci Lyon, CAP Team, INSERM U1028, CNRS UMR 5292,UCBL,UJM, 95 Blvd Pinel, F-69675 Bron, France
[2] Univ Claude Bernard Lyon 1, Univ Lumiere Lyon 2, Ecole Cent Lyon,UMR 5205 CNRS,INSA Lyon, IMAGINE Team,Lab InfoRmat Image & Syst Informat, Lyon, France
关键词
Behavior; Computer vision; Machine learning; Explainability; EXPLAINABLE ARTIFICIAL-INTELLIGENCE; MOVEMENT; PERCEPTION; ATTENTION; TRACKING; SENSORS; BLINK;
D O I
10.3758/s13428-024-02511-3
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
Artificial intelligence techniques offer promising avenues for exploring human body features from videos, yet no freely accessible tool has reliably provided holistic and fine-grained behavioral analyses to date. To address this, we developed a machine learning tool based on a two-level approach: a first lower-level processing using computer vision for extracting fine-grained and comprehensive behavioral features such as skeleton or facial points, gaze, and action units; a second level of machine learning classification coupled with explainability providing modularity, to determine which behavioral features are triggered by specific environments. To validate our tool, we filmed 16 participants across six conditions, varying according to the presence of a person ("Pers"), a sound ("Snd"), or silence ("Rest"), and according to emotional levels using self-referential ("Self") and control ("Ctrl") stimuli. We demonstrated the effectiveness of our approach by extracting and correcting behavior from videos using two computer vision software (OpenPose and OpenFace) and by training two algorithms (XGBoost and long short-term memory [LSTM]) to differentiate between experimental conditions. High classification rates were achieved for "Pers" conditions versus "Snd" or "Rest" (AUC = 0.8-0.9), with explainability revealing actions units and gaze as key features. Additionally, moderate classification rates were attained for "Snd" versus "Rest" (AUC = 0.7), attributed to action units, limbs and head points, as well as for "Self" versus "Ctrl" (AUC = 0.7-0.8), due to facial points. These findings were consistent with a more conventional hypothesis-driven approach. Overall, our study suggests that our tool is well suited for holistic and fine-grained behavioral analysis and offers modularity for extension into more complex naturalistic environments.
引用
收藏
页数:17
相关论文
共 92 条
[81]   Perceiving Crowd Attention: Ensemble Perception of a Crowd's Gaze [J].
Sweeny, Timothy D. ;
Whitney, David .
PSYCHOLOGICAL SCIENCE, 2014, 25 (10) :1903-1913
[82]   A Survey on Explainable Artificial Intelligence (XAI): Toward Medical XAI [J].
Tjoa, Erico ;
Guan, Cuntai .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (11) :4793-4813
[83]   Body Movement Synchrony Predicts Degrees of Information Exchange in a Natural Conversation [J].
Tsuchiya, Ayaka ;
Ora, Hiroki ;
Hao, Qiao ;
Ono, Yumi ;
Sato, Hikari ;
Kameda, Kohei ;
Miyake, Yoshihiro .
FRONTIERS IN PSYCHOLOGY, 2020, 11
[84]   Social Postural Coordination [J].
Varlet, Manuel ;
Marin, Ludovic ;
Lagarde, Julien ;
Bardy, Benoit G. .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2011, 37 (02) :473-483
[85]  
Vaswani A, 2017, ADV NEUR IN, V30
[86]   Deep Learning for Computer Vision: A Brief Review [J].
Voulodimos, Athanasios ;
Doulamis, Nikolaos ;
Doulamis, Anastasios ;
Protopapadakis, Eftychios .
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018
[87]   FACIAL EXPRESSION IN THE PRESENCE OF FRIENDS AND STRANGERS [J].
WAGNER, HL ;
SMITH, J .
JOURNAL OF NONVERBAL BEHAVIOR, 1991, 15 (04) :201-214
[88]   The mid-level vision toolbox for computing structural properties of real-world images [J].
Walther, Dirk B. ;
Farzanfar, Delaram ;
Han, Seohee ;
Rezanejad, Morteza .
FRONTIERS IN COMPUTER SCIENCE, 2023, 5
[89]   Clinical applications of sensors for human posture and movement analysis: A review [J].
Wong, Wai Yin ;
Wong, Man Sang ;
Lo, Kam Ho .
PROSTHETICS AND ORTHOTICS INTERNATIONAL, 2007, 31 (01) :62-75
[90]   Robust and Accurate 3D Head Pose Estimation through 3DMM and Online Head Model Reconstruction [J].
Yu, Yu ;
Mora, Kenneth Alberto Funes ;
Odobez, Jean-Marc .
2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, :711-718