Detection of eye contact with deep neural networks is as accurate as human experts

被引:29
作者
Chong, Eunji [1 ]
Clark-Whitney, Elysha [2 ]
Southerland, Audrey [1 ]
Stubbs, Elizabeth [1 ]
Miller, Chanel [1 ]
Ajodan, Eliana L. [2 ]
Silverman, Melanie R. [2 ]
Lord, Catherine [3 ]
Rozga, Agata [1 ]
Jones, Rebecca M. [2 ]
Rehg, James M. [1 ]
机构
[1] Georgia Inst Technol, Sch Interact Comp, Atlanta, GA 30332 USA
[2] Weill Cornell Med, Ctr Autism & Developing Brain, New York, NY USA
[3] Univ Calif Los Angeles, Sch Med, Los Angeles, CA USA
关键词
AUTISM; GAZE; COMMUNICATION; ATTENTION; CLASSIFICATION; DISORDER; TRACKING; CHILDREN; PLAY;
D O I
10.1038/s41467-020-19712-x
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Eye contact is among the most primary means of social communication used by humans. Quantification of eye contact is valuable as a part of the analysis of social roles and communication skills, and for clinical screening. Estimating a subject's looking direction is a challenging task, but eye contact can be effectively captured by a wearable point-of-view camera which provides a unique viewpoint. While moments of eye contact from this viewpoint can be hand-coded, such a process tends to be laborious and subjective. In this work, we develop a deep neural network model to automatically detect eye contact in egocentric video. It is the first to achieve accuracy equivalent to that of human experts. We train a deep convolutional network using a dataset of 4,339,879 annotated images, consisting of 103 subjects with diverse demographic backgrounds. 57 subjects have a diagnosis of Autism Spectrum Disorder. The network achieves overall precision of 0.936 and recall of 0.943 on 18 validation subjects, and its performance is on par with 10 trained human coders with a mean precision 0.918 and recall 0.946. Our method will be instrumental in gaze behavior analysis by serving as a scalable, objective, and accessible tool for clinicians and researchers. Eye contact is a key social behavior and its measurement could facilitate the diagnosis and treatment of autism. Here the authors show that a deep neural network model can detect eye contact as accurately has human experts.
引用
收藏
页数:10
相关论文
共 64 条
[21]   FRAGILE-X CHECKLIST [J].
HAGERMAN, RJ ;
AMIRI, K ;
CRONISTER, A .
AMERICAN JOURNAL OF MEDICAL GENETICS, 1991, 38 (2-3) :283-287
[22]   Head Movement Dynamics during Play and Perturbed Mother-Infant Interaction [J].
Hammal, Zakia ;
Cohn, Jeffrey F. ;
Messinger, Daniel S. .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2015, 6 (04) :361-370
[23]   Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms using a deep neural network [J].
Hannun, Awni Y. ;
Rajpurkar, Pranav ;
Haghpanahi, Masoumeh ;
Tison, Geoffrey H. ;
Bourn, Codie ;
Turakhia, Mintu P. ;
Ng, Andrew Y. .
NATURE MEDICINE, 2019, 25 (01) :65-+
[24]  
Hashemi J., 2014, AUTISM RES TREAT 201, V2014
[25]  
Hashemi J, 2021, IEEE T AFFECT COMPUT, V12, P215, DOI [10.1109/taffc.2018.2868196, 10.1109/TAFFC.2018.2868196]
[26]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[27]  
Hernandez J, 2012, UBICOMP'12: PROCEEDINGS OF THE 2012 ACM INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING, P301
[28]   Emotional Valence, Arousal, and Threat Ratings of 160 Chinese Words among Adolescents [J].
Ho, Samuel M. Y. ;
Mak, Christine W. Y. ;
Yeung, Dannii ;
Duan, Wenjie ;
Tang, Sandy ;
Yeung, June C. ;
Ching, Rita .
PLOS ONE, 2015, 10 (07)
[29]   Unaddressed participants' gaze in multi-person interaction: optimizing recipiency [J].
Holler, Judith ;
Kendrick, Kobin H. .
FRONTIERS IN PSYCHOLOGY, 2015, 6
[30]   The Autism Diagnostic Observation Schedule, Module 4: Revised Algorithm and Standardized Severity Scores [J].
Hus, Vanessa ;
Lord, Catherine .
JOURNAL OF AUTISM AND DEVELOPMENTAL DISORDERS, 2014, 44 (08) :1996-2012