The Automatic Identification of the Producers of Co-occurring Communicative Behaviours

被引:5
作者
Navarretta, Costanza [1 ]
机构
[1] Univ Copenhagen, DK-2300 Copenhagen S, Denmark
关键词
Multimodal corpora; Machine learning; Communicative behaviours; Co-occurring behaviours; INDIVIDUAL-DIFFERENCES; ICONIC GESTURES; HEAD MOVEMENTS; RECOGNITION; FEEDBACK;
D O I
10.1007/s12559-014-9269-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal communicative behaviours depend on numerous factors such as the communicative situation, the task, the culture and respective relationship of the people involved, their role, age, and background. This paper addresses the identification of the producers of co-occurring communicative non-verbal behaviours in a manually annotated multimodal corpus of spontaneous conversations. The work builds upon a preceding study in which a support vector machine was trained to identify the producers of communicative body behaviours using the annotations of individual behaviour types. In the present work, we investigate to which extent classification results can be improved adding to the training data the shape description of co-occurring body behaviours and temporal information. The inclusion of co-occurring behaviours reflects the fact that people often use more body behaviours at the same time when they communicate. The results of the classification experiments show that the identification of the producers of communicative behaviours improves significantly if co-occurring behaviours are added to the training data. Classification performance further improves when it also uses temporal information. Even though the results vary from body type to body type, they all show that the individual variation of communicative behaviours is large even in a very homogeneous group of people and that this variation is better modelled using information on co-occurring behaviours than individual behaviours. Being able to identify and then react correctly to individual behaviours of people is extremely important in the field of social robotics which involves the use of robots in private homes where they must interact in a natural way with different types of persons having varying needs.
引用
收藏
页码:689 / 698
页数:10
相关论文
共 65 条
[1]   The MUMIN coding scheme for the annotation of feedback, turn management and sequencing phenomena [J].
Allwood, Jens ;
Cerrato, Loredana ;
Jokinen, Kristiina ;
Navarretta, Costanza ;
Paggio, Patrizia .
LANGUAGE RESOURCES AND EVALUATION, 2007, 41 (3-4) :273-287
[2]  
[Anonymous], 2004, 6 INT C MULTIMODAL I
[3]  
[Anonymous], 2012, Journal of the Association for Laboratory Phonology, DOI DOI 10.1515/LP-2012-0006
[4]  
Beigi H, 2011, FUNDAMENTALS OF SPEAKER RECOGNITION, P1, DOI 10.1007/978-0-387-77592-0
[5]   MODELING THE PRODUCTION OF COVERBAL ICONIC GESTURES BY LEARNING BAYESIAN DECISION NETWORKS [J].
Bergmann, Kirsten ;
Kopp, Stefan .
APPLIED ARTIFICIAL INTELLIGENCE, 2010, 24 (06) :530-551
[6]  
Bergmann K, 2009, LECT NOTES ARTIF INT, V5773, P76, DOI 10.1007/978-3-642-04380-2_12
[7]  
Boersma P., 2013, Praat: doing phonetics by computer, DOI DOI 10.1097/AUD.0B013E31821473F7
[8]   Extracting and Associating Meta-features for Understanding People's Emotional Behaviour: Face and Speech [J].
Bourbakis, Nikolaus ;
Esposito, Anna ;
Kavraki, Despina .
COGNITIVE COMPUTATION, 2011, 3 (03) :436-448
[9]   AUTOMATIC PERSON RECOGNITION BY ACOUSTIC AND GEOMETRIC FEATURES [J].
BRUNELLI, R ;
FALAVIGNA, D ;
POGGIO, T ;
STRINGA, L .
MACHINE VISION AND APPLICATIONS, 1995, 8 (05) :317-325
[10]  
Cerrato L., 2007, THESIS KTH TOCKHOLM