共 2 条
Speech, Voice, Text, and Meaning A Multidisciplinary Approach to Interview Data through the use of digital tools
被引:0
|作者:
van Hessen, Arjan
[1
]
Calamai, Silvia
[2
]
van den Heuvel, Henk
[3
]
Scagliola, Stefania
[4
]
Karrouche, Norah
[5
]
Beeken, Jeannine
[6
]
Corti, Louise
[7
]
Draxler, Christoph
[8
]
机构:
[1] Univ Twente, Enschede, Netherlands
[2] Univ Siena, Siena, Italy
[3] Radboud Univ Nijmegen, Nijmegen, Netherlands
[4] Univ Luxemburg, Luxembourg, Luxembourg
[5] Erasmus Univ, Rotterdam, Netherlands
[6] Univ Essex, Colchester, Essex, England
[7] UK Data Arch, Colchester, Essex, England
[8] Ludwig Maximilians Univ Munchen, Munich, Germany
来源:
COMPANION PUBLICATON OF THE 2020 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION (ICMI '20 COMPANION)
|
2020年
关键词:
interview data;
speech processing;
emotion detection;
transcription;
annotation;
NLP;
D O I:
10.1145/3395035.3425657
中图分类号:
TP3 [计算技术、计算机技术];
学科分类号:
0812 ;
摘要:
Interview data is multimodal data: it consists of speech sound, facial expression and gestures, captured in a particular situation, and containing textual information and emotion. This workshop shows how a multidisciplinary approach may exploit the full potential of interview data. The workshop first gives a systematic overview of the research fields working with interview data. It then presents the speech technology currently available to support transcribing and annotating interview data, such as automatic speech recognition, speaker diarization, and emotion detection. Finally, scholars who work with interview data and tools may present their work and discover how to make use of existing technology.
引用
收藏
页码:454 / 455
页数:2
相关论文