Paralinguistic profiling using speech recognition

被引:5
作者
Johar, Swati [1 ]
机构
[1] Defense Inst Psychol Res, DRDO, Timarpur, New Delhi, India
关键词
Emotive vocalization; Paralanguage; Speech disruption; Speech recognition;
D O I
10.1007/s10772-013-9222-4
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This research explores the various indicators for non-verbal cues of speech and provides a method of building a paralinguistic profile of these speech characteristics which determines the emotional state of the speaker. Since a major part of human communication consists of vocalization, a robust approach that is capable of classifying and segmenting an audio stream into silent and voiced regions and developing a paralinguistic profile for the same is presented. The data consisting of disruptions is first segmented into frames and this data is analyzed by exploiting short term acoustic features, temporal characteristics of speech and measures of verbal productivity. A matrix is finally developed relating the paralinguistic properties of average pitch, energy, rate of speech, silence duration and loudness to their respective context. Happy and confident states possessed high values of energy and rate of speech and less silence duration whereas tense and sad states showed low values of energy and speech rate and high periods of silence. Paralanguage was found to be an important cue to decipher the implicit meaning in a speech sample.
引用
收藏
页码:205 / 209
页数:5
相关论文
共 11 条
[1]  
Bazzi I., 2000, P 6 INT C SPOK LANG
[2]  
Dai Keshi, 2008, Proceedings of the Fourth IASTED International Conference on Telehealth and Assistive Technologies, P31
[3]  
Gallwitz F, 1996, ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, P228, DOI 10.1109/ICSLP.1996.607083
[4]  
Goldman Eisler F., 1968, PSYCHOLINGUISTICS EX
[5]  
Kappas A., 2003, EMOTIONS VOICE
[6]   INFERENCE OF ATTITUDES FROM NONVERBAL COMMUNICATION IN 2 CHANNELS [J].
MEHRABIAN, A ;
FERRIS, SR .
JOURNAL OF CONSULTING PSYCHOLOGY, 1967, 31 (03) :248-252
[7]   COMPARATIVE PERFORMANCE STUDY OF SEVERAL PITCH DETECTION ALGORITHMS [J].
RABINER, LR ;
CHENG, MJ ;
ROSENBERG, AE ;
MCGONEGAL, CA .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (05) :399-418
[8]  
Saha G., 2005, P NCC
[9]  
Schuller B, 2003, INT CONF ACOUST SPEE, P1
[10]  
Timoney J., 2004, P 7 INT C DIG AUD EF, P177