Use of Vowels in Discriminating Speech-laugh from Laughter and Neutral Speech

被引：3

作者：

Dumpala, Harsha ^{[1
]}

Gangamohan, P. ^{[1
]}

Gangashetty, Suryakanth V. ^{[1
]}

Yegnanarayana, B. ^{[1
]}

机构：

[1] Int Inst Informat Technol, Hyderabad, Andhra Pradesh, India

来源：

17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年

关键词：

Speech-laugh; laughter; vowels; epochs; excitation source;

D O I：

10.21437/Interspeech.2016-1114

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In natural conversations, significant part of laughter co-occurs with speech which is referred to as speech-laugh. Hence, speech-laugh will have characteristics of both laughter and neutral speech. But it is not clearly evident how acoustic properties of neutral speech are influenced by its co-occurring laughter. The objective of this study is to analyze the acoustic variations between vowel regions of laughter, speech-laugh and neutral speech. The features based on excitation source characteristics extracted at epochs are considered in this study. Features extracted in the vowel regions of speech-laugh exhibit deviations from that of laughter and neutral speech. These deviations in feature values are exploited to discriminate speech-laugh from laughter and neutral speech. Two different datasets consisting of conversational speech and meeting recordings are used in this analysis. Experimental results show that the discrimination between the three classes obtained by considering vowel regions is better than that of considering the complete utterance.

引用

页码：1437 / 1441

页数：5

共 17 条

[1] [Anonymous], 2005, P INT C METH TECHN B
[2] The acoustic features of human laughter
Bachorowski, JA
Smoski, MJ
Owren, MJ
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 110 (03) : 1581 - 1597
[3] Dumpala Sri Harsha, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P975, DOI 10.1109/ICASSP.2014.6853742
[4] Approximating the Kullback Leibler Divergence between Gaussian Mixture Models
Hershey, John R.
Olsen, Peder A.
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 317 - 320
[5] Ishi C, 2014, INTERSPEECH, P1043
[6] Kennedy L., 2004, NIST ICASSP 2004 Meeting Recognition Workshop, Montreal, P118
[7] Krikke TF, 2013, INTERSPEECH, P163
[8] Menezes C., 2006, P 6 INT SEM SPEECH P, P157
[9] Mittal VK, 2014, INTERSPEECH, P504
[10] Analysis of production characteristics of laughter
Mittal, Vinay Kumar
Yegnanarayana, Bayya
[J]. COMPUTER SPEECH AND LANGUAGE, 2015, 30 (01) : 99 - 115

← 1 2 →