Use of Vowels in Discriminating Speech-laugh from Laughter and Neutral Speech

被引:3
作者
Dumpala, Harsha [1 ]
Gangamohan, P. [1 ]
Gangashetty, Suryakanth V. [1 ]
Yegnanarayana, B. [1 ]
机构
[1] Int Inst Informat Technol, Hyderabad, Andhra Pradesh, India
来源
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年
关键词
Speech-laugh; laughter; vowels; epochs; excitation source;
D O I
10.21437/Interspeech.2016-1114
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In natural conversations, significant part of laughter co-occurs with speech which is referred to as speech-laugh. Hence, speech-laugh will have characteristics of both laughter and neutral speech. But it is not clearly evident how acoustic properties of neutral speech are influenced by its co-occurring laughter. The objective of this study is to analyze the acoustic variations between vowel regions of laughter, speech-laugh and neutral speech. The features based on excitation source characteristics extracted at epochs are considered in this study. Features extracted in the vowel regions of speech-laugh exhibit deviations from that of laughter and neutral speech. These deviations in feature values are exploited to discriminate speech-laugh from laughter and neutral speech. Two different datasets consisting of conversational speech and meeting recordings are used in this analysis. Experimental results show that the discrimination between the three classes obtained by considering vowel regions is better than that of considering the complete utterance.
引用
收藏
页码:1437 / 1441
页数:5
相关论文
共 17 条
  • [1] [Anonymous], 2005, P INT C METH TECHN B
  • [2] The acoustic features of human laughter
    Bachorowski, JA
    Smoski, MJ
    Owren, MJ
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 110 (03) : 1581 - 1597
  • [3] Dumpala Sri Harsha, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P975, DOI 10.1109/ICASSP.2014.6853742
  • [4] Approximating the Kullback Leibler Divergence between Gaussian Mixture Models
    Hershey, John R.
    Olsen, Peder A.
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 317 - 320
  • [5] Ishi C, 2014, INTERSPEECH, P1043
  • [6] Kennedy L., 2004, NIST ICASSP 2004 Meeting Recognition Workshop, Montreal, P118
  • [7] Krikke TF, 2013, INTERSPEECH, P163
  • [8] Menezes C., 2006, P 6 INT SEM SPEECH P, P157
  • [9] Mittal VK, 2014, INTERSPEECH, P504
  • [10] Analysis of production characteristics of laughter
    Mittal, Vinay Kumar
    Yegnanarayana, Bayya
    [J]. COMPUTER SPEECH AND LANGUAGE, 2015, 30 (01) : 99 - 115