Pitch Contour Modelling and Modification for Expressive Marathi Speech Synthesis

被引:0
作者
Deo, Rohit S. [1 ]
Deshpande, Pallavi S. [2 ]
机构
[1] SKN Coll Engn, Dept E&TC, Pune, Maharashtra, India
[2] BVDU Coll Engn, Dept E&TC, Pune, Maharashtra, India
来源
2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI) | 2014年
关键词
Text-to-speech; Expressive Speech Synthesis; Prosody;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we have measured and analyzed features of speech signal such as fundamental frequency, jitter and shimmer its statistical modeling for Marathi. These models can be used for modifying prosody of the neutral speech further. Jitter and shimmer are measures of cycle-to-cycle variations of fundamental frequency and amplitude respectively. It characterizes the emotion and differs in values as emotion varies. An emotion or target model mentioned here is in the form of interrogate. A pitch target model is developed to model and modify the prosody of the Marathi words. The study comprises the study of existing pitch contour of words whose prosody is to be modified and target pitch contour. Its statistical analysis is done. At the end Gaussian normalization is employed to modify the prosody with help of analyzed data. Result of the subjective experiments satisfies the native listeners.
引用
收藏
页码:2455 / 2458
页数:4
相关论文
共 13 条
  • [1] Ceyysens Tim, 2002, P 3 IEEE BEN SIGN PR, pS02
  • [2] Chappel David T., 1988, SPEAKER SPECIFIC PIT, P885
  • [3] Dellaert Frank, RECOGNIZING EMOTION, P3
  • [4] Jan P. H., PROSODIC MODELLING T, P2
  • [5] Kameoka Hirokazu, STAT MODEL SPEECH F, P1
  • [6] Kang YG, 2006, INT CONF ACOUST SPEE, P733
  • [7] Kochkmann Marcel, SLT 2008, P45
  • [8] Lee Ki Young, STAT CONVERSION ALGO, P1
  • [9] Li Chunrong, ISCSLP 2012, P93
  • [10] Li Xi, ICASSP 2007, P1082