Comparison of perceived prosodic boundaries and global characteristics of voice fundamental frequency contours in Mandarin speech

被引:0
作者
Gu, Wentao [1 ,2 ]
Hirose, Keikichi [1 ]
Fujisaki, Hiroya [1 ]
机构
[1] Univ Tokyo, Bunkyo Ku, 7-3-1 Hongo, Tokyo 1138656, Japan
[2] Chinese Uni of Hong Kong, Hong Hom, Hong Kong, Peoples R China
来源
CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS | 2006年 / 4274卷
关键词
prosodic hierarchy; perceived prosodic boundary; F-0; contour; phrase; command-response model; Mandarin; perception; production;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although there have been many studies on the prosodic structure of spoken Mandarin as well as many proposals for labeling the prosody of spoken Mandarin, the labeling of prosodic boundaries in all the existing annotation systems relies on auditory perception, and lacks a direct relation to the acoustic process of prosody generation. Besides, perception-based annotation cannot ensure a high degree of consistency and reliability. In the present study, we investigate the phrasing of spoken Mandarin from the production point of view, by using an acoustic model for generating F-0 contours. The relationship between perceived prosodic boundaries at various layers and phrase commands derived from the model-based analysis of F-0 contours is then revealed. The results indicate that a perception-based prosody labeling system cannot describe the prosodic structure as accurately as the model for F-0 contour generation.
引用
收藏
页码:31 / +
页数:2
相关论文
共 16 条
  • [1] [Anonymous], 1999, 14 INT C PHONETIC SC
  • [2] CAO J, 2000, P ICSLP 2000 BEIJ CH, V2, P357
  • [3] Clark J., 1995, An introduction to phonetics and phonology, V2nd
  • [4] Analysis and synthesis of fundamental frequency contours of Standard Chinese using the command-response model
    Fujisaki, H
    Wang, CF
    Ohno, S
    Gu, WT
    [J]. SPEECH COMMUNICATION, 2005, 47 (1-2) : 59 - 70
  • [5] Fujisaki H., 1997, Computing Prosody, P27
  • [6] Fujisaki H., 2004, Proceedings of the Speech Prosody, P1
  • [7] FUJISAKI H, 1988, P ICASSP 1998 NEW YO, V2, P663
  • [8] GU W, 2006, P SPEECH PROS 2006 D, P153
  • [9] Modeling the effects of emphasis and question on fundamental frequency contours of Cantonese utterances
    Gu, Wentao
    Hirose, Keikichi
    Fujisaki, Hiroya
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04): : 1155 - 1170
  • [10] LI A, 1999, P 4 NAT C MOD PHON B