Analysis and synthesis of intonation using the Tilt model

被引:145
作者
Taylor, P [1 ]
机构
[1] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh EH1 1HN, Midlothian, Scotland
关键词
D O I
10.1121/1.428453
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper introduces the Tilt intonational model and describes how this model can be used to automatically analyze and synthesize intonation. In the model, intonation is represented as a linear sequence of events, which can be pitch accents or boundary tones. Each event is characterized by continuous parameters representing amplitude, duration, and tilt (a measure of the shape of the event). The paper describes an event detector, in effect an intonational recognition system, which produces a transcription of an utterance's intonation. The features and parameters of the event detector are discussed and performance figures are shown on a variety of read and spontaneous speaker independent conversational speech databases. Given the event locations, algorithms are described which produce an automatic analysis of each event in terms of the Tilt parameters. Synthesis algorithms are also presented which generate F0 contours from Tilt representations. The accuracy of these is shown by comparing synthetic F0 contours to real F0 contours. The paper concludes with an extensive discussion on linguistic representations of intonation and gives evidence that the Tilt model goes a long way to satisfying the desired goals of such a representation in that it has the right number of degrees of freedom to be able to describe and synthesize intonation accurately. (C) 2000 Acoustical Society of America. [S0001-4966(00)01802-6].
引用
收藏
页码:1697 / 1714
页数:18
相关论文
共 49 条
[1]  
[Anonymous], P EUR C SPEECH TECHN
[2]  
[Anonymous], J PHONETICS
[3]  
[Anonymous], [No title captured]
[4]  
BAGSHAW PC, 1993, P EUR C SPEECH COMM, P1003
[5]  
BARD EG, 1995, P ESCA NATO TUT WORK, P25
[6]  
Baum L.E., 1972, Inequalities III: Proceedings of the Third Symposium on Inequalities, page, V3, P1
[7]  
BECKMAN ME, 1993, GUIDLINES TOBI LABEL
[8]  
BLACK AW, 1997, COMPUTING PROSODY, P117
[9]  
BRUCE G, 1977, THESIS U LUND
[10]   DECLINATION - CONSTRUCT OR INTRINSIC FEATURE OF SPEECH PITCH [J].
COHEN, A ;
COLLIER, R ;
THART, J .
PHONETICA, 1982, 39 (4-5) :254-273