Comparing languages using hierarchical prosodic analysis

被引:1
作者
Simko, Juraj [1 ]
Suni, Antti [1 ]
Hiovain, Katri [1 ,2 ]
Vainio, Martti [1 ]
机构
[1] Univ Helsinki, Helsinki, Finland
[2] Univ Tampere, Tampere, Finland
来源
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION | 2017年
基金
芬兰科学院;
关键词
language comparison; prosodic typology; wavelet transform; statistical modelling; QUANTITY;
D O I
10.21437/Interspeech.2017-1044
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel. data-driven approach to assessing mutual similarities and differences among a group of languages, based on purely prosodic characteristics, namely f(0) and energy envelope signals. These signals are decomposed using continuous wavelet transform; the components represent f(0) and energy patterns on three levels of prosodic hierarchy roughly corresponding to syllables, words and phrases. Unigram language models with states derived from a combination of Delta-features obtained from these components are trained and compared using a mutual perplexity measure. In this pilot study we apply this approach to a small corpus of spoken material from seven languages (Estonian, Finnish, Hungarian, German, Swedish, Russian and Slovak) with a rich history of mutual language contacts. We present similarity trees (dendrograms) derived from the models using the hierarchically decomposed prosodic signals separately as well as combined, and compare them with patterns obtained from non-decomposed signals. We show that (1) plausible similarity patterns, reflecting language family relationships and the known contact history can be obtained even from a relatively small data set, and (2) the hierarchical decomposition approach using both f(0) and energy provides the most comprehensive results.
引用
收藏
页码:1213 / 1217
页数:5
相关论文
共 21 条
  • [21] Vicsi K., 2011, 5 EUR C INT FED MED, P86