Language Proficiency Assessment of English L2 Speakers Based on Joint Analysis of Prosody and Native Language

被引:3
作者
Zhang, Yue [1 ]
Weninger, Felix [2 ]
Batliner, Anton [3 ]
Hoenig, Florian [4 ]
Schuller, Bjorn [1 ]
机构
[1] Imperial Coll London, Dept Comp, London, England
[2] Nuance Commun, Ulm, Germany
[3] Univ Passau, Chair Complex & Intelligent Syst, Passau, Germany
[4] FAU Erlangen Nuremberg, Pattern Recognit Lab, Erlangen, Germany
来源
ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION | 2016年
关键词
Non-Native Prosody; Ll Identification; Feature Evaluation;
D O I
10.1145/2993148.2993155
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we present an in-depth analysis of the interdependency between the non-native prosody and the native language (L1) of English L2 speakers, as separately investigated in the Degree of Nativeness Task and the Native Language Task of the INTERSPEECH 2015 and 2016 Computational Paralinguistics ChallengE (ComParE). To this end, we propose a multi-task learning scheme based on auxiliary attributes for jointly learning the tasks of L1 classification and prosody score regression. The effectiveness of this approach is demonstrated in extensive experimental runs, comparing various standardised feature sets of prosodic, cepstral, spectral, and voice quality descriptors, as well as automatic feature selection. In the result, we show that the prediction of both prosody score and L1 can be improved by considering both tasks in a holistic way. In particular, we achieve an 11 % relative gain in regression performance (Spearman's correlation coefficient) on prosody scores, when comparing the best multi-and single-task learning results.
引用
收藏
页码:274 / 278
页数:5
相关论文
共 31 条
  • [1] Abcrcrombic D., 1967, ELEMENTS GEN PHONETI, V203
  • [2] [Anonymous], 2011, THESIS
  • [3] [Anonymous], P SPEECH PROS CHIC I
  • [4] [Anonymous], 2005, DATA MINING
  • [5] [Anonymous], 2013, Proceedings of the 21st ACM International Conference on Multimedia, DOI DOI 10.1145/2502081.2502224
  • [6] [Anonymous], 2015, P INTERSPEECH 2015 1
  • [7] [Anonymous], 2012, INT S AUTOMATIC DETE
  • [8] Language accent classification in American English
    Arslan, LM
    Hansen, JHL
    [J]. SPEECH COMMUNICATION, 1996, 18 (04) : 353 - 367
  • [9] Coutinho E, 2016, LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P1328
  • [10] COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES
    DAVIS, SB
    MERMELSTEIN, P
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04): : 357 - 366