An Empirical Study on Punctuation Restoration for English, Mandarin, and Code-Switching Speech

被引:0
|
作者
Liu, Changsong [1 ]
Thi Nga Ho [1 ]
Chng, Eng Siong [1 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
来源
INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2023, PT II | 2023年 / 13996卷
基金
新加坡国家研究基金会;
关键词
Punctuation Restoration; Multilingual; Codeswitching; Automatic Speech Recognition; Singaporean Speech;
D O I
10.1007/978-981-99-5837-5_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Punctuation restoration is a crucial task in enriching automated transcripts produced by Automatic Speech Recognition (ASR) systems. This paper presents an empirical study on the impact of employing different data acquisition and training strategies on the performance of punctuation restoration models for multilingual and codeswitching speech. The study focuses on two of the most popular Singaporean spoken languages, namely English and Mandarin in both monolingual and codeswitching forms. Specifically, we experimented with in-domain and out-of-domain evaluation for multilingual and codeswitching speech. Subsequently, we enlarge the training data by sampling the codeswitching corpus by reordering the conversational transcripts. We also proposed to ensemble the predicting models by averaging saved model checkpoints instead of using the last checkpoint to improve the model performance. The model employs a slot-filling approach to predict the punctuation at each word boundary. Through utilizing and enlarging the available datasets as well as ensemble different model checkpoints, the result reaches an F1 score of 76.5% and 79.5% respectively for monolingual and codeswitch test sets, which exceeds the state-of-art performance. This investigation contributes to the existing literature on punctuation restoration for multilingual and code-switch speech. It offers insights into the importance of averaging model checkpoints in improving the final model's performance. Source codes and trained models are published on our Github's repo for future replications and usage.(https://github.com/charlieliu331/Punctuation_Restoration)
引用
收藏
页码:286 / 296
页数:11
相关论文
共 50 条
  • [42] CODE-SWITCHING IN THE INTERNATIONAL SCHOOLS OF PRISHTINA: A STUDY OF ALBANIAN/ENGLISH BILINGUALISM
    Shabani, Festa
    Munishi, Shkumbin
    Sadiku, Milote
    FOLIA LINGUISTICA ET LITTERARIA, 2022, (40): : 401 - 422
  • [43] Code-switching and vernacular support: an early Middle English case study
    Skaffari, Janne
    MULTILINGUA-JOURNAL OF CROSS-CULTURAL AND INTERLANGUAGE COMMUNICATION, 2016, 35 (02): : 203 - 226
  • [44] Semi-supervised acoustic model training for speech with code-switching
    Yilmaz, Emre
    McLaren, Mitchell
    van den Heuvel, Henk
    van Leeuwen, David A.
    SPEECH COMMUNICATION, 2018, 105 : 12 - 22
  • [45] Code-switching in parents' everyday speech to bilingual infants
    Kremin, Lena, V
    Alves, Julia
    Orena, Adriel John
    Polka, Linda
    Byers-Heinlein, Krista
    JOURNAL OF CHILD LANGUAGE, 2022, 49 (04) : 714 - 740
  • [46] Investigations on speech recognition systems for low-resource dialectal Arabic-English code-switching speech
    Hamed, Injy
    Denisov, Pavel
    Li, Chia-Yu
    Elmahdy, Mohamed
    Abdennadher, Slim
    Ngoc Thang Vu
    COMPUTER SPEECH AND LANGUAGE, 2022, 72
  • [47] Addressing Code-Switching in French/Algerian Arabic Speech
    Amazota, Djegdjiga
    Adda-Decker, Martine
    Lamel, Lori
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 62 - 66
  • [48] ON DETERMINING MATRIX LANGUAGE OF CODE-SWITCHING BETWEEN SOUTHERN MIN AND MANDARIN
    Wang Sunglan
    JOURNAL OF CHINESE LINGUISTICS, 2016, 44 (02) : 357 - 383
  • [49] Phonetic Variation Modeling and a Language Model Adaptation for Korean English Code-Switching Speech Recognition
    Lee, Damheo
    Kim, Donghyun
    Yun, Seung
    Kim, Sanghun
    APPLIED SCIENCES-BASEL, 2021, 11 (06):
  • [50] Chinese-English Code-switching in Campus Advertisements
    杨真真
    校园英语, 2016, (22) : 209 - 209