An Empirical Study on Punctuation Restoration for English, Mandarin, and Code-Switching Speech

被引:0
|
作者
Liu, Changsong [1 ]
Thi Nga Ho [1 ]
Chng, Eng Siong [1 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
来源
INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2023, PT II | 2023年 / 13996卷
基金
新加坡国家研究基金会;
关键词
Punctuation Restoration; Multilingual; Codeswitching; Automatic Speech Recognition; Singaporean Speech;
D O I
10.1007/978-981-99-5837-5_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Punctuation restoration is a crucial task in enriching automated transcripts produced by Automatic Speech Recognition (ASR) systems. This paper presents an empirical study on the impact of employing different data acquisition and training strategies on the performance of punctuation restoration models for multilingual and codeswitching speech. The study focuses on two of the most popular Singaporean spoken languages, namely English and Mandarin in both monolingual and codeswitching forms. Specifically, we experimented with in-domain and out-of-domain evaluation for multilingual and codeswitching speech. Subsequently, we enlarge the training data by sampling the codeswitching corpus by reordering the conversational transcripts. We also proposed to ensemble the predicting models by averaging saved model checkpoints instead of using the last checkpoint to improve the model performance. The model employs a slot-filling approach to predict the punctuation at each word boundary. Through utilizing and enlarging the available datasets as well as ensemble different model checkpoints, the result reaches an F1 score of 76.5% and 79.5% respectively for monolingual and codeswitch test sets, which exceeds the state-of-art performance. This investigation contributes to the existing literature on punctuation restoration for multilingual and code-switch speech. It offers insights into the importance of averaging model checkpoints in improving the final model's performance. Source codes and trained models are published on our Github's repo for future replications and usage.(https://github.com/charlieliu331/Punctuation_Restoration)
引用
收藏
页码:286 / 296
页数:11
相关论文
共 50 条
  • [31] Direct Speech in the context of discussion on code-switching
    Barciela, Lois Xacobe Atanes
    ESTUDOS DE LINGUISTICA GALEGA, 2023, 15
  • [32] A Study of Code-switching between Mandarin and Yantai Dialect from Social Perspectives
    Lyu, Cui-Cui
    2016 INTERNATIONAL CONFERENCE ON EDUCATION SCIENCE AND EDUCATION MANAGEMENT (ESEM 2016), 2016, : 62 - 66
  • [33] The PF Disjunction Theorem to Southern Min/Mandarin code-switching
    Wang, Sung-Lan
    INTERNATIONAL JOURNAL OF BILINGUALISM, 2017, 21 (05) : 541 - 558
  • [34] Recognition and Translation of Code-switching Speech Utterances
    Nakayama, Sahoko
    Kano, Takatomo
    Tjandra, Andros
    Sakti, Sakriani
    Nakamura, Satoshi
    2019 22ND CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (O-COCOSDA), 2019, : 34 - 39
  • [35] Gender in Russian-English code-switching
    Chirsheva, Galina
    INTERNATIONAL JOURNAL OF BILINGUALISM, 2009, 13 (01) : 63 - 90
  • [36] Open Domain Continuous Filipino Speech Recognition with Code-Switching
    Ang, Federico
    Miyanaga, Yoshikazu
    Guevara, Rowena Cristina
    Cajote, Rhandley
    Bayona, Michael Gringo Angelo
    2014 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2014, : 2301 - 2304
  • [37] Code-switching in South Asian English CMC
    Shakir, Muhammad
    Deuber, Dagmar
    ENGLISH WORLD-WIDE, 2024, 45 (03) : 311 - 341
  • [38] A corpus investigation of the typology of code-switching between closely related languages: Data from Mandarin-Taiwanese code-switching
    Hsiao, Chien-Han
    INTERNATIONAL JOURNAL OF BILINGUALISM, 2024,
  • [39] A Tentative Study on Code-switching
    高迎梅
    科技信息, 2007, (12) : 381 - 382
  • [40] The Use of OK by Native and Non-Native Teachers in Bilingual Classrooms: Mandarin, English and Code-Switching
    Chang, Sophie Hsiu-Hui
    Huang, Lan-fen
    CONCENTRIC-STUDIES IN LINGUISTICS, 2018, 44 (02) : 111 - 135