NON-AUTOREGRESSIVE MANDARIN-ENGLISH CODE-SWITCHING SPEECH RECOGNITION

被引:7
|
作者
Chuang, Shun-Po [1 ]
Chang, Heng-Jui [1 ]
Huang, Sung-Feng [1 ]
Lee, Hung-yi [1 ]
机构
[1] Natl Taiwan Univ, Coll Elect Engn & Comp Sci, Taipei, Taiwan
来源
2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU) | 2021年
关键词
non-autoregressive; code-switching; end-to-end speech recognition;
D O I
10.1109/ASRU51503.2021.9688174
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mandarin-English code-switching (CS) is frequently used among East and Southeast Asian people. However, the intra-sentence language switching of the two very different languages makes recognizing CS speech challenging. Meanwhile, the recent successful non-autoregressive (NAR) ASR models remove the need for left-to-right beam decoding in autoregressive (AR) models and achieved outstanding performance and fast inference speed, but it has not been applied to Mandarin-English CS speech recognition. This paper takes advantage of the Mask-CTC NAR ASR framework to tackle the CS speech recognition issue. We further propose to change the Mandarin output target of the encoder to Pinyin for faster encoder training and introduce the Pinyin-to-Mandarin decoder to learn contextualized information. Moreover, we use word embedding label smoothing to regularize the decoder with contextualized information and projection matrix regularization to bridge that gap between the encoder and decoder. We evaluate these methods on the SEAME corpus and achieved exciting results.
引用
收藏
页码:465 / 472
页数:8
相关论文
共 50 条
  • [31] Code-Switching in Automatic Speech Recognition: The Issues and Future Directions
    Mustafa, Mumtaz Begum
    Yusoof, Mansoor Ali
    Khalaf, Hasan Kahtan
    Abushariah, Ahmad Abdel Rahman Mahmoud
    Kiah, Miss Laiha Mat
    Hua Nong Ting
    Muthaiyah, Saravanan
    APPLIED SCIENCES-BASEL, 2022, 12 (19):
  • [32] BENCHMARKING EVALUATION METRICS FOR CODE-SWITCHING AUTOMATIC SPEECH RECOGNITION
    Hamed, Injy
    Hussein, Amir
    Chellah, Oumnia
    Chowdhury, Shammur
    Mubarak, Hamdy
    Sitaram, Sunayana
    Habash, Nizar
    Ali, Ahmed
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 999 - 1005
  • [33] Lexical tonal effects in code-switching: A comparative study of Cantonese, Mandarin, and Vietnamese switching with English
    Li, Katrina Kechun
    Nguyen, Li
    Bryant, Christopher
    Yoo, Kayeon
    INTERNATIONAL JOURNAL OF BILINGUALISM, 2024, 28 (05) : 799 - 827
  • [34] The Use of OK by Native and Non-Native Teachers in Bilingual Classrooms: Mandarin, English and Code-Switching
    Chang, Sophie Hsiu-Hui
    Huang, Lan-fen
    CONCENTRIC-STUDIES IN LINGUISTICS, 2018, 44 (02) : 111 - 135
  • [35] Open Domain Continuous Filipino Speech Recognition with Code-Switching
    Ang, Federico
    Miyanaga, Yoshikazu
    Guevara, Rowena Cristina
    Cajote, Rhandley
    Bayona, Michael Gringo Angelo
    2014 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2014, : 2301 - 2304
  • [36] Language-specific Characteristic Assistance for Code-switching Speech Recognition
    Song, Tongtong
    Xu, Qiang
    Ge, Meng
    Wang, Longbiao
    Shi, Hao
    Lv, Yongjie
    Lin, Yuqin
    Dang, Jianwu
    INTERSPEECH 2022, 2022, : 3924 - 3928
  • [37] Code-switching in Indic Speech Synthesisers
    Thomas, Anju Leela
    Prakash, Anusha
    Baby, Arun
    Murthy, Hema A.
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1948 - 1952
  • [38] Multi-Encoder-Decoder Transformer for Code-Switching Speech Recognition
    Zhou, Xinyuan
    Yilmaz, Emre
    Long, Yanhua
    Li, Yijie
    Li, Haizhou
    INTERSPEECH 2020, 2020, : 1042 - 1046
  • [39] Code-Switching and College English Teaching
    Bo, Li
    PROCEEDINGS OF THE SIXTH NORTHEAST ASIA INTERNATIONAL SYMPOSIUM ON LANGUAGE, LITERATURE AND TRANSLATION, 2017, : 724 - 729
  • [40] Code-switching in early English literature
    Schendl, Herbert
    LANGUAGE AND LITERATURE, 2015, 24 (03) : 233 - 248