Addressing Code-Switching in French/Algerian Arabic Speech

被引:20
作者
Amazota, Djegdjiga [1 ]
Adda-Decker, Martine [1 ,2 ]
Lamel, Lori [2 ]
机构
[1] Univ Sorbonne Nouvelle Paris III, CNRS, LPP, Paris, France
[2] Paris Saclay Univ, CNRS, LIMSI, Orsay, France
来源
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION | 2017年
关键词
Code-switching; Language Identification; Algerian Arabic; French;
D O I
10.21437/Interspeech.2017-1373
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study focuses on code-switching (CS) in French/Algerian Arabic bilingual communities and investigates how speech technologies. such as automatic data partitioning, language identification and automatic speech recognition (ASR) can serve to analyze and classify this type of bilingual speech. A preliminary study carried out using a corpus of Maghrebian broadcast data revealed a relatively high presence of CS Algerian Arabic as compared to the neighboring countries Morocco and Tunisia. Therefore this study focuses on code switching produced by bilingual Algerian speakers who can be considered native speakers of both Algerian Arabic and French. A specific corpus of four hours of speech from 8 bilingual French Algerian speakers was collected. This corpus contains read speech and conversational speech in both languages and includes stretches of code-switching. We provide a linguistic description of the code-switching stretches in terms of intra-sentential and inter-sentential switches, the speech duration in each language. We report on some initial studies to locate French, Arabic and the code-switched stretches, using ASR system word posteriors for this pair of languages.
引用
收藏
页码:62 / 66
页数:5
相关论文
共 50 条
  • [41] Code-switching in early English literature
    Schendl, Herbert
    LANGUAGE AND LITERATURE, 2015, 24 (03) : 233 - 248
  • [42] Semi-supervised acoustic model training for speech with code-switching
    Yilmaz, Emre
    McLaren, Mitchell
    van den Heuvel, Henk
    van Leeuwen, David A.
    SPEECH COMMUNICATION, 2018, 105 : 12 - 22
  • [43] Language-specific Characteristic Assistance for Code-switching Speech Recognition
    Song, Tongtong
    Xu, Qiang
    Ge, Meng
    Wang, Longbiao
    Shi, Hao
    Lv, Yongjie
    Lin, Yuqin
    Dang, Jianwu
    INTERSPEECH 2022, 2022, : 3924 - 3928
  • [44] DATA AUGMENTATION FOR END-TO-END CODE-SWITCHING SPEECH RECOGNITION
    Du, Chenpeng
    Li, Hao
    Lu, Yizhou
    Wang, Lan
    Qian, Yanmin
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 194 - 200
  • [45] Pronunciation augmentation for Mandarin-English code-switching speech recognition
    Long, Yanhua
    Wei, Shuang
    Lian, Jie
    Li, Yijie
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)
  • [46] An Empirical Study on Punctuation Restoration for English, Mandarin, and Code-Switching Speech
    Liu, Changsong
    Thi Nga Ho
    Chng, Eng Siong
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2023, PT II, 2023, 13996 : 286 - 296
  • [47] Acoustic and Textual Data Augmentation for Improved ASR of Code-Switching Speech
    Yilmaz, Emre
    van den Heuvel, Henk
    van Leeuwen, David A.
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1933 - 1937
  • [48] Code-Switching in Speech of Tundra Yukaghir: Bi- and Multilingual Repetition
    Kurilova, Samona N.
    NAUCHNYI DIALOG, 2023, 12 (05): : 72 - 92
  • [49] Code-switching Speech Detection Method by Combination of Language and Acoustic Information
    Zhang, Hongji
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION APPLICATIONS (ICCIA 2012), 2012, : 372 - 375
  • [50] Multi-Encoder-Decoder Transformer for Code-Switching Speech Recognition
    Zhou, Xinyuan
    Yilmaz, Emre
    Long, Yanhua
    Li, Yijie
    Li, Haizhou
    INTERSPEECH 2020, 2020, : 1042 - 1046