Addressing Code-Switching in French/Algerian Arabic Speech

被引:20
|
作者
Amazota, Djegdjiga [1 ]
Adda-Decker, Martine [1 ,2 ]
Lamel, Lori [2 ]
机构
[1] Univ Sorbonne Nouvelle Paris III, CNRS, LPP, Paris, France
[2] Paris Saclay Univ, CNRS, LIMSI, Orsay, France
来源
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION | 2017年
关键词
Code-switching; Language Identification; Algerian Arabic; French;
D O I
10.21437/Interspeech.2017-1373
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study focuses on code-switching (CS) in French/Algerian Arabic bilingual communities and investigates how speech technologies. such as automatic data partitioning, language identification and automatic speech recognition (ASR) can serve to analyze and classify this type of bilingual speech. A preliminary study carried out using a corpus of Maghrebian broadcast data revealed a relatively high presence of CS Algerian Arabic as compared to the neighboring countries Morocco and Tunisia. Therefore this study focuses on code switching produced by bilingual Algerian speakers who can be considered native speakers of both Algerian Arabic and French. A specific corpus of four hours of speech from 8 bilingual French Algerian speakers was collected. This corpus contains read speech and conversational speech in both languages and includes stretches of code-switching. We provide a linguistic description of the code-switching stretches in terms of intra-sentential and inter-sentential switches, the speech duration in each language. We report on some initial studies to locate French, Arabic and the code-switched stretches, using ASR system word posteriors for this pair of languages.
引用
收藏
页码:62 / 66
页数:5
相关论文
共 50 条
  • [1] The French-Algerian Code-Switching Triggered audio corpus (FACST)
    Amazouz, Djegdjiga
    Adda-Decker, Martine
    Lamel, Lori
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 1468 - 1473
  • [2] Arabic Code-Switching Speech Recognition using Monolingual Data
    Ali, Ahmed
    Chowdhur, Shammur
    Hussein, Amir
    Hifny, Yasser
    INTERSPEECH 2021, 2021, : 3475 - 3479
  • [3] Code-Switching in Algerian and Tunisian Rap
    Wiedemann, Felix
    ANNEE DU MAGHREB, 2016, 14
  • [4] An Algerian Arabic-French Code-Switched Corpus
    Cotterell, Ryan
    Renduchintala, Adithya
    Saphra, Naomi
    Callison-Burch, Chris
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
  • [5] A Conversational Analysis of Arabic-French Code Switching in Algerian TV Talk Shows
    Rousan, Rafat A., I
    Merghmi, Kenza
    JORDAN JOURNAL OF MODERN LANGUAGES & LITERATURE, 2019, 11 (03) : 247 - 271
  • [6] Studying vowel variation in French-Algerian Arabic code-switched speech
    Wottawa, Jane
    Amazouz, Djegdjiga
    Adda-Decker, Martine
    Lamel, Lori
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2753 - 2757
  • [7] Effects of Dialectal Code-Switching on Speech Modules: A Study using Egyptian Arabic Broadcast Speech
    Chowdhury, Shammur A.
    Samih, Younes
    Eldesouki, Mohamed
    Ali, Ahmed
    INTERSPEECH 2020, 2020, : 2382 - 2386
  • [8] TEXTUAL DATA AUGMENTATION FOR ARABIC-ENGLISH CODE-SWITCHING SPEECH RECOGNITION
    Hussein, Amir
    Chowdhury, Shammur Absar
    Abdelali, Ahmed
    Dehak, Najim
    Ali, Ahmed
    Khudanpur, Sanjeev
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 777 - 784
  • [9] ADDRESSING ACCENT MISMATCH IN MANDARIN-ENGLISH CODE-SWITCHING SPEECH RECOGNITION
    Tan, Zhili
    Fan, Xinghua
    Zhu, Hui
    Lin, Ed
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8259 - 8263
  • [10] Code-switching in Indic Speech Synthesisers
    Thomas, Anju Leela
    Prakash, Anusha
    Baby, Arun
    Murthy, Hema A.
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1948 - 1952