Using closely-related language to build an ASR for a very under-resourced language: Iban

被引：0

作者：

Juan, Sarah Samson ^{[1
]}

Besacier, Laurent ^{[1
]}

Lecouteux, Benjamin ^{[1
]}

Tan, Tien-Ping ^{[2
]}

机构：

[1] Grenoble Alpes Univ, Grenoble Informat Lab LIG, Grenoble, France

[2] Univ Sains Malaysia, Sch Comp Sci, George Town, Malaysia

来源：

2014 17TH ORIENTAL CHAPTER OF THE INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDIZATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (COCOSDA) | 2014年

关键词：

automatic speech recognition; acoustic modelling; subspace Gaussian mixture model; bootstrapping grapheme-to-phoneme;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes our work on automatic speech recognition system (ASR) for an under-resourced language, namely the Iban language, which is spoken in Sarawak, a Malaysian Borneo state. To begin this study, we collected 8 hours of speech data due to no resources yet for ASR concerning this language. Following the lack of resources, we employed bootstrapping techniques on a closely-related language to build the Iban system. For this case, we utilized Malay data to bootstrap the grapheme-to-phoneme system (G2P) for the target language. We also developed several G2Ps to acquire Iban pronunciation dictionaries, which were later evaluated on the Iban ASR for obtaining the best version. Subsequently, we conducted experiments on cross-lingual ASR by using subspace Gaussian Mixture Models (SGMM) where the shared parameters obtained in either monolingual or multilingual fashion. From our observations, using out-of-language data as source language provided lower WER when Iban data is very imited.

引用

页数：5

共 22 条

[1]

[Anonymous], 2013, PERFECTIVE IMPERFECT

[2]

[Anonymous], 2011, WORKSH AUT SPEECH RE

[3]

Barras C., 2000, P SPEECH COMM SPEC I, V33

[4] Automatic speech recognition for under-resourced languages: A survey [J].

Besacier, Laurent ;

Barnard, Etienne ;

Karpov, Alexey ;

Schultz, Tanja .

SPEECH COMMUNICATION, 2014, 56 :85-100

[5] Maximum likelihood linear transformations for HMM-based speech recognition [J].

Gales, MJF .

COMPUTER SPEECH AND LANGUAGE, 1998, 12 (02) :75-98

[6]

Gopinath RA, 1998, INT CONF ACOUST SPEE, P661, DOI 10.1109/ICASSP.1998.675351

[7] Using out-of-language data to improve an under-resourced speech recognizer [J].

Imseng, David ;

Motlicek, Petr ;

Bourlard, Herve ;

Garner, Philip N. .

SPEECH COMMUNICATION, 2014, 56 :142-151

[8]

Juan S. S., 2013, P 4 WORKSH S SE AS N

[9]

Juan S. S., 2014, WORKSH SPOK LANG TEC

[10] Cross-Lingual Subspace Gaussian Mixture Models for Low-Resource Speech Recognition [J].

Lu, Liang ;

Ghoshal, Arnab ;

Renals, Steve .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (01) :17-27

← 1 2 3 →