Unsupervised Out-of-Distribution Dialect Detection with Mahalanobis Distance

被引:0
作者
Das, Sourya Dipta [1 ]
Vadi, Yash [1 ]
Unnam, Abhishek [1 ]
Yadav, Kuldeep [1 ]
机构
[1] SHL Labs, Bangalore, Karnataka, India
来源
INTERSPEECH 2023 | 2023年
关键词
Out of Distribution Detection; Open Set Classification; Outlier Detection; Dialect Identification; Wav2vec; 2.0; Automatic Speech Recognition;
D O I
10.21437/Interspeech.2023-1974
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Dialect classification is used in a variety of applications, such as machine translation and speech recognition, to improve the overall performance of the system. In a real-world scenario, a deployed dialect classification model can encounter anomalous inputs that differ from the training data distribution, also called out-of-distribution (OOD) samples. Those OOD samples can lead to unexpected outputs, as dialects of those samples are unseen during model training. Out-of-distribution detection is a new research area that has received little attention in the context of dialect classification. Towards this, we proposed a simple yet effective unsupervised Mahalanobis distance feature-based method to detect out-of-distribution samples. We utilize the latent embeddings from all intermediate layers of a wav2vec 2.0 transformer-based dialect classifier model for multi-task learning. Our proposed approach outperforms other state-of-the-art OOD detection methods significantly.
引用
收藏
页码:1978 / 1982
页数:5
相关论文
共 31 条
[31]   Language/Dialect Recognition Based on Unsupervised Deep Learning [J].
Zhang, Qian ;
Hansen, John H. L. .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (05) :873-882