共 31 条
Unsupervised Out-of-Distribution Dialect Detection with Mahalanobis Distance
被引:0
作者:
Das, Sourya Dipta
[1
]
Vadi, Yash
[1
]
Unnam, Abhishek
[1
]
Yadav, Kuldeep
[1
]
机构:
[1] SHL Labs, Bangalore, Karnataka, India
来源:
INTERSPEECH 2023
|
2023年
关键词:
Out of Distribution Detection;
Open Set Classification;
Outlier Detection;
Dialect Identification;
Wav2vec;
2.0;
Automatic Speech Recognition;
D O I:
10.21437/Interspeech.2023-1974
中图分类号:
O42 [声学];
学科分类号:
070206 ;
082403 ;
摘要:
Dialect classification is used in a variety of applications, such as machine translation and speech recognition, to improve the overall performance of the system. In a real-world scenario, a deployed dialect classification model can encounter anomalous inputs that differ from the training data distribution, also called out-of-distribution (OOD) samples. Those OOD samples can lead to unexpected outputs, as dialects of those samples are unseen during model training. Out-of-distribution detection is a new research area that has received little attention in the context of dialect classification. Towards this, we proposed a simple yet effective unsupervised Mahalanobis distance feature-based method to detect out-of-distribution samples. We utilize the latent embeddings from all intermediate layers of a wav2vec 2.0 transformer-based dialect classifier model for multi-task learning. Our proposed approach outperforms other state-of-the-art OOD detection methods significantly.
引用
收藏
页码:1978 / 1982
页数:5
相关论文