共 43 条
[1]
Dehak N(2010)Front-end factor analysis for speaker verification IEEE Trans Audio, Speech, Lang Process 19 788-798
[2]
Kenny PJ(2021)Mel-frequency cepstral coefficient features based on standard deviation and principal component analysis for language identification systems Cogn Comput 13 1136-1153
[3]
Dehak R(2018)Generalized variability model for speaker verification IEEE Sig Process Lett 25 1775-1779
[4]
Albadr MAA(2022)A review into deep learning techniques for spoken language identification Multimed Tool Appl 81 32593-32624
[5]
Tiun S(2022)Multi-level self-attentive TDNN: A general and efficient approach to summarize speech into discriminative utterance-level representations Speech Commun 140 42-49
[6]
Ayob M(2022)Efficient self-supervised learning representations for spoken language identification IEEE J Sel Top Sig Process 16 1296-1307
[7]
Ma J(2017)LID-senones and their statistics for language identification IEEE/ACM Trans Aud, Speech, Lang Process 26 171-183
[8]
Sethu V(2020)A new time-frequency attention tensor network for language identification Circuits, Systems, and Signal Processing 39 2744-2758
[9]
Ambikairajah E(1993)Automatic language identification using Gaussian mixture and hidden Markov models. IEEE Int Conf Acoust, Speech Sig Process. IEEE 2 399-402
[10]
Thukroo IA(2022)Multi-level self-attentive TDNN: A general and efficient approach to summarize speech into discriminative utterance-level representations Speech Commun 140 42-49