Modeling long-range dependencies in speech data for text-independent speaker recognition

被引：0

作者：

Ming, Ji ^{[1
]}

Lin, Jie ^{[2
]}

机构：

[1] Queens Univ Belfast, Inst ECIT, Belfast BT7 1NN, Antrim, North Ireland

[2] Univ Elect Sci & Technol China, Sch Comp Sci, Chengdu, Peoples R China

来源：

2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年

关键词：

time dependence; segment modeling; speaker modeling; speaker recognition;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In the paper, a new approach for modeling and matching long-range dependencies in free-text speech data is proposed for speaker recognition. The new approach consists of a sentence model to detail up to sentence-level dependencies in the training data, and a search algorithm that is capable of locating the matches of arbitrary-length segments between the training and testing sentences. The search algorithm is optimized to increase the probability for the match of long, continuous segments as opposed to short, separated segments, assuming that long, continuous segments contain more specific information about the speaker. The new approach has been evaluated on the NIST 1998 Speaker Recognition Evaluation database, and has shown improved performance.

引用

页码：4825 / +

页数：2

共 50 条

[21] Multigrained modeling with pattern specific maximum likelihood transformations for text-independent speaker recognition
Chaudhari, UV
Navrátil, J
Maes, SH
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (01): : 61 - 69
[22] Robust features for text-independent speaker recognition with short utterances
Rania Chakroun
Mondher Frikha
Neural Computing and Applications, 2020, 32 : 13863 - 13883
[23] Spin-Image Descriptors for Text-Independent Speaker Recognition
Mohammed, Suhaila N.
Jabir, Adnan J.
Abbas, Zaid Ali
EMERGING TRENDS IN INTELLIGENT COMPUTING AND INFORMATICS: DATA SCIENCE, INTELLIGENT INFORMATION SYSTEMS AND SMART COMPUTING, 2020, 1073 : 216 - 226
[24] FREQUENCY AND TEMPORAL CONVOLUTIONAL ATTENTION FOR TEXT-INDEPENDENT SPEAKER RECOGNITION
Yadav, Sarthak
Rai, Atul
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6794 - 6798
[25] Ensemble of Support Vector Machine for Text-Independent Speaker Recognition
Lei, Zhenchun
Yang, Yingchun
Wu, Zhaohui
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2006, 6 (5A): : 163 - 167
[26] An overview of text-independent speaker recognition: From features to supervectors
Kinnunen, Tomi
Li, Haizhou
SPEECH COMMUNICATION, 2010, 52 (01) : 12 - 40
[27] Adaptive Convolutional Neural Network for Text-Independent Speaker Recognition
Kim, Seong-Hu
Park, Yong-Hwa
INTERSPEECH 2021, 2021, : 66 - 70
[28] TEACHER-STUDENT TRAINING FOR TEXT-INDEPENDENT SPEAKER RECOGNITION
Ng, Raymond W. M.
Liu, Xuechen
Swietojanski, Pawel
2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 1044 - 1051
[29] Angular Margin Centroid Loss for Text-independent Speaker Recognition
Wei, Yuheng
Du, Junzhao
Liu, Hui
INTERSPEECH 2020, 2020, : 3820 - 3824
[30] A Multiscale Feature Extraction Method for Text-independent Speaker Recognition
Chen Zhigao
Li Peng
Xiao Runqiu
Li Ta
Wang Wenchao
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (11) : 3266 - 3271

← 1 2 3 4 5 →