Modeling long-range dependencies in speech data for text-independent speaker recognition

被引:0
|
作者
Ming, Ji [1 ]
Lin, Jie [2 ]
机构
[1] Queens Univ Belfast, Inst ECIT, Belfast BT7 1NN, Antrim, North Ireland
[2] Univ Elect Sci & Technol China, Sch Comp Sci, Chengdu, Peoples R China
来源
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12 | 2008年
关键词
time dependence; segment modeling; speaker modeling; speaker recognition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In the paper, a new approach for modeling and matching long-range dependencies in free-text speech data is proposed for speaker recognition. The new approach consists of a sentence model to detail up to sentence-level dependencies in the training data, and a search algorithm that is capable of locating the matches of arbitrary-length segments between the training and testing sentences. The search algorithm is optimized to increase the probability for the match of long, continuous segments as opposed to short, separated segments, assuming that long, continuous segments contain more specific information about the speaker. The new approach has been evaluated on the NIST 1998 Speaker Recognition Evaluation database, and has shown improved performance.
引用
收藏
页码:4825 / +
页数:2
相关论文
共 50 条
  • [1] A Longest Matching Segment Approach for Text-Independent Speaker Recognition
    Jafari, Ayeh
    Srinivasan, Ramji
    Crookes, Danny
    Ming, Ji
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1469 - 1472
  • [2] A novel speech feature fusion algorithm for text-independent speaker recognition
    Ma, Biao
    Xu, Chengben
    Zhang, Ye
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (24) : 64139 - 64156
  • [3] Data-model relationship in text-independent speaker recognition
    Mason, JSD
    Evans, NWD
    Stapert, R
    Auckenthaler, R
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (04) : 471 - 481
  • [4] Data-Model Relationship in Text-Independent Speaker Recognition
    John S. D. Mason
    Nicholas W. D. Evans
    Robert Stapert
    Roland Auckenthaler
    EURASIP Journal on Advances in Signal Processing, 2005
  • [5] Text-dependent and text-independent speaker recognition of reverberant speech based on CNN
    El-Moneim, Samia Abd
    Sedik, Ahmed
    Nassar, M. A.
    El-Fishawy, Adel S.
    Sharshar, A. M.
    Hassan, Shaimaa E. A.
    Mahmoud, Adel Zaghloul
    Dessouky, Moawd I.
    El-Banby, Ghada M.
    El-Samie, Fathi E. Abd
    El-Rabaie, El-Sayed M.
    Neyazi, Badawi
    Seddeq, H. S.
    Ismail, Nabil A.
    Khalaf, Ashraf A. M.
    Elabyad, G. S. M.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (04) : 993 - 1006
  • [6] Effect of Spoken Text on Text-independent Speaker Recognition
    Alsulaiman, Mansour
    PROCEEDINGS FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION, 2014, : 279 - 284
  • [7] Text-dependent and text-independent speaker recognition of reverberant speech based on CNN
    Samia Abd El-Moneim
    Ahmed Sedik
    M. A. Nassar
    Adel S. El-Fishawy
    A. M. Sharshar
    Shaimaa E. A. Hassan
    Adel Zaghloul Mahmoud
    Moawd I. Dessouky
    Ghada M. El-Banby
    Fathi E. Abd El-Samie
    El-Sayed M. El-Rabaie
    Badawi Neyazi
    H. S. Seddeq
    Nabil A. Ismail
    Ashraf A. M. Khalaf
    G. S. M. Elabyad
    International Journal of Speech Technology, 2021, 24 : 993 - 1006
  • [8] An Improved Approach for Text-Independent Speaker Recognition
    Chakroun, Rania
    Zouari, Leila Beltaifa
    Frikha, Mondher
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (08) : 343 - 348
  • [9] Improving Text-independent Speaker Recognition with GMM
    Chakroun, Rania
    Zouari, Leila Beltaifa
    Frikha, Mondher
    Ben Hamida, Ahmed
    2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 693 - 696
  • [10] Text-independent speaker recognition using LSTM-RNN and speech enhancement
    Abd El-Moneim, Samia
    Nassar, M. A.
    Dessouky, Moawad I.
    Ismail, Nabil A.
    El-Fishawy, Adel S.
    Abd El-Samie, Fathi E.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (33-34) : 24013 - 24028