EFFICIENT FEATURE EXTRACTION OF SPEAKER IDENTIFICATION USING PHONEME MEAN F-RATIO FOR CHINESE

被引:0
|
作者
Zhao, Chen [1 ]
Wang, Hongcui [1 ]
Hyon, Songgun [1 ]
Wei, Jianguo [1 ]
Dang, Jianwu [1 ]
机构
[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin 300072, Peoples R China
来源
2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING | 2012年
关键词
speaker identification; feature extraction; phoneme mean F-ratio;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The features used for speaker recognition should have more speaker individual information while attenuating the linguistic information. In order to discard the linguistic information effectively, in this paper, we employed the phoneme mean F-ratio method to investigate the different contributions of different frequency region from the point of view of Chinese phoneme, and apply it for speaker identification. It is found that the speaker individual information depending on the phonemes is distributed in different frequency regions of speech sound. Based on the contribution rate, we extracted the new features and combined with GMM model. The experiment for speaker identification task is conducted with a King-ASR Chinese database. Compared with the MFCC feature, the identification error rate with the proposed feature was reduced by 32.94%. The results confirmed that the efficiency of the phoneme mean F-ratio method for improving speaker recognition performance for Chinese.
引用
收藏
页码:345 / 348
页数:4
相关论文
共 50 条
  • [1] A method of speaker identification based on phoneme mean F-ratio contribution
    Hyon, Songgun
    Wang, Hongcui
    Zhao, Chen
    Wei, Jianguo
    Dang, Jianwu
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2669 - 2672
  • [2] A New Feature Extraction Method for Bone-conducted Life Sounds based on F-ratio
    An, Yeteng
    Wang, Hongcui
    Hyon, Songgun
    Chen, Sai
    Dang, Jianwu
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 1598 - 1604
  • [3] Hand gesture recognition using DWT and F-ratio based feature descriptor
    Sahoo, Jaya Prakash
    Ari, Samit
    Ghosh, Dipak Kumar
    IET IMAGE PROCESSING, 2018, 12 (10) : 1780 - 1787
  • [4] Speaker Identification Using MFCC Feature Extraction ANN Classification Technique
    Singh, Mahesh K.
    WIRELESS PERSONAL COMMUNICATIONS, 2024, 136 (01) : 453 - 467
  • [5] Speaker Identification based on MFSC voice feature extraction using Transformer
    Bao, Liao
    Zuo, Yi
    2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 1 - 7
  • [6] Robust Feature Extraction Using Temporal Context Averaging for Speaker Identification in Diverse Acoustic Environments
    Terraf, Yassin
    Iraqi, Youssef
    IEEE ACCESS, 2024, 12 : 14094 - 14115
  • [7] A Hybrid GRU-CNN Feature Extraction Technique for Speaker Identification
    Shihab, Md Shazzad Hossain
    Aditya, Shuvra
    Setu, Jahangir Hossain
    Imtiaz-Ud-Din, K. M.
    Efat, Md Iftekharul Alam
    2020 23RD INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2020), 2020,
  • [8] Acoustic feature extraction method for robust speaker identification
    Zuoqiang Li
    Yong Gao
    Multimedia Tools and Applications, 2016, 75 : 7391 - 7406
  • [9] PHYSIOLOGICALLY-MOTIVATED FEATURE EXTRACTION FOR SPEAKER IDENTIFICATION
    Wang, Jianglin
    Johnson, Michael T.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [10] Speaker Identification based on Hybrid Feature Extraction Techniques
    Abualadas, Feras E.
    Zeki, Akram M.
    Al-Ani, Muzhir Shaban
    Messikh, Az-Eddine
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (03) : 322 - 327