EFFICIENT FEATURE EXTRACTION OF SPEAKER IDENTIFICATION USING PHONEME MEAN F-RATIO FOR CHINESE

被引：0

作者：

Zhao, Chen ^{[1
]}

Wang, Hongcui ^{[1
]}

Hyon, Songgun ^{[1
]}

Wei, Jianguo ^{[1
]}

Dang, Jianwu ^{[1
]}

机构：

[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin 300072, Peoples R China

来源：

2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING | 2012年

关键词：

speaker identification; feature extraction; phoneme mean F-ratio;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The features used for speaker recognition should have more speaker individual information while attenuating the linguistic information. In order to discard the linguistic information effectively, in this paper, we employed the phoneme mean F-ratio method to investigate the different contributions of different frequency region from the point of view of Chinese phoneme, and apply it for speaker identification. It is found that the speaker individual information depending on the phonemes is distributed in different frequency regions of speech sound. Based on the contribution rate, we extracted the new features and combined with GMM model. The experiment for speaker identification task is conducted with a King-ASR Chinese database. Compared with the MFCC feature, the identification error rate with the proposed feature was reduced by 32.94%. The results confirmed that the efficiency of the phoneme mean F-ratio method for improving speaker recognition performance for Chinese.

引用

页码：345 / 348

页数：4

共 50 条

[1] A method of speaker identification based on phoneme mean F-ratio contribution
Hyon, Songgun
Wang, Hongcui
Zhao, Chen
Wei, Jianguo
Dang, Jianwu
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2669 - 2672
[2] A New Feature Extraction Method for Bone-conducted Life Sounds based on F-ratio
An, Yeteng
Wang, Hongcui
Hyon, Songgun
Chen, Sai
Dang, Jianwu
INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 1598 - 1604
[3] Hand gesture recognition using DWT and F-ratio based feature descriptor
Sahoo, Jaya Prakash
Ari, Samit
Ghosh, Dipak Kumar
IET IMAGE PROCESSING, 2018, 12 (10) : 1780 - 1787
[4] Speaker Identification Using MFCC Feature Extraction ANN Classification Technique
Singh, Mahesh K.
WIRELESS PERSONAL COMMUNICATIONS, 2024, 136 (01) : 453 - 467
[5] Speaker Identification based on MFSC voice feature extraction using Transformer
Bao, Liao
Zuo, Yi
2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 1 - 7
[6] Robust Feature Extraction Using Temporal Context Averaging for Speaker Identification in Diverse Acoustic Environments
Terraf, Yassin
Iraqi, Youssef
IEEE ACCESS, 2024, 12 : 14094 - 14115
[7] A Hybrid GRU-CNN Feature Extraction Technique for Speaker Identification
Shihab, Md Shazzad Hossain
Aditya, Shuvra
Setu, Jahangir Hossain
Imtiaz-Ud-Din, K. M.
Efat, Md Iftekharul Alam
2020 23RD INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2020), 2020,
[8] Acoustic feature extraction method for robust speaker identification
Zuoqiang Li
Yong Gao
Multimedia Tools and Applications, 2016, 75 : 7391 - 7406
[9] PHYSIOLOGICALLY-MOTIVATED FEATURE EXTRACTION FOR SPEAKER IDENTIFICATION
Wang, Jianglin
Johnson, Michael T.
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[10] Speaker Identification based on Hybrid Feature Extraction Techniques
Abualadas, Feras E.
Zeki, Akram M.
Al-Ani, Muzhir Shaban
Messikh, Az-Eddine
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (03) : 322 - 327

← 1 2 3 4 5 →