An Analysis of the Influence of Acoustical Adverse Conditions on Speaker Gender Identification

被引:0
|
作者
Maka, Tomasz [1 ]
Dziurzanski, Piotr [1 ]
机构
[1] West Pomeranian Univ Technol, Fac Comp Sci & Informat Technol, PL-71210 Szczecin, Poland
来源
2014 XXII ANNUAL PACIFIC VOICE CONFERENCE (PVC) | 2014年
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speaker gender as a biometric feature plays an important role in numerous voice-based services. In this work we perform an accuracy analysis of a gender recognition system in different acoustical environments (indoor and outdoor auditory scenes). At the evaluation stage, each sentence has been mixed with several types of background noise using various signal-to-noise ratio levels. Then a voiced parts of speech have been extracted and parametrized using features based on filter banks and vocal-tract properties. The obtained feature trajectories have been non-linearly smoothed in order to minimize the influence of adverse conditions on the spoken sentences. The observed accuracy is acceptable for voice-based tasks where the gender information can improve their performance.
引用
收藏
页数:4
相关论文
共 50 条
  • [31] Speaker's Gender Identification for Human-Robot Interaction
    Bae, Kyung-Sook
    Kwak, Keun-Chang
    Chi, Soo-Young
    SIGMAP 2006: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS, 2006, : 339 - +
  • [32] Speaker Gender Identification Based on Combining Linear and Nonlinear Features
    Fan Yingle
    Yi Li
    Tong Qinye
    2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 6745 - +
  • [33] Attention based gender and nationality information exploration for speaker identification
    Tang, Yong
    Liu, Chuang
    Leng, Yan
    Zhao, Weiwei
    Sun, Jiande
    Sun, Chengli
    Wang, Rongyan
    Yuan, Qi
    Li, Dengwang
    Xu, Huaqiang
    DIGITAL SIGNAL PROCESSING, 2022, 123
  • [34] Speaker identification using cepstral analysis
    Nazar, MN
    ISCON 2002: IEEE STUDENTS CONFERENCE ON EMERGING TECHNOLOGIES, PROCEEDINGS, 2002, : 139 - 143
  • [35] ANALYSIS OF DNN APPROACHES TO SPEAKER IDENTIFICATION
    Matejka, Pavel
    Glembek, Ondrej
    Novotny, Ondrej
    Plchot, Oldrich
    Grezl, Frantisek
    Burget, Lukas
    Cernocky, Jan ''Honza''
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5100 - 5104
  • [36] SPEAKER IDENTIFICATION BY ANALYSIS OF SOUND ISLANDS
    WOOD, CA
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 64 : S183 - S183
  • [37] Acoustical Keystroke Analysis for User Identification and Authentication
    Pleva, Matus
    Kiktova, Eva
    Viszlay, Peter
    Bours, Patrick
    PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA (RADIOELEKTRONIKA 2016), 2016, : 386 - 389
  • [38] Speaker Identification in Noise Mismatch Conditions Based on Jump Function Kolmogorov Analysis in Wavelet Domain
    Dat, Tran Huy
    Li Haizhou
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1469 - 1472
  • [39] Influence of speaker de-identification in depression detection
    Lopez-Otero, Paula
    Magarinos, Carmen
    Docio-Fernandez, Laura
    Rodriguez-Banga, Eduardo
    Erro, Daniel
    Garcia-Mateo, Carmen
    IET SIGNAL PROCESSING, 2017, 11 (09) : 1023 - 1030
  • [40] Speech Coding Influence on Features Dedicated to Speaker Identification
    Maka, Tomasz
    Bonikowski, Lukasz
    ICSES 2008 INTERNATIONAL CONFERENCE ON SIGNALS AND ELECTRONIC SYSTEMS, CONFERENCE PROCEEDINGS, 2008, : 489 - 492