ESTIMATION OF THE INVARIANT AND VARIANT CHARACTERISTICS IN SPEECH ARTICULATION AND ITS APPLICATION TO SPEAKER IDENTIFICATION

被引:0
|
作者
Prasad, Abhay [1 ]
Periyasamy, Vijitha [2 ]
Ghosh, Prasanta Kumar [2 ]
机构
[1] Manipal Inst Technol, Manipal 576104, Karnataka, India
[2] Indian Inst Sci IISc, Dept Elect Engn, Bangalore 560012, Karnataka, India
来源
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) | 2015年
关键词
speech articulation; invariant gestures; speaker identification; FEATURES; PURSUIT;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech articulation varies across speakers for producing a speech sound due to the differences in their vocal tract morphologies, though the speech motor actions are executed in terms of relatively invariant gestures [1]. While the invariant articulatory gestures are driven by the linguistic content of the spoken utterance, the component of speech articulation that varies across speakers reflects speaker-specific and other paralinguistic information. In this work, we present a formulation to decompose the speech articulation from multiple speakers into the variant and invariant aspects when they speak the same sentence. The variant component is found to be a better representation for discriminating speakers compared to the speech articulation which includes the invariant part. Experiments with real-time magnetic resonance imaging (rtMRI) videos of speech production from multiple speakers reveal that the variant component of speech articulation yields a better frame-level speaker identification accuracy compared to the speech articulation as well as acoustic features by 29.9% and 9.4% (absolute) respectively.
引用
收藏
页码:4265 / 4269
页数:5
相关论文
共 11 条
  • [1] Application of formant instantaneous characteristics to speech recognition and speaker identification
    侯丽敏
    胡晓宁
    谢娟敏
    Advances in Manufacturing, 2011, (02) : 123 - 127
  • [2] A new frequency scale of Chinese whispered speech in the application of speaker identification
    Lin Wei
    Yang Lili
    Xu Boling
    PROGRESS IN NATURAL SCIENCE-MATERIALS INTERNATIONAL, 2006, 16 (10) : 1072 - 1078
  • [4] Support Vector Machines Approaches and its Application to Speaker Identification
    Boujelbene, S. Zribi
    Mezghani, D. Ben Ayed
    Ellouze, N.
    2009 3RD IEEE INTERNATIONAL CONFERENCE ON DIGITAL ECOSYSTEMS AND TECHNOLOGIES, 2009, : 236 - +
  • [5] Adaptation of ANN for FPGA implementation and its application for speaker identification
    Elmisery, FA
    Khalil, AH
    Salama, AE
    Algeldawy, F
    ICEEC'04: 2004 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONIC AND COMPUTER ENGINEERING, PROCEEDINGS, 2004, : 317 - 320
  • [6] Speaker Identification and Its Application to Social Network Construction for Chinese Novels
    Jia, Yuxiang
    Dou, Huayi
    Cao, Shuai
    Zan, Hongying
    2020 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2020), 2020, : 13 - 18
  • [7] Automatic speaker verification from affective speech using Gaussian mixture model based estimation of neutral speech characteristics
    Avila, Anderson R.
    O'Shaughnessy, Douglas
    Falk, Tiago H.
    SPEECH COMMUNICATION, 2021, 132 : 21 - 31
  • [8] HIERARCHICAL MIXTURE CLUSTERING AND ITS APPLICATION TO GMM BASED TEXT INDEPENDENT SPEAKER IDENTIFICATION
    Saeidi, R.
    Mohammadi, H. R. Sadegh
    Ganchev, T.
    Rodman, R. D.
    2008 INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS, VOLS 1 AND 2, 2008, : 770 - +
  • [9] Modeling time series signal patterns by statistical distribution of prediction errors and its application to speaker identification
    Gu, QR
    Shibata, T
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IX, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING II, 2002, : 329 - 332
  • [10] EM Algorithm with Initialization Based on Incremental k-means for GMM and Its Application to Speaker Identification
    Lee, Younjeong
    Seo, Changwoo
    Hahn, Hernsoo
    Lee, Kiyong
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2005, 24 (03): : 141 - 149