Speaker Characterization Using Long-Term and Temporal Information

被引:0
|
作者
Huang, Chien-Lin [1 ]
Sun, Hanwu [1 ]
Ma, Bin [1 ]
Li, Haizhou [1 ]
机构
[1] ASTAR, Inst Infocornm Res, Human Language Technol Dept, Singapore 138632, Singapore
来源
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2 | 2010年
关键词
speaker recognition; long-term feature; temporal information;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents new techniques for front-end analysis using long-term and temporal information for speaker recognition. We propose a long-term feature analysis strategy that averages short-time spectral features over a period of time in an effort to capture the speaker traits that are manifested over a speech segment longer than a spectral frame. We found that the moving averages of temporal information are effective in speaker recognition as well. The experiments on the 2008 NIST Speaker Recognition Evaluation dataset show the long-term and temporal information contribute to substantial EER reductions.
引用
收藏
页码:370 / 373
页数:4
相关论文
共 50 条
  • [1] Using Long-Term Information to Improve Robustness in Speaker Identification
    Lyons, James G.
    O'Connell, James G.
    Paliwal, Kuldip K.
    2010 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2010,
  • [2] LONG-TERM RETENTION OF TEMPORAL INFORMATION
    GUAY, M
    PERCEPTUAL AND MOTOR SKILLS, 1982, 54 (03) : 843 - 849
  • [3] Speaker Discrimination Using Long-Term Spectrum of Speech
    Sigmund, Milan
    INFORMATION TECHNOLOGY AND CONTROL, 2019, 48 (03): : 446 - 453
  • [4] Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection
    Min, Kyle
    Roy, Sourya
    Tripathi, Subarna
    Guha, Tanaya
    Majumdar, Somdeb
    COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 371 - 387
  • [5] The acquisition and long-term retention of temporal, spatial, and item information
    Sinclair, GP
    Healy, AF
    Bourne, LE
    JOURNAL OF MEMORY AND LANGUAGE, 1997, 36 (04) : 530 - 549
  • [6] LONG-TERM FEATURE AVERAGING FOR SPEAKER RECOGNITION
    MARKEL, JD
    OSHIKA, BT
    GRAY, AH
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1977, 25 (04): : 330 - 337
  • [7] LONG-TERM AUDITORY MEMORY - SPEAKER IDENTIFICATION
    SASLOVE, H
    YARMEY, AD
    JOURNAL OF APPLIED PSYCHOLOGY, 1980, 65 (01) : 111 - 116
  • [8] Effects of Long-Term Ageing on Speaker Verification
    Kelly, Finnian
    Harte, Naomi
    BIOMETRICS AND ID MANAGEMENT, 2011, 6583 : 113 - 124
  • [9] Improving speaker verification performance against long-term speaker variability
    Wang, Linlin
    Wang, Jun
    Li, Lantian
    Zheng, Thomas Fang
    Soong, Frank K.
    SPEECH COMMUNICATION, 2016, 79 : 14 - 29
  • [10] MMN reveals the retrieving of temporal information from long-term memory
    Atienza, M
    Cantero, JL
    Dominguez-Marin, E
    INTERNATIONAL JOURNAL OF PSYCHOPHYSIOLOGY, 2001, 41 (03) : 230 - 230