Automatic Smoker Detection from Telephone Speech Signals

被引:4
|
作者
Poorjam, Amir Hossein [1 ]
Hesaraki, Soheila [2 ]
Safavi, Saeid [3 ]
van Hamme, Hugo [4 ]
Bahari, Mohamad Hasan [4 ]
机构
[1] Aalborg Univ, Audio Anal Lab, AD MT, Aalborg, Denmark
[2] Ferdowsi Univ Mashhad, Dept Elect Engn, Fac Engn, Mashhad, Razavi Khorasan, Iran
[3] Univ Hertfordshire, Sch Engn & Technol, ECE Div, Informat Engn & Proc Architectures Grp, Hatfield, Herts, England
[4] Katholieke Univ Leuven, Ctr Proc Speech & Images PSI, Leuven, Belgium
来源
SPEECH AND COMPUTER, SPECOM 2017 | 2017年 / 10458卷
关键词
Smoker detection; i-Vector; Non-negative factor analysis; Score fusion; Logistic regression; SPEAKER; ADAPTATION; LANGUAGE;
D O I
10.1007/978-3-319-66429-3_19
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes an automatic smoking habit detection from spontaneous telephone speech signals. In this method, each utterance is modeled using i-vector and non-negative factor analysis (NFA) frameworks, which yield low-dimensional representation of utterances by applying factor analysis on Gaussian mixture model means and weights respectively. Each framework is evaluated using different classification algorithms to detect the smoker speakers. Finally, score-level fusion of the i-vector-based and the NFA-based recognizers is considered to improve the classification accuracy. The proposed method is evaluated on telephone speech signals of speakers whose smoking habits are known drawn from the National Institute of Standards and Technology (NIST) 2008 and 2010 Speaker Recognition Evaluation databases. Experimental results over 1194 utterances show the effectiveness of the proposed approach for the automatic smoking habit detection task.
引用
收藏
页码:200 / 210
页数:11
相关论文
共 50 条
  • [21] Towards Automatic Emotional State Categorization from Speech Signals
    Shaukat, Arslan
    Chen, Ke
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2771 - 2774
  • [22] Automatic Speech Recognition from Neural Signals: A Focused Review
    Herff, Christian
    Schultz, Tanja
    FRONTIERS IN NEUROSCIENCE, 2016, 10
  • [23] DETECTION OF NOISELIKE SOUNDS IN TELEPHONE SPEECH
    DRUCKER, H
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1968, 44 (01): : 9 - &
  • [24] Using Automatic Speech Recognition to Measure the Intelligibility of Speech Synthesized from Brain Signals
    Varshney, Suvi
    Farias, Dana
    Brandman, David M.
    Stavisky, Sergey D.
    Miller, Lee M.
    2023 11TH INTERNATIONAL IEEE/EMBS CONFERENCE ON NEURAL ENGINEERING, NER, 2023,
  • [25] Evaluation of Wavelet Measures on Automatic Detection of Emotion in Noisy and Telephony Speech Signals
    Vasquez-Correa, J. C.
    Garcia, N.
    Vargas-Bonilla, J. F.
    Orozco-Arroyave, J. R.
    Arias-Londono, J. D.
    Lucia Quintero M, O.
    2014 INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY (ICCST), 2014,
  • [26] Automatic speech recognition services in common telephone network
    Karpov, A
    Ronzhin, A
    Proceedings of the Second IASTED International Multi-Conference on Automation, Control, and Information Technology - Signal and Image Processing, 2005, : 220 - 225
  • [27] Towards robust automatic evaluation of pathologic telephone speech
    Riedhammer, K.
    Stemmer, G.
    Haderlein, T.
    Schuster, M.
    Rosanowski, F.
    Noeth, E.
    Maier, A.
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 717 - +
  • [28] Automatic intelligibility assessment of pathologic speech over the telephone
    Haderlein, Tino
    Noeth, Elmar
    Batliner, Anton
    Eysholdt, Ulrich
    Rosanowski, Frank
    LOGOPEDICS PHONIATRICS VOCOLOGY, 2011, 36 (04) : 175 - 181
  • [29] Two-stage speech/non-speech classification of telephone signals
    Li Jian-Bin
    Yan Ji-Kun
    Zheng Hui
    Niu Zhong-Xia
    2006 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1-4: VOL 1: SIGNAL PROCESSING, 2006, : 490 - +
  • [30] Evolution of the performance of automatic speech recognition algorithms in transcribing conversational telephone speech
    Padmanabhan, M
    Saon, G
    Zweig, G
    Huang, J
    Kingsbury, B
    Mangu, L
    IMTC/2001: PROCEEDINGS OF THE 18TH IEEE INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE, VOLS 1-3: REDISCOVERING MEASUREMENT IN THE AGE OF INFORMATICS, 2001, : 1926 - 1931