Program Guardian: screening system with a novel speaker recognition approach for smart TV

被引:1
作者
Chin, Yu-Hao [1 ]
Tai, Tzu-Chiang [2 ]
Zhao, Jia-Hao [1 ]
Wang, Kuang-Yao [1 ]
Hong, Chao-Tse [3 ]
Wang, Jia-Ching [1 ]
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taoyuan, Taiwan
[2] Providence Univ, Dept Comp Sci & Informat Engn, Taichung, Taiwan
[3] Natl Chung Shan Inst Sci & Technol, Taoyuan, Taiwan
关键词
Robust principal component analysis; Sparse representation classifier; Speaker recognition; Supervector; FACE RECOGNITION; VARIABILITY; FRAMEWORK;
D O I
10.1007/s11042-016-3764-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents Program Guardian, which is a speaker recognition-based screening system for smart TV. The system identifies a specific person from his or her voice such that the smart TV can provide suitable programs for that person. This system is based on a robust speaker recognition system that uses robust principal component analysis (RPCA) and a sparse representation classifier (SRC). First, i-vectors that are generated from supervectors of Gaussian mixture models (GMMs) are used to generate the basic atoms of an over-complete dictionary. The i-vectors are then transformed using RPCA. The SRC is produced from transformed i-vector-based RPCA vectors. Finally, the sparse representation classifier corresponding to the target speaker with the least reconstruction error is constructed. NIST speaker recognition evaluation data base is used in our experiment. The results show that the proposed speaker recognition system is feasible and offers advantages over accuracy.
引用
收藏
页码:13881 / 13896
页数:16
相关论文
共 27 条
  • [1] [Anonymous], 2011, P 12 ANN C INT SPEEC
  • [2] [Anonymous], 2010, UILUENG092215 UIUC
  • [3] Bahari MH, 2012, 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, P506
  • [4] Campbell WM, 2006, INT CONF ACOUST SPEE, P97
  • [5] Support vector machines using GMM supervectors for speaker verification
    Campbell, WM
    Sturim, DE
    Reynolds, DA
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2006, 13 (05) : 308 - 311
  • [6] Robust Principal Component Analysis?
    Candes, Emmanuel J.
    Li, Xiaodong
    Ma, Yi
    Wright, John
    [J]. JOURNAL OF THE ACM, 2011, 58 (03)
  • [7] Chen CF, 2012, PROC CVPR IEEE, P2618, DOI 10.1109/CVPR.2012.6247981
  • [8] A Framework for Robust Subspace Learning
    Fernando De la Torre
    Michael J. Black
    [J]. International Journal of Computer Vision, 2003, 54 (1-3) : 117 - 142
  • [9] De la Torre F, 2001, EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL I, PROCEEDINGS, P362, DOI 10.1109/ICCV.2001.937541
  • [10] Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains
    Gauvain, Jean-Luc
    Lee, Chin-Hui
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02): : 291 - 298