Unsupervised Speaker Segmentation Framework Based on Sparse Correlation Feature

被引:0
|
作者
Sun, Yi Xin [1 ]
Ma, Yong [1 ]
Shi, Kai Bo [2 ]
Hu, Jiang Ping [1 ]
Zhao, Yi Yi [3 ]
Zhang, Yu Ping [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Automat Engn, Chengdu, Sichuan, Peoples R China
[2] Chengdu Univ, Sch Informat Sci & Engn, Chengdu, Sichuan, Peoples R China
[3] Southwestern Univ Finance & Econ, Sch Business Adm, Chengdu, Sichuan, Peoples R China
来源
2017 CHINESE AUTOMATION CONGRESS (CAC) | 2017年
基金
中国国家自然科学基金;
关键词
Hidden Markov Model; ilback-Leibler Divergence; Sparse Correlation Feature; PERSONALITY; METHODOLOGY; TRACKING; SIGNAL;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the increasing stress in working and studying, mental health becomes a major problem in the current social research. Generally, researchers can analyze psychological health states by using social perception behavior. The speech signal is an important research direction in this domain. It objectively assesses the mental health of social groups through the extraction and fusion of speech features. Thus, this requires an efficient speech segmentation algorithm. In this paper, we present a new framework of speech segmentation algorithm based on the hybrid of sparse correlation feature with Hidden Markov Model (HMM) as well as Kullback-Leibler Divergence (KLD)while it has been proven to gain higher accuracy. Specifically, HMM method can be used to gain the initial wearer's voice data. Experimental tests and comparisons with different segmentation methods have been conducted to verify the efficacy of the proposed unsupervised method. Very promising results have been obtained.
引用
收藏
页码:3058 / 3063
页数:6
相关论文
共 50 条
  • [11] An Iterative Framework for Unsupervised Learning in the PLDA based Speaker Verification
    Liu, Wenbo
    Yu, Zhiding
    Li, Ming
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 78 - +
  • [12] A speaker based unsupervised speech segmentation algorithm used in conversational speech
    Chen, Yanxiang
    Wang, Qiong
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, 2007, 4798 : 396 - +
  • [13] An Adaptive Threshold Computation for Unsupervised Speaker Segmentation
    Docio-Fernandez, Laura
    Lopez-Otero, Paula
    Garcia-Mateo, Carmen
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 860 - 863
  • [14] SPARSE REPRESENTATION-BASED APPROACH FOR UNSUPERVISED FEATURE SELECTION
    Su, Ya-Ru
    Li, Chuan-Xi
    Wang, Ru-Jing
    Chen, Peng
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2014, 28 (03)
  • [15] Unsupervised feature engineering algorithm BioSAE based on sparse autoencoder
    Zhou F.-F.
    Zhang Y.-C.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2022, 52 (07): : 1645 - 1656
  • [16] Similarity Preserving Unsupervised Feature Selection based on Sparse Learning
    Zare, Hadi
    Parsa, Mohsen Ghasemi
    Ghatee, Mehdi
    Alizadeh, Sasan H.
    2020 10TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2020, : 50 - 55
  • [17] Image segmentation by unsupervised sparse clustering
    Jeon, BK
    Jung, YB
    Hong, KS
    WACV 2005: SEVENTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION, PROCEEDINGS, 2005, : 2 - 7
  • [18] Image segmentation by unsupervised sparse clustering
    Jeon, Byoung-Ki
    Jung, Yun-Beom
    Hong, Ki-Sang
    PATTERN RECOGNITION LETTERS, 2006, 27 (14) : 1650 - 1664
  • [19] Unsupervised deep feature embeddings for speaker diarization
    Ahmad, Rehan
    Zubair, Syed
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (04) : 3138 - 3149
  • [20] Sparse DNN-based speaker segmentation using side information
    Ma, Yong
    Bao, Chang-Chun
    ELECTRONICS LETTERS, 2015, 51 (08) : 651 - 653