Unsupervised Speaker Segmentation Framework Based on Sparse Correlation Feature

被引：0

作者：

Sun, Yi Xin ^{[1
]}

Ma, Yong ^{[1
]}

Shi, Kai Bo ^{[2
]}

Hu, Jiang Ping ^{[1
]}

Zhao, Yi Yi ^{[3
]}

Zhang, Yu Ping ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Automat Engn, Chengdu, Sichuan, Peoples R China

[2] Chengdu Univ, Sch Informat Sci & Engn, Chengdu, Sichuan, Peoples R China

[3] Southwestern Univ Finance & Econ, Sch Business Adm, Chengdu, Sichuan, Peoples R China

来源：

2017 CHINESE AUTOMATION CONGRESS (CAC) | 2017年

基金：

中国国家自然科学基金;

关键词：

Hidden Markov Model; ilback-Leibler Divergence; Sparse Correlation Feature; PERSONALITY; METHODOLOGY; TRACKING; SIGNAL;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the increasing stress in working and studying, mental health becomes a major problem in the current social research. Generally, researchers can analyze psychological health states by using social perception behavior. The speech signal is an important research direction in this domain. It objectively assesses the mental health of social groups through the extraction and fusion of speech features. Thus, this requires an efficient speech segmentation algorithm. In this paper, we present a new framework of speech segmentation algorithm based on the hybrid of sparse correlation feature with Hidden Markov Model (HMM) as well as Kullback-Leibler Divergence (KLD)while it has been proven to gain higher accuracy. Specifically, HMM method can be used to gain the initial wearer's voice data. Experimental tests and comparisons with different segmentation methods have been conducted to verify the efficacy of the proposed unsupervised method. Very promising results have been obtained.

引用

页码：3058 / 3063

页数：6

共 50 条

[11] An Iterative Framework for Unsupervised Learning in the PLDA based Speaker Verification
Liu, Wenbo
Yu, Zhiding
Li, Ming
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 78 - +
[12] A speaker based unsupervised speech segmentation algorithm used in conversational speech
Chen, Yanxiang
Wang, Qiong
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, 2007, 4798 : 396 - +
[13] An Adaptive Threshold Computation for Unsupervised Speaker Segmentation
Docio-Fernandez, Laura
Lopez-Otero, Paula
Garcia-Mateo, Carmen
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 860 - 863
[14] SPARSE REPRESENTATION-BASED APPROACH FOR UNSUPERVISED FEATURE SELECTION
Su, Ya-Ru
Li, Chuan-Xi
Wang, Ru-Jing
Chen, Peng
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2014, 28 (03)
[15] Unsupervised feature engineering algorithm BioSAE based on sparse autoencoder
Zhou F.-F.
Zhang Y.-C.
Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2022, 52 (07): : 1645 - 1656
[16] Similarity Preserving Unsupervised Feature Selection based on Sparse Learning
Zare, Hadi
Parsa, Mohsen Ghasemi
Ghatee, Mehdi
Alizadeh, Sasan H.
2020 10TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2020, : 50 - 55
[17] Image segmentation by unsupervised sparse clustering
Jeon, BK
Jung, YB
Hong, KS
WACV 2005: SEVENTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION, PROCEEDINGS, 2005, : 2 - 7
[18] Image segmentation by unsupervised sparse clustering
Jeon, Byoung-Ki
Jung, Yun-Beom
Hong, Ki-Sang
PATTERN RECOGNITION LETTERS, 2006, 27 (14) : 1650 - 1664
[19] Unsupervised deep feature embeddings for speaker diarization
Ahmad, Rehan
Zubair, Syed
TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (04) : 3138 - 3149
[20] Sparse DNN-based speaker segmentation using side information
Ma, Yong
Bao, Chang-Chun
ELECTRONICS LETTERS, 2015, 51 (08) : 651 - 653

← 1 2 3 4 5 →