Online Unsupervised Pattern Discovery in Speech using Parallelization

被引:0
作者
Gajjar, Mrugesh R. [1 ]
Govindarajan, R. [1 ]
Sreenivas, T. V. [2 ]
机构
[1] Indian Inst Sci, Supercomp Educ & Res Ctr, Bangalore 560012, Karnataka, India
[2] Indian Inst Sci, Dept Elect Commun Engn, Bangalore 560012, Karnataka, India
来源
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 | 2008年
关键词
Unsupervised pattern discovery; Dynamic time warping; Parallelization; Spoken language systems;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Segmental dynamic time warping (DTW) has been demonstrated to be a useful technique for finding acoustic similarity scores between segments of two speech utterances. Due to its high computational requirements, it had to be computed in an offline manner, limiting the applications of the technique. In this paper, we present results of parallelization of this task by distributing the workload in either a static or dynamic way on an 8-processor cluster and discuss the trade-offs among different distribution schemes. We show that online unsupervised pattern discovery using segmental DTW is plausible with as low as 8 processors. This brings the task within reach of today's general purpose multi-core servers. We also show results on a 32-processor system, and discuss factors affecting scalability of our methods.
引用
收藏
页码:2458 / +
页数:2
相关论文
共 13 条
  • [1] [Anonymous], HIDDEN MARKOV MODEL
  • [2] [Anonymous], 1997, COMPUTER
  • [3] An overview of audio information retrieval
    Foote, J
    [J]. MULTIMEDIA SYSTEMS, 1999, 7 (01) : 2 - 10
  • [4] Grama Ananth, 2003, Introduction to Parallel Computing
  • [5] Speech and language technologies for audio indexing and retrieval
    Makhoul, J
    Kubala, F
    Leek, T
    Liu, DB
    Nguyen, L
    Schwartz, R
    Srivastava, A
    [J]. PROCEEDINGS OF THE IEEE, 2000, 88 (08) : 1338 - 1353
  • [6] Park A, 2005, 2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), P53
  • [7] PARK A, 2006, P SLT 2006 DEC
  • [8] PARK A, 2006, P IEEE ICASSP 2006 M
  • [9] Rabiner L., 1993, Fundamentals of Speech Recognition
  • [10] Ravishankar M. K., 1993, PARALLEL IMPLEMENTAT