A matching algorithm between arbitrary sections of two speech data sets for speech retrieval

被引：0

作者：

Itoh, Y ^{[1
]}

机构：

[1] Iwate Prefectural Univ, Morioka, Iwate, Japan

来源：

2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING | 2001年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper proposes a new matching algorithm to retrieve speech information from a speech database by speech query that allows continuous input. The algorithm is called Shift Continuous DP (CDP). Shift CDP extracts similar sections between two speech data sets. Two speech data sets are considered as reference patterns that are regarded as a speech database and input speech respectively. Shift CDP applies CDP to a constant length of unit reference patterns and provides a fast match between arbitrary sections in the reference pattern and the input speech. The algorithm allows endless input and real-time responses for the input speech query. Experiments were conducted for conversational speech and the results showed Shift CDP was successful in detecting similar sections between arbitrary sections of the reference speech and arbitrary sections of the input speech. This method can be applied to all kinds of time sequence data such as moving images.

引用

页码：593 / 596

页数：4

共 50 条

[21] An SNR-incremental stochastic matching algorithm for noisy speech recognition
Huang, CS
Wang, HC
Lee, CH
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (08): : 866 - 873
[22] Syllable Matching Algorithm with Spectral Peak Point Feature for Chinese Speech
Tang Weikang
Shao Yubin
Long Hua
Du Qingzhi
Peng Yi
Chen Liang
LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (07)
[23] A mismatch-aware stochastic matching algorithm for robust speech recognition
Liao, YF
Lin, JS
Chen, JH
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 101 - 104
[24] Matching algorithm for high speed linguistic processing in continuous speech recognition
Hamaguchi, Shigetatsu
Suzuki, Yoshitake
Denki Tsushin Kenkyujo kenkyu jitsuyoka hokoku, 1988, 37 (11): : 705 - 715
[25] A research on improving the algorithm for DTW feature-matching in speech recognition
Liu, Chang-Ming
Ren, Yi-Feng
Zhongbei Daxue Xuebao (Ziran Kexue Ban)/Journal of North University of China (Natural Science Edition), 2006, 27 (01): : 37 - 40
[26] A simplified viterbi matching algorithm for word partition in visual speech processing
Foo, SW
Lian, Y
Dong, L
Digital Media: Processing Multimedia Interactive Services, 2003, : 355 - 358
[27] A two-stage algorithm for enhancement of reverberant speech
Wu, MY
Wang, D
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1085 - 1088
[28] DNN-based Speech Synthesis for Small Data Sets Considering Bidirectional Speech-Text Conversion
Sone, Kentaro
Nakashika, Toru
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2519 - 2523
[29] Implemetation of RSA Algorithm for Speech Data Encryption and Decryption
Rahman, Md. Mijanur
Saha, Tushar Kanti
Bhuiyan, Md. Al-Amin
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2012, 12 (03): : 74 - 82
[30] Speech data retrieval system constructed on a universal phonetic code domain
Tanaka, K
Itoh, Y
Kojima, H
Fujimura, N
ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 323 - 326

← 1 2 3 4 5 →