A matching algorithm between arbitrary sections of two speech data sets for speech retrieval

被引:0
|
作者
Itoh, Y [1 ]
机构
[1] Iwate Prefectural Univ, Morioka, Iwate, Japan
来源
2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING | 2001年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a new matching algorithm to retrieve speech information from a speech database by speech query that allows continuous input. The algorithm is called Shift Continuous DP (CDP). Shift CDP extracts similar sections between two speech data sets. Two speech data sets are considered as reference patterns that are regarded as a speech database and input speech respectively. Shift CDP applies CDP to a constant length of unit reference patterns and provides a fast match between arbitrary sections in the reference pattern and the input speech. The algorithm allows endless input and real-time responses for the input speech query. Experiments were conducted for conversational speech and the results showed Shift CDP was successful in detecting similar sections between arbitrary sections of the reference speech and arbitrary sections of the input speech. This method can be applied to all kinds of time sequence data such as moving images.
引用
收藏
页码:593 / 596
页数:4
相关论文
共 50 条
  • [21] An SNR-incremental stochastic matching algorithm for noisy speech recognition
    Huang, CS
    Wang, HC
    Lee, CH
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (08): : 866 - 873
  • [22] Syllable Matching Algorithm with Spectral Peak Point Feature for Chinese Speech
    Tang Weikang
    Shao Yubin
    Long Hua
    Du Qingzhi
    Peng Yi
    Chen Liang
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (07)
  • [23] A mismatch-aware stochastic matching algorithm for robust speech recognition
    Liao, YF
    Lin, JS
    Chen, JH
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 101 - 104
  • [24] Matching algorithm for high speed linguistic processing in continuous speech recognition
    Hamaguchi, Shigetatsu
    Suzuki, Yoshitake
    Denki Tsushin Kenkyujo kenkyu jitsuyoka hokoku, 1988, 37 (11): : 705 - 715
  • [25] A research on improving the algorithm for DTW feature-matching in speech recognition
    Liu, Chang-Ming
    Ren, Yi-Feng
    Zhongbei Daxue Xuebao (Ziran Kexue Ban)/Journal of North University of China (Natural Science Edition), 2006, 27 (01): : 37 - 40
  • [26] A simplified viterbi matching algorithm for word partition in visual speech processing
    Foo, SW
    Lian, Y
    Dong, L
    Digital Media: Processing Multimedia Interactive Services, 2003, : 355 - 358
  • [27] A two-stage algorithm for enhancement of reverberant speech
    Wu, MY
    Wang, D
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1085 - 1088
  • [28] DNN-based Speech Synthesis for Small Data Sets Considering Bidirectional Speech-Text Conversion
    Sone, Kentaro
    Nakashika, Toru
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2519 - 2523
  • [29] Implemetation of RSA Algorithm for Speech Data Encryption and Decryption
    Rahman, Md. Mijanur
    Saha, Tushar Kanti
    Bhuiyan, Md. Al-Amin
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2012, 12 (03): : 74 - 82
  • [30] Speech data retrieval system constructed on a universal phonetic code domain
    Tanaka, K
    Itoh, Y
    Kojima, H
    Fujimura, N
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 323 - 326