New word detection in audio-indexing

被引:0
作者
Dharanipragada, S [1 ]
Roukos, S [1 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Heights, NY 10598 USA
来源
1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS | 1997年
关键词
D O I
10.1109/ASRU.1997.659135
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For an Audio-Indexing system that uses a speech recognizer with a fixed vocabulary to be practical one needs the ability to detect out of vocabulary or new words at query time. In this paper we present a fast, vocabulary independent, algorithm for spotting words in speech. The algorithm consists of a preprocessing stage and a coarse-to-detailed search strategy for spotting a word/phone sequence in speech. The preprocessing method provides a phone-level representation of the speech that can be searched efficiently. The coarse search, consisting of phone-ngram matching, identifies regions of speech as putative word hits. The detailed acoustic match is then conducted only at the putative hits identified in the coarse match. This gives us the desired speed in wordspotting.
引用
收藏
页码:551 / 557
页数:7
相关论文
共 50 条
[21]   A new indexing method based on word proximity for Chinese text retrieval [J].
Du, L ;
Sun, YF .
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2000, 15 (03) :280-286
[22]   Audio indexing:: primary components retrieval -: Robust classification in audio documents [J].
Pinquier, Julien ;
Andre-Obrecht, Regine .
MULTIMEDIA TOOLS AND APPLICATIONS, 2006, 30 (03) :313-330
[23]   An audio-scene cut detection method using fuzzy c-means algorithm for audio-visual indexing [J].
Nitanda, N ;
Haseyama, M ;
Kitajima, H .
2004 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 2, PROCEEDINGS, 2004, :89-92
[24]   Wavelet-based indexing of audio data in audio/multimedia databases [J].
Subramanya, SR ;
Youssef, A .
INTERNATIONAL WORKSHOP ON MULTI-MEDIA DATABASE MANAGEMENT SYSTEMS- PROCEEDINGS, 1998, :46-53
[25]   Indexing audio-visual sequences by joint audio and video processing [J].
Saraceno, C ;
Leonardi, R .
VSMM98: FUTUREFUSION - APPLICATION REALITIES FOR THE VIRTUAL AGE, VOLS 1 AND 2, 1998, :686-691
[26]   CueVideo: Automated video/audio indexing and browsing [J].
Amir, A ;
Srinivasan, S ;
Ponceleon, D ;
Petkovic, D .
SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1999, :326-326
[27]   Robustness evaluation of the basic descriptors for audio indexing [J].
Essafi, Hassane ;
Sayah, Salima ;
Ouddan, Mohamed Amine .
12TH INTERNATIONAL MULTI-MEDIA MODELLING CONFERENCE PROCEEDINGS, 2006, :369-376
[28]   A GENERIC CLASSIFICATION SYSTEM FOR MULTI-CHANNEL AUDIO INDEXING: APPLICATION TO SPEECH AND MUSIC DETECTION [J].
Benaroya, Elie-Laurent ;
Peeters, Geoffroy .
2013 14TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES (WIAMIS), 2013,
[29]   Using audio description for indexing moving images [J].
Turner, JM ;
Colinet, EL .
KNOWLEDGE ORGANIZATION, 2004, 31 (04) :222-230
[30]   Invariant Audio Prints for Music Indexing and Alignment [J].
Mignot, Remi ;
Peeters, Geoffroy .
2024 INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING, CBMI, 2024, :145-151