Spoken document retrieval using multilevel knowledge and semantic verification

被引：13

作者：

Huang, Chien-Lin ^{[1
]}

Wu, Chung-Hsien ^{[1
]}

机构：

[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2007年 / 15卷 / 08期

关键词：

multilevel knowledge; semantic verification; spoken document retrieval (SDR); spoken keyword extraction;

D O I：

10.1109/TASL.2007.907429

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This study presents a novel approach to spoken document retrieval based on multilevel knowledge indexing and semantic verification. Multilevel knowledge indexing considers three information sources, namely transcription data, keywords extracted from spoken documents, and hypernyms of the extracted keywords. A semantic network with forward-backward propagation is presented for semantic verification of the retrieved documents. In the forward step for semantic verification, a bag of keywords is chosen based on word significance measures. Semantic relations are estimated and adopted for verification in the backward procedure. The verification score is then utilized to weight and rerank the retrieved documents to obtain the final results. Experiments are performed on 40 h of anchor speech extracted from 198 It of collected broadcast news. Experimental results indicate that multilevel knowledge indexing and semantic verification achieve better retrieval results than other indexing schemes.

引用

页码：2551 / 2560

页数：10

共 32 条

[1] Buckley C., 2000, P 23 ANN INT ACM SIG, P33, DOI DOI 10.1145/345508.345543
[2] Automatic recognition of spontaneous speech for access to multilingual oral history archives
Byrne, W
Doermann, D
Franz, MT
Gustman, S
Hajic, J
Oard, D
Picheny, M
Psutka, J
Ramabhadran, B
Soergel, D
Ward, T
Zhu, WJ
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (04): : 420 - 435
[3] Chen K. J., 2003, BUILDING USING PARSE, P231
[4] Relaxation motion and possible memory of domain structures in barium titanate ceramics studied by mechanical and dielectric losses
Cheng, BL
Gabbay, M
Maglione, M
Fantozzi, G
[J]. JOURNAL OF ELECTROCERAMICS, 2003, 10 (01) : 5 - 18
[5] CRESTANI F, 2001, P ACM SIGIR C RES DE, P420
[6] Cutler R., 2002, ACM MULTIMEDIA, P503
[7] A multistage algorithm for spotting new words in speech
Dharanipragada, S
Roukos, S
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (08): : 542 - 550
[8] A TRANSLATION APPROACH TO PORTABLE ONTOLOGY SPECIFICATIONS
GRUBER, TR
[J]. KNOWLEDGE ACQUISITION, 1993, 5 (02): : 199 - 220
[9] SpeechFind: Advances in spoken document retrieval for a National Gallery of the Spoken Word
Hansen, JH
Huang, RQ
Zhou, B
Seadle, M
Deller, JR
Gurijala, AR
Kurimo, M
Angkititrakul, P
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 712 - 730
[10] HAUPTMANN AG, 1997, P IEEE INT C AC SPEE, V1, P195

← 1 2 3 4 →