Spoken document retrieval using multilevel knowledge and semantic verification

被引:13
作者
Huang, Chien-Lin [1 ]
Wu, Chung-Hsien [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2007年 / 15卷 / 08期
关键词
multilevel knowledge; semantic verification; spoken document retrieval (SDR); spoken keyword extraction;
D O I
10.1109/TASL.2007.907429
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This study presents a novel approach to spoken document retrieval based on multilevel knowledge indexing and semantic verification. Multilevel knowledge indexing considers three information sources, namely transcription data, keywords extracted from spoken documents, and hypernyms of the extracted keywords. A semantic network with forward-backward propagation is presented for semantic verification of the retrieved documents. In the forward step for semantic verification, a bag of keywords is chosen based on word significance measures. Semantic relations are estimated and adopted for verification in the backward procedure. The verification score is then utilized to weight and rerank the retrieved documents to obtain the final results. Experiments are performed on 40 h of anchor speech extracted from 198 It of collected broadcast news. Experimental results indicate that multilevel knowledge indexing and semantic verification achieve better retrieval results than other indexing schemes.
引用
收藏
页码:2551 / 2560
页数:10
相关论文
共 32 条
  • [1] Buckley C., 2000, P 23 ANN INT ACM SIG, P33, DOI DOI 10.1145/345508.345543
  • [2] Automatic recognition of spontaneous speech for access to multilingual oral history archives
    Byrne, W
    Doermann, D
    Franz, MT
    Gustman, S
    Hajic, J
    Oard, D
    Picheny, M
    Psutka, J
    Ramabhadran, B
    Soergel, D
    Ward, T
    Zhu, WJ
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (04): : 420 - 435
  • [3] Chen K. J., 2003, BUILDING USING PARSE, P231
  • [4] Relaxation motion and possible memory of domain structures in barium titanate ceramics studied by mechanical and dielectric losses
    Cheng, BL
    Gabbay, M
    Maglione, M
    Fantozzi, G
    [J]. JOURNAL OF ELECTROCERAMICS, 2003, 10 (01) : 5 - 18
  • [5] CRESTANI F, 2001, P ACM SIGIR C RES DE, P420
  • [6] Cutler R., 2002, ACM MULTIMEDIA, P503
  • [7] A multistage algorithm for spotting new words in speech
    Dharanipragada, S
    Roukos, S
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (08): : 542 - 550
  • [8] A TRANSLATION APPROACH TO PORTABLE ONTOLOGY SPECIFICATIONS
    GRUBER, TR
    [J]. KNOWLEDGE ACQUISITION, 1993, 5 (02): : 199 - 220
  • [9] SpeechFind: Advances in spoken document retrieval for a National Gallery of the Spoken Word
    Hansen, JH
    Huang, RQ
    Zhou, B
    Seadle, M
    Deller, JR
    Gurijala, AR
    Kurimo, M
    Angkititrakul, P
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 712 - 730
  • [10] HAUPTMANN AG, 1997, P IEEE INT C AC SPEE, V1, P195