A framework for effective annotation of information from closed captions using ontologies

被引:0
作者
Khan, L [1 ]
McLeod, D
Hovy, E
机构
[1] Univ Texas, Dept Comp Sci, Richardson, TX 75083 USA
[2] Univ So Calif, Dept Comp Sci, Los Angeles, CA 90088 USA
[3] Univ So Calif, Inst Informat Sci, Marina Del Rey, CA 90292 USA
基金
美国国家科学基金会;
关键词
metadata; ontology; audio; SQL;
D O I
10.1007/s10844-005-0188-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To improve the accuracy in terms of precision and recall of an audio information retrieval system we have created a domain-specific ontology (a collection of key concepts and their interrelationships), as well as a novel, pruning algorithm. Given the shortcomings of keyword-based techniques, we have opted to employ a concept-based technique utilizing this ontology. Achieving high precision and high recall is the key problem in the retrieval of audio information. In traditional approaches, high recall is typically achieved at the expense of low precision, and vice versa. Through the use of a domain-specific ontology appropriate concepts can be identified during metadata generation (description of audio) or query generation, thus improving precision. When irrelevant concepts are associated with queries or documents there is a loss of precision. On the other side of the coin, if relevant concepts are discarded, a loss of recall will ensue. In conjunction with the use of a domain specific ontology we have thus proposed a novel, automatic pruning algorithm which prunes as many irrelevant concepts as possible during any case of description and identification of documents, and query generation. To improve recall, A controlled and correct query expansion mechanism is proposed for the improvement of recall, thus guaranteeing that precision will not be lost. We have constructed a demonstration prototype, and experimentally and analytically we have shown that our model, compared to keyword search, achieves a significantly higher degree of precision and recall.
引用
收藏
页码:181 / 205
页数:25
相关论文
共 24 条
  • [21] Voorhees E. M., 1994, SIGIR '94. Proceedings of the Seventeenth Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, P61
  • [22] WILCOX LD, 1992, P INT C AC SPEECH SI, V2, P97
  • [23] WOODS W, 1999, CONCEPTUAL INDEXING
  • [24] *XML, 1999, US XML ONT CONC KNOW