Social media retrieval using image features and structured text

被引:0
作者
Iskandar, D. N. F. Awang [1 ]
Pehcevski, Jovan [1 ]
Thom, James A. [1 ]
Tahaghoghi, S. M. M. [1 ]
机构
[1] RMIT Univ, Sch Comp Sci & Informat Technol, Melbourne, Vic, Australia
来源
COMPARATIVE EVALUATION OF XML INFORMATION RETRIEVAL SYSTEMS | 2007年 / 4518卷
基金
澳大利亚研究理事会;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Use of XML offers a structured approach for representing information while maintaining separation of form and content. XML information retrieval is different from standard text retrieval in two aspects: the XML structure may be of interest as part of the query; and the information does not have to be text. In this paper, we describe an investigation of approaches to retrieve text and images from a large collection of XML documents, performed in the course of our participation in the INEX 2006 Ad Hoc and Multimedia tracks. We evaluate three information retrieval similarity measures: Pivoted Cosine, Okapi BM25 and Dirichlet. We show that on the INEX 2006 Ad Hoc queries Okapi BM25 is the most effective among the three similarity measures used for retrieving text only, while Dirichlet is more suitable when retrieving heterogeneous (text and image) data.
引用
收藏
页码:358 / 372
页数:15
相关论文
共 16 条
[1]  
ASLANDOGAN YA, 2000, MULTIMEDIA 2000 P 8, P313
[2]  
FUHR N, 2006, ADV XML INFORM RETRI, V3977
[3]  
ISKANDAR A, COMBINING IMAGE STRU, P525
[4]  
KAZAI G, 2005, INEX, P16
[5]  
LARSEN B, 2006, 29 P 1 INT C INF IN, P88
[6]  
Pehcevski J., 2005, Advances in XML Information Retrieval and Evaluation. 4th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2005. Revised Selected Papers (Lecture Notes in Computer Science Vol. 3977), P306
[7]   Hybrid XML retrieval: Combining information retrieval and a native XML database [J].
Pehcevski, J ;
Thom, JA ;
Vercoustre, AM .
INFORMATION RETRIEVAL, 2005, 8 (04) :571-600
[8]  
Singhal Amit., 1996, SIGIR 96, P21
[9]  
Snoek C.G.M., 2006, 14 ANN ACM INT C MUL, P421, DOI [10.1145/1180639.1180727, DOI 10.1145/1180639.1180727]
[10]   A probabilistic model of information retrieval: development and comparative experiments Part 1 [J].
Sparck-Jones, K ;
Walker, S ;
Robertson, SE .
INFORMATION PROCESSING & MANAGEMENT, 2000, 36 (06) :779-808