Web log analysis: a review of a decade of studies about information acquisition, inspection and interpretation of user interaction

被引:51
作者
Agosti, Maristella [1 ]
Crivellari, Franco [1 ]
Di Nunzio, Giorgio Maria [1 ]
机构
[1] Univ Padua, Dept Informat Engn, I-35131 Padua, Italy
关键词
Web log; Query log; Search log; User study; ENGINE QUERY LOGS; SUBJECT CATEGORIZATION; CLASSIFICATION; FEEDBACK; TERMS;
D O I
10.1007/s10618-011-0228-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last decade, the importance of analyzing information management systems logs has grown, because log data constitute a relevant aspect in evaluating the quality of such systems. A review of 10 years of research on log analysis is presented in this paper. About 50 papers and posters from five major conferences and about 30 related journal papers have been selected to trace the history of the state-of-the-art in this field. The paper presents an overview of two main themes: Web search engine log analysis and Digital Library System log analysis. The problem of the analysis of different sources of log data and the distribution of data are investigated.
引用
收藏
页码:663 / 696
页数:34
相关论文
共 91 条
  • [1] Agosti M., 2010, LECT NOTES COMPUTER
  • [2] Agosti M, 2009, P WORKSH CONT INF AC, P13
  • [3] Agosti M, 2007, LECT NOTES COMPUT SC, V4877, P104
  • [4] Agosti M, 2008, INFORM RETRIEVAL SER, V22, P1
  • [5] [Anonymous], 2003, Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval (SIGIR '03), DOI DOI 10.1145/860435.860453
  • [6] [Anonymous], 2009, P 18 INT C WORLD WID, DOI DOI 10.1145/1526709.1526716
  • [7] [Anonymous], 2002, WWW
  • [8] [Anonymous], 2007, P 16 INT C WORLD WID
  • [9] Space-Time Tradeoffs for Approximate Nearest Neighbor Searching
    Arya, Sunil
    Malamatos, Theocharis
    Mount, David M.
    [J]. JOURNAL OF THE ACM, 2009, 57 (01)
  • [10] Assadi H, 2003, LECT NOTES COMPUT SC, V2769, P1