Research-paper recommender systems: a literature survey

被引:416
作者
Beel, Joeran [1 ]
Gipp, Bela [2 ]
Langer, Stefan [3 ]
Breitinger, Corinna [4 ]
机构
[1] Docear, Magdeburg, Germany
[2] Univ Konstanz, Constance, Germany
[3] Otto von Guericke Univ, Magdeburg, Germany
[4] Linnaeus Univ, Kalmar, Sweden
关键词
Recommender system; User modeling; Research paper recommender systems; Content based filtering; Review; Survey;
D O I
10.1007/s00799-015-0156-0
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
In the last 16 years, more than 200 research articles were published about research-paper recommender systems. We reviewed these articles and present some descriptive statistics in this paper, as well as a discussion about the major advancements and shortcomings and an overview of the most common recommendation concepts and approaches. We found that more than half of the recommendation approaches applied content-based filtering (55 %). Collaborative filtering was applied by only 18% of the reviewed approaches, and graph-based recommendations by 16%. Other recommendation concepts included stereotyping, item-centric recommendations, and hybrid recommendations. The content-based filtering approaches mainly utilized papers that the users had authored, tagged, browsed, or downloaded. TF-IDF was the most frequently applied weighting scheme. In addition to simple terms, n-grams, topics, and citations were utilized to model users' information needs. Our review revealed some shortcomings of the current research. First, it remains unclear which recommendation concepts and approaches are the most promising. For instance, researchers reported different results on the performance of content-based and collaborative filtering. Sometimes content-based filtering performed better than collaborative filtering and sometimes it performed worse. We identified three potential reasons for the ambiguity of the results. (A) Several evaluations had limitations. They were based on strongly pruned datasets, few participants in user studies, or did not use appropriate baselines. (B) Some authors provided little information about their algorithms, which makes it difficult to re-implement the approaches. Consequently, researchers use different implementations of the same recommendations approaches, which might lead to variations in the results. (C) We speculated that minor variations in datasets, algorithms, or user populations inevitably lead to strong variations in the performance of the approaches. Hence, finding the most promising approaches is a challenge. As a second limitation, we noted that many authors neglected to take into account factors other than accuracy, for example overall user satisfaction. In addition, most approaches (81%) neglected the user-modeling process and did not infer information automatically but let users provide keywords, text snippets, or a single paper as input. Information on runtime was provided for 10% of the approaches. Finally, few research papers had an impact on research-paper recommender systems in practice. We also identified a lack of authority and long-term research interest in the field: 73% of the authors published no more than one paper on research-paper recommender systems, and there was little cooperation among different co-author groups. We concluded that several actions could improve the research landscape: developing a common evaluation framework, agreement on the information to include in research papers, a stronger focus on non-accuracy aspects and user modeling, a platform for researchers to exchange information, and an open-source framework that bundles the available recommendation approaches.
引用
收藏
页码:305 / 338
页数:34
相关论文
共 331 条
  • [1] Abu-Jbara A., 2011, P 49 ANN M ASS COMP, V1, P500
  • [2] Agarwal N, 2005, LECT NOTES COMPUT SC, V3739, P475
  • [3] A Subspace Clustering Framework for Research Group Collaboration
    Agarwal, Nitin
    Haque, Ehtesham
    Liu, Huan
    Parsons, Lance
    [J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY AND WEB ENGINEERING, 2006, 1 (01) : 35 - 58
  • [4] Document-document similarity approaches and science mapping: Experimental comparison of five approaches
    Ahlgren, Per
    Colliander, Cristian
    [J]. JOURNAL OF INFORMETRICS, 2009, 3 (01) : 49 - 63
  • [5] Airoldi E.M., 2006, P INT BIOM SOC ANN M, P1
  • [6] Al-Maskari Azzah, 2007, 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P773, DOI 10.1145/1277741.1277902
  • [7] Alotaibi Shaikhah, 2013, Artificial Intelligence in Education. Proceedings of 16th International Conference (AIED 2013): LNCS 7926, P717, DOI 10.1007/978-3-642-39112-5_96
  • [8] Alvares LOC., 2005, REV TECNOLOGIA INFOR, V4
  • [9] [Anonymous], 2011, P 2011 WORKSHOP CONT
  • [10] Arnold A, 2009, LECT NOTES COMPUT SC, V5682, P541, DOI 10.1007/978-3-642-03417-6_53