Exploring the effects of different Clustering Methods on a News Recommender System

被引:6
作者
Ulian, Douglas Zanatta [1 ]
Becker, Joao Luiz [2 ]
Marcolin, Carla Bonato [3 ]
Scornavacca, Eusebio [4 ]
机构
[1] Univ Fed Rio Grande do Sul, Escola Adm, Rua Washington Luiz 855, BR-90010460 Porto Alegre, RS, Brazil
[2] EAESP FGV, Dept Tecnol & Ciencia Dados, Av 9 Julho, BR-01313902 Sao Paulo, SP, Brazil
[3] Univ Fed Uberlandia, Dept Operacoes Sistemas, Fac Gestao & Negocios, Av Joao Naves Avila 2121,Bloco 5M,Sala 100, BR-38400902 Uberlandia, MG, Brazil
[4] Univ Baltimore, Merrick Sch Business, William H Thumel Sr Business Ctr 473, 11 W Mt Royal Ave Baltimore, Baltimore, MD 21201 USA
关键词
Recommender systems; News recommender systems; Clustering; Collaborative filtering; Content filtering; Data mining;
D O I
10.1016/j.eswa.2021.115341
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
News recommendations distinguishes from general content recommendations as it takes in consideration news freshness, sparsity, monotony and time. Recent works approach these features using hybrid Collaborative-Content-based Filtering methods, adapting clustering techniques to handle sparsity and monotony without considering the effects that different clustering methods may have over recommendation results. Such studies often evaluate the results of varying different parameters individually, ignoring possible interaction effects between them. They also base their results on metrics such as accuracy and recall that are sensitive to bias. To investigate the importance of clustering method selection to News Recommender System results we evaluated the effects of different traditional techniques in recommending news articles. We implemented an algorithm that used a hybrid Collaborative-Content-based Filtering method to incorporate user behavior, user interest, article popularity and time effect. The system uses an article selection method that built the recommendation set based on content features. With this algorithm, we examined the existence of interaction effects between the input parameters. We used a Gaussian regression process to explore the response surface while sequentially optimizing parameters. To avoid being misled by underlying biases we used Informedness, an accuracy metric that captures both positive and negative information from prediction results. Our results demonstrated that different clustering methods had a significant influence on the recommendation results. It was also found that a traditional hierarchical method outperformed optimization methods with important performance improvement. In addition, we demonstrated that parameters may interact with each other and that analyzing them separately may mislead interpretation.
引用
收藏
页数:14
相关论文
共 53 条
  • [1] [Anonymous], 2012, INT J COMPUT TECHNOL
  • [2] [Anonymous], 2009, Proceedings of the 18th international conference on World wide web, DOI [10.1145/1526709.1526802, 10.1145/1526709, DOI 10.1145/1526709, DOI 10.1145/1526709.1526802]
  • [3] [Anonymous], 2011, UCERSTI2 WORKSHOP 5
  • [4] Web news mining in an evolving framework
    Antonio Iglesias, Jose
    Tiemblo, Alexandra
    Ledezma, Agapito
    Sanchis, Araceli
    [J]. INFORMATION FUSION, 2016, 28 : 90 - 98
  • [5] Analysis of K-Means and K-Medoids Algorithm For Big Data
    Arora, Preeti
    Deepali
    Varshney, Shipra
    [J]. 1ST INTERNATIONAL CONFERENCE ON INFORMATION SECURITY & PRIVACY 2015, 2016, 78 : 507 - 512
  • [6] Exploiting search history of users for news personalization
    Bai, Xiao
    Barla Cambazoglu, B.
    Gullo, Francesco
    Mantrach, Amin
    Silvestri, Fabrizio
    [J]. INFORMATION SCIENCES, 2017, 385 : 125 - 137
  • [7] Bartz-Beielstein T, 2010, ARXIV PREPRINT ARXIV
  • [8] Model-based methods for continuous and discrete global optimization
    Bartz-Beielstein, Thomas
    Zaefferer, Martin
    [J]. APPLIED SOFT COMPUTING, 2017, 55 : 154 - 167
  • [9] Latent Cross: Making Use of Context in Recurrent Recommender Systems
    Beutel, Alex
    Covington, Paul
    Jain, Sagar
    Xu, Can
    Li, Jia
    Gatto, Vince
    Chi, Ed H.
    [J]. WSDM'18: PROCEEDINGS OF THE ELEVENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2018, : 46 - 54
  • [10] Improving news articles recommendations via user clustering
    Bouras, Christos
    Tsogkas, Vassilis
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2017, 8 (01) : 223 - 237