Use of conditional generative adversarial networks to create demographic collaborative filtering datasets

被引:0
作者
Bobadilla, Jesus [1 ]
Gutierrez, Abraham [1 ]
机构
[1] Univ Politecn Madrid, Ctra Valencia Km 7, Madrid 28031, Spain
关键词
CGANRS; Conditional Generative Adversarial Networks; Fairness; Collaborative Filtering; Recommender Systems; Synthetic Datasets; RECOMMENDER SYSTEMS;
D O I
10.1016/j.asoc.2024.112608
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a method to create synthetic collaborative filtering datasets that can be used to test both current and new fair recommender systems models. The proposed "Conditional Generative Adversarial Network for Recommender Systems (CGANRS)" method generalizes the existing generative adversarial network for recommender systems one, and it makes use of a conditional generative adversarial network to artificially generate synthetic profiles from a source dataset such as MovieLens. The created datasets can be parameterized to have different sizes and to include different number of users and items. Additionally, the provided parameters include the proportion of multi-categorical demographic information such as the number of male vs. female users, or the proportions of very young, young, adult, and senior users. To test the proposed method, three sets of synthetic databases have been created, containing different a) numbers of users, b) numbers of items, and c) proportions of male users versus female users. Results show an adequate behavior of the generated datasets, testing their a) profiles separability, b) main statistical distributions, and c) recommendation accuracies. Synthetic data sets created using the proposed conditional generative adversarial network for recommender systems method are particularly useful to improve research in the fairness field of the recommender systems area. To extend its use and to facilitate reproducibility, the source code is provided to generate as many demographic datasets as desired, as well as the artificially generated datasets in this research. Some promising future works are proposed, including a) the variation of the stochastic Gaussian distribution used to create the random noise vectors that feed the adversarial network generator model, and b) testing the fairness of the most relevant collaborative filtering models on different synthetic scenarios.
引用
收藏
页数:21
相关论文
共 34 条
  • [1] A deep learning based trust- and tag-aware recommender system
    Ahmadian, Sajad
    Ahmadian, Milad
    Jalili, Mahdi
    [J]. NEUROCOMPUTING, 2022, 488 : 557 - 571
  • [2] [Anonymous], About us
  • [3] RecGAN: Recurrent Generative Adversarial Networks for Recommendation Systems
    Bharadhwaj, Homanga
    Park, Homin
    Lim, Brian Y.
    [J]. 12TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS), 2018, : 372 - 376
  • [4] Creating synthetic datasets for collaborative filtering recommender systems using generative adversarial networks
    Bobadilla, Jesus
    Gutierrez, Abraham
    Yera, Raciel
    Martinez, Luis
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 280
  • [5] Neural Collaborative Filtering Classification Model to Obtain Prediction Reliabilities
    Bobadilla, Jesus
    Gutierrez, Abraham
    Alonso, Santiago
    Gonzalez-Prieto, Angel
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2022, 7 (04): : 18 - 26
  • [6] DeepFair: Deep Learning for Improving Fairness in Recommender Systems
    Bobadilla, Jesus
    Lara-Cabrera, Raul
    Gonzalez-Prieto, Angel
    Ortega, Fernando
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2021, 6 (06): : 86 - 94
  • [7] Hybrid recommender systems: A systematic literature review
    Cano, Erion
    Morisio, Maurizio
    [J]. INTELLIGENT DATA ANALYSIS, 2017, 21 (06) : 1487 - 1524
  • [8] CFGAN: A Generic Collaborative Filtering Framework based on Generative Adversarial Networks
    Chae, Dong-Kyu
    Kang, Jin-Soo
    Kim, Sang-Wook
    Lee, Jung-Tae
    [J]. CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 137 - 146
  • [9] Analyzing and improving stability of matrix factorization for recommender systems
    D'Amico, Edoardo
    Gabbolini, Giovanni
    Bernardis, Cesare
    Cremonesi, Paolo
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2022, 58 (02) : 255 - 285
  • [10] DataGenCARS: A generator of synthetic data for the evaluation of context-aware recommendation systems
    del Carmen Rodriguez-Hernandez, Maria
    Ilarri, Sergio
    Hermoso, Ramon
    Trillo-Lado, Raquel
    [J]. PERVASIVE AND MOBILE COMPUTING, 2017, 38 : 516 - 541