Use of conditional generative adversarial networks to create demographic collaborative filtering datasets

被引:0
作者
Bobadilla, Jesus [1 ]
Gutierrez, Abraham [1 ]
机构
[1] Univ Politecn Madrid, Ctra Valencia Km 7, Madrid 28031, Spain
关键词
CGANRS; Conditional Generative Adversarial Networks; Fairness; Collaborative Filtering; Recommender Systems; Synthetic Datasets; RECOMMENDER SYSTEMS;
D O I
10.1016/j.asoc.2024.112608
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a method to create synthetic collaborative filtering datasets that can be used to test both current and new fair recommender systems models. The proposed "Conditional Generative Adversarial Network for Recommender Systems (CGANRS)" method generalizes the existing generative adversarial network for recommender systems one, and it makes use of a conditional generative adversarial network to artificially generate synthetic profiles from a source dataset such as MovieLens. The created datasets can be parameterized to have different sizes and to include different number of users and items. Additionally, the provided parameters include the proportion of multi-categorical demographic information such as the number of male vs. female users, or the proportions of very young, young, adult, and senior users. To test the proposed method, three sets of synthetic databases have been created, containing different a) numbers of users, b) numbers of items, and c) proportions of male users versus female users. Results show an adequate behavior of the generated datasets, testing their a) profiles separability, b) main statistical distributions, and c) recommendation accuracies. Synthetic data sets created using the proposed conditional generative adversarial network for recommender systems method are particularly useful to improve research in the fairness field of the recommender systems area. To extend its use and to facilitate reproducibility, the source code is provided to generate as many demographic datasets as desired, as well as the artificially generated datasets in this research. Some promising future works are proposed, including a) the variation of the stochastic Gaussian distribution used to create the random noise vectors that feed the adversarial network generator model, and b) testing the fairness of the most relevant collaborative filtering models on different synthetic scenarios.
引用
收藏
页数:21
相关论文
共 34 条
[31]   Deep Learning Based Recommender System: A Survey and New Perspectives [J].
Zhang, Shuai ;
Yao, Lina ;
Sun, Aixin ;
Tay, Yi .
ACM COMPUTING SURVEYS, 2019, 52 (01)
[32]   DCFGAN: An adversarial deep reinforcement learning framework with improved negative sampling for session-based recommender systems [J].
Zhao, Jianli ;
Li, Hao ;
Qu, Lijun ;
Zhang, Qinzhi ;
Sun, Qiuxia ;
Huo, Huan ;
Gong, Maoguo .
INFORMATION SCIENCES, 2022, 596 :222-235
[33]   Assigning reliability values to recommendations using matrix factorization [J].
Zhu, B. ;
Ortega, F. ;
Bobadilla, J. ;
Gutierrez, A. .
JOURNAL OF COMPUTATIONAL SCIENCE, 2018, 26 :165-177
[34]   An Efficient Recommender System Method Based on the Numerical Relevances and the Non-Numerical Structures of the Ratings [J].
Zhu, Bo ;
Hurtado, Remigio ;
Bobadilla, Jesus ;
Ortega, Fernando .
IEEE ACCESS, 2018, 6 :49935-49954