Generating and Testing Synthetic Datasets for Recommender Systems to Improve Fairness in Collaborative Filtering Research

被引:0
作者
Bobadilla, J. [1 ]
Gutierrez, A. [1 ]
机构
[1] Tech Univ Madrid, Dept Comp Sci, Ctra Valencia Km 7, Madrid 28031, Spain
来源
2023 20TH ACS/IEEE INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, AICCSA | 2023年
关键词
Fairness; Recommender Systems; Synthetic Dataset; Collaborative Filtering; Demographic Features;
D O I
10.1109/AICCSA59173.2023.10479255
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fairness is an important field in the Recommender Systems area since collaborative filtering datasets tend to be demographically biased. This paper proposes a parameterized model and provides an open repository of code to generate synthetic datasets containing demographic information. The model parameters can be set to hold different numbers of minority and nonminority users, distributions, and their overlapping. A neural network model has also been used to test the accuracy obtained in different scenarios, by setting the number of minority users and the overlapping between minority and nonminority distributions of votes. The results show how minority users receive unfair recommendations, particularly when their number decreases and when the distributions of minority versus nonminority users partially overlap. This research can be easily extended by designing more sophisticated models and modifying the provided framework; in this sense, a specific future work has been proposed.
引用
收藏
页数:6
相关论文
共 24 条
[1]  
[Anonymous], About us
[2]   Fairness in Recommendation Ranking through Pairwise Comparisons [J].
Beutel, Alex ;
Chen, Jilin ;
Doshi, Tulsee ;
Qian, Hai ;
Wei, Li ;
Wu, Yi ;
Heldt, Lukasz ;
Zhao, Zhe ;
Hong, Lichan ;
Chi, Ed H. ;
Goodrow, Cristos .
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, :2212-2220
[3]   DeepFair: Deep Learning for Improving Fairness in Recommender Systems [J].
Bobadilla, Jesus ;
Lara-Cabrera, Raul ;
Gonzalez-Prieto, Angel ;
Ortega, Fernando .
INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2021, 6 (06) :86-94
[4]   Recommender Systems Clustering Using Bayesian Non Negative Matrix Factorization [J].
Bobadilla, Jesus ;
Bojorque, Rodolfo ;
Hernando Esteban, Antonio ;
Hurtado, Remigio .
IEEE ACCESS, 2018, 6 :3549-3564
[5]  
Bobadilla Jesus, 2023, arXiv
[6]  
Boli F., 2020, INT JOINT C ART JUL
[7]   Enhancing Long Term Fairness in Recommendations with Variational Autoencoders [J].
Borges, Rodrigo ;
Stefanidis, Kostas .
11TH INTERNATIONAL CONFERENCE ON MANAGEMENT OF DIGITAL ECOSYSTEMS (MEDES), 2019, :95-102
[8]  
Castillo Carlos, 2018, ACM SIGIR Forum, V52, P64, DOI 10.1145/3308774.3308783
[9]   CFGAN: A Generic Collaborative Filtering Framework based on Generative Adversarial Networks [J].
Chae, Dong-Kyu ;
Kang, Jin-Soo ;
Kim, Sang-Wook ;
Lee, Jung-Tae .
CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, :137-146
[10]   DataGenCARS: A generator of synthetic data for the evaluation of context-aware recommendation systems [J].
del Carmen Rodriguez-Hernandez, Maria ;
Ilarri, Sergio ;
Hermoso, Ramon ;
Trillo-Lado, Raquel .
PERVASIVE AND MOBILE COMPUTING, 2017, 38 :516-541