Tailoring Data Source Distributions for Fairness-aware Data Integration

被引:16
|
作者
Nargesian, Fatemeh [1 ]
Asudeh, Abolfazl [2 ]
Jagadish, H., V [3 ]
机构
[1] Univ Rochester, Rochester, MN 55905 USA
[2] Univ Illinois, Chicago, IL USA
[3] Univ Michigan, Ann Arbor, MI 48109 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2021年 / 14卷 / 11期
基金
美国国家科学基金会;
关键词
D O I
10.14778/3476249.3476299
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data scientists often develop data sets for analysis by drawing upon sources of data available to them. A major challenge is to ensure that the data set used for analysis has an appropriate representation of relevant (demographic) groups: it meets desired distribution requirements. Whether data is collected through some experiment or obtained from some data provider, the data from any single source may not meet the desired distribution requirements. Therefore, a union of data from multiple sources is often required. In this paper, we study how to acquire such data in the most cost effective manner, for typical cost functions observed in practice. We present an optimal solution for binary groups when the underlying distributions of data sources are known and all data sources have equal costs. For the generic case with unequal costs, we design an approximation algorithm that performs well in practice. When the underlying distributions are unknown, we develop an exploration-exploitation based strategy with a reward function that captures the cost and approximations of group distributions in each data source. Besides theoretical analysis, we conduct comprehensive experiments that confirm the effectiveness of our algorithms.
引用
收藏
页码:2519 / 2532
页数:14
相关论文
共 50 条
  • [21] Efficiency-aware and fairness-aware joint-layer optimization for downlink data scheduling in OFDM
    KunQi Guo
    LiXin Sun
    Ping Wang
    ShiLou Jia
    Science in China Series F: Information Sciences, 2008, 51 : 171 - 182
  • [22] Fairness-Aware UAV-Assisted Data Collection in Mobile Wireless Sensor Networks
    Ma, Xiaoyan
    Kacimi, Rahim
    Dhaou, Riadh
    2016 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING CONFERENCE (IWCMC), 2016, : 995 - 1001
  • [23] Fairness-Aware Process Mining
    Qafari, Mahnaz Sadat
    van der Aalst, Wil
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2019 CONFERENCES, 2019, 11877 : 182 - 192
  • [24] Towards Fairness-Aware Federated Learning
    Shi, Yuxin
    Yu, Han
    Leung, Cyril
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) : 11922 - 11938
  • [25] FairCF: fairness-aware collaborative filtering
    Pengyang Shao
    Le Wu
    Lei Chen
    Kun Zhang
    Meng Wang
    Science China Information Sciences, 2022, 65
  • [26] Fairness-aware Class Imbalanced Learning
    Subramanian, Shivashankar
    Rahimi, Afshin
    Baldwin, Timothy
    Cohn, Trevor
    Frermann, Lea
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 2045 - 2051
  • [27] FairCF: fairness-aware collaborative filtering
    Pengyang SHAO
    Le WU
    Lei CHEN
    Kun ZHANG
    Meng WANG
    Science China(Information Sciences), 2022, 65 (12) : 127 - 141
  • [28] A survey on fairness-aware recommender systems
    Jin, Di
    Wang, Luzhi
    Zhang, He
    Zheng, Yizhen
    Ding, Weiping
    Xia, Feng
    Pan, Shirui
    INFORMATION FUSION, 2023, 100
  • [29] Towards Robust Fairness-aware Recommendation
    Yang, Hao
    Liu, Zhining
    Zhang, Zeyu
    Zhuang, Chenyi
    Chen, Xu
    PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023, 2023, : 211 - 222
  • [30] FairCF: fairness-aware collaborative filtering
    Shao, Pengyang
    Wu, Le
    Chen, Lei
    Zhang, Kun
    Wang, Meng
    SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (12)