PROCEEDINGS OF THE 2016 ACM WEB SCIENCE CONFERENCE (WEBSCI'16)
|
2016年
关键词:
data bias;
algorithmic bias;
privacy;
novelty;
diversity;
redundancy;
noise;
spam;
D O I:
10.1145/2908131.2908135
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
The Web is the largest public big data repository that humankind has created. In this overwhelming data ocean we need to be aware of the quality of data extracted from it. One important quality issue is data bias, which appears in different forms. These biases affect the (machine learning) algorithms that we design to improve the user experience. This problem is further exacerbated by biases that are added by these algorithms, especially in the context of recommendation and personalization systems. We give several examples, stressing the importance of the user context to avoid these biases.