Analysis for Data Preprocessing To Prevent Direct Discrimination in Data Mining

被引:0
作者
Aneyrao, Trupti A. [1 ]
Fadnavis, R. A. [1 ]
机构
[1] Yeshwantrao Chavhan Coll Engn, Dept Informat Technol, Nagpur, Maharashtra, India
来源
2016 WORLD CONFERENCE ON FUTURISTIC TRENDS IN RESEARCH AND INNOVATION FOR SOCIAL WELFARE (STARTUP CONCLAVE) | 2016年
关键词
Data Mining; Direct Discrimination; Data Preprocessing; Discrimination Prevention; Rule Protection; Rule Generalization;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Data mining is a technology using which we can extract useful information from data. There are two major issues in data mining first is privacy violation and and second is discrimination. Discrimination is the unfair treatment with respect to the features that should not be considered while decision making. With respect to human, it is when people are given unfair treatment on the basis of their sensitive features like gender, race, religion etc. Discrimination can be of two types direct discrimination and indirect discrimination. Direct discrimination consists of training rules based on sensitive attributes like religion, race, community etc. Indirect discrimination is a discrimination which occurs when the decisions are taken on non-sensitive attributes but these attributes are closely related to direct discriminatory attributes. Automated decision making systems uses data mining techniques to train the system for decision making. Data form the previous work is used for the rule generation to train the system. At first sight, we can say that automating decisions systems are fair in decision making, but if the training data sets are itself discriminatory then the the system cannot be free from discrimination. To remove such discrimination we have discrimination discovery and prevention techniques in data mining. This paper mainly focuses direct discrimination removal from the data.
引用
收藏
页数:4
相关论文
共 8 条
[1]  
[Anonymous], 1963, Magnetism
[2]   Three naive Bayes approaches for discrimination-free classification [J].
Calders, Toon ;
Verwer, Sicco .
DATA MINING AND KNOWLEDGE DISCOVERY, 2010, 21 (02) :277-292
[3]  
Hajian S, 2013, Discrimination and Privacy in the Information Society, P241
[4]   A Methodology for Direct and Indirect Discrimination Prevention in Data Mining [J].
Hajian, Sara ;
Domingo-Ferrer, Josep .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (07) :1445-1459
[5]  
Pedreshi D., 2008, KDD
[6]  
Rakesh Agrawal., 1994, P 20 INT C VER LARG, V1215
[7]   Data Mining for Discrimination Discovery [J].
Ruggieri, Salvatore ;
Pedreschi, Dino ;
Turini, Franco .
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2010, 4 (02)
[8]  
Sara Hajian., 2011, COMP INT CYB SEC CIC