Data Quality and Explainable AI

被引:19
作者
Bertossi, Leopoldo [1 ,2 ]
Geerts, Floris [3 ]
机构
[1] Univ Adolfo Ibanez, Fac Engn & Sci, Santiago, Chile
[2] RelationalAI Inc, Toronto, ON, Canada
[3] Univ Antwerp, Dept Comp Sci, Middelheimlaan 1, B-2020 Antwerp, Belgium
来源
ACM JOURNAL OF DATA AND INFORMATION QUALITY | 2020年 / 12卷 / 02期
关键词
Machine learning; causes; fairness; bias; QUERY ANSWERS;
D O I
10.1145/3386687
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we provide some insights and develop some ideas, with few technical details, about the role of explanations in Data Quality in the context of data-based machine learning models (ML). In this direction, there are, as expected, roles for causality, and explainable artificial intelligence. The latter area not only sheds light on the models, but also on the data that support model construction. There is also room for defining, identifying, and explaining errors in data, in particular, in ML, and also for suggesting repair actions. More generally, explanations can be used as a basis for defining dirty data in the context of ML, and measuring or quantifying them. We think dirtiness as relative to the ML task at hand, e.g., classification.
引用
收藏
页数:9
相关论文
共 40 条
[1]  
[Anonymous], 2012, SYNTH LECT DATA MANA
[2]  
[Anonymous], 2017, P NIPS
[3]  
[Anonymous], 2011, PROBABILISTIC DATABA, DOI DOI 10.2200/S00362ED1V01Y201105DTM016
[4]   ERBlox: Combining matching dependencies with machine learning for entity resolution [J].
Bahmani, Zeinab ;
Bertossi, Leopoldo ;
Vasiloglou, Nikolaos .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2017, 83 :118-141
[5]  
Batini C., 2016, DATA QUALITY CONCEPT
[6]  
Bertossi L., EXPT SCORE BASED EXP
[7]   Ontological Multidimensional Data Models and Contextual Data Quality [J].
Bertossi, Leopoldo ;
Milani, Mostafa .
ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2018, 9 (03)
[8]   Causes for query answers from databases: Datalog abduction, view-updates, and integrity constraints [J].
Bertossi, Leopoldo ;
Salimi, Babak .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2017, 90 :226-252
[9]   From Causes for Database Queries to Repairs and Model-Based Diagnosis and Back [J].
Bertossi, Leopoldo ;
Salimi, Babak .
THEORY OF COMPUTING SYSTEMS, 2017, 61 (01) :191-232
[10]   Data Cleaning and Query Answering with Matching Dependencies and Matching Functions [J].
Bertossi, Leopoldo ;
Kolahi, Solmaz ;
Lakshmanan, Laks V. S. .
THEORY OF COMPUTING SYSTEMS, 2013, 52 (03) :441-482