An AI Planning System for Data Cleaning

被引:4
|
作者
Boselli, Roberto [1 ,2 ]
Cesarini, Mirko [1 ,2 ]
Mercorio, Fabio [1 ,2 ]
Mezzanzanica, Mario [1 ,2 ]
机构
[1] Univ Milano Bicocca, Dept Stat & Quantitat Methods, Milan, Italy
[2] Univ Milano Bicocca, CRISP Res Ctr, Milan, Italy
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2017, PT III | 2017年 / 10536卷
关键词
AI planning; Data quality; Data cleaning; ETL; CHECKING;
D O I
10.1007/978-3-319-71273-4_29
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data Cleaning represents a crucial and error prone activity in KDD that might have unpredictable effects on data analytics, affecting the believability of the whole KDD process. In this paper we describe how a bridge between AI Planning and Data Quality communities has been made, by expressing both the data quality and cleaning tasks in terms of AI planning. We also report a real-life application of our approach.
引用
收藏
页码:349 / 353
页数:5
相关论文
共 50 条
  • [31] The effect of data cleaning on record linkage quality
    Randall, Sean M.
    Ferrante, Anna M.
    Boyd, James H.
    Semmens, James B.
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2013, 13
  • [32] Using Ontologies for Interoperability of Data Cleaning Operations
    Almeida, Ricardo
    Oliveira, Paulo
    7TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI 2012), 2012,
  • [33] The effect of data cleaning on record linkage quality
    Sean M Randall
    Anna M Ferrante
    James H Boyd
    James B Semmens
    BMC Medical Informatics and Decision Making, 13
  • [34] An Effective Data Warehousing System for RFID Using Novel Data Cleaning, Data Transformation and Loading Techniques
    Kochar, Barjesh
    Chhillar, Rajender
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2012, 9 (03) : 208 - 216
  • [35] An Automated Web Services Composition System Based on Service Classification and AI Planning
    Qi, Shanfeng
    Tang, Xinhuai
    Chen, Delai
    SECOND INTERNATIONAL CONFERENCE ON CLOUD AND GREEN COMPUTING / SECOND INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING AND ITS APPLICATIONS (CGC/SCA 2012), 2012, : 537 - 540
  • [36] The Potentials of AI Planning on the Edge
    Georgievski, Ilche
    Aiello, Marco
    2023 IEEE INTERNATIONAL CONFERENCE ON EDGE COMPUTING AND COMMUNICATIONS, EDGE, 2023, : 330 - 336
  • [37] MARK-AGE data management: Cleaning, exploration and visualization of data
    Baur, Jennifer
    Moreno-Villanueva, Maria
    Koetter, Tobias
    Sindlinger, Thilo
    Buerkle, Alexander
    Berthold, Michael R.
    Junk, Michael
    MECHANISMS OF AGEING AND DEVELOPMENT, 2015, 151 : 38 - 44
  • [38] A SYSTEMATIC MAPPING REVIEW ON DATA CLEANING METHODS IN BIG DATA ENVIRONMENTS
    Iwata, Claudio Keiji
    Galegale, Napoleao Verardi
    Ito, Marcia
    de Azevedo, Marilia Macorin
    Feitosa, Marcelo Duduchi
    Arima, Carlos Hideo
    IADIS-INTERNATIONAL JOURNAL ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2024, 19 (02): : 19 - 36
  • [39] Tracking performance in poultry is affected by data cleaning method and housing system
    Candelotto, Laura
    Grethen, Klara J.
    Montalcini, Camille M.
    Toscano, Michael J.
    Gomez, Yamenah
    APPLIED ANIMAL BEHAVIOUR SCIENCE, 2022, 249
  • [40] Development of Data Cleaning and Integration Algorithm for Asset Management of Power System
    Hwang, Jae-Sang
    Mun, Sung-Duk
    Kim, Tae-Joon
    Oh, Geun-Won
    Sim, Yeon-Sub
    Chang, Seung Jin
    ENERGIES, 2022, 15 (05)