Research on, and Development of, Data Extraction and Data Cleaning Technology based on the Internet of Things

被引:2
作者
Li, Zhaochan [1 ]
Sun, Lili [1 ]
Higgs, Russell [2 ]
机构
[1] Beijing Wuzi Univ, Grad Sch, Beijing, Peoples R China
[2] Univ Coll Dublin, Sch Math & Stat, Dublin, Ireland
来源
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE) AND IEEE/IFIP INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC), VOL 2 | 2017年
关键词
data pump; current research; future development; BIG;
D O I
10.1109/CSE-EUC.2017.248
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper introduces the technological techniques of data cleaning and data extraction. The current state of domestic and international research in these two areas is reviewed and their future development considered. The following concepts are all explained: the basic principle of data cleaning, the framework models, the need for and the objectives of data cleaning, the testing method and the cleaning tool. Also introduced are data extraction techniques such as static data capture, log file capture, database generator capture, date and time capture, file comparison capture and finally source application capture. Finally the advantages and disadvantages of these various data extraction technologies and which to use in real-life situations are considered.
引用
收藏
页码:332 / 341
页数:10
相关论文
empty
未找到相关数据