Big Data Provenance Research Directions

被引:0
作者
Chacko, Anu [1 ]
Kumar, S. D. Madhu [1 ]
机构
[1] Natl Inst Technol Calicut, Kozhikode, Kerala, India
来源
TENCON 2017 - 2017 IEEE REGION 10 CONFERENCE | 2017年
关键词
Big Data; Provenance; MapReduce; NoSQL;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Provenance is the "chronology of the ownership, custody or location of a historical object". This term was initially relevant in the works of art and archeology but now finds use in other fields of science and information technology. Provenance is valuable in authenticating the value of an object. In the context of Information Technology, data provenance holds the history of the data item of how it reached its present state, i.e., the information of all the processes that transformed/influenced it. This information is useful for checking the correctness of information or to do error debugging. In this era of big data, where decisions are based on the insights derived by analyzing large variety of data automatically gathered, provenance becomes very important. The value/correctness of the insight depends on the correctness of the input data. Hence we need mechanisms that will vouch for the correctness of the data i.e., provenance. This paper gives an introduction to the concepts of provenance and reviews the work in Big Data provenance. The paper intends to give readers an understanding of the research direction in this area.
引用
收藏
页码:651 / 656
页数:6
相关论文
共 18 条
  • [1] Ala Alkhaldi, 2015, I IEEE INT C CLOUD C
  • [2] Putting Lipstick on Pig: Enabling Database-style Workflow Provenance
    Amsterdamer, Yael
    Davidson, Susan B.
    Deutch, Daniel
    Milo, Tova
    Stoyanovich, Julia
    Tannen, Val
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 5 (04): : 346 - 357
  • [3] Braun U, 2006, LECT NOTES COMPUT SC, V4145, P171
  • [4] Chacko A.M., 2017, INT J BIG DATA INTEL, V4, P186, DOI [10.1504/IJBDI.2017.10006111, DOI 10.1504/IJBDI.2017.10006111]
  • [5] Chacko AM, 2016, COMM COM INF SC, V625, P152
  • [6] Provenance in Databases: Why, How, and Where
    Cheney, James
    Chiticariu, Laura
    Tan, Wang-Chiew
    [J]. FOUNDATIONS AND TRENDS IN DATABASES, 2007, 1 (04): : 379 - 474
  • [7] Chong S., 2009, OOPSLA 2009
  • [8] Crawl Daniel, 2009, P 4 WORKSH WORKFL SU
  • [9] Cuzzocrea Alfredo, P 2015 IEEE ANN INT
  • [10] GUNELIUS S., 2014, DATA EXPLOSION 2014