Quarry: A User-centered Big Data Integration Platform

被引:0
作者
Petar Jovanovic
Sergi Nadal
Oscar Romero
Alberto Abelló
Besim Bilalli
机构
[1] Universitat Politècnica de Catalunya (BarcelonaTech),
来源
Information Systems Frontiers | 2021年 / 23卷
关键词
Data Integration; Big Data; Data-Intensive Flows; Metadata;
D O I
暂无
中图分类号
学科分类号
摘要
Obtaining valuable insights and actionable knowledge from data requires cross-analysis of domain data typically coming from various sources. Doing so, inevitably imposes burdensome processes of unifying different data formats, discovering integration paths, and all this given specific analytical needs of a data analyst. Along with large volumes of data, the variety of formats, data models, and semantics drastically contribute to the complexity of such processes. Although there have been many attempts to automate various processes along the Big Data pipeline, no unified platforms accessible by users without technical skills (like statisticians or business analysts) have been proposed. In this paper, we present a Big Data integration platform (Quarry) that uses hypergraph-based metadata to facilitate (and largely automate) the integration of domain data coming from a variety of sources, and provides an intuitive interface to assist end users both in: (1) data exploration with the goal of discovering potentially relevant analysis facets, and (2) consolidation and deployment of data flows which integrate the data, and prepare them for further analysis (descriptive or predictive), visualization, and/or publishing. We validate Quarry’s functionalities with the use case of World Health Organization (WHO) epidemiologists and data analysts in their fight against Neglected Tropical Diseases (NTDs).
引用
收藏
页码:9 / 33
页数:24
相关论文
共 50 条
  • [41] Big data integration - an evolutionary perspective
    Dinu, Simona
    [J]. ADVANCED TOPICS IN OPTOELECTRONICS, MICROELECTRONICS AND NANOTECHNOLOGIES X, 2020, 11718
  • [42] In Search of Big Medical Data Integration Solutions - A Comprehensive Survey
    Dhayne, Houssein
    Haque, Rafiqul
    Kilany, Rima
    Taher, Yehia
    [J]. IEEE ACCESS, 2019, 7 : 91265 - 91290
  • [43] The Design and Implementation of the Enterprise Level Data Platform and Big Data Driven Applications and Analytics
    Liu, Hesen
    Guo, Jiahui
    Yu, Wenpeng
    Zhu, Lin
    Liu, Yilu
    Xia, Tao
    Sun, Rui
    Gardner, R. Matthew
    [J]. 2016 IEEE/PES TRANSMISSION AND DISTRIBUTION CONFERENCE AND EXPOSITION (T&D), 2016,
  • [44] User Information Extraction in Big Data Environment
    Wang, Kaiqiang
    Shi, Yijie
    [J]. PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 2315 - 2318
  • [45] User Experience (UX) of a Big Data Infrastructure
    Chunpir, Hashim Iqbal
    Williams, Dean
    Ludwig, Thomas
    [J]. HUMAN INTERFACE AND THE MANAGEMENT OF INFORMATION: SUPPORTING LEARNING, DECISION-MAKING AND COLLABORATION, HCI INTERNATIONAL 2017, PT II, 2017, 10274 : 467 - 474
  • [46] The Internet-of-Things: Reflections on the past, present and future from a user-centered and smart environment perspective
    Chin, Jeannette
    Callaghan, Vic
    Ben Allouch, Somaya
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND SMART ENVIRONMENTS, 2019, 11 (01) : 45 - 69
  • [47] Data integration and mining based on web big data
    Zhang, Su-Zhi
    Qu, Xu-Kai
    Sun, Jia-Bin
    [J]. International Journal of Multimedia and Ubiquitous Engineering, 2015, 10 (06): : 123 - 130
  • [48] Web Data Integration and Mining Based on Big Data
    Zhang, Su-Zhi
    Qu, Xu-Kai
    Sun, Jia-Bin
    [J]. INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMMUNICATION ENGINEERING (CSCE 2015), 2015, : 80 - 84
  • [49] Booly: a new data integration platform
    Long H Do
    Francisco F Esteves
    Harvey J Karten
    Ethan Bier
    [J]. BMC Bioinformatics, 11
  • [50] Data integration platform for village emergency
    Ying W.
    Daoping W.
    Guangli L.
    Di L.
    [J]. International Journal of Digital Content Technology and its Applications, 2010, 4 (04) : 215 - 217