EXPLORING COMPLEX AND BIG DATA

被引:29
作者
Stefanowski, Jerzy [1 ]
Krawiec, Krzysztof [1 ]
Wrembel, Robert [1 ]
机构
[1] Poznan Univ Tech, Inst Comp Sci, Ul Piotrowo 2, PL-60965 Poznan, Poland
关键词
big data; complex data; data integration; data provenance; data streams; deep learning; PROVENANCE; CHALLENGES;
D O I
10.1515/amcs-2017-0046
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper shows how big data analysis opens a range of research and technological problems and calls for new approaches. We start with defining the essential properties of big data and discussing the main types of data involved. We then survey the dedicated solutions for storing and processing big data, including a data lake, virtual integration, and a polystore architecture. Difficulties in managing data quality and provenance are also highlighted. The characteristics of big data imply also specific requirements and challenges for data mining algorithms, which we address as well. The links with related areas, including data streams and deep learning, are discussed. The common theme that naturally emerges from this characterization is complexity. All in all, we consider it to be the truly defining feature of big data (posing particular research and technological challenges), which ultimately seems to be of greater importance than the sheer data volume.
引用
收藏
页码:669 / 679
页数:11
相关论文
共 50 条
  • [1] Exploring the performance measures of big data analytics systems
    Ali, Ismail Mohamed
    Jusoh, Yusmadi Yah
    Abdullah, Rusli
    Ahmed, Yahye Abukar
    INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2023, 10 (01): : 92 - 104
  • [2] Exploring Big Data Governance Frameworks
    Al-Badi, Ali
    Tarhini, Ali
    Khan, Asharul Islam
    9TH INTERNATIONAL CONFERENCE ON EMERGING UBIQUITOUS SYSTEMS AND PERVASIVE NETWORKS (EUSPN-2018) / 8TH INTERNATIONAL CONFERENCE ON CURRENT AND FUTURE TRENDS OF INFORMATION AND COMMUNICATION TECHNOLOGIES IN HEALTHCARE (ICTH-2018), 2018, 141 : 271 - 277
  • [3] Exploring the Benefits and Challenges of Big Data
    Farooq, Usman
    NEW INDUSTRIALIZATION AND URBANIZATION DEVELOPMENT ANNUAL CONFERENCE: THE INTERNATIONAL FORUM ON NEW INDUSTRIALIZATION DEVELOPMENT IN BIG-DATA ERA, 2015, : 606 - 621
  • [4] A Review on Complex Event Processing Systems for Big Data
    Tawsif, K.
    Hossen, J.
    Raja, J. Emerson
    Jesmeen, M. Z. H.
    Arif, E. M. H.
    2018 FOURTH INTERNATIONAL CONFERENCE ON INFORMATION RETRIEVAL AND KNOWLEDGE MANAGEMENT (CAMP), 2018, : 2 - 7
  • [5] What makes Big Data, Big Data? Exploring the ontological characteristics of 26 datasets
    Kitchin, Rob
    McArdle, Gavin
    BIG DATA & SOCIETY, 2016, 3 (01): : 1 - 10
  • [6] Exploring big data analytics in health care
    Ramesh T.
    Santhi V.
    International Journal of Intelligent Networks, 2020, 1 : 135 - 140
  • [7] PathGraph: Querying and Exploring Big Data Graphs
    Colazzo, Dario
    Mecca, Vincenzo
    Nole, Maurizio
    Sartiani, Carlo
    30TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM 2018), 2018,
  • [8] Exploring the Potentials of Big Data Analytics in Marketing
    Faizi, Rdouan
    El Fkihi, Sanaa
    El Atia, Abdellatif
    INNOVATION MANAGEMENT AND EDUCATION EXCELLENCE THROUGH VISION 2020, VOLS I -XI, 2018, : 2109 - 2114
  • [9] Exploring and cleaning big data with random sample data blocks
    Salloum, Salman
    Huang, Joshua Zhexue
    He, Yulin
    JOURNAL OF BIG DATA, 2019, 6 (01)
  • [10] Exploring and cleaning big data with random sample data blocks
    Salman Salloum
    Joshua Zhexue Huang
    Yulin He
    Journal of Big Data, 6