EXPLORING COMPLEX AND BIG DATA

被引:29
作者
Stefanowski, Jerzy [1 ]
Krawiec, Krzysztof [1 ]
Wrembel, Robert [1 ]
机构
[1] Poznan Univ Tech, Inst Comp Sci, Ul Piotrowo 2, PL-60965 Poznan, Poland
关键词
big data; complex data; data integration; data provenance; data streams; deep learning; PROVENANCE; CHALLENGES;
D O I
10.1515/amcs-2017-0046
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper shows how big data analysis opens a range of research and technological problems and calls for new approaches. We start with defining the essential properties of big data and discussing the main types of data involved. We then survey the dedicated solutions for storing and processing big data, including a data lake, virtual integration, and a polystore architecture. Difficulties in managing data quality and provenance are also highlighted. The characteristics of big data imply also specific requirements and challenges for data mining algorithms, which we address as well. The links with related areas, including data streams and deep learning, are discussed. The common theme that naturally emerges from this characterization is complexity. All in all, we consider it to be the truly defining feature of big data (posing particular research and technological challenges), which ultimately seems to be of greater importance than the sheer data volume.
引用
收藏
页码:669 / 679
页数:11
相关论文
共 50 条
  • [31] Digital Transformation: Exploring big data Governance in Public Administration
    Yukhno, Alexander
    PUBLIC ORGANIZATION REVIEW, 2024, 24 (01) : 335 - 349
  • [32] The role of big data analytics in Internet of Things
    Ahmed, Ejaz
    Yaqoob, Ibrar
    Hashem, Ibrahim Abaker Targio
    Khan, Imran
    Ahmed, Abdelmuttlib Ibrahim Abdalla
    Imran, Muhammad
    Vasilakos, Athanasios V.
    COMPUTER NETWORKS, 2017, 129 : 459 - 471
  • [33] A Survey on Classifying Big Data with Label Noise
    Johnson, Justin M.
    Khoshgoftaar, Taghi M.
    ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2022, 14 (04):
  • [34] Big Data application on signal processing systems
    Tikhonyuk, A. I.
    Erokhin, S. D.
    Chadov, T. A.
    2018 SYSTEMS OF SIGNAL SYNCHRONIZATION, GENERATING AND PROCESSING IN TELECOMMUNICATIONS (SYNCHROINFO), 2018,
  • [35] Seeing the "Big" Picture: Big Data Methods for Exploring Relationships Between Usage, Language, and Outcome in Internet Intervention Data
    Carpenter, Jordan
    Crutchley, Patrick
    Zilca, Ran D.
    Schwartz, H. Andrew
    Smith, Laura K.
    Cobb, Angela M.
    Parks, Acacia C.
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2016, 18 (08)
  • [36] Complex event recognition in the Big Data era: a survey
    Giatrakos, Nikos
    Alevizos, Elias
    Artikis, Alexander
    Deligiannakis, Antonios
    Garofalakis, Minos
    VLDB JOURNAL, 2020, 29 (01) : 313 - 352
  • [37] Complex event recognition in the Big Data era: a survey
    Nikos Giatrakos
    Elias Alevizos
    Alexander Artikis
    Antonios Deligiannakis
    Minos Garofalakis
    The VLDB Journal, 2020, 29 : 313 - 352
  • [38] Big Data and government: Evidence of the role of Big Data for smart cities
    Hong, Sounman
    Kim, Sun Hyoung
    Kim, Youngrok
    Park, Jeongin
    BIG DATA & SOCIETY, 2019, 6 (01):
  • [39] Exploring New Vista of Secured and Optimized Data Slicing for Big Data: An IOT Paradigm
    Sarkar, Manash
    Hassanien, Aboul Ella
    WIRELESS PERSONAL COMMUNICATIONS, 2021, 116 (01) : 601 - 628
  • [40] Exploring New Vista of Secured and Optimized Data Slicing for Big Data: An IOT Paradigm
    Manash Sarkar
    Aboul Ella Hassanien
    Wireless Personal Communications, 2021, 116 : 601 - 628