Efficient Data Management Tools for the Heterogeneous Big Data Warehouse

被引:3
|
作者
Alekseev, A. A. [1 ]
Osipova, V. V. [1 ]
Ivanov, M. A. [1 ]
Klimentov, A. [2 ]
Grigorieva, N. V. [1 ]
Nalamwar, H. S. [1 ]
机构
[1] Natl Res Tomsk Polytech Univ, Tomsk, Russia
[2] Brookhaven Natl Lab, Upton, NY 11973 USA
关键词
Relational Database Management System (RDBMS); Non-relational Structure Query Language (NoSQL); Structure Query Language (SQL); Big Data; Heterogeneous Data Warehouse; Apache Hadoop; Hive; MongoDB; Data Manipulation Language (DML) Operations;
D O I
10.1134/S1547477116050022
中图分类号
O412 [相对论、场论]; O572.2 [粒子物理学];
学科分类号
摘要
The traditional RDBMS has been consistent for the normalized data structures. RDBMS served well for decades, but the technology is not optimal for data processing and analysis in data intensive fields like social networks, oil-gas industry, experiments at the Large Hadron Collider, etc. Several challenges have been raised recently on the scalability of data warehouse like workload against the transactional schema, in particular for the analysis of archived data or the aggregation of data for summary and accounting purposes. The paper evaluates new database technologies like HBase, Cassandra, and MongoDB commonly referred as NoSQL databases for handling messy, varied and large amount of data. The evaluation depends upon the performance, throughput and scalability of the above technologies for several scientific and industrial use-cases. This paper outlines the technologies and architectures needed for processing Big Data, as well as the description of the back-end application that implements data migration from RDBMS to NoSQL data warehouse, NoSQL database organization and how it could be useful for further data analytics.
引用
收藏
页码:689 / 692
页数:4
相关论文
共 50 条
  • [41] Automated integration of heterogeneous data warehouse schemas
    University of Zagreb, Faculty of Electrical Engineering and Computing, Zagreb, Croatia
    不详
    Int. J. Data Warehouse. Min., 2008, 4 (1-21):
  • [42] Access Control Analysis in Heterogeneous Big Data Management Systems
    Poltavtseva, M. A.
    Kalinin, M. O.
    PROGRAMMING AND COMPUTER SOFTWARE, 2024, 50 (07) : 549 - 558
  • [43] Towards an efficient big data management schema for IoT
    Sawalha, Samer
    Al-Naymat, Ghazi
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (09) : 7803 - 7818
  • [44] Managing data quality: Robust and resistant tools for a data warehouse environment
    Schwarzkopf, AB
    ASSOCIATION FOR INFORMATION SYSTEMS PROCEEDINGS OF THE AMERICAS CONFERENCE ON INFORMATION SYSTEMS, 1998, : 954 - 956
  • [45] Medical Big Data Analysis Using Big Data Tools and Methods
    Alhussain, Thamer
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2018, 8 (04) : 793 - 795
  • [46] Benefits of a clinical data warehouse with data mining tools to collect data for a radiotherapy trial
    Roelofs, Erik
    Persoon, Lucas
    Nijsten, Sebastiaan
    Wiessler, Wolfgang
    Dekker, Andre
    Lambin, Philippe
    RADIOTHERAPY AND ONCOLOGY, 2013, 108 (01) : 174 - 179
  • [47] Efficient Data Streams Processing in the Real Time Data Warehouse
    Majeed, Fiaz
    Mahmood, Muhammad Sohaib
    Iqbal, Mujahid
    PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (ICCSIT 2010), VOL 5, 2010, : 57 - 61
  • [48] An Overview of Data Warehouse and Data Lake in Modern Enterprise Data Management
    Nambiar, Athira
    Mundra, Divyansh
    BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (04)
  • [49] A dynamic data classification techniques and tools for big data
    Rani, T. Usha
    Priyanka, C. H. Sindhu
    Monica, B. S. S.
    INTERNATIONAL CONFERENCE ON COMPUTER VISION AND MACHINE LEARNING, 2019, 1228
  • [50] Data Warehouse for Quality Management Systems
    慕春棣
    戴剑彬
    TsinghuaScienceandTechnology, 1998, (03) : 83 - 86