The Performance Analysis of Distributed Storage Systems Used in Scalable Web Systems

被引:0
|
作者
Oles, Dominik [1 ]
Nowak, Ziemowit [2 ]
机构
[1] Tieto Czech Sro, 28 Rijna 3346-91, Ostrava 70200, Czech Republic
[2] Wroclaw Univ Sci & Technol, Fac Comp Sci & Management, Wybrzeze Wyspianskiego 27, PL-50370 Wroclaw, Poland
来源
INFORMATION SYSTEMS ARCHITECTURE AND TECHNOLOGY, ISAT 2018, PT I | 2019年 / 852卷
关键词
Big Data; Hadoop; HBase; Kudu;
D O I
10.1007/978-3-319-99981-4_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scalable web systems are directly related to distributed storage systems used to process large amounts of data (big data). An example of such a system is Hadoop with its many extensions supporting data storage such as SQL-on-Hadoop systems and the "Parquet" file format. Another kind of systems for storing and processing big data are NoSQL databases, such as HBase, which are used in applications requiring fast random access. The Kudu system was created to combine the advantages of Hadoop and HBase and enable both effective data set analysis and fast random access. As subject of the research, performance analysis of the mentioned systems was performed. The experiment was conducted in the Amazon Web Services public cloud environment, where the cluster of nine virtual machines was configured. For research purpose, containing about billion rows fragment of "Wikipedia Page Traffic Statistics" public dataset was used. The results of the measurements confirm that the Kudu system is a promising alternative to the commonly used technologies.
引用
收藏
页码:287 / 298
页数:12
相关论文
共 50 条
  • [1] Locational Performance Analysis of Distributed Photovoltaic Systems
    Christoforidis, Georgios C.
    Panapakidis, Ioannis P.
    2017 7TH INTERNATIONAL CONFERENCE ON MODERN POWER SYSTEMS (MPS), 2017,
  • [2] Distributed Systems Performance for Big Data
    Ramos, Marcelo Paiva
    Tasinaffo, Paulo Marcelo
    de Almeida, Eugenio Sper
    Achite, Luis Marcelo
    da Cunha, Adilson Marques
    Vieira Dias, Luiz Alberto
    INFORMATION TECHNOLOGY: NEW GENERATIONS, 2016, 448 : 733 - 744
  • [3] Secured and High Performance Distributed Big Data Storage in Cloud Systems
    Hossain, Md Delwar
    Adnan, Muhammad Abdullah
    PROCEEDINGS OF 2018 3RD INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS), 2018, : 72 - 79
  • [4] Evolution and Analysis of Distributed File Systems in Cloud Storage: Analytical Survey
    Ramesh, Dharavath
    Patidar, Neeraj
    Kumar, Gaurav
    Vunnam, Teja
    2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2016, : 753 - 758
  • [5] Design and Implementation of a Scalable Distributed Web Crawler Based on Hadoop
    Shi, YuLiang
    Zhang, Ti
    2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), 2017, : 537 - 541
  • [6] A High-Performance and Scalable Distributed Storage and Computing System for IMS Services
    Seraoui, Youssef
    Bellafkih, Mostafa
    Raouyane, Brahim
    2016 2ND INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGIES AND APPLICATIONS (CLOUDTECH), 2016, : 335 - 342
  • [7] On the performance of SQL scalable systems on Kubernetes: a comparative study
    Cardas, Cristian
    Aldana-Martin, Jose F.
    Burgueno-Romero, Antonio M.
    Nebro, Antonio J.
    Mateos, Jose M.
    Sanchez, Juan J.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2023, 26 (03): : 1935 - 1947
  • [8] On the performance of SQL scalable systems on Kubernetes: a comparative study
    Cristian Cardas
    José F. Aldana-Martín
    Antonio M. Burgueño-Romero
    Antonio J. Nebro
    Jose M. Mateos
    Juan J. Sánchez
    Cluster Computing, 2023, 26 : 1935 - 1947
  • [9] Performance Analysis of Structured, Un-Structured, and Cloud Storage Systems
    Mondal, Anindita Sarkar
    Sanyal, Madhupa
    Chattapadhyay, Samiran
    Mondal, Kartick Chandra
    INTERNATIONAL JOURNAL OF AMBIENT COMPUTING AND INTELLIGENCE, 2019, 10 (01) : 1 - 29
  • [10] An Extended IMS Framework With a High-Performance and Scalable Distributed Storage and Computing System
    Seraoui, Youssef
    Raouyane, Brahim
    Bellafkih, Mostafa
    2017 INTERNATIONAL SYMPOSIUM ON NETWORKS, COMPUTERS AND COMMUNICATIONS (ISNCC), 2017,