Performance Evaluation of a MapReduce Hadoop-based Implementation for Processing Large Virtual Campus Log Files

被引:4
作者
Xhafa, Fatos [1 ]
Garcia, Daniel [1 ]
Ramirez, Daniel [1 ]
Caballe, Santi [2 ]
机构
[1] Univ Politecn Cataluna, Barcelona, Spain
[2] Open Univ Catalonia, Barcelona, Spain
来源
2015 10TH INTERNATIONAL CONFERENCE ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING (3PGCIC) | 2015年
关键词
Big Data; Massive Processing; Learning Analytics; Mining; Performance; Virtual Campus; Log Files; MapReduce; Hadoop; Cloud Computing; ANALYTICS;
D O I
10.1109/3PGCIC.2015.42
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Cloud computing technologies are bringing new scales of computational processing power and storage capacity to meet very demanding requirements of today's applications. One such family of applications is the one of analytics based on processing big data. More specifically, there is a large family of analytics applications from processing log data files. Indeed, log data files are commonplace in many Internet-based systems and applications, comprising system logs, server logs, application logs, databases logs, user activity logs, etc. These applications are analytics oriented applications based on processing the various types of log files. While log data file processing has been recently an issue of investigation by many researchers and developers, the new feature is that of scale: Cloud based systems can enable processing unlimited amount of data either off-line or online in streaming mode. In this work we evaluate the performance of a MapReduce Hadoop-based implementation for processing large log data files of a Virtual Campus. The study aims to reveal the potential of using such implementations as a basis for learning analytics for use by a variety of users in a Virtual Campus.
引用
收藏
页码:200 / 206
页数:7
相关论文
共 18 条
  • [1] Al-Ashmoery Y., 2015, INTELLIGENT SYSTEMS, P1
  • [2] What groupware functionality do users really use? Analysis of the usage of the BSCW system
    Appelt, W
    [J]. NINTH EUROMICRO WORKSHOP ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 2001, : 337 - 341
  • [3] Caballe S., 2008, LEARNING GRID HDB, V2
  • [4] Distributed-based massive processing of activity logs for efficient user modeling in a Virtual Campus
    Caballe, Santi
    Xhafa, Fatos
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2013, 16 (04): : 829 - 844
  • [5] Linking Business Analytics to Decision Making Effectiveness: A Path Model Analysis
    Cao, Guangming
    Duan, Yanqing
    Li, Gendao
    [J]. IEEE TRANSACTIONS ON ENGINEERING MANAGEMENT, 2015, 62 (03) : 384 - 395
  • [6] Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
  • [7] Dorogov AY, 2015, 2015 XVIII International Conference on Soft Computing and Measurements (SCM), P182, DOI 10.1109/SCM.2015.7190449
  • [8] Mahmood T, 2013, 2013 2ND NATIONAL CONFERENCE ON INFORMATION ASSURANCE (NCIA), P129, DOI 10.1109/NCIA.2013.6725337
  • [9] Miguel J., 2015, CONCURR COMP-PRACT E, V27, P1988
  • [10] Osman Amr, 2013, 2013 IEEE Ninth World Congress on Services (SERVICES), P428, DOI 10.1109/SERVICES.2013.36