Big Data Open Source Platforms

被引:11
|
作者
Coimbra de Almeida, Pedro Daniel [1 ]
Bernardino, Jorge [2 ]
机构
[1] ISEC Coimbra Inst Engn, Dept Comp Engn & Syst, Coimbra, Portugal
[2] ISEC CISUC, Polytech Coimbra, Coimbra, Portugal
来源
2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015 | 2015年
关键词
Big Data; Open Source; Data Mining; Data Analysis;
D O I
10.1109/BigDataCongress.2015.45
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In a global market the capacity to mine and analyze user data is one way for companies to be as close in time and accuracy to the needs of their users. Big Data Platforms are one solution for companies to solve the necessary challenges to accomplish these capacities. Unfortunately the number of challenges that need to be addressed, allied with the high number of different solutions proposed, has led to the creation of a high number of different platforms making it hard to name one definitive and adequate platform for companies. In this paper we compare six of the most important Big Data Open Source Platforms to help companies or organizations choose the most adequate one to their needs. We analyze the following open source platforms - Apache Mahout, MOA, R Project, Vowpal Wabbit, PEGASUS and GraphLab Create (TM).
引用
收藏
页码:268 / 275
页数:8
相关论文
共 50 条
  • [1] Big Data Analytics: A Preliminary Study of Open Source Platforms
    Nereu, Jorge
    Almeida, Ana
    Bernardino, Jorge
    ICSOFT: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES, 2017, : 435 - 440
  • [2] Open Source Big Data Analytics Technique
    Sharma, Ishan
    Tiwari, Rajeev
    Anand, Abhineet
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT 2016, VOL 1, 2017, 468 : 593 - 602
  • [3] An Open Source System for Big Data Warehousing
    Cassavia, Nunziato
    Masciari, Elio
    Sacca, Domenico
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2017, : 306 - 313
  • [4] Environmental Open Data in Urban Platforms: An Approach to the Big Data Life Cycle
    Gessa, Ana
    Sancha, Pilar
    JOURNAL OF URBAN TECHNOLOGY, 2020, 27 (01) : 27 - 45
  • [5] Big data or good data? Improving the quality of big data by open source clinical research protocols
    Oberije, C.
    Roelofs, E.
    Nalbantov, G.
    Dekker, A.
    Wiessler, W.
    Eble, M.
    Dries, W.
    Janvary, L.
    Bulens, P.
    Lambin, P.
    RADIOTHERAPY AND ONCOLOGY, 2014, 111 : S274 - S274
  • [6] Open Source Initiatives for Big Data Governance and Security: A Survey
    HU Baiqing
    WANG Wenjie
    Chi Harold Liu
    ZTECommunications, 2018, 16 (02) : 55 - 66
  • [7] Open Source Big Data Analytics Frameworks Written in Scala
    Miller, John A.
    Bowman, Casey
    Harish, Vishnu Gowda
    Quinn, Shannon
    2016 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2016, 2016, : 389 - 393
  • [8] Multi-source heterogeneous cultural big data integration platforms design
    Liu P.
    Wang H.
    Zheng D.
    Liu F.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2021, 49 (02): : 95 - 101
  • [9] Big-data platform based on open source ecosystem
    Lei J.
    Ye H.
    Wu Z.
    Zhang P.
    Xie L.
    He Y.
    1600, Science Press (54): : 80 - 93
  • [10] A study of software reliability on big data open source software
    Kumar, Ranjan
    Kumar, Subhash
    Tiwari, Sanjay K.
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2019, 10 (02) : 242 - 250