Survey of Real-time Processing Systems for Big Data

被引:42
|
作者
Liu, Xiufeng [1 ]
Iftikhar, Nadeem [2 ]
Xie, Xike [3 ]
机构
[1] Univ Waterloo, Waterloo, ON, Canada
[2] Univ Coll Northern, Hjorring, Denmark
[3] Aalborg Univ, Aalborg, Denmark
来源
PROCEEDINGS OF THE 18TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM (IDEAS14) | 2014年
关键词
Survey; Real-time; Big data; Architectures; Systems; MAPREDUCE;
D O I
10.1145/2628194.2628251
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In recent years, real-time processing and analytics systems for big data-in the context of Business Intelligence (BI)-have received a growing attention. The traditional BI platforms that perform regular updates on daily, weekly or monthly basis are no longer adequate to satisfy the fast-changing business environments. However, due to the nature of big data, it has become a challenge to achieve the real-time capability using the traditional technologies. The recent distributed computing technology, MapReduce, provides off-the-shelf high scalability that can significantly shorten the processing time for big data; Its open-source implementation such as Hadoop has become the de-facto standard for processing big data, however, Hadoop has the limitation of supporting real-time updates. The improvements in Hadoop for the real-time capability, and the other alternative real-time frameworks have been emerging in recent years. This paper presents a survey of the open source technologies that support big data processing in a real-time/near real-time fashion, including their system architectures and platforms.
引用
收藏
页码:356 / 361
页数:6
相关论文
共 50 条
  • [1] Real-time big data processing for anomaly detection: A Survey
    Habeeb, Riyaz Ahamed Ariyaluran
    Nasaruddin, Fariza
    Gani, Abdullah
    Hashem, Ibrahim Abaker Targio
    Ahmed, Ejaz
    Imran, Muhammad
    INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2019, 45 : 289 - 307
  • [2] A survey on data stream, big data and real-time
    Gomes E.H.A.
    Plentz P.D.M.
    De Rolt C.R.
    Dantas M.A.R.
    International Journal of Networking and Virtual Organisations, 2019, 20 (02) : 143 - 167
  • [3] Parallel Processing Systems for Big Data: A Survey
    Zhang, Yunquan
    Cao, Ting
    Li, Shigang
    Tian, Xinhui
    Yuan, Liang
    Jia, Haipeng
    Vasilakos, Athanasios V.
    PROCEEDINGS OF THE IEEE, 2016, 104 (11) : 2114 - 2136
  • [4] Beyond Batch Processing: Towards Real-Time and Streaming Big Data
    Shahrivari, Saeed
    COMPUTERS, 2014, 3 (04) : 117 - 129
  • [5] Big Data Real-time Processing Based on Storm
    Yang, Wenjie
    Liu, Xingang
    Zhang, Lan
    Yang, Laurence T.
    2013 12TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2013), 2013, : 1784 - 1787
  • [6] Real-Time Big Data Stream Processing Using GPU with Spark Over Hadoop Ecosystem
    Rathore, M. Mazhar
    Son, Hojae
    Ahmad, Awais
    Paul, Anand
    Jeon, Gwanggil
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2018, 46 (03) : 630 - 646
  • [7] Railway Big Data Real-time Processing Based on Storm
    Guo, Shihang
    Zhang, Lichen
    PROCEEDINGS OF THE 2016 2ND WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS, 2016, 81 : 536 - 539
  • [8] A Survey on Real-time Big Data Analytics: Applications and Tools
    Yadranjiaghdam, Babak
    Pool, Nathan
    Tabrizi, Nasseh
    2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI), 2016, : 404 - 409
  • [9] Near real-time big-data processing for data driven applications
    Kampars, Janis
    Grabis, Janis
    2017 3RD INTERNATIONAL CONFERENCE ON BIG DATA INNOVATIONS AND APPLICATIONS (INNOVATE-DATA), 2017, : 35 - 42
  • [10] InfoFrame table access method for real-time processing of big data
    Oosawa, Hideki
    Miyata, Tsuyoshi
    NEC Technical Journal, 2012, 7 (02): : 23 - 27