Big Data Analytics

被引:45
作者
Rajaraman, V. [1 ]
机构
[1] Indian Inst Sci, Bengaluru, India
来源
RESONANCE-JOURNAL OF SCIENCE EDUCATION | 2016年 / 21卷 / 08期
关键词
Big data; data science; fourth paradigm; MapReduce; Hadoop;
D O I
10.1007/s12045-016-0376-7
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
The volume and variety of data being generated using computers is doubling every two years. It is estimated that in 2015, 8 Zettabytes ( Zetta=10(21)) were generated which consisted mostly of unstructured data such as emails, blogs, Twitter, Facebook posts, images, and videos. This is called big data. It is possible to analyse such huge data collections with clusters of thousands of inexpensive computers to discover patterns in the data that have many applications. But analysing massive amounts of data available in the Internet has the potential of impinging on our privacy. Inappropriate analysis of big data can lead to misleading conclusions. In this article, we explain what is big data, how it is analysed, and give some case studies illustrating the potentials and pitfalls of big data analytics.
引用
收藏
页码:695 / 716
页数:22
相关论文
共 9 条
[1]  
HASSOUN M, 1998, FUNDAMENTALS ARTIFIC
[2]  
Hey T., 2009, 4 PARADIGM DATA INTE
[3]   The Parable of Google Flu: Traps in Big Data Analysis [J].
Lazer, David ;
Kennedy, Ryan ;
King, Gary ;
Vespignani, Alessandro .
SCIENCE, 2014, 343 (6176) :1203-1205
[4]  
Metz C, HUGE BREAKTHROUGH GO
[5]  
Rajaraman V, 2016, PARALLEL COMPUTERS A
[6]  
Smith S, 2014, SIGNIFICANCE, V11, P10
[7]  
Srikant R P, 2016, DATAQUEST
[8]  
Ward J S, ARXIVORGABS13095821
[9]  
Watson H J, 2014, COMMUNICATION ASS IN, V34, P124