机构:
Univ Carlos III Madrid, Dept Estadist, Madrid 28903, Spain
Univ Carlos III Madrid, Inst Financial Big Data, Madrid 28903, SpainUniv Carlos III Madrid, Dept Estadist, Madrid 28903, Spain
Galeano, Pedro
[1
,2
]
Pena, Daniel
论文数: 0引用数: 0
h-index: 0
机构:
Univ Carlos III Madrid, Dept Estadist, Madrid 28903, Spain
Univ Carlos III Madrid, Inst Financial Big Data, Madrid 28903, SpainUniv Carlos III Madrid, Dept Estadist, Madrid 28903, Spain
Pena, Daniel
[1
,2
]
机构:
[1] Univ Carlos III Madrid, Dept Estadist, Madrid 28903, Spain
[2] Univ Carlos III Madrid, Inst Financial Big Data, Madrid 28903, Spain
Machine learning;
Sparse model selection;
Statistical learning;
Network analysis;
Multivariate data;
Time series;
HIGH-DIMENSIONAL DATA;
OUTLIER DETECTION;
TIME-SERIES;
PRINCIPAL COMPONENTS;
VARIABLE SELECTION;
MODEL SELECTION;
ALGORITHMS;
CHALLENGES;
VISUALIZATION;
VALIDATION;
D O I:
10.1007/s11749-019-00651-9
中图分类号:
O21 [概率论与数理统计];
C8 [统计学];
学科分类号:
020208 ;
070103 ;
0714 ;
摘要:
This article analyzes how Big Data is changing the way we learn from observations. We describe the changes in statistical methods in seven areas that have been shaped by the Big Data-rich environment: the emergence of new sources of information; visualization in high dimensions; multiple testing problems; analysis of heterogeneity; automatic model selection; estimation methods for sparse models; and merging network information with statistical models. Next, we compare the statistical approach with those in computer science and machine learning and argue that the convergence of different methodologies for data analysis will be the core of the new field of data science. Then, we present two examples of Big Data analysis in which several new tools discussed previously are applied, as using network information or combining different sources of data. Finally, the article concludes with some final remarks.
机构:
Univ A Coruna, Fac Informat, Dept Matemat, CITIC,ITMATI,Grp MODES, La Coruna 15071, SpainUniv A Coruna, Fac Informat, Dept Matemat, CITIC,ITMATI,Grp MODES, La Coruna 15071, Spain