Rejoinder on: Data science, big data and statistics

被引:0
作者
Galeano, Pedro [1 ,2 ]
Pena, Daniel [1 ,2 ]
机构
[1] Univ Carlos III Madrid, Dept Estadist, Madrid 28903, Spain
[2] Univ Carlos III Madrid, Inst Financial Big Data, Madrid 28903, Spain
关键词
Machine learning; Multivariate data; Network analysis; Sparse model selection; Statistical learning; Time series;
D O I
10.1007/s11749-019-00652-8
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This article analyzes how Big Data is changing the way we learn from observations. We describe the changes in statistical methods in seven areas that have been shaped by the Big Data-rich environment: the emergence of new sources of information; visualization in high dimensions; multiple testing problems; analysis of heterogeneity; automatic model selection; estimation methods for sparse models; and merging network information with statistical models. Next, we compare the statistical approach with those in computer science and machine learning and argue that the convergence of different methodologies for data analysis will be the core of the new field of data science. Then, we present two examples of Big Data analysis in which several new tools discussed previously are applied, as using network information or combining different sources of data. Finally, the article concludes with some final remarks. © 2019, Sociedad de Estadística e Investigación Operativa.
引用
收藏
页码:363 / 368
页数:6
相关论文
共 50 条
[11]   Data Science and Big Data Technologies Role in the Digital Economy [J].
Novikov, Sergey V. .
TEM JOURNAL-TECHNOLOGY EDUCATION MANAGEMENT INFORMATICS, 2020, 9 (02) :756-762
[12]   Integrating Systems Modelling and Data Science: The Joint Future of Simulation and 'Big Data' Science [J].
Pruyt, Erik .
INTERNATIONAL JOURNAL OF SYSTEM DYNAMICS APPLICATIONS, 2016, 5 (01) :1-16
[13]   Data Science: Relationship with big data, data driven predictions and machine learning. [J].
Singh, Akansha ;
Saxena, Nidhi .
2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL PERFORMANCE EVALUATION (COMPE-2021), 2021, :67-+
[14]   Gerontologic Biostatistics and Data Science: Aging Research in the Era of Big Data [J].
Chen, Chixiang ;
Murphy, Terrence E. ;
Speiser, Jaime Lynn ;
Bandeen-Roche, Karen ;
Allore, Heather ;
Travison, Thomas G. ;
Griswold, Michael ;
Shardell, Michelle .
JOURNALS OF GERONTOLOGY SERIES A-BIOLOGICAL SCIENCES AND MEDICAL SCIENCES, 2024, 80 (01)
[15]   Big data and machine learning for materials science [J].
Rodrigues Jr, Jose F. ;
Florea, Larisa ;
de Oliveira, Maria C. F. ;
Diamond, Dermot ;
Oliveira Jr, Osvaldo N. .
DISCOVER MATERIALS, 2021, 1 (01)
[16]   Evolution of symposia on the interface of computing and statistics defines data science to be the interface [J].
Goodman, Arnold .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2014, 6 (05) :367-377
[17]   Learning from data: Unifying statistics and computer science [J].
Cleveland, WS .
INTERNATIONAL STATISTICAL REVIEW, 2005, 73 (02) :217-221
[18]   Data science vs. statistics: two cultures? [J].
Iain Carmichael ;
J. S. Marron .
Japanese Journal of Statistics and Data Science, 2018, 1 (1) :117-138
[19]   Artificial Intelligence and Big Data Science in Neurocritical Care [J].
Mainali, Shraddha ;
Park, Soojin .
CRITICAL CARE CLINICS, 2023, 39 (01) :235-242
[20]   Big Data Research for Social Science and Social Impact [J].
Lytras, Miltiadis D. ;
Visvizi, Anna .
SUSTAINABILITY, 2020, 12 (01)