Some Thoughts on Official Statistics and its Future (with discussion)

被引:1
作者
Tille, Yves [1 ]
Debusschere, Marc [2 ]
Luomaranta, Henri [3 ]
Axelson, Martin [4 ]
Elvers, Eva [5 ]
Holmberg, Anders [6 ]
Valliant, Richard [7 ]
机构
[1] Univ Neuchatel, Inst Stat, Pierrea Mazel 7, CH-2000 Neuchatel, Switzerland
[2] Stat Belgium, Koning Albert II Laan 16, B-1000 Brussels, Belgium
[3] Stat Finland, Tyopajankatu 13, Helsinki 00580, Finland
[4] Stat Sweden, Klostergatan 23, SE-70189 Orebro, Sweden
[5] Stat Sweden, Solna Strandvag 86, SE-17154 Solna, Sweden
[6] Australian Bur Stat, Methodol Div, Locked Bag 10, Belconnen, ACT 2617, Australia
[7] Univ Michigan, Inst Social Res, 4620 North Pk Ave,Apt 1406W, Chevy Chase, MD 20815 USA
关键词
Deduction; foundations; induction; Lasso; p-value; registers; sampling; statistical learning; REGRESSION SHRINKAGE; MISSING DATA; MODEL; SELECTION; HISTORY;
D O I
10.2478/jos-2022-0026
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
In this article, we share some reflections on the state of statistical science and its evolution in the production systems of official statistics. We first try to make a synthesis of the evolution of statistical thinking. We then examine the evolution of practices in official statistics, which had to face very early on a diversification of sou rces: first with the use of censuses, then sample surveys and finally administrative files. At each stage, a profound revision of methods was necessary. We show that since the middle of the 20th century, one of the major challenges of statistics has been to produce estimates from a variety of sources. To do this, a large number of methods have been proposed which are based on very different f oundations. The term "big data" encompasses a set of sources and new statistical methods. We first examine the potential of valorization of big data in official statistics. Some applications such as image analysis for agricultural prediction are very old and will be further developed. However, we report our skepticism towards web-scrapping methods. Then we examine the use of new deep learning methods. With access to more and more sources, the great challenge will remain the valorization and harmonization of these sources.
引用
收藏
页码:557 / 598
页数:42
相关论文
共 92 条
  • [1] [Anonymous], 2016, Journal of Statistical Research
  • [2] Bastin C., 1980, PRATIQUE LANALYSE DO
  • [3] Beaumont J.-F., 2008, Metron, VLXVI, P5
  • [4] Bellhouse D., 1988, Sampling, V6, P1
  • [5] Benzcri JP., 1973, LANALYSE DONNES
  • [6] Bethlehem J., 2009, RISE SURVEY SAMPLING
  • [7] Borges JorgeLuis., 1960, OTRAS INQUISICIONES
  • [8] Box G. E. P., 2007, WILEY SERIES PROBABI, V2nd
  • [9] Model-Assisted Survey Estimation with Modern Prediction Techniques
    Breidt, F. Jay
    Opsomer, Jean D.
    [J]. STATISTICAL SCIENCE, 2017, 32 (02) : 190 - 205
  • [10] Breidt FJ, 2000, ANN STAT, V28, P1026