A review of spline function procedures in R

被引:309
作者
Perperoglou, Aris [1 ]
Sauerbrei, Willi [2 ,3 ]
Abrahamowicz, Michal [4 ]
Schmid, Matthias [5 ]
机构
[1] Univ Essex, Dept Math Sci, Colchester, Essex, England
[2] Univ Freiburg, Inst Med Biometry & Stat, Fac Med, Freiburg, Germany
[3] Univ Freiburg, Med Ctr, Freiburg, Germany
[4] McGill Univ, Ctr Hlth, Montreal, PQ, Canada
[5] Univ Bonn, Fac Med, Med Biometry Informat & Epidemiol, Bonn, Germany
关键词
Multivariable modelling; Functional form of continuous covariates; GENERALIZED ADDITIVE-MODELS;
D O I
10.1186/s12874-019-0666-3
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
BackgroundWith progress on both the theoretical and the computational fronts the use of spline modelling has become an established tool in statistical regression analysis. An important issue in spline modelling is the availability of user friendly, well documented software packages. Following the idea of the STRengthening Analytical Thinking for Observational Studies initiative to provide users with guidance documents on the application of statistical methods in observational research, the aim of this article is to provide an overview of the most widely used spline-based techniques and their implementation in R.MethodsIn this work, we focus on the R Language for Statistical Computing which has become a hugely popular statistics software. We identified a set of packages that include functions for spline modelling within a regression framework. Using simulated and real data we provide an introduction to spline modelling and an overview of the most popular spline functions.ResultsWe present a series of simple scenarios of univariate data, where different basis functions are used to identify the correct functional form of an independent variable. Even in simple data, using routines from different packages would lead to different results.ConclusionsThis work illustrate challenges that an analyst faces when working with data. Most differences can be attributed to the choice of hyper-parameters rather than the basis used. In fact an experienced user will know how to obtain a reasonable outcome, regardless of the type of spline used. However, many analysts do not have sufficient knowledge to use these powerful tools adequately and will need more guidance.
引用
收藏
页数:16
相关论文
共 43 条
  • [1] [Anonymous], 1990, SPLINE MODELS OBSERV
  • [2] [Anonymous], 2001, A Practical Guide to Splines
  • [3] [Anonymous], QUANTREG QUANTILE RE
  • [4] [Anonymous], 1993, NONPARAMETRIC REGRES
  • [5] [Anonymous], 16 YEARS R PROJECT H
  • [6] [Anonymous], GROWTH CRAN PACKAGES
  • [7] [Anonymous], 1990, GEN ADDITIVE MODELS
  • [8] [Anonymous], R PROGRAMMING LANGUA
  • [9] [Anonymous], R YOU READY R YOU NE
  • [10] [Anonymous], NEW SURVEYS SHOW CON