Distributional regression modeling via generalized additive models for location, scale, and shape: An overview through a data set from learning analytics

被引:11
|
作者
Marmolejo-Ramos, Fernando [1 ]
Tejo, Mauricio [2 ]
Brabec, Marek [3 ]
Kuzilek, Jakub [4 ,5 ]
Joksimovic, Srecko [1 ]
Kovanovic, Vitomir [1 ]
Gonzalez, Jorge [6 ]
Kneib, Thomas [7 ,8 ]
Buehlmann, Peter [9 ]
Kook, Lucas [10 ,11 ]
Briseno-Sanchez, Guillermo [12 ]
Ospina, Raydonal [13 ]
机构
[1] Univ South Australia, Ctr Change & Complex Learning, Adelaide, SA, Australia
[2] Univ Valparaiso, Inst Estadist, Valparaiso, Chile
[3] Czech Acad Sci, Inst Comp Sci, Dept Stat Modelling, Prague, Czech Republic
[4] CTU, Czech Inst Informat Robot & Cybernet, Prague, Czech Republic
[5] Humboldt Univ, Comp Sci & Soc Res Grp, Comp Sci Educ, Berlin, Germany
[6] Pontificia Univ Catolica Chile, Dept Estadist, Santiago, Chile
[7] Georg August Univ Gottingen, Campus Inst Data Sci CIDAS, Gottingen, Germany
[8] Georg August Univ Gottingen, Chair Stat, Gottingen, Germany
[9] Swiss Fed Inst Technol, Seminar Stat, Zurich, Switzerland
[10] Univ Zurich, Epidemiol Biostat & Prevent Inst, Zurich, Switzerland
[11] Zurich Univ Appl Sci, Inst Data Anal & Proc Design, Winterthur, Switzerland
[12] TU Dortmund Univ, Dept Stat, Dortmund, Germany
[13] Univ Fed Pernambuco, CASTLab, Dept Stat, Recife, PE, Brazil
基金
欧盟地平线“2020”;
关键词
causal regularization; causality; educational data mining; generalized additive models for location; scale; and shape; learning analytics; machine learning; statistical learning; statistical modeling; supervised learning; VARIABLE SELECTION; CAUSAL INFERENCE; GAMLSS; KURTOSIS; IDENTIFICATION; DEFINITION; PREDICTION; SKEWNESS; PACKAGE; RULES;
D O I
10.1002/widm.1479
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The advent of technological developments is allowing to gather large amounts of data in several research fields. Learning analytics (LA)/educational data mining has access to big observational unstructured data captured from educational settings and relies mostly on unsupervised machine learning (ML) algorithms to make sense of such type of data. Generalized additive models for location, scale, and shape (GAMLSS) are a supervised statistical learning framework that allows modeling all the parameters of the distribution of the response variable with respect to the explanatory variables. This article overviews the power and flexibility of GAMLSS in relation to some ML techniques. Also, GAMLSS' capability to be tailored toward causality via causal regularization is briefly commented. This overview is illustrated via a data set from the field of LA. This article is categorized under: Application Areas > Education and Learning Algorithmic Development > Statistics Technologies > Machine Learning
引用
收藏
页数:22
相关论文
共 12 条
  • [1] Are generalized additive models for location, scale, and shape an improvement on existing models for estimating skewed and heteroskedastic cost data?
    Bohl A.A.
    Blough D.K.
    Fishman P.A.
    Harris J.R.
    Phelan E.A.
    Health Services and Outcomes Research Methodology, 2013, 13 (1) : 18 - 38
  • [2] Robust gradient boosting for generalized additive models for location, scale and shape
    Speller, Jan
    Staerk, Christian
    Gude, Francisco
    Mayr, Andreas
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2023,
  • [3] Generalized additive models for location, scale and shape for high dimensional dataua flexible approach based on boosting
    Mayr, Andreas
    Fenske, Nora
    Hofner, Benjamin
    Kneib, Thomas
    Schmid, Matthias
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2012, 61 : 403 - 427
  • [4] LASSO-type penalization in the framework of generalized additive models for location, scale and shape
    Groll, Andreas
    Hambuckers, Julien
    Kneib, Thomas
    Umlauf, Nikolaus
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2019, 140 : 59 - 73
  • [5] Small Area Estimation of Household Economic Indicators under Unit-Level Generalized Additive Models for Location, Scale and Shape
    Mori, Lorenzo
    Ferrante, Maria Rosaria
    JOURNAL OF SURVEY STATISTICS AND METHODOLOGY, 2024, 13 (01) : 160 - 196
  • [6] Timing leaf senescence: A generalized additive models for location, scale and shape approach
    Marien, Bertold
    Papadimitriou, Dimitri
    Kotilainen, Titta
    Zuccarini, Paolo
    Dox, Inge
    Verlinden, Melanie
    Heinecke, Thilo
    Marien, Joachim
    Willems, Patrick
    Decoster, Mieke
    Gasco, Aina
    Lange, Holger
    Penuelas, Josep
    Campioli, Matteo
    AGRICULTURAL AND FOREST METEOROLOGY, 2022, 315
  • [7] Gradient boosting in Markov-switching generalized additive models for location, scale, and shape
    Adam, Timo
    Mayr, Andreas
    Kneib, Thomas
    ECONOMETRICS AND STATISTICS, 2022, 22 : 3 - 16
  • [8] Modeling Linguistic Variables With Regression Models: Addressing Non-Gaussian Distributions, Non-independent Observations, and Non-linear Predictors With Random Effects and Generalized Additive Models for Location, Scale, and Shape
    Coupe, Christophe
    FRONTIERS IN PSYCHOLOGY, 2018, 9
  • [9] Evaluating diel, ontogenetic and environmental effects on larval fish vertical distribution using Generalized Additive Models for Location, Scale and Shape
    Hernandez, Frank J.
    Hare, Jonathan A.
    Fey, Dariusz P.
    FISHERIES OCEANOGRAPHY, 2009, 18 (04) : 224 - 236
  • [10] Modelling and Clustering Sea Conditions: Bivariate FiniteMixtures of Generalized Additive Models for Location, Shape, and Scale Applied to the Analysis of Meteorological Tides and Wave Heights
    Ricciotti, Lorena
    Picone, Marco
    Pollice, Alessio
    Maruotti, Antonello
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2024, 12 (05)