Introduction to statistical modelling 2: categorical variables and interactions in linear regression

被引:19
作者
Lunt, Mark [1 ]
机构
[1] Univ Manchester, Manchester Acad Hlth Sci Ctr, Inst Inflammat & Repair, Arthrit Res UK Epidemiol Unit,Ctr Musculoskeletal, Manchester, Lancs, England
关键词
linear regression; categorical variable; indicator variable; dummy variable; interaction; RA; POPULATION;
D O I
10.1093/rheumatology/ket172
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
In the first article in this series we explored the use of linear regression to predict an outcome variable from a number of predictive factors. It assumed that the predictive factors were measured on an interval scale. However, this article shows how categorical variables can also be included in a linear regression model, enabling predictions to be made separately for different groups and allowing for testing the hypothesis that the outcome differs between groups. The use of interaction terms to measure whether the effect of a particular predictor variable differs between groups is also explained. An alternative approach to testing the difference between groups of the effect of a given predictor, which consists of measuring the effect in each group separately and seeing whether the statistical significance differs between the groups, is shown to be misleading.
引用
收藏
页码:1141 / 1144
页数:4
相关论文
共 50 条
[41]   CLEverReg: A CNN-LSTM based Linear Regression Technique for Temporal Fire Event Modelling [J].
Yusuf, Syed Adnan ;
Samad, Abdul ;
Garrity, David James .
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[42]   Freight production of agricultural commodities in India using multiple linear regression and generalized additive modelling [J].
Dhulipala, Sowjanya ;
Patil, Gopal R. .
TRANSPORT POLICY, 2020, 97 :245-258
[43]   A hybrid modelling method for time series forecasting based on a linear regression model and deep learning [J].
Xu, Wenquan ;
Peng, Hui ;
Zeng, Xiaoyong ;
Zhou, Feng ;
Tian, Xiaoying ;
Peng, Xiaoyan .
APPLIED INTELLIGENCE, 2019, 49 (08) :3002-3015
[44]   Comparisons of linear regression and survival analysis using single and mixture distributions approaches in modelling LGD [J].
Zhang, Jie ;
Thomas, Lyn C. .
INTERNATIONAL JOURNAL OF FORECASTING, 2012, 28 (01) :204-215
[45]   A NEW APPROACH TO SELECT THE BEST SUBSET OF PREDICTORS IN LINEAR REGRESSION MODELLING: BI-OBJECTIVE MIXED INTEGER LINEAR PROGRAMMING [J].
Charkhgard, Hadi ;
Eshragh, Ali .
ANZIAM JOURNAL, 2019, 61 (01) :64-75
[46]   Statistical modeling in the laser cladding process of Inconel 625 via linear regression and response surface method [J].
Borhani, Mohammad Reza ;
Rajabi, Mohammad ;
Shojarazavi, Reza ;
Jamaati, Roohollah .
JOURNAL OF LASER APPLICATIONS, 2023, 35 (02)
[47]   TWO-SAMPLE TESTS FOR HIGH-DIMENSIONAL LINEAR REGRESSION WITH AN APPLICATION TO DETECTING INTERACTIONS [J].
Xia, Yin ;
Cai, Tianxi ;
Cai, T. Tony .
STATISTICA SINICA, 2018, 28 (01) :63-92
[48]   Computational tools for probing interactions in multiple linear regression, multilevel modeling, and latent curve analysis [J].
Preacher, Kristopher J. ;
Curran, Patrick J. ;
Bauer, Daniel J. .
JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2006, 31 (04) :437-448
[49]   Lack of fit tests for linear regression models with many predictor variables using minimal weighted maximal matchings [J].
Miller, Forrest R. ;
Neill, James W. .
JOURNAL OF MULTIVARIATE ANALYSIS, 2016, 150 :14-26
[50]   Standardizing effect size from linear regression models with log-transformed variables for meta-analysis [J].
Miguel Rodríguez-Barranco ;
Aurelio Tobías ;
Daniel Redondo ;
Elena Molina-Portillo ;
María José Sánchez .
BMC Medical Research Methodology, 17