Introduction to statistical modelling 2: categorical variables and interactions in linear regression

被引:19
作者
Lunt, Mark [1 ]
机构
[1] Univ Manchester, Manchester Acad Hlth Sci Ctr, Inst Inflammat & Repair, Arthrit Res UK Epidemiol Unit,Ctr Musculoskeletal, Manchester, Lancs, England
关键词
linear regression; categorical variable; indicator variable; dummy variable; interaction; RA; POPULATION;
D O I
10.1093/rheumatology/ket172
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
In the first article in this series we explored the use of linear regression to predict an outcome variable from a number of predictive factors. It assumed that the predictive factors were measured on an interval scale. However, this article shows how categorical variables can also be included in a linear regression model, enabling predictions to be made separately for different groups and allowing for testing the hypothesis that the outcome differs between groups. The use of interaction terms to measure whether the effect of a particular predictor variable differs between groups is also explained. An alternative approach to testing the difference between groups of the effect of a given predictor, which consists of measuring the effect in each group separately and seeing whether the statistical significance differs between the groups, is shown to be misleading.
引用
收藏
页码:1141 / 1144
页数:4
相关论文
共 50 条
[31]   Optimal Simultaneous Confidence Bands in Multiple Linear Regression with Predictor Variables Constrained in an Ellipsoidal Region [J].
Ah-Kine, P. ;
Liu, W. .
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2015, 44 (03) :441-452
[32]   Constant width simultaneous confidence bands in multiple linear regression with predictor variables constrained in intervals [J].
Liu, W ;
Jamshidian, M ;
Zhang, Y ;
Bretz, F .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2005, 75 (06) :425-436
[33]   CONDITIONS FOR THE CONSISTENCY OF THE TOTAL LEAST SQUARES ESTIMATOR IN AN ERRORS-IN-VARIABLES LINEAR REGRESSION MODEL [J].
Shklyar, S. V. .
THEORY OF PROBABILITY AND MATHEMATICAL STATISTICS, 2010, 83 :148-162
[34]   Comparison of various texture classification methods using multiresolution analysis and linear regression modelling [J].
Dhanya, S. ;
Roshni, V. S. Kumari .
SPRINGERPLUS, 2016, 5
[35]   Linear Regression Modelling on Epigallocatechin-3-gallate Sensor Data for Green Tea [J].
Modak, Angiras ;
Chatterjee, Trisita Nandy ;
Nag, Sangita ;
Roy, Runu Banerjee ;
Tudu, Bipan ;
Bandyopadhyay, Rajib .
2018 FOURTH IEEE INTERNATIONAL CONFERENCE ON RESEARCH IN COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (ICRCICN), 2018, :112-117
[36]   Empirical modelling of the growth of Ruditapes philippinarum by means of non linear regression on factorial coordinates [J].
Goulletquer, Philippe ;
Bacher, Cedric .
AQUATIC LIVING RESOURCES, 1988, 1 (03) :141-154
[37]   Machine learning versus linear regression modelling approach for accurate ozone concentrations prediction [J].
Jumin, Ellysia ;
Zaini, Nuratiah ;
Ahmed, Ali Najah ;
Abdullah, Samsuri ;
Ismail, Marzuki ;
Sherif, Mohsen ;
Sefelnasr, Ahmed ;
EI-Shafie, Ahmed .
ENGINEERING APPLICATIONS OF COMPUTATIONAL FLUID MECHANICS, 2020, 14 (01) :713-725
[38]   CONSTRUCTION OF EXACT SIMULTANEOUS CONFIDENCE BANDS IN MULTIPLE LINEAR REGRESSION WITH PREDICTOR VARIABLES CONSTRAINED IN AN ELLIPSOIDAL REGION [J].
Liu, Wei ;
Lin, Shan .
STATISTICA SINICA, 2009, 19 (01) :213-232
[39]   Simulation-based simultaneous confidence bands in multiple linear regression with predictor variables constrained in intervals [J].
Liu, W ;
Jamshidian, M ;
Zhang, Y ;
Donnelly, J .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2005, 14 (02) :459-484
[40]   A hybrid modelling method for time series forecasting based on a linear regression model and deep learning [J].
Wenquan Xu ;
Hui Peng ;
Xiaoyong Zeng ;
Feng Zhou ;
Xiaoying Tian ;
Xiaoyan Peng .
Applied Intelligence, 2019, 49 :3002-3015