Univariate description and bivariate statistical inference: the first step delving into data

被引:142
作者
Zhang, Zhongheng [1 ]
机构
[1] Zhejiang Univ, Jinhua Hosp, Jinhua Municipal Cent Hosp, Dept Crit Care Med, Jinhua 321000, Peoples R China
关键词
Univariate description; bivariate statistical inference; R; table; baseline characteristics; automation;
D O I
10.21037/atm.2016.02.11
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
In observational studies, the first step is usually to explore data distribution and the baseline differences between groups. Data description includes their central tendency (e.g., mean, median, and mode) and dispersion (e.g., standard deviation, range, interquartile range). There are varieties of bivariate statistical inference methods such as Student's t-test, Mann-Whitney U test and Chi-square test, for normal, skews and categorical data, respectively. The article shows how to perform these analyses with R codes. Furthermore, I believe that the automation of the whole workflow is of paramount importance in that (I) it allows for others to repeat your results; (II) you can easily find out how you performed analysis during revision; (III) it spares data input by hand and is less error-prone; and (IV) when you correct your original dataset, the final result can be automatically corrected by executing the codes. Therefore, the process of making a publication quality table incorporating all abovementioned statistics and P values is provided, allowing readers to customize these codes to their own needs.
引用
收藏
页数:7
相关论文
共 8 条
[1]  
Breiman L., 2001, Machine Learning, V45, P5
[2]  
Corder G.W., 2014, Nonparametric Statistics: A Step-By-Step Approach, DOI DOI 10.1002/9781118165881
[3]   On the meaning and use of kurtosis [J].
DeCarlo, LT .
PSYCHOLOGICAL METHODS, 1997, 2 (03) :292-307
[4]   Wilcoxon-Mann-Whitney or t-test? On assumptions for hypothesis tests and multiple interpretations of decision rules [J].
Fay, Michael P. ;
Proschan, Michael A. .
STATISTICS SURVEYS, 2010, 4 :1-39
[5]   Normality Tests for Statistical Analysis: A Guide for Non-Statisticians [J].
Ghasemi, Asghar ;
Zahediasl, Saleh .
INTERNATIONAL JOURNAL OF ENDOCRINOLOGY AND METABOLISM, 2012, 10 (02) :486-489
[6]  
Komsta L., 2012, moments: Moments, cumulants, skewness, kurtosis and related tests)
[7]   Missing values in big data research: some basic skills [J].
Zhang, Zhongheng .
ANNALS OF TRANSLATIONAL MEDICINE, 2015, 3 (21)
[8]   Data management by using R: big data clinical research series [J].
Zhang, Zhongheng .
ANNALS OF TRANSLATIONAL MEDICINE, 2015, 3 (20)