Principal components analysis of diet and alternatives for identifying the combination of foods that are associated with the risk of disease: a simulation study

被引:5
作者
Bakolis, Ioannis [1 ]
Burney, Peter [2 ]
Hooper, Richard [3 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Sch Publ Hlth, Dept Epidemiol & Biostat, London W2 1PG, England
[2] Univ London Imperial Coll Sci Technol & Med, Natl Heart & Lung Inst, Resp Epidemiol & Publ Hlth Grp, London SW3 6LR, England
[3] Barts & London Queen Marys Sch Med & Dent, Blizard Inst, Ctr Primary Care & Publ Hlth, London E1 2AT, England
关键词
Principal components analysis; Dietary patterns; Nutritional epidemiology; Logistic regression; Monte Carlo simulation; CORONARY-HEART-DISEASE; OBSTRUCTIVE PULMONARY-DISEASE; BREAST-CANCER RISK; CARDIOVASCULAR-DISEASE; COLORECTAL-CANCER; CLUSTER-ANALYSIS; EATING PATTERNS; ADULT; WOMEN; MEN;
D O I
10.1017/S0007114514000221
中图分类号
R15 [营养卫生、食品卫生]; TS201 [基础科学];
学科分类号
100403 ;
摘要
Dietary patterns derived empirically using principal components analysis (PCA) are widely employed for investigating diet-disease relationships. In the present study, we investigated whether PCA performed better at identifying such associations than an analysis of each food on a FFQ separately, referred to here as an exhaustive single food analysis (ESFA). Data on diet and disease were simulated using real FFQ data and by assuming a number of food intakes in combination that were associated with the risk of disease. In each simulation, ESFA and PCA were employed to identify the combinations of foods that are associated with the risk of disease using logistic regression, allowing for multiple testing and adjusting for energy intake. ESFA was also separately adjusted for principal components of diet, foods that were significant in the unadjusted ESFA and propensity scores. For each method, we investigated the power with which an association between diet and disease could be identified, and the power and false discovery rate (FDR) for identifying the specific combination of food intakes. In some scenarios, ESFA had greater power to detect a diet-disease association than PCA. ESFA also typically had a greater power and a lower FDR for identifying the combinations of food intakes that are associated with the risk of disease. The FDR of both methods increased with increasing sample size, but when ESFA was adjusted for foods that were significant in the unadjusted ESFA, FDR were controlled at the desired level. These results question the widespread use of PCA in nutritional epidemiology. The adjusted ESFA identifies the combinations of foods that are causally linked to the risk of disease with low FDR and surprisingly good power.
引用
收藏
页码:61 / 69
页数:9
相关论文
共 63 条
[1]   Dietary patterns and breast cancer risk in women participating in the Black Women's Health Study [J].
Agurs-Collins, Tanya ;
Rosenberg, Lynn ;
Makambi, Kepher ;
Palmer, Julie R. ;
Adams-Campbell, Lucile .
AMERICAN JOURNAL OF CLINICAL NUTRITION, 2009, 90 (03) :621-628
[2]  
[Anonymous], 2002, The Analysis and Interpretation of Multivariate Data for Social Scientists
[3]   Quantitative trait loci analysis using the false discovery rate [J].
Benjamini, Y ;
Yekutieli, D .
GENETICS, 2005, 171 (02) :783-789
[4]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[5]   Dietary patterns and gastric cancer risk: a systematic review and meta-analysis [J].
Bertuccio, P. ;
Rosato, V. ;
Andreano, A. ;
Ferraroni, M. ;
Decarli, A. ;
Edefonti, V. ;
La Vecchia, C. .
ANNALS OF ONCOLOGY, 2013, 24 (06) :1450-1458
[6]   Postmenopausal Breast Cancer Risk and Dietary Patterns in the E3N-EPIC Prospective Cohort Study [J].
Cottet, Vanessa ;
Touvier, Mathilde ;
Fournier, Agnes ;
Touillaud, Marina S. ;
Lafay, Lionel ;
Clavel-Chapelon, Francoise ;
Boutron-Ruault, Marie-Christine .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2009, 170 (10) :1257-1267
[7]   Dietary patterns and breast cancer: a review with focus on methodological issues [J].
Edefonti, Valeria ;
Randi, Giorgia ;
La Vecchia, Carlo ;
Ferraroni, Monica ;
Decarli, Adriano .
NUTRITION REVIEWS, 2009, 67 (06) :297-314
[8]  
Ender PB, 2002, POWERLOG COMMAND PER
[9]   Sure independence screening for ultrahigh dimensional feature space [J].
Fan, Jianqing ;
Lv, Jinchi .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :849-883
[10]  
Fan JQ, 2010, STAT SINICA, V20, P101