Breast cancer-associated high-order SNP-SNP interaction of CXCL12/CXCR4-related genes by an improved multifactor dimensionality reduction (MDR-ER)

被引:28
|
作者
Fu, Ou-Yang [1 ,2 ,3 ,4 ]
Chang, Hsueh-Wei [2 ,8 ,9 ]
Lin, Yu-Da [5 ]
Chuang, Li-Yeh [6 ,7 ]
Hou, Ming-Feng [2 ,4 ,10 ,11 ]
Yang, Cheng-Hong [5 ]
机构
[1] Kaohsiung Med Univ, Grad Inst Med, Coll Med, Kaohsiung 80708, Taiwan
[2] Kaohsiung Med Univ, Kaohsiung Med Univ Hosp, Ctr Canc, 100 Shih Chuan 1st Rd, Kaohsiung 80708, Taiwan
[3] Kaohsiung Municipal Tatung Hosp, Dept Surg, Kaohsiung 80145, Taiwan
[4] Kaohsiung Med Univ Hosp, Dept Surg, Kaohsiung 80708, Taiwan
[5] Natl Kaohsiung Univ Appl Sci, Dept Elect Engn, 415 Chien Kung Rd, Kaohsiung 80778, Taiwan
[6] I Shou Univ, Dept Chem Engn, Kaohsiung 84001, Taiwan
[7] I Shou Univ, Inst Biotechnol & Chem Engn, Kaohsiung 84001, Taiwan
[8] Kaohsiung Med Univ, Dept Biomed Sci & Environm Biol, Kaohsiung 80708, Taiwan
[9] Natl Sun Yat Sen Univ, Inst Med Sci & Technol, Kaohsiung 80424, Taiwan
[10] Kaohsiung Med Univ, Inst Clin Med, Kaohsiung, Taiwan
[11] Kaohsiung Municipal Hsiaokang Hosp, Kaohsiung 812, Taiwan
关键词
breast cancer; single nucleotide polymorphism; gene-gene interactions; imbalanced data set; multifactor dimensionality reduction; D-LOOP; POLYMORPHISMS; RISK; CXCR4; SUSCEPTIBILITY; METASTASIS; BIOMARKERS; CELLS;
D O I
10.3892/or.2016.4956
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
In association studies, the combined effects of single nucleotide polymorphism (SNP)-SNP interactions and the problem of imbalanced data between cases and controls are frequently ignored. In the present study, we used an improved multifactor dimensionality reduction (MDR) approach namely MDR-ER to detect the high order SNP-SNP interaction in an imbalanced breast cancer data set containing seven SNPs of chemokine CXCL12/CXCR4 pathway genes. Most individual SNPs were not significantly associated with breast cancer. After MDR-ER analysis, six significant SNP-SNP interaction models with seven genes (highest cross-validation consistency, 10; classification error rates, 41.3-21.0; and prediction error rates, 47.4-55.3) were identified. CD4 and VEGFA genes were associated in a 2-loci interaction model (classification error rate, 41.3; prediction error rate, 47.5; odds ratio (OR), 2.069; 95% bootstrap CI, 1.40-2.90; P=1.71E-04) and it also appeared in all the best 2-7-loci models. When the loci number increased, the classification error rates and P-values decreased. The powers in 2-7-loci in all models were >0.9. The minimum classification error rate of the MDR-ER-generated model was shown with the 7-loci interaction model (classification error rate, 21.0; OR=15.282; 95% bootstrap CI, 9.54-23.87; P=4.03E-31). In the epistasis network analysis, the overall effect with breast cancer susceptibility was identified and the SNP order of impact on breast cancer was identified as follows: CD4 = VEGFA > KITLG > CXCL12 > CCR7 = MMP2 > CXCR4. In conclusion, the MDR-ER can effectively and correctly identify the best SNP-SNP interaction models in an imbalanced data set for breast cancer cases.
引用
收藏
页码:1739 / 1747
页数:9
相关论文
empty
未找到相关数据