UGM: a more stable procedure for large-scale multiple testing problems, new solutions to identify oncogene

被引:0
作者
Liu, Chengyou [1 ]
Zhou, Leilei [1 ]
Wang, Yuhe [1 ]
Tian, Shuchang [1 ]
Zhu, Junlin [2 ]
Qin, Hang [1 ]
Ding, Yong [3 ,4 ]
Jiang, Hongbing [1 ,5 ]
机构
[1] Nanjing Med Univ, Dept Med Engn, Nanjing Hosp 1, Nanjing, Jiangsu, Peoples R China
[2] Nanjing Med Univ, Dept Crit Care Med, Nanjing Hosp 1, Nanjing 210016, Jiangsu, Peoples R China
[3] Nanjing Med Univ, Dept Math & Comp, Nanjing, Jiangsu, Peoples R China
[4] Nanjing Med Univ, Dept Biomed Engn, Nanjing, Jiangsu, Peoples R China
[5] Nanjing Hlth Informat Ctr, Nanjing 210016, Jiangsu, Peoples R China
关键词
Differentially expressed genes; False discovery rate; Standard deviation; RNA-Seq data; Root mean square error; Cancer-associated genes; GENOME-WIDE ASSOCIATION; BREAST-CANCER; EXPRESSION; ACTIVATION; GENES;
D O I
10.1186/s12976-019-0117-1
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Variations of gene expression levels play an important role in tumors. There are numerous methods to identify differentially expressed genes in high-throughput sequencing. Several algorithms endeavor to identify distinctive genetic patterns susceptable to particular diseases. Although these processes have been proved successful, the probability that the number of non-differentially expressed genes measured by false discovery rate (FDR) has a large standard deviation, and the misidentification rate (type I error) grows rapidly when the number of genes to be detected become larger. In this study we developed a new method, Unit Gamma Measurement (UGM), accounting for multiple hypotheses test statistics distribution, which could reduce the dependency problem. Simulated expression profile data and breast cancer RNA-Seq data were utilized to testify the accuracy of UGM. The results show that the number of non-differentially expressed genes identified by the UGM is very close to the real-evidence data, and the UGM also has a smaller standard error, range, quartile range and RMS error. In addition, the UGM can be used to screen many breast cancer-associated genes, such as BRCA1, BRCA2, PTEN, BRIP1, etc., provides better accuracy, robustness and efficiency, the method of identification differentially expressed genes in high-throughput sequencing.
引用
收藏
页数:9
相关论文
共 26 条
[1]   AdvISER-PYRO: Amplicon Identification using SparsE Representation of PYROsequencing signal [J].
Ambroise, Jerome ;
Piette, Anne-Sophie ;
Delcorps, Cathy ;
Rigouts, Leen ;
de Jong, Bouke C. ;
Irenge, Leonid ;
Robert, Annie ;
Gala, Jean-Luc .
BIOINFORMATICS, 2013, 29 (16) :1963-1969
[2]  
Barrett T, 2005, NUCLEIC ACIDS RES, V33, pD562
[3]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[4]   Adaptive linear step-up procedures that control the false discovery rate [J].
Benjamini, Yoav ;
Krieger, Abba M. ;
Yekutieli, Daniel .
BIOMETRIKA, 2006, 93 (03) :491-507
[5]   Genome-wide association studies in Sjogren's syndrome: What do the genes tell us about disease pathogenesis? [J].
Burbelo, Peter D. ;
Arnbatipudi, Kiran ;
Alevizos, Ilias .
AUTOIMMUNITY REVIEWS, 2014, 13 (07) :756-761
[6]   Loss of the BRCA1-Interacting Helicase BRIP1 Results in Abnormal Mammary Acinar Morphogenesis [J].
Daino, Kazuhiro ;
Imaoka, Tatsuhiko ;
Morioka, Takamitsu ;
Tani, Shusuke ;
Iizuka, Daisuke ;
Nishimura, Mayumi ;
Shimada, Yoshiya .
PLOS ONE, 2013, 8 (09)
[7]   Breast Cancer Scenario in a Regional Cancer Centre in Eastern India over Eight Years - Still a Major Public Health Problem [J].
Datta, Karabi ;
Choudhuri, Maitrayee ;
Guha, Subhas ;
Biswas, Jaydip .
ASIAN PACIFIC JOURNAL OF CANCER PREVENTION, 2012, 13 (03) :809-813
[8]   Targeting the DNA repair defect in BRCA mutant cells as a therapeutic strategy [J].
Farmer, H ;
McCabe, N ;
Lord, CJ ;
Tutt, ANJ ;
Johnson, DA ;
Richardson, TB ;
Santarosa, M ;
Dillon, KJ ;
Hickson, I ;
Knights, C ;
Martin, NMB ;
Jackson, SP ;
Smith, GCM ;
Ashworth, A .
NATURE, 2005, 434 (7035) :917-921
[9]   The expression and function of hsp30-like small heat shock protein genes in amphibians, birds, fish, and reptiles [J].
Heikkila, John J. .
COMPARATIVE BIOCHEMISTRY AND PHYSIOLOGY A-MOLECULAR & INTEGRATIVE PHYSIOLOGY, 2017, 203 :179-192
[10]   dmGWAS: dense module searching for genome-wide association studies in protein-protein interaction networks [J].
Jia, Peilin ;
Zheng, Siyuan ;
Long, Jirong ;
Zheng, Wei ;
Zhao, Zhongming .
BIOINFORMATICS, 2011, 27 (01) :95-102