Differential gene expression in disease: a comparison between high-throughput studies and the literature

被引:62
作者
Rodriguez-Esteban, Raul [1 ]
Jiang, Xiaoyu [2 ]
机构
[1] Roche Innovat Ctr Basel, Roche Pharmaceut Res & Early Dev, Grenzacherstr 124, CH-4070 Basel, Switzerland
[2] Biogen, Cambridge, MA USA
关键词
CROHNS-DISEASE; IDENTIFICATION; MICROARRAY; KERATINOCYTES; PLATFORMS; EVOLUTION; DISCOVERY; GENOMICS; COLITIS; ARRAY;
D O I
10.1186/s12920-017-0293-y
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: Differential gene expression is important to understand the biological differences between healthy and diseased states. Two common sources of differential gene expression data are microarray studies and the biomedical literature. Methods: With the aid of text mining and gene expression analysis we have examined the comparative properties of these two sources of differential gene expression data. Results: The literature shows a preference for reporting genes associated to higher fold changes in microarray data, rather than genes that are simply significantly differentially expressed. Thus, the resemblance between the literature and microarray data increases when the fold-change threshold for microarray data is increased. Moreover, the literature has a reporting preference for differentially expressed genes that (1) are overexpressed rather than underexpressed; (2) are overexpressed in multiple diseases; and (3) are popular in the biomedical literature at large. Additionally, the degree to which diseases are similar depends on whether microarray data or the literature is used to compare them. Finally, vaguely-qualified reports of differential expression magnitudes in the literature have only small correlation with microarray fold-change data. Conclusions: Reporting biases of differential gene expression in the literature can be affecting our appreciation of disease biology and of the degree of similarity that actually exists between different diseases.
引用
收藏
页数:10
相关论文
共 38 条
[1]  
[Anonymous], 2011, Proceedings of the BioNLP Shared Task 2011 Workshop
[2]  
[Anonymous], P WORKSH CURR TRENDS
[3]   Standardizing global gene expression analysis between laboratories and across platforms [J].
Bammler, T ;
Beyer, RP ;
Bhattacharya, S ;
Boorman, GA ;
Boyles, A ;
Bradford, BU ;
Bumgarner, RE ;
Bushel, PR ;
Chaturvedi, K ;
Choi, D ;
Cunningham, ML ;
Dengs, S ;
Dressman, HK ;
Fannin, RD ;
Farun, FM ;
Freedman, JH ;
Fry, RC ;
Harper, A ;
Humble, MC ;
Hurban, P ;
Kavanagh, TJ ;
Kaufmann, WK ;
Kerr, KF ;
Jing, L ;
Lapidus, JA ;
Lasarev, MR ;
Li, J ;
Li, YJ ;
Lobenhofer, EK ;
Lu, X ;
Malek, RL ;
Milton, S ;
Nagalla, SR ;
O'Malley, JP ;
Palmer, VS ;
Pattee, P ;
Paules, RS ;
Perou, CM ;
Phillips, K ;
Qin, LX ;
Qiu, Y ;
Quigley, SD ;
Rodland, M ;
Rusyn, I ;
Samson, LD ;
Schwartz, DA ;
Shi, Y ;
Shin, JL ;
Sieber, SO ;
Slifer, S .
NATURE METHODS, 2005, 2 (05) :351-356
[4]  
Barker C, 2006, ENCY LANGUAGES LINGU
[5]  
Beisswanger E, 2008, STUD HEALTH TECHNOL, V136, P9
[6]   Visualizing the drug target landscape [J].
Campbell, Stephen J. ;
Gaulton, Anna ;
Marshall, Jason ;
Bichko, Dmitri ;
Martin, Sid ;
Brouwer, Cory ;
Harland, Lee .
DRUG DISCOVERY TODAY, 2010, 15 (1-2) :3-15
[7]   IDENTIFICATION OF A GROUP OF PROTEINS THAT ARE STRONGLY UP-REGULATED IN TOTAL EPIDERMAL-KERATINOCYTES FROM PSORIATIC SKIN [J].
CELIS, JE ;
CRUGER, D ;
KIIL, J ;
LAURIDSEN, JB ;
RATZ, G ;
BASSE, B ;
CELIS, A .
FEBS LETTERS, 1990, 262 (02) :159-164
[8]  
Chang Jia-Fu, 2013, J Pathol Inform, V4, P20, DOI 10.4103/2153-3539.115880
[9]   A recipe for high impact [J].
Cokol, Murat ;
Rodriguez-Esteban, Raul ;
Rzhetsky, Andrey .
GENOME BIOLOGY, 2007, 8 (05)
[10]   Visualizing evolution and impact of biomedical fields [J].
Cokol, Murat ;
Rodriguez-Esteban, Raul .
JOURNAL OF BIOMEDICAL INFORMATICS, 2008, 41 (06) :1050-1052