Inferring disease-associated long non-coding RNAs using genome-wide tissue expression profiles

被引:20
作者
Pan, Xiaoyong [1 ,2 ]
Jensen, Lars Juhl [2 ]
Gorodkin, Jan [1 ]
机构
[1] Univ Copenhagen, Ctr Noncoding RNA Technol & Hlth, Dept Vet & Anim Sci, DK-1870 Frederiksberg C, Denmark
[2] Univ Copenhagen, Novo Nordisk Fdn, Ctr Prot Res, Dis Syst Biol Program, DK-2200 Copenhagen N, Denmark
关键词
GENE-EXPRESSION; HUMAN LNCRNA; EVOLUTION; PRIORITIZATION; INFERENCE; PATTERNS; DATABASE;
D O I
10.1093/bioinformatics/bty859
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Long non-coding RNAs (lncRNAs) are important regulators in wide variety of biological processes, which are linked to many diseases. Compared to protein-coding genes (PCGs), the association between diseases and lncRNAs is still not well studied. Thus, inferring disease-associated lncRNAs on a genome-wide scale has become imperative. Results In this study, we propose a machine learning-based method, DislncRF, which infers disease-associated lncRNAs on a genome-wide scale based on tissue expression profiles. DislncRF uses random forest models trained on expression profiles of known disease-associated PCGs across human tissues to extract general patterns between expression profiles and diseases. These models are then applied to score associations between lncRNAs and diseases. DislncRF was benchmarked against a gold standard dataset and compared to other methods. The results show that DislncRF yields promising performance and outperforms the existing methods. The utility of DislncRF is further substantiated on two diseases in which we find that top scoring candidates are supported by literature or independent datasets. Availability and implementation https://github.com/xypan1232/DislncRF Supplementary information Supplementary data are available at Bioinformatics online.
引用
收藏
页码:1494 / 1502
页数:9
相关论文
共 51 条
[1]   GeneTIER: prioritization of candidate disease genes using tissue-specific gene expression profiles [J].
Antanaviciute, Agne ;
Daly, Catherine ;
Crinnion, Laura A. ;
Markham, Alexander F. ;
Watson, Christopher M. ;
Bonthron, David T. ;
Carr, Ian M. .
BIOINFORMATICS, 2015, 31 (16) :2728-2735
[2]   The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans [J].
Ardlie, Kristin G. ;
DeLuca, David S. ;
Segre, Ayellet V. ;
Sullivan, Timothy J. ;
Young, Taylor R. ;
Gelfand, Ellen T. ;
Trowbridge, Casandra A. ;
Maller, Julian B. ;
Tukiainen, Taru ;
Lek, Monkol ;
Ward, Lucas D. ;
Kheradpour, Pouya ;
Iriarte, Benjamin ;
Meng, Yan ;
Palmer, Cameron D. ;
Esko, Tonu ;
Winckler, Wendy ;
Hirschhorn, Joel N. ;
Kellis, Manolis ;
MacArthur, Daniel G. ;
Getz, Gad ;
Shabalin, Andrey A. ;
Li, Gen ;
Zhou, Yi-Hui ;
Nobel, Andrew B. ;
Rusyn, Ivan ;
Wright, Fred A. ;
Lappalainen, Tuuli ;
Ferreira, Pedro G. ;
Ongen, Halit ;
Rivas, Manuel A. ;
Battle, Alexis ;
Mostafavi, Sara ;
Monlong, Jean ;
Sammeth, Michael ;
Mele, Marta ;
Reverter, Ferran ;
Goldmann, Jakob M. ;
Koller, Daphne ;
Guigo, Roderic ;
McCarthy, Mark I. ;
Dermitzakis, Emmanouil T. ;
Gamazon, Eric R. ;
Im, Hae Kyung ;
Konkashbaev, Anuar ;
Nicolae, Dan L. ;
Cox, Nancy J. ;
Flutre, Timothee ;
Wen, Xiaoquan ;
Stephens, Matthew .
SCIENCE, 2015, 348 (6235) :648-660
[3]   COMPARTMENTS: unification and visualization of protein subcellular localization evidence [J].
Binder, Janos X. ;
Pletscher-Frankild, Sune ;
Tsafou, Kalliopi ;
Stolte, Christian ;
O'Donoghue, Sean I. ;
Schneider, Reinhard ;
Jensen, Lars Juhl .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2014,
[4]   Tissue-specific mutation accumulation in human adult stem cells during life [J].
Blokzijl, Francis ;
de Ligt, Joep ;
Jager, Myrthe ;
Sasselli, Valentina ;
Roerink, Sophie ;
Sasaki, Nobuo ;
Huch, Meritxell ;
Boymans, Sander ;
Kuijk, Ewart ;
Prins, Pjotr ;
Nijman, Isaac J. ;
Martincorena, Inigo ;
Mokry, Michal ;
Wiegerinck, Caroline L. ;
Middendorp, Sabine ;
Sato, Toshiro ;
Schwank, Gerald ;
Nieuwenhuis, Edward E. S. ;
Verstegen, Monique M. A. ;
van der Laan, Luc J. W. ;
de Jonge, Jeroen ;
IJzermans, Jan N. M. ;
Vries, Robert G. ;
van de Wetering, Marc ;
Stratton, Michael R. ;
Clevers, Hans ;
Cuppen, Edwin ;
van Boxtel, Ruben .
NATURE, 2016, 538 (7624) :260-+
[5]   Concordance of gene expression in human protein complexes reveals tissue specificity and pathology [J].
Bornigen, Daniela ;
Pers, Tune H. ;
Thorrez, Lieven ;
Huttenhower, Curtis ;
Moreau, Yves ;
Brunak, Soren .
NUCLEIC ACIDS RESEARCH, 2013, 41 (18) :e171
[6]   The evolution of gene expression levels in mammalian organs [J].
Brawand, David ;
Soumillon, Magali ;
Necsulea, Anamaria ;
Julien, Philippe ;
Csardi, Gabor ;
Harrigan, Patrick ;
Weier, Manuela ;
Liechti, Angelica ;
Aximu-Petri, Ayinuer ;
Kircher, Martin ;
Albert, Frank W. ;
Zeller, Ulrich ;
Khaitovich, Philipp ;
Gruetzner, Frank ;
Bergmann, Sven ;
Nielsen, Rasmus ;
Paeaebo, Svante ;
Kaessmann, Henrik .
NATURE, 2011, 478 (7369) :343-+
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]   LncRNADisease: a database for long-non-coding RNA-associated diseases [J].
Chen, Geng ;
Wang, Ziyun ;
Wang, Dongqing ;
Qiu, Chengxiang ;
Liu, Mingxi ;
Chen, Xing ;
Zhang, Qipeng ;
Yan, Guiying ;
Cui, Qinghua .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D983-D986
[9]   Long non-coding RNAs and complex diseases: from experimental results to computational models [J].
Chen, Xing ;
Yan, Chenggang Clarence ;
Zhang, Xu ;
You, Zhu-Hong .
BRIEFINGS IN BIOINFORMATICS, 2017, 18 (04) :558-576
[10]   FMLNCSIM: fuzzy measure-based lncRNA functional similarity calculation model [J].
Chen, Xing ;
Huang, Yu-An ;
Wang, Xue-Song ;
You, Zhu-Hong ;
Chan, Keith C. C. .
ONCOTARGET, 2016, 7 (29) :45948-45958