OmixLitMiner: A Bioinformatics Tool for Prioritizing Biological Leads from 'Omics Data Using Literature Retrieval and Data Mining

被引:6
作者
Steffen, Pascal [1 ,2 ,3 ]
Wu, Jemma [2 ]
Hariharan, Shubhang [1 ]
Voss, Hannah [3 ]
Raghunath, Vijay [4 ]
Molloy, Mark P. [1 ,2 ]
Schlueter, Hartrnut [3 ]
机构
[1] Univ Sydney, Kolling Inst, Bowel Canc & Biomarker Res, Sydney, NSW 2065, Australia
[2] Macquarie Univ, Dept Mol Sci, Sydney, NSW 2109, Australia
[3] Univ Med Ctr Hamburg Eppendorf UKE, Dept Clin Chem & Lab Med, Mass Spectrometr Prote Grp, D-20246 Hamburg, Germany
[4] Univ Sydney, Sydney Informat Hub, Sydney, NSW 2008, Australia
关键词
proteomics; genomics; data mining; literature retrieval; bioinformatics; mass spectrometry; EXPRESSION; IDENTIFICATION; METASTASIS; CARCINOMA;
D O I
10.3390/ijms21041374
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Proteomics and genomics discovery experiments generate increasingly large result tables, necessitating more researcher time to convert the biological data into new knowledge. Literature review is an important step in this process and can be tedious for large scale experiments. An informed and strategic decision about which biomolecule targets should be pursued for follow-up experiments thus remains a considerable challenge. To streamline and formalise this process of literature retrieval and analysis of discovery based 'omics data and as a decision-facilitating support tool for follow-up experiments we present OmixLitMiner, a package written in the computational language R. The tool automates the retrieval of literature from PubMed based on UniProt protein identifiers, gene names and their synonyms, combined with user defined contextual keyword search (i.e., gene ontology based). The search strategy is programmed to allow either strict or more lenient literature retrieval and the outputs are assigned to three categories describing how well characterized a regulated gene or protein is. The category helps to meet a decision, regarding which gene/protein follow-up experiments may be performed for gaining new knowledge and to exclude following already known biomarkers. We demonstrate the tool's usefulness in this retrospective study assessing three cancer proteomics and one cancer genomics publication. Using the tool, we were able to corroborate most of the decisions in these papers as well as detect additional biomolecule leads that may be valuable for future research.
引用
收藏
页数:11
相关论文
共 17 条
[1]  
Acland A, 2013, NUCLEIC ACIDS RES, V41, pD8, DOI [10.1093/nar/gkx1095, 10.1093/nar/gks1189, 10.1093/nar/gkq1172]
[2]  
Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkw1099, 10.1093/nar/gkh131]
[3]   MineBlast:: a literature presentation service supporting protein annotation by data mining of BLAST results [J].
Dieterich, G ;
Kärst, U ;
Wehland, J ;
Jänsch, L .
BIOINFORMATICS, 2005, 21 (16) :3450-3451
[4]   Differential Proteome Analysis of Human Neuroblastoma Xenograft Primary Tumors and Matched Spontaneous Distant Metastases [J].
Haenel, Lorena ;
Gosau, Tobias ;
Maar, Hanna ;
Valentiner, Ursula ;
Schumacher, Udo ;
Riecken, Kristoffer ;
Windhorst, Sabine ;
Hansen, Nils-Owe ;
Heikaus, Laura ;
Wurlitzer, Marcus ;
Nolte, Ingo ;
Schlueter, Hartmut ;
Lange, Tobias .
SCIENTIFIC REPORTS, 2018, 8
[5]  
Hoffman R., 2007, Fundamentals of Data Mining in Genomics and Proteomics, P251
[6]   Expression of Bax, p53, and p27/kip in patients with papillary thyroid carcinoma with or without cervical nodal metastasis [J].
Karlidag, Turgut ;
Cobanoglu, Bengu ;
Keles, Erol ;
Alpay, Hayrettin Cengiz ;
Ozercan, Ibrahim ;
Kaygusuz, Irfan ;
Yalcin, Sinasi ;
Sakallioglu, Oner .
AMERICAN JOURNAL OF OTOLARYNGOLOGY, 2007, 28 (01) :31-36
[7]   Identifying High-Priority Proteins Across the Human Diseasome Using Semantic Similarity [J].
Lau, Edward ;
Venkatraman, Vidya ;
Thomas, Cody T. ;
Wu, Joseph C. ;
Van Eyk, Jennifer E. ;
Lam, Maggie P. Y. .
JOURNAL OF PROTEOME RESEARCH, 2018, 17 (12) :4267-4278
[8]   Proteomics of thyroid tumours provides new insights into their molecular composition and changes associated with malignancy [J].
Martinez-Aguilar, Juan ;
Clifton-Bligh, Roderick ;
Molloy, Mark P. .
SCIENTIFIC REPORTS, 2016, 6
[9]   Successful identification of a predictive biomarker for lymph node metastasis in colorectal cancer using a proteomic approach [J].
Mori, Koichiro ;
Toiyama, Yuji ;
Otake, Kohei ;
Ide, Shozo ;
Imaoka, Hiroki ;
Okigami, Masato ;
Okugawa, Yoshinaga ;
Fujikawa, Hiroyuki ;
Saigusa, Susumu ;
Hiro, Junichiro ;
Kobayashi, Minako ;
Ohi, Masaki ;
Tanaka, Koji ;
Inoue, Yasuhiro ;
Kobayashi, Yuhko ;
Mohri, Yasuhiko ;
Kobayashi, Issei ;
Goel, Ajay ;
Kusunoki, Masato .
ONCOTARGET, 2017, 8 (63) :106935-106947
[10]   A Review of Recent Advancement in Integrating Omics Data with Literature Mining towards Biomedical Discoveries [J].
Raja, Kalpana ;
Patrick, Matthew ;
Gao, Yilin ;
Madu, Desmond ;
Yang, Yuyang ;
Tsoi, Lam C. .
INTERNATIONAL JOURNAL OF GENOMICS, 2017, 2017