A collaborative filtering-based approach to biomedical knowledge discovery

被引:18
作者
Lever, Jake [1 ,2 ]
Gakkhar, Sitanshu [1 ]
Gottlieb, Michael [1 ]
Rashnavadi, Tahereh [1 ]
Lin, Santina [1 ]
Siu, Celia [1 ]
Smith, Maia [1 ]
Jones, Martin R. [1 ]
Krzywinski, Martin [1 ]
Jones, Steven J. M. [1 ,2 ,3 ]
机构
[1] Canadas Michael Smith Genome Sci Ctr, Vancouver, BC V5Z 4S6, Canada
[2] Univ British Columbia, Vancouver, BC V6T 1Z1, Canada
[3] Simon Fraser Univ, Burnaby, BC V5A 1S6, Canada
关键词
FISH-OIL;
D O I
10.1093/bioinformatics/btx613
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The increase in publication rates makes it challenging for an individual researcher to stay abreast of all relevant research in order to find novel research hypotheses. Literature-based discovery methods make use of knowledge graphs built using text mining and can infer future associations between biomedical concepts that will likely occur in new publications. These predictions are a valuable resource for researchers to explore a research topic. Current methods for prediction are based on the local structure of the knowledge graph. A method that uses global knowledge from across the knowledge graph needs to be developed in order to make knowledge discovery a frequently used tool by researchers. Results: We propose an approach based on the singular value decomposition (SVD) that is able to combine data from across the knowledge graph through a reduced representation. Using cooccurrence data extracted from published literature, we show that SVD performs better than the leading methods for scoring discoveries. We also show the diminishing predictive power of knowledge discovery as we compare our predictions with real associations that appear further into the future. Finally, we examine the strengths and weaknesses of the SVD approach against another well-performing system using several predicted associations.
引用
收藏
页码:652 / 659
页数:8
相关论文
共 32 条
[1]   Text mining and its potential applications in systems biology [J].
Ananiadou, Sophia ;
Kell, Douglas B. ;
Tsujii, Jun-ichi .
TRENDS IN BIOTECHNOLOGY, 2006, 24 (12) :571-579
[2]  
[Anonymous], 2007, Numerical Recipes: The Art of Scientific Computing
[3]  
[Anonymous], 2007, P KDD CUP WORKSH NEW
[4]  
Bruskiewich R, 2016, BIORXIV
[5]   FLAP pharmacological blockade modulates metabolism of endogenous tau in vivo [J].
Chu, J. ;
Lauretti, E. ;
Di Meco, A. ;
Pratico, D. .
TRANSLATIONAL PSYCHIATRY, 2013, 3 :e333-e333
[6]   The 385+million word Corpus of Contemporary American English (1990-2008+) Design, architecture, and linguistic insights [J].
Davies, Mark .
INTERNATIONAL JOURNAL OF CORPUS LINGUISTICS, 2009, 14 (02) :159-190
[7]   FISH-OIL DIETARY SUPPLEMENTATION IN PATIENTS WITH RAYNAUD PHENOMENON - A DOUBLE-BLIND, CONTROLLED, PROSPECTIVE-STUDY [J].
DIGIACOMO, RA ;
KREMER, JM ;
SHAH, DM .
AMERICAN JOURNAL OF MEDICINE, 1989, 86 (02) :158-164
[8]   THE APPROXIMATION OF ONE MATRIX BY ANOTHER OF LOWER RANK [J].
Eckart, Carl ;
Young, Gale .
PSYCHOMETRIKA, 1936, 1 (03) :211-218
[9]   CoPub: a literature-based keyword enrichment tool for microarray data analysis [J].
Frijters, Raoul ;
Heupers, Bart ;
van Beek, Pieter ;
Bouwhuis, Maurice ;
van Schaik, Rene ;
de Vlieg, Jacob ;
Polman, Jan ;
Alkema, Wynand .
NUCLEIC ACIDS RESEARCH, 2008, 36 :W406-W410
[10]   Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters [J].
Funk, Christopher ;
Baumgartner, William, Jr. ;
Garcia, Benjamin ;
Roeder, Christophe ;
Bada, Michael ;
Cohen, K. Bretonnel ;
Hunter, Lawrence E. ;
Verspoor, Karin .
BMC BIOINFORMATICS, 2014, 15