Automated cognome construction and semi-automated hypothesis generation

被引:21
作者
Voytek, Jessica B. [2 ]
Voytek, Bradley [1 ,3 ]
机构
[1] Univ Calif San Francisco, Dept Neurol, San Francisco, CA 94158 USA
[2] Univ Calif Berkeley, Sch Informat, Berkeley, CA 94720 USA
[3] Univ Calif Berkeley, Helen Wills Neurosci Inst, Berkeley, CA 94720 USA
基金
美国国家卫生研究院;
关键词
Literature mining; Hypothesis generation; Semi-automated science; Cognome; Connectome; PUBLICATION BIAS; PREFRONTAL CORTEX; NEUROSCIENCE;
D O I
10.1016/j.jneumeth.2012.04.019
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Modern neuroscientific research stands on the shoulders of countless giants. PubMed alone contains more than 21 million peer-reviewed articles with 40-50,000 more published every month. Understanding the human brain, cognition, and disease will require integrating facts from dozens of scientific fields spread amongst millions of studies locked away in static documents, making any such integration daunting, at best. The future of scientific progress will be aided by bridging the gap between the millions of published research articles and modern databases such as the Allen brain atlas (ABA). To that end, we have analyzed the text of over 3.5 million scientific abstracts to find associations between neuroscientific concepts. From the literature alone, we show that we can blindly and algorithmically extract a "cognome": relationships between brain structure, function, and disease. We demonstrate the potential of data-mining and cross-platform data-integration with the ABA by introducing two methods for semi-automated hypothesis generation. By analyzing statistical "holes" and discrepancies in the literature we can find understudied or overlooked research paths. That is, we have added a layer of semi-automation to a part of the scientific process itself. This is an important step toward fundamentally incorporating data-mining algorithms into the scientific method in a manner that is generalizable to any scientific or medical field. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:92 / 100
页数:9
相关论文
共 30 条
[1]   Challenges and Opportunities in Mining Neuroscience Data [J].
Akil, Huda ;
Martone, Maryann E. ;
Van Essen, David C. .
SCIENCE, 2011, 331 (6018) :708-712
[2]   CoPub Mapper: mining MEDLINE based on search term co-publication [J].
Alako, BTF ;
Veldhoven, A ;
van Baal, S ;
Jelier, R ;
Verhoeven, S ;
Rullmann, T ;
Polman, J ;
Jenster, G .
BMC BIOINFORMATICS, 2005, 6 (1)
[3]   A critical look at connectomics [J].
不详 .
NATURE NEUROSCIENCE, 2010, 13 (12) :1441-1441
[4]   Decoding the Large-Scale Structure of Brain Function by Classifying Mental States Across Individuals [J].
不详 .
PSYCHOLOGICAL SCIENCE, 2009, 20 (11) :1364-1372
[5]   PUBLICATION BIAS AND DISSEMINATION OF CLINICAL RESEARCH [J].
BEGG, CB ;
BERLIN, JA .
JOURNAL OF THE NATIONAL CANCER INSTITUTE, 1989, 81 (02) :107-115
[6]  
Bilder Robert M, 2009, Cogn Neuropsychiatry, V14, P419, DOI 10.1080/13546800902787180
[7]  
Bjork B, 2008, INT C EL PUBL
[8]   A Proposal for a Coordinated Effort for the Determination of Brainwide Neuroanatomical Connectivity in Model Organisms at a Mesoscopic Scale [J].
Bohland, Jason W. ;
Wu, Caizhi ;
Barbas, Helen ;
Bokil, Hemant ;
Bota, Mihail ;
Breiter, Hans C. ;
Cline, Hollis T. ;
Doyle, John C. ;
Freed, Peter J. ;
Greenspan, Ralph J. ;
Haber, Suzanne N. ;
Hawrylycz, Michael ;
Herrera, Daniel G. ;
Hilgetag, Claus C. ;
Huang, Z. Josh ;
Jones, Allan ;
Jones, Edward G. ;
Karten, Harvey J. ;
Kleinfeld, David ;
Kotter, Rolf ;
Lester, Henry A. ;
Lin, John M. ;
Mensh, Brett D. ;
Mikula, Shawn ;
Panksepp, Jaak ;
Price, Joseph L. ;
Safdieh, Joseph ;
Saper, Clifford B. ;
Schiff, Nicholas D. ;
Schmahmann, Jeremy D. ;
Stillman, Bruce W. ;
Svoboda, Karel ;
Swanson, Larry W. ;
Toga, Arthur W. ;
Van Essen, David C. ;
Watson, James D. ;
Mitra, Partha P. .
PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (03)
[9]   The Small World of Psychopathology [J].
Borsboom, Denny ;
Cramer, Angelique O. J. ;
Schmittmann, Verena D. ;
Epskamp, Sacha ;
Waldorp, Lourens J. .
PLOS ONE, 2011, 6 (11)
[10]   NeuroNames 2002 [J].
Bowden, DM ;
Dubach, MF .
NEUROINFORMATICS, 2003, 1 (01) :43-59