Viziometrics: Analyzing Visual Information in the Scientific Literature

被引:36
作者
Lee, Po-Shen [1 ]
West, Jevin D. [2 ]
Howe, Bill [2 ]
机构
[1] Univ Washington, Dept Elect Engn, Seattle, WA 98105 USA
[2] Univ Washington, Informat Sch, Seattle, WA 98195 USA
基金
美国国家科学基金会;
关键词
Viziometrics; scholarly communication; meta research; figure retrieval; information retrieval; bibliometrics; scientometrics; GRAPHS; JOURNALS; IMPACT; TABLES; BRAIN; TEXT;
D O I
10.1109/TBDATA.2017.2689038
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scientific results are communicated visually in the literature through diagrams, visualizations, and photographs. These information-dense objects have been largely ignored in bibliometrics and scientometrics studies when compared to citations and text. In this paper, we use techniques from computer vision and machine learning to classify more than 8 million figures from PubMed into five figure types and study the resulting patterns of visual information as they relate to scholarly impact. We find that the distribution of figures and figure types in the literature has remained relatively constant over time, but can vary widely across field and topic. Remarkably, we find a significant correlation between scientific impact and the use of visual information, where higher impact papers tend to include more diagrams, and to a lesser extent more plots. To explore these results and other ways of extracting this visual information, we have built a visual browser to illustrate the concept and explore design alternatives for supporting viziometric analysis and organizing visual information. We use these results to articulate a new research agenda-viziometrics-to study the organization and presentation of visual information in the scientific literature.
引用
收藏
页码:117 / 129
页数:13
相关论文
共 33 条
[1]   Structured literature image finder: Parsing text and figures in biomedical literature [J].
Ahmed, Amr ;
Arnold, Andrew ;
Coelho, Luis Pedro ;
Kangas, Joshua ;
Sheikh, Abdul-Saboor ;
Xing, Eric ;
Cohen, William ;
Murphy, Robert F. .
JOURNAL OF WEB SEMANTICS, 2010, 8 (2-3) :151-154
[2]  
Ahmed A, 2009, KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P39
[3]   The effects of self-explaining when learning with text or diagrams [J].
Ainsworth, S ;
Loizou, AT .
COGNITIVE SCIENCE, 2003, 27 (04) :669-681
[4]  
[Anonymous], 199966 STANDF INFOLA
[5]  
[Anonymous], 2011, INT C ART INT STAT
[6]  
Bhatia S., 2010, P 19 INT C WORLD WID, P1061, DOI [10.1145/1772690.1772804, DOI 10.1145/1772690.1772804]
[7]  
Browuer William, 2008, Joint Conference on Digital Libraries (JCDL 2008), P276, DOI 10.1145/1378889.1378936
[8]   Solo Versus Collaborative Writing: Discrepancies in the Use of Tables and Graphs in Academic Articles [J].
Cabanac, Guillaume ;
Hubert, Gilles ;
Hartley, James .
JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2014, 65 (04) :812-820
[9]   DiagramFlyer: A Search Engine for Data-Driven Diagrams [J].
Chen, Zhe ;
Cafarella, Michael ;
Adar, Eytan .
WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2015, :183-186
[10]  
Choudhury S. R., 2013, AUTOMATED DATA EXTRA