Homophily and missing links in citation networks

被引:44
作者
Ciotti, Valerio [1 ,2 ]
Bonaventura, Moreno [1 ,2 ]
Nicosia, Vincenzo [2 ]
Panzarasa, Pietro [1 ]
Latora, Vito [2 ,3 ,4 ]
机构
[1] Queen Mary Univ London, Sch Business & Management, Mile End Rd, London E1 4NS, England
[2] Queen Mary Univ London, Sch Math Sci, Mile End Rd, London E1 4NS, England
[3] Univ Catania, Dipartimento Fis & Astron, Via S Sofia, I-95123 Catania, Italy
[4] Ist Nazl Fis Nucl, Sez Catania, Via S Sofia, I-95123 Catania, Italy
基金
英国工程与自然科学研究理事会;
关键词
citation networks; homophily; link prediction; bibliometric techniques; SUPREME-COURT; SCIENCE;
D O I
10.1140/epjds/s13688-016-0068-2
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Citation networks have been widely used to study the evolution of science through the lenses of the underlying patterns of knowledge flows among academic papers, authors, research sub-fields, and scientific journals. Here we focus on citation networks to cast light on the salience of homophily, namely the principle that similarity breeds connection, for knowledge transfer between papers. To this end, we assess the degree to which citations tend to occur between papers that are concerned with seemingly related topics or research problems. Drawing on a large data set of articles published in the journals of the American Physical Society between 1893 and 2009, we propose a novel method for measuring the similarity between articles through the statistical validation of the overlap between their bibliographies. Results suggest that the probability of a citation made by one article to another is indeed an increasing function of the similarity between the two articles. Our study also enables us to uncover missing citations between pairs of highly related articles, and may thus help identify barriers to effective knowledge flows. By quantifying the proportion of missing citations, we conduct a comparative assessment of distinct journals and research sub-fields in terms of their ability to facilitate or impede the dissemination of knowledge. Findings indicate that Electromagnetism and Interdisciplinary Physics are the two sub-fields in physics with the smallest percentage of missing citations. Moreover, knowledge transfer seems to be more effectively facilitated by journals of wide visibility, such as Physical Review Letters, than by lower-impact ones. Our study has important implications for authors, editors and reviewers of scientific journals, as well as public preprint repositories, as it provides a procedure for recommending relevant yet missing references and properly integrating bibliographies of papers.
引用
收藏
页数:14
相关论文
共 27 条
[1]  
[Anonymous], FREEDOM CONTROL MODE
[2]  
[Anonymous], 2010, NETWORKS INTRO, DOI DOI 10.1093/ACPROF:OSO/9780199206650.001.0001
[3]  
[Anonymous], 1 C EM ANT CEAS MOUN
[4]   Distinguishing influence-based contagion from homophily-driven diffusion in dynamic networks [J].
Aral, Sinan ;
Muchnik, Lev ;
Sundararajan, Arun .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (51) :21544-21549
[5]   Emergence of scaling in random networks [J].
Barabási, AL ;
Albert, R .
SCIENCE, 1999, 286 (5439) :509-512
[6]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[7]   Combining mapping and citation network analysis for a better understanding of the scientific development: The case of the absorptive capacity field [J].
Calero-Medina, Clara ;
Noyons, Ed C. M. .
JOURNAL OF INFORMETRICS, 2008, 2 (04) :272-279
[8]   The incidence and role of negative citations in science [J].
Catalini, Christian ;
Lacetera, Nicola ;
Oettl, Alexander .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (45) :13823-13826
[9]  
Clough J.R., 2014, ARXIV14081274
[10]   Transitive reduction of citation networks [J].
Clough, James R. ;
Gollings, Jamie ;
Loach, Tamar V. ;
Evans, Tim S. .
JOURNAL OF COMPLEX NETWORKS, 2015, 3 (02) :189-203