Similarity indexes for scientometric research: A comparative analysis

被引:8
作者
Adnani, Hinde [1 ]
Cherraj, Mohammed [1 ]
Bouabid, Hamid [1 ]
机构
[1] Mohammed V Univ Rabat, Fac Sci, Ave Ibn Batouta,BP 1014 RP, Rabat, Morocco
关键词
Similarity index; Co-word analysis; Co-authorship analysis; Co-citation analysis; Bibliographic coupling; Proximity Index; AUTHOR COCITATION ANALYSIS; BIBLIOGRAPHIC DATA; PROXIMITY-MEASURES; SALTONS COSINE; COLLABORATION; RELATEDNESS; REQUIREMENTS; RESEMBLANCE; ASSOCIATION; RETRIEVAL;
D O I
10.22452/mjlis.vol25no3.3
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
A significant number of papers in the field of scientometrics addressed the comparisons of various similarity indexes. However, there is still a debate on the appropriateness of an index compared to others, beacause of the assessment differences reported in the literature. The objective of this paper is to make a comparative analysis of the five most used similarity indexes for the three scientometric analysis types: co-word, co-citation and co-authorship. A total of 388 papers addressing similarity indexes in scientometric analysis over three decades were retrieved from the Web of Science and examined; of which 49 were retained as the most relevant according to selective criteria. The approach consisted of building cross matrices for the five indexes (Jaccard, Dice-Sorensson, Salton, Pearson, and Association Strength) for the three types of scientometric analysis. For each of these analyses, a distinction is made between papers according to their theoretical or empirical results. Furthermore, papers are classified according to the mathematical formula of the similarity index being used (vector vs non vector). In the 49 relevant papers being selected, the comparative analysis showed that there is still no consensus on the appropriateness of an index for co-word and co-authorship analyses, while for co-citation, Salton is the widely preferred one. The Association Strength is the less covered and compared to other indexes for the three analysis types. An open source computer program was developed as a tool to facilitate empirical comparative studies of indexes. It allows generating normalized matrix of any chosen index for the two mathematical variants.
引用
收藏
页码:31 / 48
页数:18
相关论文
共 67 条
[1]  
Abdulhayoglu MA, 2013, PRO INT CONF SCI INF, P1151
[2]   Requirements for a cocitation similarity measure, with special reference to Pearson's correlation coefficient [J].
Ahlgren, P ;
Jarneving, B ;
Rousseau, R .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2003, 54 (06) :550-560
[3]   Rejoinder: In defense of formal methods [J].
Ahlgren, P ;
Jarneving, B ;
Rousseau, R .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2004, 55 (10) :936-936
[4]  
ALKHARASHI IA, 1994, J AM SOC INFORM SCI, V45, P548, DOI 10.1002/(SICI)1097-4571(199409)45:8<548::AID-ASI3>3.0.CO
[5]  
2-X
[6]  
[Anonymous], 1901, B SOC VAUDOISE SCI N
[7]  
[Anonymous], 1957, IBM internal report
[8]  
[Anonymous], 1983, Introduction to Modern Information Retrieval
[9]   Pearson's r and author cocitation analysis:: A commentary on the controversy [J].
Bensman, SJ .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2004, 55 (10) :935-935
[10]   Mapping the backbone of science [J].
Boyack, KW ;
Klavans, R ;
Börner, K .
SCIENTOMETRICS, 2005, 64 (03) :351-374