Using mutual information as a cocitation similarity measure

被引:0
|
作者
Lukun Zheng
机构
[1] Western Kentucky University,Department of Mathematics
来源
Scientometrics | 2019年 / 119卷
关键词
Author co-citation analysis; Similarity measures; Mutual information;
D O I
暂无
中图分类号
学科分类号
摘要
The debate regarding to which similarity measure can be used in co-citation analysis lasted for many years. The mostly debated measure is Pearson’s correlation coefficient r. It has been used as similarity measure in literature since the beginning of the technique in the 1980s. However, some researchers criticized using Pearson’s r as a similarity measure because it does not fully satisfy the mathematical conditions of a good similarity metric and (or) because it doesn’t meet some natural requirements a similarity measure should satisfy. Alternative similarity measures like cosine measure and chi square measure were also proposed and studied, which resulted in more controversies and debates about which similarity measure to use in co-citation analysis. In this article, we put forth the hypothesis that the researchers with high mutual information are closely related to each other and that the mutual information can be used as a similarity measure in author co-citation analysis. Given two researchers, the mutual information between them can be calculated based on their publications and their co-citation frequencies. A mutual information proximity matrix is then constructed. This proximity matrix meet the two requirements formulated by Ahlgren et al. (J Am Soc Inf Sci Technol 54(6):550–560, 2003). We conduct several experimental studies for the validation of our hypothesis and the results using mutual information are compared to the results using other similarity measures.
引用
收藏
页码:1695 / 1713
页数:18
相关论文
共 50 条
  • [41] Mutual information as a measure of multivariate association: analytical properties and statistical estimation
    Blumentritt, Thomas
    Schmid, Friedrich
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2012, 82 (09) : 1257 - 1274
  • [42] Visualization of intellectual structure in information retrieval: Author cocitation analysis
    Ding, Y
    INTERNATIONAL FORUM ON INFORMATION AND DOCUMENTATION, 1998, 23 (01): : 25 - 36
  • [43] Image Registration Using a Combination of Mutual Information and Spatial Information
    Anthony, Amankwah
    Lofffeld, Otmar
    2006 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-8, 2006, : 4012 - 4016
  • [44] Multimodal image registration using IECC as the similarity measure
    Itou, Takeshi
    Shinohara, Hiroyuki
    Sakaguchi, Kazuya
    Hashimoto, Takeyuki
    Yokoi, Takashi
    Souma, Tsutomu
    MEDICAL PHYSICS, 2011, 38 (02) : 1103 - 1115
  • [45] Survey on the Estimation of Mutual Information Methods as a Measure of Dependency Versus Correlation Analysis
    Gencaga, D.
    Malakar, N. K.
    Lary, D. J.
    BAYESIAN INFERENCE AND MAXIMUM ENTROPY METHODS IN SCIENCE AND ENGINEERING, MAXENT 2013, 2014, 1636 : 80 - 87
  • [46] Evaluation of gene-expression clustering via mutual information distance measure
    Ido Priness
    Oded Maimon
    Irad Ben-Gal
    BMC Bioinformatics, 8
  • [47] Multimodality image registration by maximization of quantitative-qualitative measure of mutual information
    Luan, Hongxia
    Qi, Feihu
    Xue, Zhong
    Chen, Liya
    Shen, Dinggang
    PATTERN RECOGNITION, 2008, 41 (01) : 285 - 298
  • [48] UNSUPERVISED FEATURE EXTRACTION BASED ON A MUTUAL INFORMATION MEASURE FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Hossain, Md Ali
    Pickering, Mark
    Jia, Xiuping
    2011 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2011, : 1720 - 1723
  • [49] A multilocus linkage disequilibrium measure based on mutual information theory and its applications
    Zhang, Lei
    Liu, Jianfeng
    Deng, Hong-Wen
    GENETICA, 2009, 137 (03) : 355 - 364
  • [50] A multilocus linkage disequilibrium measure based on mutual information theory and its applications
    Lei Zhang
    Jianfeng Liu
    Hong-Wen Deng
    Genetica, 2009, 137 : 355 - 364