Multi-source data fusion study in scientometrics

被引:4
作者
Hai-Yun Xu
Zeng-Hui Yue
Chao Wang
Kun Dong
Hong-Shen Pang
Zhengbiao Han
机构
[1] Chengdu Library of Chinese Academy of Sciences,School of Medical Information Engineering
[2] Jining Medical University,Guangzhou Institutes of Biomedicine and Health
[3] Chinese Academy of Sciences,undefined
[4] Nanjing Agricultural University,undefined
来源
Scientometrics | 2017年 / 111卷
关键词
Data fusion; Relations fusion; Multi-mode analysis; Multi-source data; Scientometrics;
D O I
暂无
中图分类号
学科分类号
摘要
This paper provides an introduction to multi-source data fusion (MSDF) and comprehensively overviews the ingredients and challenges of MSDF in scientometrics. As compared to the MSDF methods in the sensor and other fields, and considering the features of scientometrics, in this paper an application model and procedure of MSDF in scientometrics are proposed. The model and procedure can be divided into three parts: data type integration, fusion of data relations, and ensemble clustering. Furthermore, the fusion of data relations can be divided into cross-integration of multi-mode data and matrix fusion of multi-relational data. To obtain a clearer and deeper analysis of the MSDF model, this paper further focuses on the application of MSDF in topic identification based on text analysis of scientific literatures. This paper also discusses the application of MSDF for the exploration of scientific literatures. Finally, the most suitable MSDF methods for different situations are discussed.
引用
收藏
页码:773 / 792
页数:19
相关论文
共 128 条
  • [1] Ahlgren P(2009)Document–document similarity approaches and science mapping: Experimental comparison of five approaches Journal of Informetrics 3 49-63
  • [2] Colliander C(2004)Using literature and data to learn Bayesian networks as clinical models of ovarian tumors Artificial Intelligence in Medicine 30 257-281
  • [3] Antal P(2006)Learning causal bayesian networks from literature data Periodica Polytechnica Electrical Engineering 50 201-11
  • [4] Fannes G(2007)Modularity and community detection in bipartite networks Physical Review E 76 1-70
  • [5] Timmerman D(2009)Finding groups in data: Cluster analysis with ants Applied Soft Computing 9 61-221
  • [6] Moreau Y(1991)Mapping of science by combined co-citation and word analysis I. Structural aspects Journal of the American Society for information science 42 233-279
  • [7] De Moor B(2006)Link-based similarity measures for the classification of Web documents Journal of the American Society for Information Science and Technology 57 208-21
  • [8] Antal P(2008)Combining mapping and citation network analysis for a better understanding of the scientific development: The case of the absorptive capacity field Journal of Informetrics 2 272-1104
  • [9] Millinghoffer A(2007)Module identification in bipartite and directed networks Physical Review E 76 036102-45
  • [10] Barber MJ(2005)Research and development of multi-sensor information fusion technology Bulletin of National Science Foundation of China 19 17-19