A novel approach for classification and clustering of biomedical citations

被引:0
作者
Parthasarathy, G. [1 ]
Tomar, D. C. [2 ]
机构
[1] Sathyabama Univ, Dept Comp Sci, Madras, Tamil Nadu, India
[2] Jerusalem Coll Engn, Dept Informat Technol, Madras, Tamil Nadu, India
来源
BIOMEDICAL RESEARCH-INDIA | 2016年 / 27卷
关键词
Citation crawler; Citation mark-up language; Classification; Hierarchical clustering; Citation database;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Citation refers the information of a published paper with its author and publication details. It is used by various authors for referring the research works published in other research articles. Citations play a crucial role in several scientific publications digital libraries (DLs), like Cite Seer, arXiv e-Print, DBLP, and Google Scholar. Users usually use citations to seek out data of interest in DLs, while researchers relay on citations to see the impact of a specific article. Citation mining is the area where in the citation databases are mined for performing various mining tasks such as classification and clustering to retrieve citations efficiently and accurately. Citations have additionally been used as auxiliary support in information retrieval tasks. Citation classification is the process of classifying the citation data by means of topic, author, paper name, and journal category. Clustering involves the categorization of papers based on content similarity or functional similarity. At present the size of databases in the web is massive hence the quantity of records in a dataset will vary from some thousands to thousands of millions. Authors or scholars are spending their precious time in searching the papers especially in bio medical field. So to provide more accurate retrieval of biomedical citations we have proposed a citation mining system with a combined approach of clustering. Our experiments conducted with the citations from the web database shows an effective retrieval of biomedical citations.
引用
收藏
页码:S22 / S30
页数:9
相关论文
共 26 条
  • [1] Calvillo EA, 2013, INT CONF ELECTR COMM, P78, DOI 10.1109/CONIELECOMP.2013.6525763
  • [2] Document clustering of scientific texts using citation contexts
    Aljaber, Bader
    Stokes, Nicola
    Bailey, James
    Pei, Jian
    [J]. INFORMATION RETRIEVAL, 2010, 13 (02): : 101 - 131
  • [3] Ambeth KDV, 2015, BIOMED PHARMACOL J, V8, P435
  • [4] Some measures for comparing citation databases
    Bar-Ilan, Judit
    Levene, Mark
    Lin, Ayelet
    [J]. JOURNAL OF INFORMETRICS, 2007, 1 (01) : 26 - 34
  • [5] Bolelli L, 2006, LECT NOTES ARTIF INT, V4213, P30
  • [6] Distance measures for dynamic citation networks
    Bommarito, Michael J., II
    Katz, Daniel Martin
    Zelner, Jonathan L.
    Fowler, James H.
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2010, 389 (19) : 4201 - 4208
  • [7] Incidence of large for gestational age infants when gestational diabetes mellitus is diagnosed early and late in pregnancy
    Boriboonhirunsarn, Dittakarn
    Kasempipatchai, Vorama
    [J]. JOURNAL OF OBSTETRICS AND GYNAECOLOGY RESEARCH, 2016, 42 (03) : 273 - 278
  • [8] Chen Ding, 1999, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028), P105, DOI 10.1109/ICSMC.1999.825216
  • [9] BRS-compactness in networks: Theoretical considerations related to cohesion in citation graphs, collaboration networks and the Internet
    Egghe, L
    Rousseau, R
    [J]. MATHEMATICAL AND COMPUTER MODELLING, 2003, 37 (7-8) : 879 - 899
  • [10] FUJITA K, 2012, INT C TECHN MAN EM T, P267