Identification and classification of research data cited in scholarly papers

被引:0
|
作者
Tsunokake M. [1 ]
Matsubara S. [2 ]
机构
[1] Graduate School of Informatics, Nagoya University Furo-cho, Chikusa-ku, Nagoya
[2] Information and Communications, Nagoya University Furo-cho, Chikusa-ku, Nagoya
关键词
Distributed representations; Information extraction; Open science; Repository; Research data; URL;
D O I
10.1541/ieejeiss.140.1357
中图分类号
学科分类号
摘要
This paper proposes a method for identifying and classifying the research data cited in scholarly papers, aiming at automatic generation of metadata stored in data repository. This study focuses on URL citations in the scholarly papers. That is, the targets are to identify the URLs referring to the research data and to classify them into tool and data. The method is realized as a multi-class classification (tool/data/others). The method acquires the distributed representations of the URLs from the context around them, and uses them as the input feature. There exists an advantage in that the meanings of URLs can be given based on their surrounding words. This study adopts an approach of computing the meaning of the entire URL from those of the components of the URL. In order to evaluate the performance of the proposed method, experiments on URL classification were conducted. The scholarly papers included in the proceedings of the international conference were used as experimental data. Experimental results have shown the effectiveness of the proposed method for identifying and classifying URLs referring to research data. © 2020 The Institute of Electrical Engineers of Japan.
引用
收藏
页码:1357 / 1364
页数:7
相关论文
共 50 条
  • [1] Identification of Scholarly Papers and Authors
    Baba, Kensuke
    Mori, Masao
    Ito, Eisuke
    NETWORKED DIGITAL TECHNOLOGIES, 2011, 136 : 195 - 202
  • [2] Using Citation Contexts in Scholarly Papers for Research Data Search
    Tsunokake, Masaya
    Matsubara, Shigeki
    16TH INTERNATIONAL JOINT SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE PROCESSING (ISAI-NLP 2021), 2021,
  • [3] Extracting Information about Research Resources from Scholarly Papers
    Saji, Ayahito
    Matsubara, Shigeki
    FROM BORN-PHYSICAL TO BORN-VIRTUAL: AUGMENTING INTELLIGENCE IN DIGITAL LIBRARIES, ICADL 2022, 2022, 13636 : 440 - 448
  • [4] Expanding the data Ark: an attempt to make the data from highly cited social science papers publicly available
    Dulitzki, Coby
    Crane, Steven Michael
    Hardwicke, Tom E.
    Ioannidis, John P. A.
    ROYAL SOCIETY OPEN SCIENCE, 2024, 11 (05):
  • [5] Simultaneous Registration of Scholarly Papers to a Researcher Database and a Repository
    Baba, Kensuke
    Mori, Masao
    2012 INTERNATIONAL CONFERENCE ON E-LEARNING AND E-TECHNOLOGIES IN EDUCATION (ICEEE), 2012, : 200 - 203
  • [6] Research Data in Scholarly Practices: Observations of an Interdisciplinary Horizon2020 Project
    Dutoit, Madeleine
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES (TPDL 2017), 2017, 10450 : 623 - 627
  • [7] Journal research data sharing policies: a study of highly-cited journals in neuroscience, physics, and operations research
    Antti M. Rousi
    Mikael Laakso
    Scientometrics, 2020, 124 : 131 - 152
  • [8] Journal research data sharing policies: a study of highly-cited journals in neuroscience, physics, and operations research
    Rousi, Antti M.
    Laakso, Mikael
    SCIENTOMETRICS, 2020, 124 (01) : 131 - 152
  • [9] How to get great research cited
    Lindgreen, Adam
    Di Benedetto, C. Anthony
    Brodie, Roderick J.
    Fehrer, Julia
    van der Borgh, Michel
    INDUSTRIAL MARKETING MANAGEMENT, 2020, 89 : A1 - A7
  • [10] Challenges in building scholarly knowledge graphs for research assessment in open science
    Manghi, Paolo
    QUANTITATIVE SCIENCE STUDIES, 2024, 5 (04): : 991 - 1021