A multilevel analysis of data quality for formal software citation

被引:0
|
作者
Schindler, David [1 ]
Hossain, Tazin [1 ]
Spors, Sascha [1 ]
Krueger, Frank [1 ,2 ,3 ]
机构
[1] Univ Rostock, Inst Commun Engn, Rostock, Germany
[2] Univ Appl Sci Technol Business & Design, Hsch Wismar, Fac Engn, Wismar, Germany
[3] Univ Rostock, Dept Knowledge Culture & Transformat, Rostock, Germany
来源
QUANTITATIVE SCIENCE STUDIES | 2024年 / 5卷 / 03期
关键词
data quality; scientific software; software citation; SIMULTANEOUS CONFIDENCE-INTERVALS; AGREEMENT; CROSSREF; SCIENCE; IMPACT; TEXT;
D O I
10.1162/qss_a_00309
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Software is a central part of modern science, and knowledge of its use is crucial for the scientific community with respect to reproducibility and attribution of its developers. Several studies have investigated in-text mentions of software and its quality, while the quality of formal software citations has only been analyzed superficially. This study performs an in-depth evaluation of formal software citation based on a set of manually annotated software references. It examines which resources are cited for software usage, to what extent they allow proper identification of software and its specific version, how this information is made available by scientific publishers, and how well it is represented in large-scale bibliographic databases. The results show that software articles are the most cited resource for software, while direct software citations are better suited for identification of software versions. Moreover, we found current practices by both publishers and bibliographic databases to be unsuited to represent these direct software citations, hindering large-scale analyses such as assessing software impact. We argue that current practices for representing software citations-the recommended way to cite software by current citation standards-stand in the way of their adoption by the scientific community, and urge providers of bibliographic data to explicitly model scientific software.
引用
收藏
页码:637 / 667
页数:31
相关论文
共 50 条
  • [31] A content-based citation analysis study based on text categorization
    Taskin, Zehra
    Al, Umut
    SCIENTOMETRICS, 2018, 114 (01) : 335 - 357
  • [32] A Framework for considering Quality of Data through Software Development
    Guerra-Garcia, Cesar
    Perez-Gonzalez, Hector G.
    Martinez-Perez, Francisco
    Nava-Munoz, Sandra E.
    Juarez-Ramirez, Reyes
    2022 10TH INTERNATIONAL CONFERENCE IN SOFTWARE ENGINEERING RESEARCH AND INNOVATION, CONISOFT, 2022, : 1 - 10
  • [33] Applying a Data Quality Model to Experiments in Software Engineering
    Carolina Valverde, Maria
    Vallespir, Diego
    Marotta, Adriana
    Ignacio Panach, Jose
    ADVANCES IN CONCEPTUAL MODELING, 2014, 8823 : 168 - 177
  • [34] Scientific Collaboration, Citation and Topic Analysis of International Conference on Agile Software Development Papers
    Ahmad, Muhammad Ovais
    Raulamo-Jurvanen, Paivi
    ADVANCES IN AGILE AND USER-CENTRED SOFTWARE ENGINEERING, 2020, 376 : 108 - 132
  • [35] The Impact of Data Quality on Software Testing Effort Prediction
    Radlinski, Lukasz
    ELECTRONICS, 2023, 12 (07)
  • [36] Is Collaboration Among Scientists Related to the Citation Impact of Papers Because Their Quality Increases With Collaboration? An Analysis Based on Data From F1000Prime and Normalized Citation Scores
    Bornmann, Lutz
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2017, 68 (04) : 1036 - 1047
  • [37] Citation Analysis and Trends in Knowledge Management
    Landrum, W. Heath
    Jourdan, Zack
    Hall, Dianne
    Lang, Teresa
    AMCIS 2010 PROCEEDINGS, 2010,
  • [38] Does data curation matter in citation and co-citation analysis? Evidence from a top service journal
    Koseoglu, Mehmet Ali
    Arici, Hasan Evrim
    Arici, Nagihan Cakmakoglu
    COLLNET JOURNAL OF SCIENTOMETRICS AND INFORMATION MANAGEMENT, 2023, 17 (02) : 269 - 287
  • [39] Matched Control Groups for Modeling Events in Citation Data: An Illustration of Nobel Prize Effects in Citation Networks
    Farys, Rudolf
    Wolbring, Tobias
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2017, 68 (09) : 2201 - 2210
  • [40] Increasing the equitability of data citation in paleontology: capacity building for the big data future
    Smith, Jansen A.
    Raja, Nussaibah B.
    Clements, Thomas
    Dimitrijevic, Danijela
    Dowding, Elizabeth M.
    Dunne, Emma M.
    Gee, Bryan M.
    Godoy, Pedro L.
    Lombardi, Elizabeth M.
    Mulvey, Laura P. A.
    Naetscher, Paulina S.
    Reddin, Carl J.
    Shirley, Bryan
    Warnock, Rachel C. M.
    Kocsis, Adam T.
    PALEOBIOLOGY, 2024, 50 (02) : 165 - 176