Biomedical text mining for research rigor and integrity: tasks, challenges, directions

被引:34
作者
Kilicoglu, Halil [1 ]
机构
[1] US Natl Lib Med, Lister Hill Natl Ctr Biomed Commun, Bethesda, MD 20894 USA
基金
美国国家卫生研究院;
关键词
biomedical research waste; biomedical text mining; natural language processing; research rigor; research integrity; reproducibility; AUTOMATIC RECOGNITION; PLAGIARISM; ARTICLES; CITATION; KNOWLEDGE; REPRODUCIBILITY; CLASSIFICATION; EXTRACTION; SENTENCES; MEDICINE;
D O I
10.1093/bib/bbx057
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
An estimated quarter of a trillion US dollars is invested in the biomedical research enterprise annually. There is growing alarm that a significant portion of this investment is wasted because of problems in reproducibility of research findings and in the rigor and integrity of research conduct and reporting. Recent years have seen a flurry of activities focusing on standardization and guideline development to enhance the reproducibility and rigor of biomedical research. Research activity is primarily communicated via textual artifacts, ranging from grant applications to journal publications. These artifacts can be both the source and the manifestation of practices leading to research waste. For example, an article may describe a poorly designed experiment, or the authors may reach conclusions not supported by the evidence presented. In this article, we pose the question of whether biomedical text mining techniques can assist the stakeholders in the biomedical research enterprise in doing their part toward enhancing research integrity and rigor. In particular, we identify four key areas in which text mining techniques can make a significant contribution: plagiarism/fraud detection, ensuring adherence to reporting guidelines, managing information overload and accurate citation/enhanced bibliometrics. We review the existing methods and tools for specific tasks, if they exist, or discuss relevant research that can provide guidance for future work. With the exponential increase in biomedical research output and the ability of text mining approaches to perform automatic tasks at large scale, we propose that such approaches can support tools that promote responsible research practices, providing significant benefits for the biomedical research enterprise.
引用
收藏
页码:1400 / 1414
页数:15
相关论文
共 50 条
  • [31] Review of Survey Research in Fuzzy Approach for Text Mining
    Lai, Yi-Wei
    Chen, Mu-Yen
    IEEE ACCESS, 2023, 11 : 39635 - 39649
  • [32] Global Genetics Research in Prostate Cancer: A Text Mining and Computational Network Theory Approach
    Azam, Md Facihul
    Musa, Aliyu
    Dehmer, Matthias
    Yli-Harja, Olli P.
    Emmert-Streib, Frank
    FRONTIERS IN GENETICS, 2019, 10
  • [33] A systematic review of text mining approaches applied to various application areas in the biomedical domain
    Cheerkoot-Jalim, Sudha
    Khedo, Kavi Kumar
    JOURNAL OF KNOWLEDGE MANAGEMENT, 2021, 25 (03) : 642 - 668
  • [34] Grid-based Support for Different Text Mining Tasks
    Sarnovsky, Martin
    Butka, Peter
    Paralic, Jan
    ACTA POLYTECHNICA HUNGARICA, 2009, 6 (04) : 5 - 27
  • [35] ParaBTM: A Parallel Processing Framework for Biomedical Text Mining on Supercomputers
    Xing, Yuting
    Wu, Chengkun
    Yang, Xi
    Wang, Wei
    Zhu, En
    Yin, Jianping
    MOLECULES, 2018, 23 (05):
  • [36] TEXT MINING FROM BIOMEDICAL DOMAIN USING A FULL PARSER
    Govindarajan, Priya
    Ravichandran, K. S.
    2016 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT), VOL 2, 2016, : 522 - 530
  • [37] PubMiner: Machine learning-based text mining system for biomedical information mining
    Eom, JH
    Zhang, BT
    ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, PROCEEDINGS, 2004, 3192 : 216 - 225
  • [38] Preparing clinical text for use in biomedical research
    Pestian, JP
    Itert, L
    Andersen, C
    Duch, W
    JOURNAL OF DATABASE MANAGEMENT, 2006, 17 (02) : 1 - 11
  • [39] Medical informatics research trend analysis: A text mining approach
    Kim, Yong-Mi
    Delen, Dursun
    HEALTH INFORMATICS JOURNAL, 2018, 24 (04) : 432 - 452
  • [40] Thematic series on biomedical ontologies in JBMS: challenges and new directions
    Hoehndorf, Robert
    Haendel, Melissa
    Stevens, Robert
    Rebholz-Schuhmann, Dietrich
    JOURNAL OF BIOMEDICAL SEMANTICS, 2014, 5