Machine learning-based donor permission extraction from informed consent documents

被引:0
|
作者
Zhang, Meng [1 ]
Sankaranarayanapillai, Madhuri [1 ]
Du, Jingcheng [1 ]
Xiang, Yang [1 ]
Manion, Frank J. [2 ]
Harris, Marcelline R. [2 ]
Stansbury, Cooper [2 ]
Pham, Huy Anh [1 ]
Tao, Cui [1 ,3 ]
机构
[1] Univ Texas Hlth Sci Ctr Houston, McWilliam Sch Biomed Informat, Houston, TX 77030 USA
[2] Univ Michigan, Sch Nursing, Ann Arbor, MI USA
[3] Mayo Clin, Dept Artificial Intelligence & Informat, Jacksonville, FL 32224 USA
基金
美国国家卫生研究院;
关键词
Informed consent; Machine learning; Natural language processing; Text classification;
D O I
10.1186/s12859-023-05568-7
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundWith more clinical trials are offering optional participation in the collection of bio-specimens for biobanking comes the increasing complexity of requirements of informed consent forms. The aim of this study is to develop an automatic natural language processing (NLP) tool to annotate informed consent documents to promote biorepository data regulation, sharing, and decision support. We collected informed consent documents from several publicly available sources, then manually annotated them, covering sentences containing permission information about the sharing of either bio-specimens or donor data, or conducting genetic research or future research using bio-specimens or donor data.ResultsWe evaluated a variety of machine learning algorithms including random forest (RF) and support vector machine (SVM) for the automatic identification of these sentences. 120 informed consent documents containing 29,204 sentences were annotated, of which 1250 sentences (4.28%) provide answers to a permission question. A support vector machine (SVM) model achieved a F-1 score of 0.95 on classifying the sentences when using a gold standard, which is a prefiltered corpus containing all relevant sentences.ConclusionsThis study provides the feasibility of using machine learning tools to classify permission-related sentences in informed consent documents.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Machine learning-based donor permission extraction from informed consent documents
    Meng Zhang
    Madhuri Sankaranarayanapillai
    Jingcheng Du
    Yang Xiang
    Frank J. Manion
    Marcelline R. Harris
    Cooper Stansbury
    Huy Anh Pham
    Cui Tao
    BMC Bioinformatics, 24
  • [2] Analysis of Permission Selection Techniques in Machine Learning-based Malicious App Detection
    Park, Jihyeon
    Kang, Munyeong
    Cho, Seong-je
    Han, Hyoil
    Suh, Kyoungwon
    2020 IEEE THIRD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING (AIKE 2020), 2020, : 92 - 99
  • [3] A Machine Learning-Based Pipeline for the Extraction of Insights from Customer Reviews
    Lakatos, Robert
    Bogacsovics, Gergo
    Harangi, Balazs
    Lakatos, Istvan
    Tiba, Attila
    Toth, Janos
    Szabo, Marianna
    Hajdu, Andras
    BIG DATA AND COGNITIVE COMPUTING, 2024, 8 (03)
  • [4] Machine learning-based keywords extraction for scientific literature
    Wu, Chunguo
    Marchese, Maurizio
    Jiang, Jingqing
    Ivanyukovich, Alexander
    Liang, Yanchun
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2007, 13 (10) : 1471 - 1483
  • [5] Machine Learning-Based Algorithms to Knowledge Extraction from Time Series Data: A Review
    Ciaburro, Giuseppe
    Iannace, Gino
    DATA, 2021, 6 (06)
  • [6] Automatic Extraction of Ontological Explanation for Machine Learning-Based Systems
    Chondamrongkul, Nacha
    Temdee, Punnarumol
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2023, 33 (01) : 133 - 156
  • [7] Deep learning-based automatic action extraction from structured chemical synthesis procedures
    Vaskevicius, Mantas
    Kapociute-Dzikiene, Jurgita
    Vaskevicius, Arnas
    Slepikas, Liudas
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [8] Machine learning-based opinion extraction approach from movie reviews for sentiment analysis
    Mustafa Abdalrassual Jassim
    Dhafar Hamed Abd
    Mohamed Nazih Omri
    Multimedia Tools and Applications, 2025, 84 (17) : 18599 - 18624
  • [9] Automatic extraction of titles from general documents using machine learning
    Hu, Yunhua
    Li, Hang
    Cao, Yunbo
    Teng, Li
    Meyerzon, Dmitriy
    Zheng, Qinghua
    INFORMATION PROCESSING & MANAGEMENT, 2006, 42 (05) : 1276 - 1293
  • [10] Learning-based summarisation of XML documents
    Massih R. Amini
    Anastasios Tombros
    Nicolas Usunier
    Mounia Lalmas
    Information Retrieval, 2007, 10 : 233 - 255