Machine learning-based donor permission extraction from informed consent documents

被引:1
作者
Zhang, Meng [1 ]
Sankaranarayanapillai, Madhuri [1 ]
Du, Jingcheng [1 ]
Xiang, Yang [1 ]
Manion, Frank J. [2 ]
Harris, Marcelline R. [2 ]
Stansbury, Cooper [2 ]
Pham, Huy Anh [1 ]
Tao, Cui [1 ,3 ]
机构
[1] Univ Texas Hlth Sci Ctr Houston, McWilliam Sch Biomed Informat, Houston, TX 77030 USA
[2] Univ Michigan, Sch Nursing, Ann Arbor, MI USA
[3] Mayo Clin, Dept Artificial Intelligence & Informat, Jacksonville, FL 32224 USA
基金
美国国家卫生研究院;
关键词
Informed consent; Machine learning; Natural language processing; Text classification;
D O I
10.1186/s12859-023-05568-7
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundWith more clinical trials are offering optional participation in the collection of bio-specimens for biobanking comes the increasing complexity of requirements of informed consent forms. The aim of this study is to develop an automatic natural language processing (NLP) tool to annotate informed consent documents to promote biorepository data regulation, sharing, and decision support. We collected informed consent documents from several publicly available sources, then manually annotated them, covering sentences containing permission information about the sharing of either bio-specimens or donor data, or conducting genetic research or future research using bio-specimens or donor data.ResultsWe evaluated a variety of machine learning algorithms including random forest (RF) and support vector machine (SVM) for the automatic identification of these sentences. 120 informed consent documents containing 29,204 sentences were annotated, of which 1250 sentences (4.28%) provide answers to a permission question. A support vector machine (SVM) model achieved a F-1 score of 0.95 on classifying the sentences when using a gold standard, which is a prefiltered corpus containing all relevant sentences.ConclusionsThis study provides the feasibility of using machine learning tools to classify permission-related sentences in informed consent documents.
引用
收藏
页数:10
相关论文
共 50 条
[31]   Extraction of mitigation-related text from Endangered Species Act documents using machine learning: a case study [J].
Varghese A. ;
Allen K. ;
Agyeman-Badu G. ;
Haire J. ;
Madsen R. .
Environment Systems and Decisions, 2022, 42 (1) :63-74
[32]   A Machine Learning-Based Evaluation Method for Machine Translation [J].
Kotani, Katsunori ;
Yoshimi, Takehiko .
ARTIFICIAL INTELLIGENCE: THEORIES, MODELS AND APPLICATIONS, PROCEEDINGS, 2010, 6040 :351-+
[33]   Evaluation of a Machine Learning-Based Prognostic Model for Unrelated Hematopoietic Cell Transplantation Donor Selection [J].
Buturovic, Ljubomir ;
Shelton, Jason ;
Spellman, Stephen R. ;
Wang, Tao ;
Friedman, Lyssa ;
Loftus, David ;
Hesterberg, Lyndal ;
Woodring, Todd ;
Fleischhauer, Katharina ;
Hsu, Katharine C. ;
Verneris, Michael R. ;
Haagenson, Mike ;
Lee, Stephanie J. .
BIOLOGY OF BLOOD AND MARROW TRANSPLANTATION, 2018, 24 (06) :1299-1306
[34]   A Machine Learning-Based Approach for Demarcating Requirements in Textual Specifications [J].
Abualhaija, Sallam ;
Arora, Chetan ;
Sabetzadeh, Mehrdad ;
Briand, Lionel C. ;
Vaz, Eduardo .
2019 27TH IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE 2019), 2019, :51-62
[35]   Diagnosis, disclosure, and informed consent: Learning from parents of children with cancer [J].
Levi, RB ;
Marsick, R ;
Drotar, D ;
Kodish, ED .
JOURNAL OF PEDIATRIC HEMATOLOGY ONCOLOGY, 2000, 22 (01) :3-12
[36]   Learning from informed consent litigation to improve practices: A systematic review [J].
Giudici-Wach, Karine ;
Gillois, Pierre ;
Remen, Thomas ;
Claudot, Frederique .
PATIENT EDUCATION AND COUNSELING, 2022, 105 (07) :1714-1721
[37]   Machine Learning Based Detection of Digital Documents Maliciously Recaptured from Displays [J].
Gholam-Zadeh, Saleh ;
Upenik, Evgeniy ;
Hatarsi, Guy ;
Ebrahimi, Touradj .
APPLICATIONS OF DIGITAL IMAGE PROCESSING XLIII, 2020, 11510
[38]   From substitution to redefinition: A framework of machine learning-based science assessment [J].
Zhai, Xiaoming ;
C. Haudek, Kevin ;
Shi, Lehong ;
H. Nehm, Ross ;
Urban-Lurain, Mark .
JOURNAL OF RESEARCH IN SCIENCE TEACHING, 2020, 57 (09) :1430-1459
[39]   Machine Learning-Based Prediction of Grain Size from Colored Microstructure [J].
Jung, Jun-Ho ;
Kim, Hee-Soo .
KOREAN JOURNAL OF METALS AND MATERIALS, 2023, 61 (05) :379-387
[40]   Building Machine Learning-based Threat Hunting System from Scratch [J].
Chen, Chung-Kuan ;
Lin, Si-Chen ;
Huang, Szu-Chun ;
Chu, Yung-Tien ;
Lei, Chin-Laung ;
Huang, Chun-Ying .
DIGITAL THREATS: RESEARCH AND PRACTICE, 2022, 3 (03)