Machine learning-based donor permission extraction from informed consent documents

被引:1
作者
Zhang, Meng [1 ]
Sankaranarayanapillai, Madhuri [1 ]
Du, Jingcheng [1 ]
Xiang, Yang [1 ]
Manion, Frank J. [2 ]
Harris, Marcelline R. [2 ]
Stansbury, Cooper [2 ]
Pham, Huy Anh [1 ]
Tao, Cui [1 ,3 ]
机构
[1] Univ Texas Hlth Sci Ctr Houston, McWilliam Sch Biomed Informat, Houston, TX 77030 USA
[2] Univ Michigan, Sch Nursing, Ann Arbor, MI USA
[3] Mayo Clin, Dept Artificial Intelligence & Informat, Jacksonville, FL 32224 USA
基金
美国国家卫生研究院;
关键词
Informed consent; Machine learning; Natural language processing; Text classification;
D O I
10.1186/s12859-023-05568-7
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundWith more clinical trials are offering optional participation in the collection of bio-specimens for biobanking comes the increasing complexity of requirements of informed consent forms. The aim of this study is to develop an automatic natural language processing (NLP) tool to annotate informed consent documents to promote biorepository data regulation, sharing, and decision support. We collected informed consent documents from several publicly available sources, then manually annotated them, covering sentences containing permission information about the sharing of either bio-specimens or donor data, or conducting genetic research or future research using bio-specimens or donor data.ResultsWe evaluated a variety of machine learning algorithms including random forest (RF) and support vector machine (SVM) for the automatic identification of these sentences. 120 informed consent documents containing 29,204 sentences were annotated, of which 1250 sentences (4.28%) provide answers to a permission question. A support vector machine (SVM) model achieved a F-1 score of 0.95 on classifying the sentences when using a gold standard, which is a prefiltered corpus containing all relevant sentences.ConclusionsThis study provides the feasibility of using machine learning tools to classify permission-related sentences in informed consent documents.
引用
收藏
页数:10
相关论文
共 50 条
[41]   Machine Learning-Based Monitoring of Chemical Contamination from Drilling Leaks [J].
Sun, Jian ;
Ai, Yong ;
Tang, Kang ;
Zhang, Zhe .
CHEMISTRY AND TECHNOLOGY OF FUELS AND OILS, 2025,
[42]   Benchmarking physics-informed machine learning-based short term PV-power forecasting tools [J].
Pombo, Daniel Vazquez ;
Bacher, Peder ;
Ziras, Charalampos ;
Bindner, Henrik W. ;
Spataru, Sergiu, V ;
Sorensen, Poul E. .
ENERGY REPORTS, 2022, 8 :6512-6520
[43]   A Machine Learning-Based Topic Extraction and Categorization of State Universities and Colleges (SUC) Customer Feedbacks [J].
Soriano, Lorna T. ;
Palaoag, Thelma D. .
PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND MANAGEMENT (ICICM 2018), 2018, :1-6
[44]   Machine Learning-Based Universal Threshold Voltage Extraction of Transistors Using Convolutional Neural Networks [J].
Kocak, Husnu Murat ;
Davis, Jesse ;
Houssa, Michel ;
Naskali, Ahmet Teoman ;
Mitard, Jerome .
IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, 2024, 37 (04) :615-619
[45]   Machine Learning and Deep Learning based Information Extraction from Bangla Names [J].
Kowsher, Md ;
Sanjid, Md Zahidul Islam ;
Das, Avishek ;
Ahmed, Mahid ;
Sarker, Md Murad Hossain .
9TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE IN COMPUTATIONAL SCIENCE, YSC2020, 2020, 178 :224-233
[46]   Machine Learning-Based Mapping for Mineral Exploration [J].
Zuo, Renguang ;
Carranza, Emmanuel John M. .
MATHEMATICAL GEOSCIENCES, 2023, 55 (07) :891-895
[47]   Machine Learning-Based Mapping for Mineral Exploration [J].
Renguang Zuo ;
Emmanuel John M. Carranza .
Mathematical Geosciences, 2023, 55 :891-895
[48]   Machine Learning-Based Hurricane Wind Reconstruction [J].
Yang, Qidong ;
Lee, Chia-Ying ;
Tippett, Michael K. ;
Chavas, Daniel R. ;
Knutson, Thomas R. .
WEATHER AND FORECASTING, 2022, 37 (04) :477-493
[49]   Machine learning-based wavelength detection system [J].
Kwon, Ik-Hyun ;
Choi, Yong-Joon ;
Ide, Tomoya ;
Noda, Toshihiko ;
Takahashi, Kazuhiro ;
Sawada, Kazuaki .
JAPANESE JOURNAL OF APPLIED PHYSICS, 2025, 64 (01)
[50]   Machine learning-based assessment of diabetes risk [J].
Sun, Qi ;
Cheng, Xin ;
Han, Kuo ;
Sun, Yichao ;
Ren, He ;
Li, Ping .
APPLIED INTELLIGENCE, 2025, 55 (02)