A Natural Language Processing Model to Identify Confidential Content in Adolescent Clinical Notes

被引:4
作者
Rabbani, Naveed [1 ]
Bedgood, Michael [2 ]
Brown, Conner [3 ]
Steinberg, Ethan [4 ,5 ]
Goldstein, Rachel L. L. [6 ]
Carlson, Jennifer L. L. [6 ]
Pageler, Natalie [1 ]
Morse, Keith E. E. [1 ]
机构
[1] Stanford Univ, Dept Pediat, Sch Med, 453 Quarry Rd, Stanford, CA 94304 USA
[2] Calif Dept Publ Hlth, Richmond, CA USA
[3] Lucile Packard Childrens Hosp, Informat Serv Dept, Palo Alto, CA USA
[4] Stanford Univ, Ctr Biomed Informat Res, Sch Med, Stanford, CA 94304 USA
[5] Stanford Univ, Dept Comp Sci, Stanford, CA 94304 USA
[6] Stanford Univ, Dept Pediat, Div Adolescent Med, Sch Med, Stanford, CA 94304 USA
关键词
confidentiality; patient portals; natural language processing; machine learning; health information exchange; 21ST-CENTURY CURES ACT; HEALTH-CARE; PERCEPTIONS; INFORMATION;
D O I
10.1055/a-2051-9764
中图分类号
R-058 [];
学科分类号
摘要
Background The 21st Century Cures Act mandates the immediate, electronic release of health information to patients. However, in the case of adolescents, special consideration is required to ensure that confidentiality is maintained. The detection of confidential content in clinical notes may support operational efforts to preserve adolescent confidentiality while implementing information sharing. Objectives This study aimed to determine if a natural language processing (NLP) algorithm can identify confidential content in adolescent clinical progress notes. Methods A total of 1,200 outpatient adolescent progress notes written between 2016 and 2019 were manually annotated to identify confidential content. Labeled sentences from this corpus were featurized and used to train a two-part logistic regression model, which provides both sentence-level and note-level probability estimates that a given text contains confidential content. This model was prospectively validated on a set of 240 progress notes written in May 2022. It was subsequently deployed in a pilot intervention to augment an ongoing operational effort to identify confidential content in progress notes. Note-level probability estimates were used to triage notes for review and sentence-level probability estimates were used to highlight high-risk portions of those notes to aid the manual reviewer. Results The prevalence of notes containing confidential content was 21% (255/1,200) and 22% (53/240) in the train/test and validation cohorts, respectively. The ensemble logistic regression model achieved an area under the receiver operating characteristic of 90 and 88% in the test and validation cohorts, respectively. Its use in a pilot intervention identified outlier documentation practices and demonstrated efficiency gains over completely manual note review. Conclusion An NLP algorithm can identify confidential content in progress notes with high accuracy. Its human-in-the-loop deployment in clinical operations augmented an ongoing operational effort to identify confidential content in adolescent progress notes. These findings suggest NLP may be used to support efforts to preserve adolescent confidentiality in the wake of the information blocking mandate.
引用
收藏
页码:400 / 407
页数:8
相关论文
共 37 条
[1]  
[Anonymous], 2022, CUR ACT FIN RUL INF
[2]  
[Anonymous], 2020, The FSM Trust Fund: Actions Are Required to Effectively and Efficiently Achieve the Goals of the FSM Trust Fund
[3]  
[Anonymous], HLTH LANGUAGE NLP UN
[4]  
[Anonymous], 3M CODEASSIST SYSTEM
[5]   The 21st Century Cures Act and Multiuser Electronic Health Record Access: Potential Pitfalls of Information Release [J].
Arvisais-Anhalt, Simone ;
Lau, May ;
Lehmann, Christoph U. ;
Holmgren, A. Jay ;
Medford, Richard J. ;
Ramirez, Charina M. ;
Chen, Clifford N. .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (02)
[6]  
Bedgood M., 2023, APPL CLIN INFORM
[7]   Organizational Perspectives on Technical Capabilities and Barriers Related to Pediatric Data Sharing and Confidentiality [J].
Bedgood, Michael ;
Kuelbs, Cynthia L. ;
Jones, Veena G. ;
Pageler, Natalie .
JAMA NETWORK OPEN, 2022, 5 (07)
[8]   Computer-assisted clinical coding: A narrative review of the literature on its benefits, limitations, implementation and impact on clinical coding professionals [J].
Campbell, Sharon ;
Giadresco, Katrina .
HEALTH INFORMATION MANAGEMENT JOURNAL, 2020, 49 (01) :5-18
[9]   NASPAG/SAHM Statement: The 21st Century Cures Act and Adolescent Confidentiality [J].
Carlson, Jennifer ;
Goldstein, Rachel ;
Hoover, Kim ;
Tyson, Nichole .
JOURNAL OF ADOLESCENT HEALTH, 2021, 68 (02) :426-428
[10]   Inviting Patients to Read Their Doctors' Notes: A Quasi-experimental Study and a Look Ahead [J].
Delbanco, Tom ;
Walker, Jan ;
Bell, Sigall K. ;
Darer, Jonathan D. ;
Elmore, Joann G. ;
Farag, Nadine ;
Feldman, Henry J. ;
Mejilla, Roanne ;
Ngo, Long ;
Ralston, James D. ;
Ross, Stephen E. ;
Trivedi, Neha ;
Vodicka, Elisabeth ;
Leveille, Suzanne G. .
ANNALS OF INTERNAL MEDICINE, 2012, 157 (07) :461-U36