Normalizing acronyms and abbreviations to aid patient understanding of clinical texts: ShARe/CLEF eHealth Challenge 2013, Task 2

被引:14
作者
Mowery, Danielle L. [1 ]
South, Brett R. [1 ]
Christensen, Lee [1 ]
Leng, Jianwei [1 ]
Peltonen, Laura-Maria [2 ,3 ]
Salantera, Sanna [2 ,3 ]
Suominen, Hanna [4 ,5 ]
Martinez, David [6 ,7 ]
Velupillai, Sumithra [8 ]
Elhadad, Noemie [9 ]
Savova, Guergana [10 ]
Pradhan, Sameer [10 ]
Chapman, Wendy W. [1 ]
机构
[1] Univ Utah, Dept Biomed Informat, Salt Lake City, UT 84112 USA
[2] Univ Turku, Nursing Sci, Turku, Finland
[3] Turku Univ Hosp, Turku, Finland
[4] Univ Canberra, Data61, CSIRO, Australian Natl Univ, Locked Bag 8001, Canberra, ACT 2601, Australia
[5] Univ Turku, Locked Bag 8001, Canberra, ACT 2601, Australia
[6] MedWhat Com, San Francisco, CA USA
[7] Univ Melbourne, Parkville, Vic, Australia
[8] Stockholm Univ, Dept Comp & Syst Sci DSV, Stockholm, Sweden
[9] Columbia Univ, Dept Biomed Informat, New York, NY USA
[10] Harvard Med Sch, Boston Childrens Hosp, Boston, MA USA
来源
JOURNAL OF BIOMEDICAL SEMANTICS | 2016年 / 7卷
基金
澳大利亚研究理事会; 芬兰科学院; 美国国家卫生研究院;
关键词
Natural language processing; Acronyms; Abbreviations; Consumer health information; Unified Medical Language System; MEDICAL-RECORDS; MEDLINE; ACCESS; DICTIONARY; ANNOTATION; DOCTORS; SYSTEM;
D O I
10.1186/s13326-016-0084-y
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The ShARe/CLEF eHealth challenge lab aims to stimulate development of natural language processing and information retrieval technologies to aid patients in understanding their clinical reports. In clinical text, acronyms and abbreviations, also referenced as short forms, can be difficult for patients to understand. For one of three shared tasks in 2013 (Task 2), we generated a reference standard of clinical short forms normalized to the Unified Medical Language System. This reference standard can be used to improve patient understanding by linking to web sources with lay descriptions of annotated short forms or by substituting short forms with a more simplified, lay term. Methods: In this study, we evaluate 1) accuracy of participating systems' normalizing short forms compared to a majority sense baseline approach, 2) performance of participants' systems for short forms with variable majority sense distributions, and 3) report the accuracy of participating systems' normalizing shared normalized concepts between the test set and the Consumer Health Vocabulary, a vocabulary of lay medical terms. Results: The best systems submitted by the five participating teams performed with accuracies ranging from 43 to 72 %. A majority sense baseline approach achieved the second best performance. The performance of participating systems for normalizing short forms with two or more senses with low ambiguity (majority sense greater than 80 %) ranged from 52 to 78 % accuracy, with two or more senses with moderate ambiguity (majority sense between 50 and 80 %) ranged from 23 to 57 % accuracy, and with two or more senses with high ambiguity (majority sense less than 50 %) ranged from 2 to 45 % accuracy. With respect to the ShARe test set, 69 % of short form annotations contained common concept unique identifiers with the Consumer Health Vocabulary. For these 2594 possible annotations, the performance of participating systems ranged from 50 to 75 % accuracy. Conclusion: Short form normalization continues to be a challenging problem. Short form normalization systems perform with moderate to reasonable accuracies. The Consumer Health Vocabulary could enrich its knowledge base with missed concept unique identifiers from the ShARe test set to further support patient understanding of unfamiliar medical terms.
引用
收藏
页数:13
相关论文
共 64 条
  • [1] [Anonymous], 2007, Biological, translational, and clinical language processing
  • [2] Aronson AR, 2001, J AM MED INFORM ASSN, P17
  • [3] The "Meaningful Use" Regulation for Electronic Health Records
    Blumenthal, David
    Tavenner, Marilyn
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2010, 363 (06) : 501 - 504
  • [4] The Unified Medical Language System: Toward a collaborative approach for solving terminologic problems
    Campbell, KE
    Oliver, DE
    Shortliffe, EH
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1998, 5 (01) : 12 - 16
  • [5] Creating an online dictionary of abbreviations from MEDLINE
    Chang, JT
    Schütze, H
    Altman, RB
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2002, 9 (06) : 612 - 620
  • [6] Overcoming barriers to NLP for clinical text: the role of shared tasks and the need for additional creative solutions
    Chapman, Wendy W.
    Nadkarni, Prakash M.
    Hirschman, Lynette
    D'Avolio, Leonard W.
    Savova, Guergana K.
    Uzuner, Ozlem
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2011, 18 (05) : 540 - 543
  • [7] Inviting Patients to Read Their Doctors' Notes: A Quasi-experimental Study and a Look Ahead
    Delbanco, Tom
    Walker, Jan
    Bell, Sigall K.
    Darer, Jonathan D.
    Elmore, Joann G.
    Farag, Nadine
    Feldman, Henry J.
    Mejilla, Roanne
    Ngo, Long
    Ralston, James D.
    Ross, Stephen E.
    Trivedi, Neha
    Vodicka, Elisabeth
    Leveille, Suzanne G.
    [J]. ANNALS OF INTERNAL MEDICINE, 2012, 157 (07) : 461 - U36
  • [8] Open Notes: Doctors and Patients Signing On
    Delbanco, Tom
    Walker, Jan
    Darer, Jonathan D.
    Elmore, Joann G.
    Feldman, Henry J.
    Leveille, Suzanne G.
    Ralston, James D.
    Ross, Stephen E.
    Vodicka, Elisabeth
    Weber, Valerie D.
    [J]. ANNALS OF INTERNAL MEDICINE, 2010, 153 (02) : 121 - W45
  • [9] Denny JC, 2002, AMIA 2002 SYMPOSIUM, PROCEEDINGS, P1007
  • [10] Department of Health and Human Services, HIPAA ADM SIMPL REG