Automatic Extraction and Decryption of Abbreviations from Domain-Specific Texts

被引:2
作者
Egorov, Michil [1 ]
Funkner, Anastasia [1 ]
机构
[1] ITMO Univ, St Petersburg, Russia
来源
PHEALTH 2021 | 2021年 / 285卷
基金
俄罗斯科学基金会;
关键词
Clinical text; medical records; natural language processing; abbreviations;
D O I
10.3233/SHTI210615
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
This paper explores the problems of extraction and decryption of abbreviations from domain-specific texts in Russian. The main focus are unstructured electronic medical records which pose specific preprocessing problems. The major challenge is that there is no uniform way to write medical histories. The aim of the paper is to generalize the way of decrypting abbreviations from any variant of text. A dataset of nearly three million medical records was collected. A classifier model was trained in order to extract and decrypt abbreviations. After testing the proposed method with 224,307 records, the model showed an F1 score of 93.7% on a valid dataset.
引用
收藏
页码:281 / 284
页数:4
相关论文
共 50 条
[21]   MAANA: An Automated Tool for DoMAin-specific HANdling of Ambiguity [J].
Ezzini, Saad ;
Abualhaija, Sallam ;
Arora, Chetan ;
Sabetzadeh, Mehrdad ;
Briand, Lionel .
2021 IEEE/ACM 43RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: COMPANION PROCEEDINGS (ICSE-COMPANION 2021), 2021, :188-189
[22]   Relation Identification in Business Rules for Domain-specific Documents [J].
Bhattacharyya, Abhidip ;
Chittimalli, Pavan Kumar ;
Naik, Ravindra .
ISEC'18: PROCEEDINGS OF THE 11TH INNOVATIONS IN SOFTWARE ENGINEERING CONFERENCE, 2018,
[23]   PharmBERT: a domain-specific BERT model for drug labels [J].
ValizadehAslani, Taha ;
Shi, Yiwen ;
Ren, Ping ;
Wang, Jing ;
Zhang, Yi ;
Hu, Meng ;
Zhao, Liang ;
Liang, Hualou .
BRIEFINGS IN BIOINFORMATICS, 2023, 24 (04)
[24]   DSG-KD: Knowledge Distillation From Domain-Specific to General Language Models [J].
Cho, Sangyeon ;
Jeon, Jangyeong ;
Lee, Dongjoon ;
Lee, Changhee ;
Kim, Junyeong .
IEEE ACCESS, 2024, 12 :130973-130982
[25]   A case for developing domain-specific vocabularies for extracting suicide factors from healthcare notes [J].
Morrow, Destinee ;
Zamora-Resendiz, Rafael ;
Beckham, Jean C. ;
Kimbrel, Nathan A. ;
Oslin, David W. ;
Tamang, Suzanne ;
Crivelli, Silvia .
JOURNAL OF PSYCHIATRIC RESEARCH, 2022, 151 :328-338
[26]   Recent Domain-Specific Applications of Artificial Intelligence Using IoT [J].
Mali, Amol D. .
INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2019, 28 (07)
[27]   Recognition of domain-specific terries with d-bigram model [J].
Nobesawa, S ;
Sato, K ;
Saito, H .
6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL III, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING I, 2002, :406-411
[28]   Using Domain-specific Corpora for Improved Handling of Ambiguity in Requirements [J].
Ezzini, Saad ;
Abualhaija, Sallam ;
Arora, Chetan ;
Sabetzadeh, Mehrdad ;
Briand, Lionel C. .
2021 IEEE/ACM 43RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2021), 2021, :1485-1497
[29]   Domain-specific meta-embedding with latent semantic structures [J].
Liu, Qian ;
Lu, Jie ;
Zhang, Guangquan ;
Shen, Tao ;
Zhang, Zhihan ;
Huang, Heyan .
INFORMATION SCIENCES, 2021, 555 :410-423
[30]   Towards Conversational Syntax for Domain-Specific Languages using Chatbots [J].
Perez-Soler, Sara ;
Gonzalez-Jimenez, Mario ;
Guerra, Esther ;
de lara, Juan .
JOURNAL OF OBJECT TECHNOLOGY, 2019, 18 (02)