Automatic Extraction and Decryption of Abbreviations from Domain-Specific Texts

被引:2
|
作者
Egorov, Michil [1 ]
Funkner, Anastasia [1 ]
机构
[1] ITMO Univ, St Petersburg, Russia
来源
PHEALTH 2021 | 2021年 / 285卷
基金
俄罗斯科学基金会;
关键词
Clinical text; medical records; natural language processing; abbreviations;
D O I
10.3233/SHTI210615
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
This paper explores the problems of extraction and decryption of abbreviations from domain-specific texts in Russian. The main focus are unstructured electronic medical records which pose specific preprocessing problems. The major challenge is that there is no uniform way to write medical histories. The aim of the paper is to generalize the way of decrypting abbreviations from any variant of text. A dataset of nearly three million medical records was collected. A classifier model was trained in order to extract and decrypt abbreviations. After testing the proposed method with 224,307 records, the model showed an F1 score of 93.7% on a valid dataset.
引用
收藏
页码:281 / 284
页数:4
相关论文
共 50 条
  • [1] Automatic extraction of domain-specific stopwords from labeled documents
    Makrehchi, Masoud
    Kamel, Mohamed S.
    ADVANCES IN INFORMATION RETRIEVAL, 2008, 4956 : 222 - 233
  • [2] Semi-automatic extraction of multiword terms from domain-specific corpora
    Pajic, Vesna
    Stankovic, Stasa Vujicic
    Stankovic, Ranka
    Pajic, Milos
    ELECTRONIC LIBRARY, 2018, 36 (03): : 550 - 567
  • [3] DEXTER: Automatic Extraction of Domain-Specific Glossaries for Language Teaching
    Perinan-Pascual, Carlos
    Mestre-Mestre, Eva M.
    CURRENT WORK IN CORPUS LINGUISTICS: WORKING WITH TRADITIONALLY- CONCEIVED CORPORA AND BEYOND (CILC2015), 2015, 198 : 377 - 385
  • [4] Sentiment Analysis for Domain-Specific Texts
    Yanagimoto, Hidekazu
    Yoshioka, Michifumi
    PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL LIFE AND ROBOTICS (AROB 17TH '12), 2012, : 791 - 794
  • [5] DOMAIN-SPECIFIC AUTOMATIC PROGRAMMING
    BARSTOW, DR
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1985, 11 (11) : 1321 - 1336
  • [6] Domain-specific keyphrase extraction
    Frank, E
    Paynter, GW
    Witten, IH
    Gutwin, C
    Nevill-Manning, CG
    IJCAI-99: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 & 2, 1999, : 668 - 673
  • [7] Domain-Specific Paraphrase Extraction
    Pavlick, Ellie
    Ganitkevitch, Juri
    Chan, Tsz Ping
    Yao, Xuchen
    Van Durme, Benjamin
    Callison-Burch, Chris
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 57 - 62
  • [8] Domain-Specific Term Extraction: A Case Study on Greek Maritime Legal Texts
    Mouratidis, Despoina
    Mathe, Eirini
    Voutos, Yorghos
    Stamou, Klio
    Kermanidis, Katia
    Mylonas, Phivos
    Kanavos, Andreas
    PROCEEDINGS OF THE 12TH HELLENIC CONFERENCE ON ARTIFICIAL INTELLIGENCE, SETN 2022, 2022,
  • [9] Automatic domain-specific term extraction and its application in text classification
    Liu, Tao
    Liu, Bing-Quan
    Xu, Zhi-Ming
    Wang, Xiao-Long
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2007, 35 (02): : 328 - 332
  • [10] Extracting hyponymy relations from domain-specific free texts
    Zhang, Chun-Xia
    Cao, Cun-Gen
    Liu, Lei
    Niu, Zhen-Dong
    Lin, Jun-Hong
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 3360 - +