Construction of cardiovascular information extraction corpus based on electronic medical records

被引:1
|
作者
Chang, Hongyang [1 ]
Zan, Hongying [1 ,2 ]
Zhang, Shuai [1 ]
Zhao, Bingfei [1 ]
Zhang, Kunli [1 ,2 ]
机构
[1] Zhengzhou Univ, Sch Comp & Artificial Intelligence, Zhengzhou, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
关键词
cardiovascular disease; corpus construction; electronic medical record; RECOGNITION;
D O I
10.3934/mbe.2023596
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cardiovascular disease has a significant impact on both society and patients, making it necessary to conduct knowledge-based research such as research that utilizes knowledge graphs and automated question answering. However, the existing research on corpus construction for cardiovascular disease is relatively limited, which has hindered further knowledge-based research on this disease. Electronic medical records contain patient data that span the entire diagnosis and treatment process and include a large amount of reliable medical information. Therefore, we collected electronic medical record data related to cardiovascular disease, combined the data with relevant work experience and developed a standard for labeling cardiovascular electronic medical record entities and entity relations. By building a sentence-level labeling result dictionary through the use of a rule-based semi-automatic method, a cardiovascular electronic medical record entity and entity relationship labeling corpus (CVDEMRC) was constructed. The CVDEMRC contains 7691 entities and 11,185 entity relation triples, and the results of consistency examination were 93.51% and 84.02% for entities and entity-relationship annotations, respectively, demonstrating good consistency results. The CVDEMRC constructed in this study is expected to provide a database for information extraction research related to cardiovascular diseases.
引用
收藏
页码:13379 / 13397
页数:19
相关论文
共 50 条
  • [1] Corpus Construction for Named-Entity and Entity Relations for Electronic Medical Records of Cardiovascular Disease
    Chang, Hongyang
    Zan, Hongying
    Zhang, Shuai
    Zhao, Bingfei
    Zhang, Kunli
    HEALTH INFORMATION PROCESSING, CHIP 2022, 2023, 1772 : 3 - 18
  • [2] Developing a cardiovascular disease risk factor annotated corpus of Chinese electronic medical records
    Jia Su
    Bin He
    Yi Guan
    Jingchi Jiang
    Jinfeng Yang
    BMC Medical Informatics and Decision Making, 17
  • [3] Developing a cardiovascular disease risk factor annotated corpus of Chinese electronic medical records
    Su, Jia
    He, Bin
    Guan, Yi
    Jiang, Jingchi
    Yang, Jinfeng
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2017, 17
  • [4] The model of "taking electronic medical records as the core for information construction in hospitals"
    Wu Tao
    Xu Ke
    Li Ping
    Li Xian-feng
    Xu Wei-guo
    CHINESE MEDICAL JOURNAL, 2013, 126 (02) : 373 - 377
  • [5] Development of Patient Information Extraction Method by Sequence Labeling using Electronic Medical Records
    Kushima, Muneo
    Matsuo, Ryosuke
    Ogawa, Taisuke
    Araki, Kenji
    Hasegawa, Yoshiyuki
    Nozue, Suguru
    Okazaki, Emi
    Koga, Hisayoshi
    2020 IEEE 50TH INTERNATIONAL SYMPOSIUM ON MULTIPLE-VALUED LOGIC (ISMVL 2020), 2020, : 105 - 110
  • [6] An Automated Approach for Clinical Quantitative Information Extraction from Chinese Electronic Medical Records
    Liu, Shanshan
    Pan, Xiaoyi
    Chen, Boyu
    Gao, Dongfa
    Hao, Tianyong
    HEALTH INFORMATION SCIENCE (HIS 2018), 2018, 11148 : 98 - 109
  • [7] A network-based analysis of medical information extracted from electronic medical records
    Reategui, Ruth
    Ratte, Sylvie
    Bautista-Valarezo, Estefania
    Beltran-Valdivieso, J. F.
    2020 XLVI LATIN AMERICAN COMPUTING CONFERENCE (CLEI 2020), 2021, : 10 - 19
  • [8] Designing Privacy Information Protection of Electronic Medical Records
    Tseng, Tzu-Wei
    Yang, Cheng-Yi
    Liu, Chien-Tsai
    2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI), 2016, : 75 - 80
  • [9] Harnessing Electronic Medical Records in Cardiovascular Clinical Practice and Research
    Gouda, Pishoy
    Ezekowitz, Justin
    JOURNAL OF CARDIOVASCULAR TRANSLATIONAL RESEARCH, 2023, 16 (03) : 546 - 556
  • [10] Harnessing Electronic Medical Records in Cardiovascular Clinical Practice and Research
    Pishoy Gouda
    Justin Ezekowitz
    Journal of Cardiovascular Translational Research, 2023, 16 : 546 - 556