Citywide quality of health information system through text mining of electronic health records

被引:0
作者
Anastasia A. Funkner
Michil P. Egorov
Sergey A. Fokin
Gennady M. Orlov
Sergey V. Kovalchuk
机构
[1] ITMO University,
[2] Medical Information and Analytical Center,undefined
[3] Sokolov North-Western District Scientific and Clinical Center,undefined
来源
Applied Network Science | / 6卷
关键词
Health information system; Electronic health record; Unstructured data; Natural language processing; Data completeness; Machine learning;
D O I
暂无
中图分类号
学科分类号
摘要
A system of hospitals in large cities can be considered a large and diverse but interconnected system. Widely applied in hospitals, electronic health records (EHR) are crucially different from each other because of the use of different health information systems, internal hospital rules, and individual behavior of physicians. The unstructured (textual) data of EHR is rarely used to assess the citywide quality of healthcare. Within the study, we analyze EHR data, particularly textual unstructured data, as a reflection of the complex multi-agent system of healthcare in the city of Saint Petersburg, Russia. Through analyzing the data collected by the Medical Information and Analytical Center, a method was proposed and evaluated for identifying a common structure, understanding the diversity, and assessing information quality in EHR data through the application of natural language processing techniques.
引用
收藏
相关论文
共 57 条
[11]  
Dugas M(2020)Medical corpora comparison using topic modeling Procedia Comput Sci 178 244-835
[12]  
Burke HB(2017)An exploratory case study to understand primary care users and their data quality tradeoffs J Data Inf Qual 8 1-87
[13]  
Hoang A(2015)Semantic processing of EHR data for clinical research J Biomed Inform 58 247-381
[14]  
Becher D(2013)A hybrid system for temporal information extraction from clinical text J Am Med Inform Assoc 20 828-151
[15]  
Datta S(2017)Improving the quality of EHR recording in primary care: a data quality feedback tool J Am Med Inform Assoc 24 81-488
[16]  
Bernstam EV(2015)Bigartm: open source library for regularized multimodal topic modeling of large collections Commun Comput Inf Sci 542 370-undefined
[17]  
Roberts K(2012)Extracting diagnoses and investigation results from unstructured text in electronic health records by semi-supervised machine learning PLOS ONE 7 e30412-undefined
[18]  
Freedman HG(2013)Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research J Am Med Inform Assoc 20 144-undefined
[19]  
Williams H(2003)Measuring the completeness and currency of codified clinical information Methods Inf Med 42 482-undefined
[20]  
Miller MA(2019)Ontology-based clinical information extraction from physician’s free-text notes J Biomed Inform 98 103276-undefined