Identifying a Medical Department Based on Unstructured Data: A Big Data Application in Healthcare

被引:2
作者
Bansal, Veena [1 ]
Poddar, Abhishek [2 ]
Ghosh-Roy, R. [3 ]
机构
[1] Indian Inst Technol Bhilai, Raipur 492015, Madhya Pradesh, India
[2] Indian Inst Technol Kanpur, Kanpur 208016, Uttar Pradesh, India
[3] IBM United Kingdom Ltd, London SE1 9PZ, England
关键词
healthcare; big data; unstructured data; tertiary healthcare; DECISION-SUPPORT-SYSTEM; EXPERT-SYSTEM; TEXT; CLASSIFIER;
D O I
10.3390/info10010025
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Health is an individual's most precious asset and healthcare is one of the vehicles for preserving it. The Indian government's spend on healthcare system is relatively low (1.2% of GDP). Consequently, Secondary and Tertiary government healthcare centers in India (that are presumed to be of above average ratings) are always crowded. In Tertiary healthcare centers, like the All India Institute of Medical Science (AIIMS), patients are often unable to articulate their problems correctly to the healthcare center's reception staff, so that these patients to be directed to the correct healthcare department. In this paper, we propose a system that will scan prescriptions, referral letters and medical diagnostic reports of a patient, process the input using OCR (Optical Character Recognition) engines, coupled with image processing tools, to direct the patient to the most relevant department. We have implemented and tested parts of this system wherein a patient enters his symptoms and/or provisional diagnosis; the system suggests a department based on this user input. Our system suggests the correct department 70.19% of the time. On further investigation, we found that one particular department of the hospital was over-represented. We eliminated the department from the data and performance of the system improved to 92.7%. Our system presently makes its suggestions using random forest algorithm that has been trained using two information repositories-symptoms and disease data, functional description of each medical department. It is our informed assumption that, once we have incorporated medicine information and diagnostics imaging data to train the system; and the complete medical history of the patient, performance of the system will improve further.
引用
收藏
页数:14
相关论文
共 52 条
[1]   AN EXPERT-SYSTEM FOR HOMEOPATHIC GLAUCOMA TREATMENT (SEHO) [J].
ALONSOAMO, F ;
PEREZ, AG ;
GOMEZ, GL ;
MONTES, C .
EXPERT SYSTEMS WITH APPLICATIONS, 1995, 8 (01) :89-99
[2]  
Aly M., 2005, Neural Networks: The Official Journal of the International Neural Network Society, V19, P1
[3]  
[Anonymous], 2001, P 2 INT C MULT DAT M
[4]  
[Anonymous], 1997, WORLD BANK REPORT
[5]   Using machine learning to support healthcare professionals in making preauthorisation decisions [J].
Araujo, Flavio H. D. ;
Santana, Andre M. ;
Santos Neto, Pedro de A. .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2016, 94 :1-7
[6]   Integrating knowledge sources in Devanagari text recognition system [J].
Bansal, V ;
Sinha, RMK .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2000, 30 (04) :500-505
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]  
Busemann S, 2000, 6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, P158
[9]   A web-based clinical decision support system for gestational diabetes: Automatic diet prescription and detection of insulin needs [J].
Caballero-Ruiz, Estefania ;
Garcia-Saez, Gema ;
Rigla, Mercedes ;
Villaplana, Maria ;
Pons, Belen ;
Elena Hernando, M. .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2017, 102 :35-49
[10]  
Candel A., 2015, Deep Learning with H2O