Clinical text retrieval-an overview of basic building blocks and applications

被引:4
作者
Dalianis, Hercules [1 ]
机构
[1] Department of Computer and Systems Sciences, Stockholm University, P.O. Box 7003, Kista
来源
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) | 2014年 / 8830卷
关键词
Clinical text; Electronic patient records; ICD-10; Information retrieval; SNOMED-CT; Spell checking; Swedish;
D O I
10.1007/978-3-319-12511-4_8
中图分类号
学科分类号
摘要
This article describes information retrieval, natural language processing and text mining of electronic patient record text, also called clinical text. Clinical text is written by physicians and nurses to document the health care process of the patient. First we describe some characteristics of clinical text, followed by the automatic preprocessing of the text that is necessary for making it usable for some applications. We also describe some applications for clinicians including spelling and grammar checking, ICD-10 diagnosis code assignment, as well as other applications for hospital management such as ICD-10 diagnosis code validation and detection of adverse events such as hospital acquired infections. Part of the preprocessing makes the clinical text useful for faceted search, although clinical text already has some keys for performing faceted search such as gender, age, ICD-10 diagnosis codes, ATC drug codes, etc. Preprocessing makes use of ICD-10 codes and the SNOMED-CT textual descriptions. ICD-10 codes and SNOMED-CT are available in several languages and can be considered the modern Greek or Latin of medical language. The basic research presented here has its roots in the challenges described by the health care sector. These challenges have been partially solved in academia, and we believe the solutions will be adapted to the health care sector in real world applications. © Springer International Publishing Switzerland 2014.
引用
收藏
页码:147 / 165
页数:18
相关论文
共 53 条
[1]  
Allvin H., Carlsson E., Dalianis H., Danielsson-Ojala R., Daudaravicius V., Hassel M., Kokkinakis D., Lundgren-Laine H., Nilsson G.H., Nytro O., Sanna S., Hanna S., Sumithra V., Characteristics of Finnish and Swedish intensive care nursing narratives: A comparative analysis to support the development of clinical language technologies, Journal of Biomedical Semantics, 2, pp. 1-11, (2011)
[2]  
Carlberger J., Dalianis H., Hassel M., Knutsson O., Improving precision in information retrieval for Swedish using stemming, Proceedings of NODALIDA 2001 -13th Nordic Conference on Computational Linguistics, (2001)
[3]  
Chapman W.W., Bridewell W., Hanbury P., Cooper G.F., Buchanan B.G., Evaluation of negation phrases in narrative clinical reports, Proceedings of the AMIA Symposium, (2001)
[4]  
Chapman W.W., Bridewell W., Hanbury P., Cooper G.F., Buchanan B.G., A simple algorithm for identifying negated findings and diseases in discharge summaries, Journal of Biomedical Informatics, 34, 5, pp. 301-310, (2001)
[5]  
Chen A., Gey F.C., Combining query translation and document translation in cross-language retrieval, CLEF 2003. LNCS, 3237, pp. 108-121, (2004)
[6]  
Dalianis H., Evaluating a spelling support in a search engine, NLDB 2002. LNCS, 2253, pp. 183-190, (2002)
[7]  
Dalianis H., Aggregation in natural language generation, Computational Intelligence, 15, 4, pp. 384-414, (1999)
[8]  
Dalianis H., Improving search engine retrieval using a compound splitter for Swedish, Proceedings of the 15th Nordic Conference of Computational Linguistics, pp. 38-42., (2005)
[9]  
Dalianis H., Hassel M., Henriksson A., Skeppstedt M., Stockholm EPR Corpus: A clinical database used to improve health care, Swedish Language Technology Conference, pp. 17-18, (2012)
[10]  
Dalianis H., Hassel M., Velupillai S., The Stockholm EPR Corpus-Characteristics and Some Initial Findings, Proceedings of ISHIMR 2009, Evaluation and Implementation of e-Health and Health Information Initiatives: International Perspectives, 14th International Symposium for Health Information Management Research, pp. 243-249, (2009)