SECNLP: A survey of embeddings in clinical natural language processing

被引:44
作者
Kalyan, Katikapalli Subramanyam [1 ]
Sangeetha, S. [1 ]
机构
[1] NIT Trichy, Text Analyt & NLP Lab, Dept Comp Applicat, Trichy, India
关键词
Embeddings; Distributed representations; Medical; Natural language processing; Survey; RECURRENT NEURAL-NETWORKS; ELECTRONIC HEALTH RECORDS; VECTOR REPRESENTATIONS; SEMANTIC SIMILARITY; WORD EMBEDDINGS; PHARMACOVIGILANCE; RELATEDNESS;
D O I
10.1016/j.jbi.2019.103323
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Distributed vector representations or embeddings map variable length text to dense fixed length vectors as well as capture prior knowledge which can transferred to downstream tasks. Even though embeddings have become de facto standard for text representation in deep learning based NLP tasks in both general and clinical domains, there is no survey paper which presents a detailed review of embeddings in Clinical Natural Language Processing. In this survey paper, we discuss various medical corpora and their characteristics, medical codes and present a brief overview as well as comparison of popular embeddings models. We classify clinical embeddings and discuss each embedding type in detail. We discuss various evaluation methods followed by possible solutions to various challenges in clinical embeddings. Finally, we conclude with some of the future directions which will advance research in clinical embeddings.
引用
收藏
页数:21
相关论文
共 135 条
[1]  
Alawad M, 2018, IEEE INT CONF BIG DA, P2838, DOI 10.1109/BigData.2018.8621999
[2]  
[Anonymous], AAAI
[3]  
[Anonymous], 2018, INT C INF SCI APPL
[4]  
[Anonymous], 2018, ARXIV180309288
[5]  
[Anonymous], P 29 C NEUR INF PROC
[6]  
[Anonymous], J AM MED INFORM ASS
[7]  
[Anonymous], THESIS
[8]  
[Anonymous], 2013, NIPS
[9]  
[Anonymous], SEMANTIC EMBEDDING I
[10]  
[Anonymous], 2018, NEURAL PROCESS LETT