Natural language processing approach for distributed health data management

被引:2
作者
Forestiero, Agostino [1 ]
Papuzzo, Giuseppe [1 ]
机构
[1] CNR, ICAR, Via Pietro Bucci 8-9 C, I-87036 Arcavacata Di Rende, CS, Italy
来源
2020 28TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2020) | 2020年
关键词
Electronic Health Records; Natural language processing; Semantic overlay network; Self-organization; INFORMATION;
D O I
10.1109/PDP50117.2020.00061
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Today's health domain is characterized by heterogeneous, numerous, highly dynamics and geographically distributed information sources. Moreover, the increasing use of digital health data, like electronic health records (EHRs), has led to store an unprecedented amount of information. Managing this large amount of data can, often, introduce issues of information overload, with potential negative consequences on clinical work, such as errors of omission, delays, and overall patient safety. Innovative techniques, approaches and infrastructures are needed to investigate data featured by high velocity, volume and variability. This paper introduces a distributed and self-organizing algorithm for building a management system for big data in highly dynamic environments like healthcare domain. Health data are represented with vectors obtained through the Doc2Vec model, a Natural Language Processing (NLP) approach able to capture the semantic context representing documents in dense vectors namely word embeddings. Doc2Vec is an unsupervised algorithm to generate vectors starting from sentences/documents based on word2vec approach which can generate vectors for words. The servers of a clinical distributed system, by performing autonomous and local operations, organize themselves in a sorted overlay network, so that resource management operations become faster and efficient. The effectiveness of the approach was proved performing a set of preliminary experiments exploiting a tailored implemented simulator.
引用
收藏
页码:360 / 363
页数:4
相关论文
共 14 条
[1]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127
[2]   Exploiting Social Media to enhance Clinical Decision Support [J].
Comito, Carmela ;
Forestiero, Agostino ;
Papuzzo, Giuseppe .
2019 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE WORKSHOPS (WI 2019 COMPANION), 2019, :244-249
[3]   Routing indices for peer-to-peer systems [J].
Crespo, A ;
Garcia-Molina, H .
22ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 2002, :23-32
[4]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[5]  
Folino G., 2006, Journal of Software, V1, DOI 10.4304/jsw.1.2.12-23
[6]   Reorganization and discovery of grid information with epidemic tuning [J].
Forestiero, Agostino ;
Mastroianni, Carlo ;
Spezzano, Giandomenico .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2008, 24 (08) :788-797
[7]   Multi-agent recommendation system in Internet of Things [J].
Forestiero, Agostino .
2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, :772-775
[8]   Building a Peer-to-peer Information System in Grids via Self-organizing Agents [J].
Forestiero, Agostino ;
Mastroianni, Carlo ;
Spezzano, Giandomenico .
JOURNAL OF GRID COMPUTING, 2008, 6 (02) :125-140
[9]  
Liddy E.D, 2001, Natural language processing
[10]  
Nobre Jeferson Campos, 2017, J NETW SYST MANAG, P1