Prediction Model of User Physical Activity using Data Characteristics-based Long Short-term Memory Recurrent Neural Networks

被引:41
作者
Kim, Joo-Chang [1 ]
Chung, Kyungyong [2 ]
机构
[1] Kyonggi Univ, Dept Comp Sci, Data Min Lab, 154-42 Gwanggyosan Ro, Suwon 16227, Gyeonggi Do, South Korea
[2] Kyonggi Univ, Div Comp Sci & Engn, 154-42 Gwanggyosan Ro, Suwon 16227, Gyeonggi Do, South Korea
基金
新加坡国家研究基金会;
关键词
Data Mining; Neural Networks; LSTM; Prediction; Mobile Healthcare;
D O I
10.3837/tiis.2019.04.018
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, mobile healthcare services have attracted significant attention because of the emerging development and supply of diverse wearable devices. Smartwatches and health bands are the most common type of mobile-based wearable devices and their market size is increasing considerably. However, simple value comparisons based on accumulated data have revealed certain problems, such as the standardized nature of health management and the lack of personalized health management service models. The convergence of information technology (IT) and biotechnology (BT) has shifted the medical paradigm from continuous health management and disease prevention to the development of a system that can be used to provide ground-based medical services regardless of the user's location. Moreover, the IT-BT convergence has necessitated the development of lifestyle improvement models and services that utilize big data analysis and machine learning to provide mobile healthcare-based personal health management and disease prevention information. Users' health data, which are specific as they change over time, are collected by different means according to the users' lifestyle and surrounding circumstances. In this paper, we propose a prediction model of user physical activity that uses data characteristics-based long short-term memory (DC-LSTM) recurrent neural networks (RNNs). To provide personalized services, the characteristics and surrounding circumstances of data collectable from mobile host devices were considered in the selection of variables for the model. The data characteristics considered were ease of collection, which represents whether or not variables are collectable, and frequency of occurrence, which represents whether or not changes made to input values constitute significant variables in terms of activity. The variables selected for providing personalized services were activity, weather, temperature, mean daily temperature, humidity, UV, fine dust, asthma and lung disease probability index, skin disease probability index, cadence, travel distance, mean heart rate, and sleep hours. The selected variables were classified according to the data characteristics. To predict activity, an LSTM RNN was built that uses the classified variables as input data and learns the dynamic characteristics of time series data. LSTM RNNs resolve the vanishing gradient problem that occurs in existing RNNs. They are classified into three different types according to data characteristics and constructed through connections among the LSTMs. The constructed neural network learns training data and predicts user activity. To evaluate the proposed model, the root mean square error (RMSE) was used in the performance evaluation of the user physical activity prediction method for which an autoregressive integrated moving average (ARIMA) model, a convolutional neural network (CNN), and an RNN were used. The results show that the proposed DC-LSTM RNN method yields an excellent mean RMSE value of 0.616. The proposed method is used for predicting significant activity considering the surrounding circumstances and user status utilizing the existing standardized activity prediction services. It can also be used to predict user physical activity and provide personalized healthcare based on the data collectable from mobile host devices.
引用
收藏
页码:2060 / 2077
页数:18
相关论文
共 30 条
[1]  
[Anonymous], CLUSTER COMPUTING
[2]  
[Anonymous], MULTIMEDIA TOOLS APP
[3]  
Bartlett G, 2017, PROGRESS AND CHALLENGES IN PRECISION MEDICINE, P101, DOI 10.1016/B978-0-12-809411-2.00005-2
[4]  
Cadena C, 2016, Robotics: Science and systems, P1
[5]   An CNN-LSTM Attention Approach to Understanding User Query Intent from Online Health Communities [J].
Cai, Ruichu ;
Zhu, Binjun ;
Liu, Wenyin ;
Ji, Lei ;
Yan, Jun ;
Hao, Tianyong .
2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2017), 2017, :430-437
[6]   Interactive Design Recommendation Using Sensor Based Smart Wear and Weather WebBot [J].
Chung, Kyung-Yong ;
Na, Young-Joo ;
Lee, Jung-Hyun .
WIRELESS PERSONAL COMMUNICATIONS, 2013, 73 (02) :243-256
[7]   Mapreduce: Simplified data processing on large clusters [J].
Dean, Jeffrey ;
Ghemawat, Sanjay .
COMMUNICATIONS OF THE ACM, 2008, 51 (01) :107-113
[8]   Deep learning with long short-term memory networks for financial market predictions [J].
Fischer, Thomas ;
Krauss, Christopher .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2018, 270 (02) :654-669
[9]   Learning precise timing with LSTM recurrent networks [J].
Gers, FA ;
Schraudolph, NN ;
Schmidhuber, J .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (01) :115-143
[10]   Forecasting stock markets using wavelet transforms and recurrent neural networks: An integrated system based on artificial bee colony algorithm [J].
Hsieh, Tsung-Jung ;
Hsiao, Hsiao-Fen ;
Yeh, Wei-Chang .
APPLIED SOFT COMPUTING, 2011, 11 (02) :2510-2525