Assessing impacts of data volume and data set balance in using deep learning approach to human activity recognition

被引:0
作者
Chen, Haipeng [1 ]
Xiong, Fuhai [1 ]
Wu, Dihong [1 ]
Zheng, Lingxiang [1 ]
Peng, Ao [1 ]
Hong, Xuemin [1 ]
Tang, Biyu [1 ]
Lu, Hai [1 ]
Shi, Haibin [1 ]
Zheng, Huiru [2 ]
机构
[1] Xiamen Univ, Sch Informat Sci & Engn, Xiamen, Peoples R China
[2] Ulster Univ, Sch Comp, Coleraine, Antrim, North Ireland
来源
2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM) | 2017年
关键词
human activity recognition; deep learning; LSTM; CNN;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Over the past decade, deep learning developed rapidly and had significant impact on a variety of application domains. It has been applied to the field of human activity recognition to substitute for well-established analysis techniques that rely on handcrafted feature extraction and classification methods in recent years. However, less attentions have been paid to the influence of training data on recognition accuracy. In this paper, we assessed the influence factors of data volume and data balance in human activity recognition when using deep learning approaches. We evaluated the relationship between data volumes of training dataset and predict accuracy of deep learning algorithms. Given the impact of the data balance between activity categories on the recognition accuracy, we modified the SMOTE algorithm so that it can be applied to human activity recognition. Results show that when the data volume is small (< 4M), the recognition accuracy increased quickly with the increase of the quantity of training data. However, the growth trend of recognition accuracy slows down when the data quantity reaches 4 million. Further increase the data volume does not significantly improve the activity recognition performance. So we can conclude that 4 million data volume can ensure a sufficient accuracy for human activity recognition. Meanwhile, the data set balance operation can not only improve the recognition accuracy of minority categories, but also helps to increase the overall accuracy.
引用
收藏
页码:1160 / 1165
页数:6
相关论文
共 22 条
  • [1] Anguita D., 2013, ESANN, V3, P3
  • [2] [Anonymous], 2017, ENSEMBLES DEEP LSTM
  • [3] [Anonymous], 1997, Neural Computation
  • [4] [Anonymous], 2016, J SCI COMPUT
  • [5] Bachlin Marc, 2009, ISWC
  • [6] Activity recognition from user-annotated acceleration data
    Bao, L
    Intille, SS
    [J]. PERVASIVE COMPUTING, PROCEEDINGS, 2004, 3001 : 1 - 17
  • [7] SMOTE: Synthetic minority over-sampling technique
    Chawla, Nitesh V.
    Bowyer, Kevin W.
    Hall, Lawrence O.
    Kegelmeyer, W. Philip
    [J]. 2002, American Association for Artificial Intelligence (16)
  • [8] Chen L., 2012, IEEE T SYST MAN CYB, V42
  • [9] Chen Y., 2016, 2016 INT C ART INT T
  • [10] A Deep Learning Approach to Human Activity Recognition Based on Single Accelerometer
    Chen, Yuqing
    Xue, Yang
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 1488 - 1492