A Vision-Based System for Monitoring Elderly People at Home

被引:40
作者
Buzzelli, Marco [1 ]
Albe, Alessio [1 ]
Ciocca, Gianluigi [1 ]
机构
[1] Univ Milano Bicocca, Dept Comp Sci Syst & Commun, Viale Sarca 336, I-20126 Milan, Italy
来源
APPLIED SCIENCES-BASEL | 2020年 / 10卷 / 01期
关键词
computer vision; action recognition; deep learning; internet of things; assisted living; ACTION RECOGNITION; CARE;
D O I
10.3390/app10010374
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Assisted living technologies can be of great importance for taking care of elderly people and helping them to live independently. In this work, we propose a monitoring system designed to be as unobtrusive as possible, by exploiting computer vision techniques and visual sensors such as RGB cameras. We perform a thorough analysis of existing video datasets for action recognition, and show that no single dataset can be considered adequate in terms of classes or cardinality. We subsequently curate a taxonomy of human actions, derived from different sources in the literature, and provide the scientific community with considerations about the mutual exclusivity and commonalities of said actions. This leads us to collecting and publishing an aggregated dataset, called ALMOND (Assisted Living MONitoring Dataset), which we use as the training set for a vision-based monitoring approach.We rigorously evaluate our solution in terms of recognition accuracy using different state-of-the-art architectures, eventually reaching 97% on inference of basic poses, 83% on alerting situations, and 71% on daily life actions. We also provide a general methodology to estimate the maximum allowed distance between camera and monitored subject. Finally, we integrate the defined actions and the trained model into a computer-vision-based application, specifically designed for the objective of monitoring elderly people at their homes.
引用
收藏
页数:25
相关论文
共 62 条
  • [11] Ciocca G., 2006, J REAL-TIME IMAGE PR, V1, P69, DOI DOI 10.1007/s11554-012-0278-1
  • [12] Elder Tracking and Fall Detection System Using Smart Tiles
    Daher, Mohamad
    Diab, Ahmad
    El Najjar, Maan El Badaoui
    Khalil, Mohamad Ali
    Charpillet, Francois
    [J]. IEEE SENSORS JOURNAL, 2017, 17 (02) : 469 - 479
  • [13] Diba A., 2017, Temporal 3D ConvNets: New architecture and transfer learning for video classification
  • [14] Donahue J, 2015, PROC CVPR IEEE, P2625, DOI 10.1109/CVPR.2015.7298878
  • [15] Learning Spatiotemporal Features with 3D Convolutional Networks
    Du Tran
    Bourdev, Lubomir
    Fergus, Rob
    Torresani, Lorenzo
    Paluri, Manohar
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4489 - 4497
  • [16] European Commission - Economic and Financial Affairs, 2018, 2018 AG REP
  • [17] Convolutional Two-Stream Network Fusion for Video Action Recognition
    Feichtenhofer, Christoph
    Pinz, Axel
    Zisserman, Andrew
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1933 - 1941
  • [18] García-Herranz M, 2010, J UNIVERS COMPUT SCI, V16, P1633
  • [19] Large-scale weakly-supervised pre-training for video action recognition
    Ghadiyaram, Deepti
    Du Tran
    Mahajan, Dhruv
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12038 - 12047
  • [20] An Elderly Health Care System Using Wireless Sensor Networks at Home
    Huo, Hongwei
    Hu, Youzhi
    Yan, Hairong
    Mubeen, Saad
    Zhang, Hongke
    [J]. 2009 3RD INTERNATIONAL CONFERENCE ON SENSOR TECHNOLOGIES AND APPLICATIONS (SENSORCOMM 2009), 2009, : 158 - +