Social Activity Recognition on Continuous RGB-D Video Sequences

被引:13
作者
Coppola, Claudio [1 ]
Cosar, Serhan [2 ]
Faria, Diego R. [3 ]
Bellotto, Nicola [2 ]
机构
[1] Queen Mary Univ London, Mile End Rd, London E1 4NS, England
[2] Univ Lincoln, Lincoln LN6 7TS, England
[3] Aston Univ, Aston Express Way, Birmingham B4 7ET, W Midlands, England
基金
欧盟地平线“2020”;
关键词
Social activity recognition; Activity recognition; Activity temporal segmentation; Machine learning; ACTIONLET ENSEMBLE; FEATURES;
D O I
10.1007/s12369-019-00541-y
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Modern service robots are provided with one or more sensors, often including RGB-D cameras, to perceive objects and humans in the environment. This paper proposes a new system for the recognition of human social activities from a continuous stream of RGB-D data. Many of the works until now have succeeded in recognising activities from clipped videos in datasets, but for robotic applications it is important to be able to move to more realistic scenarios in which such activities are not manually selected. For this reason, it is useful to detect the time intervals when humans are performing social activities, the recognition of which can contribute to trigger human-robot interactions or to detect situations of potential danger. The main contributions of this research work include a novel system for the recognition of social activities from continuous RGB-D data, combining temporal segmentation and classification, as well as a model for learning the proximity-based priors of the social activities. A new public dataset with RGB-D videos of social and individual activities is also provided and used for evaluating the proposed solutions. The results show the good performance of the system in recognising social activities from continuous RGB-D data.
引用
收藏
页码:201 / 215
页数:15
相关论文
共 40 条
[1]  
[Anonymous], 1990, CONDUCTING INTERACTI
[2]  
[Anonymous], 2011, P BRIT MACH VIS C BM
[3]  
[Anonymous], 2017, P IEEE C COMP VIS PA, DOI [DOI 10.1109/CVPR.2017.143, 10.48550/arxiv.1611.08050, DOI 10.48550/ARXIV.1611.08050]
[4]  
[Anonymous], 2018, DENSEPOSE DENSE HUMA
[5]  
[Anonymous], 2016, IEEE C COMP VIS PATT
[6]  
[Anonymous], 2004, THESIS
[7]  
[Anonymous], 2014, IEEE RO MAN 14
[8]   Active Learning of an Action Detector from Untrimmed Videos [J].
Bandla, Sunil ;
Grauman, Kristen .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :1833-1840
[9]   Social interactions by visual focus of attention in a three-dimensional environment [J].
Bazzani, L. ;
Cristani, M. ;
Tosato, D. ;
Farenzena, M. ;
Paggetti, G. ;
Menegaz, G. ;
Murino, V. .
EXPERT SYSTEMS, 2013, 30 (02) :115-127
[10]  
Chakraborty I, 2013, IEEE CVPR