Human activity recognition for efficient human-robot collaboration

被引:7
作者
Zhdanova, M. [1 ]
Voronin, V. [1 ]
Semenishchev, E. [1 ]
Ilyukhin, Yu [1 ]
Zelensky, A. [1 ]
机构
[1] Moscow State Univ Technol STANKIN, Ctr Cognit Technol & Machine Vis, Vadkovsky Line 1, Moscow 127055, Russia
来源
ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING IN DEFENSE APPLICATIONS II | 2020年 / 11543卷
关键词
action recognition; human activity; descriptor; machine vision systems; human-robot collaboration;
D O I
10.1117/12.2574133
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A crucial technology in modern smart manufacturing is the human-robot collaboration (HRC) concept. In the HRC, operators, and robots unite and collaborate to perform complex tasks in a variety of scenarios, heterogeneous and dynamic conditions. A unique role in the implementation of the HRC model, as a means of sensation, is assigned to machine vision systems. It provides the receipt and processing of visual information about the environment, the analysis of images of the working area, the transfer of this information to the control system, and decision-making within the framework of the task. Thus, the task of recognizing the actions of a human-operator for the development of a robot control system in order to implement an effective HRC system becomes relevant. The operator commands fed to the robot can have a variety of forms: from simple and concrete to quite abstract. This introduces several difficulties when the implementation of automated recognition systems in real conditions; this is a heterogeneous background, an uncontrolled work environment, irregular lighting, etc. In the article, we present an algorithm for constructing a video descriptor and solve the problem of classifying a set of actions into predefined classes. The proposed algorithm is based on capturing three-dimensional sub-volumes located inside a video sequence patch and calculating the difference in intensities between these sub-volumes. Video patches and central coordinates of sub-volumes are built on the principle of VLBP. Such a representation of three-dimensional blocks (patches) of a video sequence by capturing sub-volumes, inside each patch, in several scales and orientations, leads to an informative description of the scene and the actions taking place in it. Experimental results showed the effectiveness of the proposed algorithm on known data sets.
引用
收藏
页数:11
相关论文
共 21 条
  • [1] Ahonen T, 2004, LECT NOTES COMPUT SC, V3021, P469
  • [2] Face description with local binary patterns:: Application to face recognition
    Ahonen, Timo
    Hadid, Abdenour
    Pietikainen, Matti
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (12) : 2037 - 2041
  • [3] Chen C, 2015, IEEE IMAGE PROC, P168, DOI 10.1109/ICIP.2015.7350781
  • [4] Orthogonal combination of local binary patterns for dynamic texture recognition
    Chen, Yin
    Guo, Xuejun
    Klein, Dominik A.
    [J]. MIPPR 2015: PATTERN RECOGNITION AND COMPUTER VISION, 2015, 9813
  • [5] Contemporary state and outlook for development of metrological assurance in the machine-building industry
    Grigoriev, S. N.
    Masterenko, D. A.
    Teleshevskii, V. I.
    Emelyanov, P. N.
    [J]. MEASUREMENT TECHNIQUES, 2013, 55 (11) : 1311 - 1315
  • [6] An ARM-based Multi-channel CNC Solution for Multi-tasking Turning and Milling Machines
    Grigoriev, Sergej N.
    Martinov, Georgi M.
    [J]. 7TH HPC 2016 - CIRP CONFERENCE ON HIGH PERFORMANCE CUTTING, 2016, 46 : 525 - 528
  • [7] Scalable open cross-platform kernel of PCNC system for multi-axis machine tool
    Grigoriev, Sergej N.
    Martinov, Georgi M.
    [J]. FIFTH CIRP CONFERENCE ON HIGH PERFORMANCE CUTTING 2012, 2012, 1 : 238 - 243
  • [8] Johnson S., 2010, BMVC, V2
  • [9] Logarithmic Image Processing - The mathematical and physical framework for the representation and processing of transmitted images
    Jourlin, M
    Pinoli, JC
    [J]. ADVANCES IN IMAGING AND ELECTRON PHYSICS, VOL 115, 2001, 115 : 129 - 196
  • [10] Kellokumpu V., 2008, P BMVC, V1