Spatial-temporal human gesture recognition under degraded conditions using three-dimensional integral imaging

被引:17
|
作者
Shen, Xin [1 ]
Kim, Hee-seung [1 ]
Satoru, Komatsu [1 ]
Markman, Adam [1 ]
Javidi, Bahram [1 ]
机构
[1] Univ Connecticut, Dept Elect & Comp Engn, Storrs, CT 06269 USA
来源
OPTICS EXPRESS | 2018年 / 26卷 / 11期
关键词
RESOLUTION LIMITATION; SCATTERING MEDIUM; VISUALIZATION; PHOTOGRAPHY; OBJECTS; DISPLAY; FILTERS; SYSTEM; TARGET; NOISE;
D O I
10.1364/OE.26.013938
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
We present spatial-temporal human gesture recognition in degraded conditions including low light levels and occlusions using passive sensing three-dimensional (3D) integral imaging (InIm) system and 3D correlation filters. The 4D (lateral, longitudinal, and temporal) reconstructed data is processed using a variety of algorithms including linear and non-linear distortion-invariant filters; and compared with previously reported space-time interest points (STIP) feature detector, 3D histogram of oriented gradients (3D HOG) feature descriptor, with a standard bag-of-features support vector machine (SVM) framework, etc. The gesture recognition results with different classification algorithms are compared using a variety of performance metrics such as receiver operating characteristic (ROC) curves, area under the curve (AUC), SNR, the probability of classification errors, and confusion matrix. Integral imaging video sequences of human gestures are captured under degraded conditions such as low light illumination and in the presence of partial occlusions. A four-dimensional (4D) reconstructed video sequence is computed that provides lateral and depth information of a scene over time i.e. (x, y, z, t). The total-variation denoising algorithm is applied to the signal to further reduce noise and preserve data in the video frames. We show that the 4D signal consists of decreased scene noise, partial occlusion removal, and improved SNR due to the computational InIm and/or denoising algorithms. Finally, gesture recognition is processed with classification algorithms, such as distortion-invariant correlation filters; and STIP, 3D HOG with SVM, which are applied to the reconstructed 4D gesture signal to classify the human gesture. Experiments are conducted using a synthetic aperture InIm system in ambient light. Our experiments indicate that the proposed approach is promising in detection of human gestures in degraded conditions such as low illumination conditions with partial occlusion. To the best of our knowledge, this is the first report on spatial-temporal human gesture recognition in degraded conditions using passive sensing 4D integral imaging with non-linear correlation filters. (C) 2018 Optical Society of America under the terms of the OSA Open Access Publishing Agreement
引用
收藏
页码:13938 / 13951
页数:14
相关论文
共 48 条
  • [1] An overview of spatial-temporal human gesture recognition under degraded environments using integral imaging
    Shen, Xin
    Kim, Hee-Seung
    Komatsu, Satoru
    Markman, Adam
    Javidi, Bahram
    THREE-DIMENSIONAL IMAGING, VISUALIZATION, AND DISPLAY 2019, 2019, 10997
  • [2] Human gesture recognition using three-dimensional integral imaging
    Javier Traver, V.
    Latorre-Carmona, Pedro
    Salvador-Balaguer, Eva
    Pla, Filiberto
    Javidi, Bahram
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2014, 31 (10) : 2312 - 2320
  • [3] Human gesture recognition under degraded environments using 3D-integral imaging and deep learning
    Krishnan, Gokul
    Joshi, Rakesh
    O'Connor, Timothy
    Pla, Filiberto
    Javidi, Bahram
    OPTICS EXPRESS, 2020, 28 (13) : 19711 - 19725
  • [4] Three-dimensional object-distortion-tolerant recognition for integral imaging using independent component analysis
    Do, Cuong Manh
    Martinez-Cuenca, Raul
    Javidi, Bahram
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2009, 26 (02) : 245 - 251
  • [5] Three-Dimensional Holographic Display Using Dense Ray Sampling and Integral Imaging Capture
    Xiao, Xiao
    Wakunami, Koki
    Chen, Xiaoxi
    Shen, Xin
    Javidi, Bahram
    Kim, Jinwoong
    Nam, Jeho
    JOURNAL OF DISPLAY TECHNOLOGY, 2014, 10 (08): : 688 - 694
  • [6] Fundamentals of automated human gesture recognition using 3D integral imaging: a tutorial
    Javidi, Bahram
    Pla, Filiberto
    Sotoca, Jose M.
    Shen, Xin
    Latorre-Carmona, Pedro
    Martinez-Corral, Manuel
    Fernandez-Beltran, Ruben
    Krishnan, Gokul
    ADVANCES IN OPTICS AND PHOTONICS, 2020, 12 (04) : 1237 - 1299
  • [7] Three-Dimensional Optical Sensing and Visualization Using Integral Imaging
    Cho, Myungjin
    Daneshpanah, Mehdi
    Moon, Inkyu
    Javidi, Bahram
    PROCEEDINGS OF THE IEEE, 2011, 99 (04) : 556 - 575
  • [8] Three-Dimensional Digital Zooming of Integral Imaging under Photon-Starved Conditions
    Yeo, Gilsu
    Cho, Myungjin
    SENSORS, 2023, 23 (05)
  • [9] Experiments With Three-Dimensional Integral Imaging Under Low Light Levels
    Stern, Adrian
    Aloni, Doron
    Javidi, Bahram
    IEEE PHOTONICS JOURNAL, 2012, 4 (04): : 1188 - 1195
  • [10] Deep learning polarimetric three-dimensional integral imaging object recognition in adverse environmental conditions
    Usmani, Kashif
    Krishnan, Gokul
    O'Connor, Timothy
    Javidi, Bahram
    OPTICS EXPRESS, 2021, 29 (08) : 12215 - 12228