SmallTAL: Real-Time Egocentric Online Temporal Action Localization for the Data-Impoverished

被引:0
作者
Joyce, Eric C. [1 ]
Chen, Yao [2 ]
Neeter, Eduardo [2 ]
Mordohai, Philippos [1 ]
机构
[1] Stevens Inst Technol, Dept Comp Sci, Hoboken, NJ 07030 USA
[2] HyperTunnel, Basingstoke, England
来源
PRESENCE-VIRTUAL AND AUGMENTED REALITY | 2023年 / 32卷
关键词
SEQUENCE; NETWORK;
D O I
10.1162/pres_a_00408
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a real-time, online temporal action localization system that requires a small amount of annotated data. The main challenges we address are high intra-class variability and a large and diverse background class. We address these using a flexible frame descriptor, dynamic time warping, and a novel approach to database construction. Our solution receives egocentric RGB-D streams as input and makes predictions at regular temporal intervals. We validate our approach by localizing actions in a digital twin of an electrical substation, in which certain objects have been replaced by functional virtual replicas.
引用
收藏
页码:179 / 203
页数:25
相关论文
共 96 条
  • [1] Abraham M., 2017, Harvard Business Review, V13, P1
  • [2] MS-TCN: Multi-Stage Temporal Convolutional Network for Action Segmentation
    Abu Farha, Yazan
    Gall, Juergen
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3570 - 3579
  • [3] Unsupervised Learning from Narrated Instruction Videos
    Alayrac, Jean-Baptiste
    Bojanowski, Piotr
    Agrawal, Nishant
    Sivic, Josef
    Laptev, Ivan
    Lacoste-Julien, Simon
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4575 - 4583
  • [4] Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation
    Behrmann, Nadine
    Golestaneh, S. Alireza
    Kolter, Zico
    Gall, Jurgen
    Noroozi, Mehdi
    [J]. COMPUTER VISION - ECCV 2022, PT XXXV, 2022, 13695 : 52 - 68
  • [5] Berndt D.J., 1994, KDD WORKSH, V10, P359
  • [6] Buch S., 2019, BRIT MACH VIS C
  • [7] Heilbron FC, 2015, PROC CVPR IEEE, P961, DOI 10.1109/CVPR.2015.7298698
  • [8] Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
    Carreira, Joao
    Zisserman, Andrew
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4724 - 4733
  • [9] Rethinking the Faster R-CNN Architecture for Temporal Action Localization
    Chao, Yu-Wei
    Vijayanarasimhan, Sudheendra
    Seybold, Bryan
    Ross, David A.
    Deng, Jia
    Sukthankar, Rahul
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1130 - 1139
  • [10] GateHUB: Gated History Unit with Background Suppression for Online Action Detection
    Chen, Junwen
    Mittal, Gaurav
    Yu, Ye
    Kong, Yu
    Chen, Mei
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19893 - 19902