Object-Centric Spatio-Temporal Pyramids for Egocentric Activity Recognition

被引:24
|
作者
McCandless, Tomas [1 ]
Grauman, Kristen [1 ]
机构
[1] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
来源
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013 | 2013年
关键词
D O I
10.5244/C.27.30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Activities in egocentric video are largely defined by the objects with which the camera wearer interacts, making representations that summarize the objects in view quite informative. Beyond simply recording how frequently each object occurs in a single histogram, spatio-temporal binning approaches can capture the objects' relative layout and ordering. However, existing methods use hand-crafted binning schemes (e.g., a uniformly spaced pyramid of partitions), which may fail to capture the relationships that best distinguish certain activities. We propose to learn the spatio-temporal partitions that are discriminative for a set of egocentric activity classes. We devise a boosting approach that automatically selects a small set of useful spatio-temporal pyramid histograms among a randomized pool of candidate partitions. In order to efficiently focus the candidate partitions, we further propose an "object-centric" cutting scheme that prefers sampling bin boundaries near those objects prominently involved in the egocentric activities. In this way, we specialize the randomized pool of partitions to the egocentric setting and improve the training efficiency for boosting. Our approach yields state-of-the-art accuracy for recognition of challenging activities of daily living.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Symbiotic Attention for Egocentric Action Recognition With Object-Centric Alignment
    Wang, Xiaohan
    Zhu, Linchao
    Wu, Yu
    Yang, Yi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 6605 - 6617
  • [2] Spatio-Temporal Object Recognition
    De Geest, Roeland
    Deboeverie, Francis
    Philips, Wilfried
    Tuytelaars, Tinne
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, ACIVS 2015, 2015, 9386 : 681 - 692
  • [3] STFormer: Spatio-temporal former for hand-object interaction recognition from egocentric RGB video
    Liang, Jiao
    Wang, Xihan
    Yang, Jiayi
    Gao, Quanli
    ELECTRONICS LETTERS, 2024, 60 (17)
  • [4] Spatio-Temporal Phrases for Activity Recognition
    Zhang, Yimeng
    Liu, Xiaoming
    Chang, Ming-Ching
    Ge, Weina
    Chen, Tsuhan
    COMPUTER VISION - ECCV 2012, PT III, 2012, 7574 : 707 - 721
  • [5] Spatio-temporal CNN algorithm for object segmentation and object recognition
    Schultz, A
    Rekeczky, C
    Szatmari, I
    Roska, T
    Chua, LO
    CNNA 98 - 1998 FIFTH IEEE INTERNATIONAL WORKSHOP ON CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS - PROCEEDINGS, 1998, : 347 - 352
  • [6] Novel Spatio-temporal Features for Fingertip Writing Recognition in Egocentric Viewpoint
    Hameed, Muhammad Zaid
    Garcia-Hernando, Guillermo
    Kim, Tae-Kyun
    2015 14TH IAPR INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA), 2015, : 484 - 488
  • [7] Spatio-temporal influences at the neural level of object recognition
    Wallis, G
    NETWORK-COMPUTATION IN NEURAL SYSTEMS, 1998, 9 (02) : 265 - 278
  • [8] GATSBI: Generative Agent-centric Spatio-temporal Object Interaction
    Min, Cheol-Hui
    Bae, Jinseok
    Lee, Junho
    Kim, Young Min
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3073 - 3082
  • [9] Spatio-Temporal Context Kernel for Activity Recognition
    Yuan, Fei
    Sahbi, Hichem
    Prinet, Veronique
    2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 436 - 440
  • [10] ST(OR)2: Spatio-Temporal Object Level Reasoning for Activity Recognition in the Operating Room
    Hamoud, Idris
    Jamal, Muhammad Abdullah
    Srivastav, Vinkle
    Mutter, Didier
    Padoy, Nicolas
    Mohareri, Omid
    Proceedings of Machine Learning Research, 2023, 227 : 1254 - 1268