Symbiotic Attention for Egocentric Action Recognition With Object-Centric Alignment

被引:56
|
作者
Wang, Xiaohan [1 ,2 ]
Zhu, Linchao [2 ]
Wu, Yu [1 ,2 ]
Yang, Yi [2 ]
机构
[1] Baidu Res, Beijing 100193, Peoples R China
[2] Univ Technol Sydney, Australian Artificial Intelligence Inst, ReLER Lab, Sydney, NSW 2007, Australia
关键词
Feature extraction; Cognition; Three-dimensional displays; Symbiosis; Task analysis; Two dimensional displays; Solid modeling; Egocentric video analysis; action recognition; deep learning; symbiotic attention;
D O I
10.1109/TPAMI.2020.3015894
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose to tackle egocentric action recognition by suppressing background distractors and enhancing action-relevant interactions. The existing approaches usually utilize two independent branches to recognize egocentric actions, i.e., a verb branch and a noun branch. However, the mechanism to suppress distracting objects and exploit local human-object correlations is missing. To this end, we introduce two extra sources of information, i.e., the candidate objects spatial location and their discriminative features, to enable concentration on the occurring interactions. We design a Symbiotic Attention with Object-centric feature Alignment framework (SAOA) to provide meticulous reasoning between the actor and the environment. First, we introduce an object-centric feature alignment method to inject the local object features to the verb branch and noun branch. Second, we propose a symbiotic attention mechanism to encourage the mutual interaction between the two branches and select the most action-relevant candidates for classification. The framework benefits from the communication among the verb branch, the noun branch, and the local object information. Experiments based on different backbones and modalities demonstrate the effectiveness of our method. Notably, our framework achieves the state-of-the-art on the largest egocentric video dataset.
引用
收藏
页码:6605 / 6617
页数:13
相关论文
共 50 条
  • [21] Permission Analysis for Object-Centric Processes
    Breitmayer, Marius
    Arnold, Lisa
    Reichert, Manfred
    INTELLIGENT INFORMATION SYSTEMS, CAISE FORUM 2024, 2024, 520 : 11 - 19
  • [22] Discovery of Object-Centric Declarative Models
    Christfort, Axel K. F.
    Rivkin, Audrey
    Fahland, Dirk
    Hildebrandt, Thomas T.
    Slaats, Tijs
    2024 6TH INTERNATIONAL CONFERENCE ON PROCESS MINING, ICPM, 2024, : 137 - 144
  • [23] Object-centric process predictive analytics
    Galanti, Riccardo
    De Leoni, Massimiliano
    Navarin, Nicola
    Marazzi, Alan
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [24] Discovery of Object-Centric Declarative Models
    Christfort, Axel K. F.
    Rivkin, Andrey
    Fahland, Dirk
    Hildebrandt, Thomas T.
    Slaats, Tijs
    2024 6TH INTERNATIONAL CONFERENCE ON PROCESS MINING, ICPM, 2024, : 121 - 128
  • [25] Provably Learning Object-Centric Representations
    Brady, Jack
    Zimmermann, Roland S.
    Sharma, Yash
    Schoelkopf, Bernhard
    von Kuegelgen, Julius
    Brendel, Wieland
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [26] Object-Centric Conformance Alignments with Synchronization
    Gianola, Alessandro
    Montali, Marco
    Winkler, Sarah
    ADVANCED INFORMATION SYSTEMS ENGINEERING, CAISE 2024, 2024, 14663 : 3 - 19
  • [27] OCVOS: OBJECT-CENTRIC REPRESENTATION FOR VIDEO OBJECT SEGMENTATION
    Jo, Junho
    Wee, Dongyoon
    Cho, Nam Ik
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1655 - 1659
  • [28] Time-traveling object-centric breakpoints
    Bourcier, Valentin
    Costiou, Steven
    Santander, Maximilian Ignacio Willembrinck
    Vanegue, Adrien
    Etien, Anne
    JOURNAL OF COMPUTER LANGUAGES, 2024, 80
  • [29] Deep Object-Centric Policies for Autonomous Driving
    Wang, Dequan
    Devin, Coline
    Cai, Qi-Zhi
    Yu, Fisher
    Darrell, Trevor
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 8853 - 8859
  • [30] Manifold geometric invariants and object-centric approach
    Jannson, TP
    APPLICATIONS AND SCIENCE OF NEURAL NETWORKS, FUZZY SYSTEMS, AND EVOLUTIONARY COMPUTATION V, 2002, 4787 : 158 - 173