Symbiotic Attention for Egocentric Action Recognition With Object-Centric Alignment

被引:56
|
作者
Wang, Xiaohan [1 ,2 ]
Zhu, Linchao [2 ]
Wu, Yu [1 ,2 ]
Yang, Yi [2 ]
机构
[1] Baidu Res, Beijing 100193, Peoples R China
[2] Univ Technol Sydney, Australian Artificial Intelligence Inst, ReLER Lab, Sydney, NSW 2007, Australia
关键词
Feature extraction; Cognition; Three-dimensional displays; Symbiosis; Task analysis; Two dimensional displays; Solid modeling; Egocentric video analysis; action recognition; deep learning; symbiotic attention;
D O I
10.1109/TPAMI.2020.3015894
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose to tackle egocentric action recognition by suppressing background distractors and enhancing action-relevant interactions. The existing approaches usually utilize two independent branches to recognize egocentric actions, i.e., a verb branch and a noun branch. However, the mechanism to suppress distracting objects and exploit local human-object correlations is missing. To this end, we introduce two extra sources of information, i.e., the candidate objects spatial location and their discriminative features, to enable concentration on the occurring interactions. We design a Symbiotic Attention with Object-centric feature Alignment framework (SAOA) to provide meticulous reasoning between the actor and the environment. First, we introduce an object-centric feature alignment method to inject the local object features to the verb branch and noun branch. Second, we propose a symbiotic attention mechanism to encourage the mutual interaction between the two branches and select the most action-relevant candidates for classification. The framework benefits from the communication among the verb branch, the noun branch, and the local object information. Experiments based on different backbones and modalities demonstrate the effectiveness of our method. Notably, our framework achieves the state-of-the-art on the largest egocentric video dataset.
引用
收藏
页码:6605 / 6617
页数:13
相关论文
共 50 条
  • [1] Object-Centric Spatio-Temporal Pyramids for Egocentric Activity Recognition
    McCandless, Tomas
    Grauman, Kristen
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
  • [2] Symbiotic Attention with Privileged Information for Egocentric Action Recognition
    Wang, Xiaohan
    Wu, Yu
    Zhu, Linchao
    Yang, Yi
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12249 - 12256
  • [3] Object-Centric Learning with Slot Attention
    Locatello, Francesco
    Weissenborn, Dirk
    Unterthiner, Thomas
    Mahendran, Aravindh
    Heigold, Georg
    Uszkoreit, Jakob
    Dosovitskiy, Alexey
    Kipf, Thomas
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [4] Point Cloud Registration With Object-Centric Alignment
    Zagar, Bare Luka
    Yurtsever, Ekim
    Peters, Arne
    Knoll, Alois C.
    IEEE ACCESS, 2022, 10 : 76586 - 76595
  • [5] ALBUM-BASED OBJECT-CENTRIC EVENT RECOGNITION
    Tsai, Shen-Fu
    Huang, Thomas S.
    Tang, Feng
    2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
  • [6] Object-Centric Debugging
    Ressia, Jorge
    Bergel, Alexandre
    Nierstrasz, Oscar
    2012 34TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2012, : 485 - 495
  • [7] Object-Centric Multiple Object Tracking
    Zhao, Zixu
    Wang, Jiaze
    Horn, Max
    Ding, Yizhuo
    He, Tong
    Bai, Zechen
    Zietlow, Dominik
    Simon-Gabriel, Carl-Johann
    Shuai, Bing
    Tu, Zhuowen
    Brox, Thomas
    Schiele, Bernt
    Fu, Yanwei
    Locatello, Francesco
    Zhang, Zheng
    Xiao, Tianjun
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 16555 - 16565
  • [8] Rearrangement planning using object-centric and robot-centric action spaces
    King, Jennifer E.
    Cognetti, Marco
    Srinivasa, Siddhartha S.
    2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 3940 - 3947
  • [9] Learning global object-centric representations via disentangled slot attention
    Chen, Tonglin
    Huang, Yinxuan
    Shen, Zhimeng
    Huang, Jinghao
    Li, Bin
    Xue, Xiangyang
    MACHINE LEARNING, 2025, 114 (02)
  • [10] Object-Centric Slot Diffusion
    Jiang, Jindong
    Deng, Fei
    Singh, Gautam
    Ahn, Sungjin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,