Temporal graph convolutional network for multi-agent reinforcement learning of action detection

被引:0
|
作者
Wang, Liangliang [1 ,2 ]
Liu, Jiayao [3 ]
Wang, Ke [4 ]
Ge, Lianzheng [4 ]
Liang, Peidong [5 ,6 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China
[2] Wuhan Univ Technol, Chongqing Res Inst, Chongqing 401120, Peoples R China
[3] Beijing Jiaotong Univ, Sch Software Engn, Beijing 100044, Peoples R China
[4] Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150001, Peoples R China
[5] Fujian Quanzhou Inst Adv Mfg Technol, Quanzhou 362008, Peoples R China
[6] Fujian Key Lab Intelligent Operat & Maintenance Ro, Quanzhou 362008, Peoples R China
关键词
Action detection; Action spatio-temporal representation; Deep reinforcement learning; Graph convolutional network; Attention mechanism;
D O I
10.1016/j.asoc.2024.111916
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most action detection techniques process untrimmed action videos using temporal context, without explicitly consider the inherent spatio-temporal context information, leading to limited accuracy in the cases with big spatial complexity and temporal redundancy. To this end, by intuitively dealing with the problem following a two-stage "clip selection + clip classification" scheme, this paper proposes to formulate action detection as a Markov process and builds up a multi-agent reinforcement learning framework capturing global structural relationships of videos to optimize the selection and classification, simultaneously and progressively. In particular, a temporal graph convolutional network is constructed to represent the spatio-temporal correlations of video clips, which are initialized by evenly sampling, and further adjusted via learning the rewards adaptively for multi-agent cooperation. Multi-head dot-product attention mechanism is adopted to integrate the relations of latent CNN features of interacting agents. Our framework is jointly learnt by fusing the objectives of clip selection policy and clip recognition. The proposed method comprises a novel graph convolutional network based spatio-temporal semantic observation module which captures topological features among nearby agents, and a new policy module that segments actions according to the rewards from the objectives of action recognition. Extensive experiments are conducted on ActivityNet v1.3 and THUMOS14, with 30.13% and 55.4% mAP obtained, demonstrate the applicability and superiority of our approach.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning
    Malysheva, Aleksandra
    Kudenko, Daniel
    Shpilman, Aleksei
    2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 171 - 176
  • [2] Graph Convolutional Multi-Agent Reinforcement Learning for UAV Coverage Control
    Dai, Anna
    Li, Rongpeng
    Zhaot, Zhifeng
    Zhang, Honggang
    2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 1106 - 1111
  • [3] Routing with Graph Convolutional Networks and Multi-Agent Deep Reinforcement Learning
    Bhavanasi, Sai Shreyas
    Pappone, Lorenzo
    Esposito, Flavio
    2022 IEEE CONFERENCE ON NETWORK FUNCTION VIRTUALIZATION AND SOFTWARE DEFINED NETWORKS (IEEE NFV-SDN), 2022, : 72 - 77
  • [4] Multi-Agent Graph Convolutional Reinforcement Learning for Intelligent Load Balancing
    Houidi, Omar
    Bakri, Sihem
    Zeghlache, Djamal
    PROCEEDINGS OF THE IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM 2022, 2022,
  • [5] Multi-agent reinforcement learning based on graph convolutional network for flexible job shop scheduling
    Jing, Xuan
    Yao, Xifan
    Liu, Min
    Zhou, Jiajun
    JOURNAL OF INTELLIGENT MANUFACTURING, 2024, 35 (01) : 75 - 93
  • [6] Multi-agent reinforcement learning based on graph convolutional network for flexible job shop scheduling
    Xuan Jing
    Xifan Yao
    Min Liu
    Jiajun Zhou
    Journal of Intelligent Manufacturing, 2024, 35 : 75 - 93
  • [7] Multi-hop temporal knowledge graph reasoning with multi-agent reinforcement learning
    Bai, Luyi
    Chen, Mingzhuo
    Xiao, Qianwen
    APPLIED SOFT COMPUTING, 2024, 160
  • [8] Multi-Agent Graph Convolutional Reinforcement Learning for Dynamic Electric Vehicle Charging Pricing
    Zhang, Weijia
    Liu, Hao
    Han, Jindong
    Ge, Yong
    Xiong, Hui
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 2471 - 2481
  • [9] GRAPHCOMM: A GRAPH NEURAL NETWORK BASED METHOD FOR MULTI-AGENT REINFORCEMENT LEARNING
    Shen, Siqi
    Fu, Yongquan
    Su, Huayou
    Pan, Hengyue
    Qiao, Peng
    Dou, Yong
    Wang, Cheng
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3510 - 3514
  • [10] Recursive Reasoning Graph for Multi-Agent Reinforcement Learning
    Ma, Xiaobai
    Isele, David
    Gupta, Jayesh K.
    Fujimura, Kikuo
    Kochenderfer, Mykel J.
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7664 - 7671