Temporal graph convolutional network for multi-agent reinforcement learning of action detection

被引：0

作者：

Wang, Liangliang ^{[1
,2
]}

Liu, Jiayao ^{[3
]}

Wang, Ke ^{[4
]}

Ge, Lianzheng ^{[4
]}

Liang, Peidong ^{[5
,6
]}

机构：

[1] Wuhan Univ Technol, Sch Comp Sci & Artificial Intelligence, Wuhan 430070, Peoples R China

[2] Wuhan Univ Technol, Chongqing Res Inst, Chongqing 401120, Peoples R China

[3] Beijing Jiaotong Univ, Sch Software Engn, Beijing 100044, Peoples R China

[4] Harbin Inst Technol, State Key Lab Robot & Syst, Harbin 150001, Peoples R China

[5] Fujian Quanzhou Inst Adv Mfg Technol, Quanzhou 362008, Peoples R China

[6] Fujian Key Lab Intelligent Operat & Maintenance Ro, Quanzhou 362008, Peoples R China

来源：

APPLIED SOFT COMPUTING | 2024年 / 163卷

关键词：

Action detection; Action spatio-temporal representation; Deep reinforcement learning; Graph convolutional network; Attention mechanism;

D O I：

10.1016/j.asoc.2024.111916

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most action detection techniques process untrimmed action videos using temporal context, without explicitly consider the inherent spatio-temporal context information, leading to limited accuracy in the cases with big spatial complexity and temporal redundancy. To this end, by intuitively dealing with the problem following a two-stage "clip selection + clip classification" scheme, this paper proposes to formulate action detection as a Markov process and builds up a multi-agent reinforcement learning framework capturing global structural relationships of videos to optimize the selection and classification, simultaneously and progressively. In particular, a temporal graph convolutional network is constructed to represent the spatio-temporal correlations of video clips, which are initialized by evenly sampling, and further adjusted via learning the rewards adaptively for multi-agent cooperation. Multi-head dot-product attention mechanism is adopted to integrate the relations of latent CNN features of interacting agents. Our framework is jointly learnt by fusing the objectives of clip selection policy and clip recognition. The proposed method comprises a novel graph convolutional network based spatio-temporal semantic observation module which captures topological features among nearby agents, and a new policy module that segments actions according to the rewards from the objectives of action recognition. Extensive experiments are conducted on ActivityNet v1.3 and THUMOS14, with 30.13% and 55.4% mAP obtained, demonstrate the applicability and superiority of our approach.

引用

页数：8

共 50 条

[1] MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning
Malysheva, Aleksandra
Kudenko, Daniel
Shpilman, Aleksei
2019 XVI INTERNATIONAL SYMPOSIUM PROBLEMS OF REDUNDANCY IN INFORMATION AND CONTROL SYSTEMS (REDUNDANCY), 2019, : 171 - 176
[2] Graph Convolutional Multi-Agent Reinforcement Learning for UAV Coverage Control
Dai, Anna
Li, Rongpeng
Zhaot, Zhifeng
Zhang, Honggang
2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 1106 - 1111
[3] Routing with Graph Convolutional Networks and Multi-Agent Deep Reinforcement Learning
Bhavanasi, Sai Shreyas
Pappone, Lorenzo
Esposito, Flavio
2022 IEEE CONFERENCE ON NETWORK FUNCTION VIRTUALIZATION AND SOFTWARE DEFINED NETWORKS (IEEE NFV-SDN), 2022, : 72 - 77
[4] Multi-Agent Graph Convolutional Reinforcement Learning for Intelligent Load Balancing
Houidi, Omar
Bakri, Sihem
Zeghlache, Djamal
PROCEEDINGS OF THE IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM 2022, 2022,
[5] Multi-agent reinforcement learning based on graph convolutional network for flexible job shop scheduling
Jing, Xuan
Yao, Xifan
Liu, Min
Zhou, Jiajun
JOURNAL OF INTELLIGENT MANUFACTURING, 2024, 35 (01) : 75 - 93
[6] Multi-agent reinforcement learning based on graph convolutional network for flexible job shop scheduling
Xuan Jing
Xifan Yao
Min Liu
Jiajun Zhou
Journal of Intelligent Manufacturing, 2024, 35 : 75 - 93
[7] Multi-hop temporal knowledge graph reasoning with multi-agent reinforcement learning
Bai, Luyi
Chen, Mingzhuo
Xiao, Qianwen
APPLIED SOFT COMPUTING, 2024, 160
[8] Multi-Agent Graph Convolutional Reinforcement Learning for Dynamic Electric Vehicle Charging Pricing
Zhang, Weijia
Liu, Hao
Han, Jindong
Ge, Yong
Xiong, Hui
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 2471 - 2481
[9] GRAPHCOMM: A GRAPH NEURAL NETWORK BASED METHOD FOR MULTI-AGENT REINFORCEMENT LEARNING
Shen, Siqi
Fu, Yongquan
Su, Huayou
Pan, Hengyue
Qiao, Peng
Dou, Yong
Wang, Cheng
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3510 - 3514
[10] Recursive Reasoning Graph for Multi-Agent Reinforcement Learning
Ma, Xiaobai
Isele, David
Gupta, Jayesh K.
Fujimura, Kikuo
Kochenderfer, Mykel J.
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7664 - 7671

← 1 2 3 4 5 →