DESEM: Depthwise Separable Convolution-Based Multimodal Deep Learning for In-Game Action Anticipation

被引：0

作者：

Kim, Changhyun ^{[1
]}

Bae, Jinsoo ^{[1
]}

Baek, Insung ^{[1
]}

Jeong, Jaeyoon ^{[1
]}

Lee, Young Jae ^{[1
]}

Park, Kiwoong ^{[2
]}

Shim, Sang Heun ^{[2
]}

Kim, Seoung Bum ^{[1
]}

机构：

[1] Korea Univ, Sch Ind & Management Engn, Seoul 02841, South Korea

[2] Agcy Def Dev ADD, Seoul 05771, South Korea

来源：

IEEE ACCESS | 2023年 / 11卷

关键词：

Games; Deep learning; Feature extraction; Convolutional neural networks; Artificial intelligence; Videos; Forecasting; Action anticipation; depthwise separable convolution; game artificial intelligence; multimodal deep learning; weighted loss function; TIME STRATEGY GAME;

D O I：

10.1109/ACCESS.2023.3271282

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In real-time strategy (RTS) games, to defeat their opponents, players need to choose and implement the correct sequential actions. Because RTS games like StarCraft II are real-time, players have a very limited time to choose how to develop their strategy. In addition, players can only partially observe the parts of the map that they have explored. Therefore, unlike Chess or Go, players do not know what their opponents are doing. For these reasons, applying generally used artificial intelligence models to forecast sequential actions in RTS games is a challenge. To address this, we propose depthwise separable convolution-based multimodal deep learning (DESEM) for forecasting sequential actions in the game StarCraft II. DESEM performs multimodal learning using high-dimensional frames and action labels simultaneously as inputs. We use a depthwise separable convolution as the backbone network for extracting features from high-dimensional frames. In addition, we propose a weighted loss function to resolve class imbalances. We use 1,978 StarCraft II replays where the Terrans win in a Terran vs. Protoss game. The experimental results show that the proposed depthwise separable convolution is superior to the conventional convolution. Furthermore, we demonstrate that multimodal learning and the weighted loss function contribute significantly to improving forecasting performance.

引用

页码：46504 / 46512

页数：9

共 21 条

[1] COMPOSV: compound feature extraction and depthwise separable convolution-based online signature verification
Vorugunti, Chandra Sekhar
Pulabaigari, Viswanath
Mukherjee, Prerana
Gautam, Avinash
NEURAL COMPUTING & APPLICATIONS, 2022, 34 (13) : 10901 - 10928
[2] COMPOSV: compound feature extraction and depthwise separable convolution-based online signature verification
Chandra Sekhar Vorugunti
Viswanath Pulabaigari
Prerana Mukherjee
Avinash Gautam
Neural Computing and Applications, 2022, 34 : 10901 - 10928
[3] A Lightweight Design to Convolution-Based Deep Learning CSI Feedback
Hu, Zhengyang
Zou, Yafei
Xue, Jiang
IEEE COMMUNICATIONS LETTERS, 2024, 28 (09) : 2081 - 2085
[4] Potato leaf disease detection with a novel deep learning model based on depthwise separable convolution and transformer networks
Reis, Hatice Catal
Turk, Veysel
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
[5] An unsupervised transfer learning bearing fault diagnosis method based on depthwise separable convolution
Li, Xueyi
Yuan, Peng
Wang, Xiangkai
Li, Daiyou
Xie, Zhijie
Kong, Xiangwei
MEASUREMENT SCIENCE AND TECHNOLOGY, 2023, 34 (09)
[6] Ischemic stroke infarct segmentation model based on depthwise separable convolution for multimodal magnetic resonance imaging
Jin Y.
Wang M.
Chen J.
Li Y.
Shengwu Yixue Gongchengxue Zazhi/Journal of Biomedical Engineering, 2024, 41 (03): : 535 - 543
[7] Joint optic disc and cup segmentation based on densely connected depthwise separable convolution deep network
Bingyan Liu
Daru Pan
Hui Song
BMC Medical Imaging, 21
[8] An efficient and accurate deep learning method for tree species classification that integrates depthwise separable convolution and dilated convolution using hyperspectral data
Fu, Mengni
Lu, Chi
Mao, Yingwu
Zhang, Xiaoli
Wu, Yong
Luo, Hongbin
Liu, Zhi
Li, Wenfang
Ou, Guanglong
INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2024, 17 (01)
[9] Joint optic disc and cup segmentation based on densely connected depthwise separable convolution deep network
Liu, Bingyan
Pan, Daru
Song, Hui
BMC MEDICAL IMAGING, 2021, 21 (01)
[10] Multimodal vision-based human action recognition using deep learning: a review
Shafizadegan, Fatemeh
Naghsh-Nilchi, Ahmad R.
Shabaninia, Elham
ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (07)

← 1 2 3 →