DESEM: Depthwise Separable Convolution-Based Multimodal Deep Learning for In-Game Action Anticipation

被引:0
|
作者
Kim, Changhyun [1 ]
Bae, Jinsoo [1 ]
Baek, Insung [1 ]
Jeong, Jaeyoon [1 ]
Lee, Young Jae [1 ]
Park, Kiwoong [2 ]
Shim, Sang Heun [2 ]
Kim, Seoung Bum [1 ]
机构
[1] Korea Univ, Sch Ind & Management Engn, Seoul 02841, South Korea
[2] Agcy Def Dev ADD, Seoul 05771, South Korea
关键词
Games; Deep learning; Feature extraction; Convolutional neural networks; Artificial intelligence; Videos; Forecasting; Action anticipation; depthwise separable convolution; game artificial intelligence; multimodal deep learning; weighted loss function; TIME STRATEGY GAME;
D O I
10.1109/ACCESS.2023.3271282
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In real-time strategy (RTS) games, to defeat their opponents, players need to choose and implement the correct sequential actions. Because RTS games like StarCraft II are real-time, players have a very limited time to choose how to develop their strategy. In addition, players can only partially observe the parts of the map that they have explored. Therefore, unlike Chess or Go, players do not know what their opponents are doing. For these reasons, applying generally used artificial intelligence models to forecast sequential actions in RTS games is a challenge. To address this, we propose depthwise separable convolution-based multimodal deep learning (DESEM) for forecasting sequential actions in the game StarCraft II. DESEM performs multimodal learning using high-dimensional frames and action labels simultaneously as inputs. We use a depthwise separable convolution as the backbone network for extracting features from high-dimensional frames. In addition, we propose a weighted loss function to resolve class imbalances. We use 1,978 StarCraft II replays where the Terrans win in a Terran vs. Protoss game. The experimental results show that the proposed depthwise separable convolution is superior to the conventional convolution. Furthermore, we demonstrate that multimodal learning and the weighted loss function contribute significantly to improving forecasting performance.
引用
收藏
页码:46504 / 46512
页数:9
相关论文
共 21 条
  • [1] COMPOSV: compound feature extraction and depthwise separable convolution-based online signature verification
    Vorugunti, Chandra Sekhar
    Pulabaigari, Viswanath
    Mukherjee, Prerana
    Gautam, Avinash
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (13) : 10901 - 10928
  • [2] COMPOSV: compound feature extraction and depthwise separable convolution-based online signature verification
    Chandra Sekhar Vorugunti
    Viswanath Pulabaigari
    Prerana Mukherjee
    Avinash Gautam
    Neural Computing and Applications, 2022, 34 : 10901 - 10928
  • [3] A Lightweight Design to Convolution-Based Deep Learning CSI Feedback
    Hu, Zhengyang
    Zou, Yafei
    Xue, Jiang
    IEEE COMMUNICATIONS LETTERS, 2024, 28 (09) : 2081 - 2085
  • [4] Potato leaf disease detection with a novel deep learning model based on depthwise separable convolution and transformer networks
    Reis, Hatice Catal
    Turk, Veysel
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [5] An unsupervised transfer learning bearing fault diagnosis method based on depthwise separable convolution
    Li, Xueyi
    Yuan, Peng
    Wang, Xiangkai
    Li, Daiyou
    Xie, Zhijie
    Kong, Xiangwei
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2023, 34 (09)
  • [6] Ischemic stroke infarct segmentation model based on depthwise separable convolution for multimodal magnetic resonance imaging
    Jin Y.
    Wang M.
    Chen J.
    Li Y.
    Shengwu Yixue Gongchengxue Zazhi/Journal of Biomedical Engineering, 2024, 41 (03): : 535 - 543
  • [7] Joint optic disc and cup segmentation based on densely connected depthwise separable convolution deep network
    Bingyan Liu
    Daru Pan
    Hui Song
    BMC Medical Imaging, 21
  • [8] An efficient and accurate deep learning method for tree species classification that integrates depthwise separable convolution and dilated convolution using hyperspectral data
    Fu, Mengni
    Lu, Chi
    Mao, Yingwu
    Zhang, Xiaoli
    Wu, Yong
    Luo, Hongbin
    Liu, Zhi
    Li, Wenfang
    Ou, Guanglong
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2024, 17 (01)
  • [9] Joint optic disc and cup segmentation based on densely connected depthwise separable convolution deep network
    Liu, Bingyan
    Pan, Daru
    Song, Hui
    BMC MEDICAL IMAGING, 2021, 21 (01)
  • [10] Multimodal vision-based human action recognition using deep learning: a review
    Shafizadegan, Fatemeh
    Naghsh-Nilchi, Ahmad R.
    Shabaninia, Elham
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (07)