Training a Minesweeper Agent Using a Convolutional Neural Network

被引:0
|
作者
Wang, Wenbo [1 ]
Lei, Chengyou [2 ]
机构
[1] South China Agr Univ, Coll Elect Engn, Coll Artificial Intelligence, Guangzhou 510642, Peoples R China
[2] Wuhan Second Ship Design & Res Inst, Wuhan 430205, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 05期
关键词
convolutional neural network (CNN); Minesweeper game; deep Q-network (DQN); supervised learning; sequential decision making; deep reinforcement learning; deep neural network; feedback control; artificial general intelligence (AGI);
D O I
10.3390/app15052490
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The Minesweeper game is modeled as a sequential decision-making task, for which a neural network architecture, state encoding, and reward function were herein designed. Both a Deep Q-Network (DQN) and supervised learning methods were successfully applied to optimize the training of the game. The experiments were conducted on the AutoDL platform using an NVIDIA RTX 3090 GPU for efficient computation. The results showed that in a 6 x 6 grid with four mines, the DQN model achieved an average win rate of 93.3% (standard deviation: 0.77%), while the supervised learning method achieved 91.2% (standard deviation: 0.9%), both outperforming human players and baseline algorithms and demonstrating high intelligence. The mechanisms of the two methods in the Minesweeper task were analyzed, with the reasons for the faster training speed and more stable performance of supervised learning explained from the perspectives of means-ends analysis and feedback control. Although there is room for improvement in sample efficiency and training stability in the DQN model, its greater generalization ability makes it highly promising for application in more complex decision-making tasks.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Sport training action correction by using convolutional neural network
    Shi, Dan
    Jiang, Xin
    INTERNET TECHNOLOGY LETTERS, 2021, 4 (03)
  • [2] Convolutional Neural Network Training with Dynamic Epoch Ordering
    Plana Rius, Ferran
    Angulo Bahon, Cecilio
    Casas, Marc
    Mirats Tur, Josep Maria
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2019, 319 : 105 - 114
  • [3] Optimized Training for Convolutional Neural Network Using Enhanced Grey Wolf Optimization Algorithm
    Guernine, Akram
    Kimour, Mohamed Tahar
    INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2021, 45 (05): : 731 - 739
  • [4] Spectral Classification of a Set of Hyperspectral Images using the Convolutional Neural Network, in a Single Training
    Zbakh, Abdelali
    Mdaghri, Zoubida Alaoui
    Benyoussef, Abdelillah
    El Kenz, Abdellah
    El Yadari, Mourad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (06) : 245 - 250
  • [5] Landmark Detection in Cardiac MRI by Using a Convolutional Neural Network
    Xue, Hui
    Artico, Jessica
    Fontana, Marianna
    Moon, James C.
    Davies, Rhodri H.
    Kellman, Peter
    RADIOLOGY-ARTIFICIAL INTELLIGENCE, 2021, 3 (05)
  • [6] Iris Recognition Using Convolutional Neural Network
    Zhuang, Yuan
    Chuah, Joon Huang
    Chow, Chee Onn
    Lim, Marcus Guozong
    2020 IEEE 10TH INTERNATIONAL CONFERENCE ON SYSTEM ENGINEERING AND TECHNOLOGY (ICSET), 2020, : 134 - 138
  • [7] Classification of Plants Using Convolutional Neural Network
    Saini, Gurinder
    Khamparia, Aditya
    Luhach, Ashish Kumar
    FIRST INTERNATIONAL CONFERENCE ON SUSTAINABLE TECHNOLOGIES FOR COMPUTATIONAL INTELLIGENCE, 2020, 1045 : 551 - 561
  • [8] Classification of Brainwaves Using Convolutional Neural Network
    Joshi, Swapnil R.
    Headley, Drew B.
    Ho, K. C.
    Pare, Denis
    Nair, Satish S.
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [9] Landslide Susceptibility Mapping Using Deep Neural Network and Convolutional Neural Network
    Gong, Sung-Hyun
    Baek, Won-Kyung
    Jung, Hyung-Sup
    KOREAN JOURNAL OF REMOTE SENSING, 2022, 38 (06) : 1723 - 1735
  • [10] Image enhancement using convolutional neural network
    Zhou, Abel
    Tan, Qi
    Davidson, Rob
    2020 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO PROCESSING AND ARTIFICIAL INTELLIGENCE, 2020, 11584