Training a Minesweeper Agent Using a Convolutional Neural Network

被引:0
作者
Wang, Wenbo [1 ]
Lei, Chengyou [2 ]
机构
[1] South China Agr Univ, Coll Elect Engn, Coll Artificial Intelligence, Guangzhou 510642, Peoples R China
[2] Wuhan Second Ship Design & Res Inst, Wuhan 430205, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 05期
关键词
convolutional neural network (CNN); Minesweeper game; deep Q-network (DQN); supervised learning; sequential decision making; deep reinforcement learning; deep neural network; feedback control; artificial general intelligence (AGI);
D O I
10.3390/app15052490
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The Minesweeper game is modeled as a sequential decision-making task, for which a neural network architecture, state encoding, and reward function were herein designed. Both a Deep Q-Network (DQN) and supervised learning methods were successfully applied to optimize the training of the game. The experiments were conducted on the AutoDL platform using an NVIDIA RTX 3090 GPU for efficient computation. The results showed that in a 6 x 6 grid with four mines, the DQN model achieved an average win rate of 93.3% (standard deviation: 0.77%), while the supervised learning method achieved 91.2% (standard deviation: 0.9%), both outperforming human players and baseline algorithms and demonstrating high intelligence. The mechanisms of the two methods in the Minesweeper task were analyzed, with the reasons for the faster training speed and more stable performance of supervised learning explained from the perspectives of means-ends analysis and feedback control. Although there is room for improvement in sample efficiency and training stability in the DQN model, its greater generalization ability makes it highly promising for application in more complex decision-making tasks.
引用
收藏
页数:17
相关论文
共 50 条
[31]   Iterative Random Training Sampling Convolutional Neural Network for Hyperspectral Image Classification [J].
Chang, Chein-I ;
Liang, Chia-Chen ;
Hu, Peter Fuming .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[32]   Segmentation of MR Images using DN Convolutional Neural Network [J].
Bhairnallykar S.T. ;
Narawade V. .
International Journal of Information Technology, 2023, 15 (8) :4565-4576
[33]   Automated Detection of Lunar Rockfalls Using a Convolutional Neural Network [J].
Bickel, Valentin Tertius ;
Lanaras, Charis ;
Manconi, Andrea ;
Loew, Simon ;
Mall, Urs .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (06) :3501-3511
[34]   In-vehicle network intrusion detection using deep convolutional neural network [J].
Song, Hyun Min ;
Woo, Jiyoung ;
Kim, Huy Kang .
VEHICULAR COMMUNICATIONS, 2020, 21
[35]   Fish diseases detection using convolutional neural network (CNN) [J].
Hasan, Noraini ;
Ibrahim, Shafaf ;
Azlan, Anis Aqilah .
INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2022, 13 (01) :1977-1983
[36]   Earthquake events classification using convolutional recurrent neural network [J].
Ku, Bonhwa ;
Kim, Gwantae ;
Jang, Su ;
Ko, Hanseok .
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2020, 39 (06) :592-599
[37]   Smart staff attendance system using Convolutional Neural Network [J].
Natesan, P. ;
Gothai, E. ;
Rajalaxmi, R. R. ;
Karthikeyan, K. V. Mohana ;
Muthukumar, V ;
Naveen, R. M. .
2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2021,
[38]   Lung Nodule Detection and Classification by Using Convolutional Neural Network [J].
Tekade, Ruchita ;
Rajeswari, K. .
HELIX, 2018, 8 (05) :3696-3700
[39]   Methods for image denoising using convolutional neural network: a review [J].
Ademola E. Ilesanmi ;
Taiwo O. Ilesanmi .
Complex & Intelligent Systems, 2021, 7 :2179-2198
[40]   Methods for image denoising using convolutional neural network: a review [J].
Ilesanmi, Ademola E. ;
Ilesanmi, Taiwo O. .
COMPLEX & INTELLIGENT SYSTEMS, 2021, 7 (05) :2179-2198