Training a Minesweeper Agent Using a Convolutional Neural Network

被引:0
作者
Wang, Wenbo [1 ]
Lei, Chengyou [2 ]
机构
[1] South China Agr Univ, Coll Elect Engn, Coll Artificial Intelligence, Guangzhou 510642, Peoples R China
[2] Wuhan Second Ship Design & Res Inst, Wuhan 430205, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 05期
关键词
convolutional neural network (CNN); Minesweeper game; deep Q-network (DQN); supervised learning; sequential decision making; deep reinforcement learning; deep neural network; feedback control; artificial general intelligence (AGI);
D O I
10.3390/app15052490
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The Minesweeper game is modeled as a sequential decision-making task, for which a neural network architecture, state encoding, and reward function were herein designed. Both a Deep Q-Network (DQN) and supervised learning methods were successfully applied to optimize the training of the game. The experiments were conducted on the AutoDL platform using an NVIDIA RTX 3090 GPU for efficient computation. The results showed that in a 6 x 6 grid with four mines, the DQN model achieved an average win rate of 93.3% (standard deviation: 0.77%), while the supervised learning method achieved 91.2% (standard deviation: 0.9%), both outperforming human players and baseline algorithms and demonstrating high intelligence. The mechanisms of the two methods in the Minesweeper task were analyzed, with the reasons for the faster training speed and more stable performance of supervised learning explained from the perspectives of means-ends analysis and feedback control. Although there is room for improvement in sample efficiency and training stability in the DQN model, its greater generalization ability makes it highly promising for application in more complex decision-making tasks.
引用
收藏
页数:17
相关论文
共 50 条
[41]   Recognizing Arabic Letter Utterance using Convolutional Neural Network [J].
Rajagede, Rian Adam ;
Dewa, Chandra Kusuma ;
Afiahayati .
2017 18TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNDP 2017), 2017, :181-186
[42]   Crowd Video Event Classification using Convolutional Neural Network [J].
Shri, S. Jothi ;
Jothilakshmi, S. .
COMPUTER COMMUNICATIONS, 2019, 147 :35-39
[43]   Intelligent Detection of "Problematic Map" Using Convolutional Neural Network [J].
Ren J. ;
Liu W. ;
Li Z. ;
Li R. ;
Zhai X. .
Liu, Wanzeng (luwnzg@163.com), 1600, Editorial Board of Medical Journal of Wuhan University (46) :570-577
[44]   Efficient Vehicle Recognition and Classification using Convolutional Neural Network [J].
San, Wei Jian ;
Lim, Marcus Guozong ;
Chuah, Joon Huang .
2018 IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND INTELLIGENT SYSTEMS (I2CACIS), 2018, :117-122
[45]   Classification of Respiratory States Using Spectrogram with Convolutional Neural Network [J].
Park, Cheolhyeong ;
Lee, Deokwoo .
APPLIED SCIENCES-BASEL, 2022, 12 (04)
[46]   Dari Speech Classification Using Deep Convolutional Neural Network [J].
Dawodi, Mursal ;
Baktash, Jawid Ahamd ;
Wada, Tomohisa ;
Alam, Najwa ;
Joya, Mohammad Zarif .
2020 IEEE INTERNATIONAL IOT, ELECTRONICS AND MECHATRONICS CONFERENCE (IEMTRONICS 2020), 2020, :110-113
[47]   Using Fermat Number Transform to Accelerate Convolutional Neural Network [J].
Xu, Weihong ;
You, Xiaohu ;
Zhang, Chuan .
2017 IEEE 12TH INTERNATIONAL CONFERENCE ON ASIC (ASICON), 2017, :1033-1036
[48]   Future prediction of coastal recession using convolutional neural network [J].
Khan, Abdul Rehman ;
Bin Ab Razak, Mohd Shahrizal ;
Yusuf, Badronnisa Binti ;
Shafri, Helmi Zulhaidi Bin Mohd ;
Mohamad, Noorasiah Binti .
ESTUARINE COASTAL AND SHELF SCIENCE, 2024, 299
[49]   Abnormality detection in chest diseases using a convolutional neural network [J].
Dhar, K. K. ;
Bhattacharya, P. ;
Kumar, Nitendra ;
Mitra, Asish .
JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2023, 44 (01) :97-111
[50]   Recycling waste classification using optimized convolutional neural network [J].
Mao, Wei-Lung ;
Chen, Wei-Chun ;
Wang, Chien-Tsung ;
Lin, Yu-Hao .
RESOURCES CONSERVATION AND RECYCLING, 2021, 164