Training a Minesweeper Agent Using a Convolutional Neural Network

被引：0

作者：

Wang, Wenbo ^{[1
]}

Lei, Chengyou ^{[2
]}

机构：

[1] South China Agr Univ, Coll Elect Engn, Coll Artificial Intelligence, Guangzhou 510642, Peoples R China

[2] Wuhan Second Ship Design & Res Inst, Wuhan 430205, Peoples R China

来源：

APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 05期

关键词：

convolutional neural network (CNN); Minesweeper game; deep Q-network (DQN); supervised learning; sequential decision making; deep reinforcement learning; deep neural network; feedback control; artificial general intelligence (AGI);

D O I：

10.3390/app15052490

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

The Minesweeper game is modeled as a sequential decision-making task, for which a neural network architecture, state encoding, and reward function were herein designed. Both a Deep Q-Network (DQN) and supervised learning methods were successfully applied to optimize the training of the game. The experiments were conducted on the AutoDL platform using an NVIDIA RTX 3090 GPU for efficient computation. The results showed that in a 6 x 6 grid with four mines, the DQN model achieved an average win rate of 93.3% (standard deviation: 0.77%), while the supervised learning method achieved 91.2% (standard deviation: 0.9%), both outperforming human players and baseline algorithms and demonstrating high intelligence. The mechanisms of the two methods in the Minesweeper task were analyzed, with the reasons for the faster training speed and more stable performance of supervised learning explained from the perspectives of means-ends analysis and feedback control. Although there is room for improvement in sample efficiency and training stability in the DQN model, its greater generalization ability makes it highly promising for application in more complex decision-making tasks.

引用

页数：17

共 50 条

[21] Object Recognition in Images using Convolutional Neural Network [J].

Duth, Sudharshan P. ;

Raj, Swathi .

PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INVENTIVE SYSTEMS AND CONTROL (ICISC 2018), 2018, :718-722

[22] Convolutional Neural Network Pruning Using Filter Attenuation [J].

Mousa-Pasandi, Morteza ;

Hajabdollahi, Mohsen ;

Karimi, Nader ;

Samavi, Shadrokh ;

Shirani, Shahram .

2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, :2905-2909

[23] Cardiac Arrhythmia Classification Using Convolutional Neural Network [J].

Gamgami, Oumaima ;

Korikache, Reda ;

Chaieb, Amine .

ADVANCES IN SMART MEDICAL, IOT & ARTIFICIAL INTELLIGENCE, VOL 1, ICSMAI 2024, 2024, 11 :297-308

[24] Classifying Malware Using Convolutional Gated Neural Network [J].

Kim, Chang Hoon ;

Kabanga, Espoir K. ;

Kang, Sin-Jae .

2018 20TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT), 2018, :40-44

[25] A Survey on Face Recognition Using Convolutional Neural Network [J].

Swapna, M. ;

Sharma, Yogesh Kumar ;

Prasad, B. M. G. .

DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT-2K19, 2020, 1079 :649-661

[26] Advertisement Image Classification Using Convolutional Neural Network [J].

An Tien Vo ;

Hai Son Tran ;

Thai Hoang Le .

2017 9TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2017), 2017, :197-202

[27] Neurological Status Classification Using Convolutional Neural Network [J].

Jaloli, Mehrad ;

Choudhary, Divya ;

Cescon, Marzia .

IFAC PAPERSONLINE, 2020, 53 (05) :409-414

[28] Speech Enhancement using Fully Convolutional UNET and Gated Convolutional Neural Network [J].

Baloch, Danish ;

Abdullah, Sidrah ;

Qaiser, Asma ;

Ahmed, Saad ;

Nasim, Faiza ;

Kanwal, Mehreen .

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (11) :831-836

[29] Fast Training of Convolutional Neural Network Classifiers through Extreme Learning machines [J].

Yoo, Youngwoo ;

Oh, Se-Young .

2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, :1702-1708

[30] Deep Convolutional Neural Network Compression based on the Intrinsic Dimension of the Training Data [J].

Hadi, Abir Mohammad ;

Won, Kwanghee .

APPLIED COMPUTING REVIEW, 2024, 24 (01) :14-23

← 1 2 3 4 5 →