Modeling Expert Knowledge in a Heuristic-Based Gin Rummy Agent

被引：0

作者：

Larkin, Sarah ^{[1
]}

Collicott, William ^{[1
]}

Hiebel, Jason ^{[1
]}

机构：

[1] Michigan Technol Univ, 1400 Townsend Dr, Houghton, MI 49931 USA

来源：

THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2021年 / 35卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We developed a heuristic-based reflex agent, Tonic, for the EAAI 2021 Undergraduate Research Challenge, which tasks competitors to create an autonomous player to play the card game gin rummy. Tonic's heuristics originate in expert knowledge and inform decision making for the three actions comprising a turn: drawing a card, discarding a card, and deciding when to knock. However, because these strategies are based in human intuition, there is often a lack of specificity to directly model them as algorithms. We developed parameterized models describing that intuition based on factors such as the number of turns played and an estimation of the opponent hand. To hone their performance, we conducted both manual analysis and parameter optimization (grid search) using self-play and play against a simple baseline agent. These heuristic models enable Tonic to win against the baseline agent at least 68% of the time.

引用

页码：15577 / 15582

页数：6

共 50 条

[21] Heuristic-based algorithm for active control
Tang, Y
JOURNAL OF ENGINEERING MECHANICS-ASCE, 1996, 122 (08): : 801 - 803
[22] A heuristic-based approach to conceptual design
Yih Tng Chong
Chun-Hsien Chen
Kah Fai Leong
Research in Engineering Design, 2009, 20 : 97 - 116
[23] A Hierarchy of Heuristic-Based Models of Crowd Dynamics
P. Degond
C. Appert-Rolland
M. Moussaïd
J. Pettré
G. Theraulaz
Journal of Statistical Physics, 2013, 152 : 1033 - 1068
[24] Design of an Interactive Scheduling Heuristic-Based Application
Duay, Edmond
Gondraneos, Gene Mark
Indino-Pineda, Karisha Ann
Seva, Rosemary
INDUSTRIAL ENGINEERING AND APPLICATIONS-EUROPE, ICIEA-EU 2024, 2024, 507 : 95 - 106
[25] Heuristic-based Blockchain Assignment: An Empirical Study
Chen, Jianyu
Gai, Keke
Jiang, Peng
Zhu, Liehuang
19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 916 - 923
[26] Heuristic-Based Recommendation for Metamodel - OCL Coevolution
Batot, Edouard
Kessentini, Wael
Sahraoui, Houari
Famelis, Michalis
2017 ACM/IEEE 20TH INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS (MODELS 2017), 2017, : 210 - 220
[27] Heuristic-based backtracking relaxation for propositional satisfiability
Bhalla, Ateet
Lynce, Ines
De Sousa, Jose T.
Marques-Silva, Joao
JOURNAL OF AUTOMATED REASONING, 2005, 35 (1-3) : 3 - 24
[28] Policy Generator (PG): A Heuristic-Based Fuzzer
Felix, Alejandro
Tappenden, Andrew F.
Miller, James
PROCEEDINGS OF THE 49TH ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS 2016), 2016, : 5535 - 5544
[29] Heuristic-Based Backtracking Relaxation for Propositional Satisfiability
Ateet Bhalla
Inês Lynce
José T. de Sousa
João Marques-Silva
Journal of Automated Reasoning, 2005, 35 : 3 - 24
[30] Heuristic-based backtracking relaxation for propositional satisfiability
Bhalla, Ateet
Lynce, Inês
De Sousa, José T.
Marques-Silva, João
Journal of Automated Reasoning, 2005, 35 (1-3): : 3 - 24

← 1 2 3 4 5 →