Modeling Expert Knowledge in a Heuristic-Based Gin Rummy Agent

被引:0
|
作者
Larkin, Sarah [1 ]
Collicott, William [1 ]
Hiebel, Jason [1 ]
机构
[1] Michigan Technol Univ, 1400 Townsend Dr, Houghton, MI 49931 USA
来源
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2021年 / 35卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We developed a heuristic-based reflex agent, Tonic, for the EAAI 2021 Undergraduate Research Challenge, which tasks competitors to create an autonomous player to play the card game gin rummy. Tonic's heuristics originate in expert knowledge and inform decision making for the three actions comprising a turn: drawing a card, discarding a card, and deciding when to knock. However, because these strategies are based in human intuition, there is often a lack of specificity to directly model them as algorithms. We developed parameterized models describing that intuition based on factors such as the number of turns played and an estimation of the opponent hand. To hone their performance, we conducted both manual analysis and parameter optimization (grid search) using self-play and play against a simple baseline agent. These heuristic models enable Tonic to win against the baseline agent at least 68% of the time.
引用
收藏
页码:15577 / 15582
页数:6
相关论文
共 50 条
  • [21] Heuristic-based algorithm for active control
    Tang, Y
    JOURNAL OF ENGINEERING MECHANICS-ASCE, 1996, 122 (08): : 801 - 803
  • [22] A heuristic-based approach to conceptual design
    Yih Tng Chong
    Chun-Hsien Chen
    Kah Fai Leong
    Research in Engineering Design, 2009, 20 : 97 - 116
  • [23] A Hierarchy of Heuristic-Based Models of Crowd Dynamics
    P. Degond
    C. Appert-Rolland
    M. Moussaïd
    J. Pettré
    G. Theraulaz
    Journal of Statistical Physics, 2013, 152 : 1033 - 1068
  • [24] Design of an Interactive Scheduling Heuristic-Based Application
    Duay, Edmond
    Gondraneos, Gene Mark
    Indino-Pineda, Karisha Ann
    Seva, Rosemary
    INDUSTRIAL ENGINEERING AND APPLICATIONS-EUROPE, ICIEA-EU 2024, 2024, 507 : 95 - 106
  • [25] Heuristic-based Blockchain Assignment: An Empirical Study
    Chen, Jianyu
    Gai, Keke
    Jiang, Peng
    Zhu, Liehuang
    19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 916 - 923
  • [26] Heuristic-Based Recommendation for Metamodel - OCL Coevolution
    Batot, Edouard
    Kessentini, Wael
    Sahraoui, Houari
    Famelis, Michalis
    2017 ACM/IEEE 20TH INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS (MODELS 2017), 2017, : 210 - 220
  • [27] Heuristic-based backtracking relaxation for propositional satisfiability
    Bhalla, Ateet
    Lynce, Ines
    De Sousa, Jose T.
    Marques-Silva, Joao
    JOURNAL OF AUTOMATED REASONING, 2005, 35 (1-3) : 3 - 24
  • [28] Policy Generator (PG): A Heuristic-Based Fuzzer
    Felix, Alejandro
    Tappenden, Andrew F.
    Miller, James
    PROCEEDINGS OF THE 49TH ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS 2016), 2016, : 5535 - 5544
  • [29] Heuristic-Based Backtracking Relaxation for Propositional Satisfiability
    Ateet Bhalla
    Inês Lynce
    José T. de Sousa
    João Marques-Silva
    Journal of Automated Reasoning, 2005, 35 : 3 - 24
  • [30] Heuristic-based backtracking relaxation for propositional satisfiability
    Bhalla, Ateet
    Lynce, Inês
    De Sousa, José T.
    Marques-Silva, João
    Journal of Automated Reasoning, 2005, 35 (1-3): : 3 - 24