Best-First Beam Search

被引:31
|
作者
Meister, Clara [1 ]
Vieira, Tim [2 ]
Cotterell, Ryan [1 ,3 ]
机构
[1] Swiss Fed Inst Technol, Zurich, Switzerland
[2] Johns Hopkins Univ, Baltimore, MD USA
[3] Univ Cambridge, Cambridge, England
关键词
D O I
10.1162/tacl_a_00346
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Decoding for many NLP tasks requires an effective heuristic algorithm for approximating exact search because the problem of searching the full output space is often intractable, or impractical in many settings. The default algorithm for this job is beam search-a pruned version of breadth-first search. Quite surprisingly, beam search often returns better results than exact inference due to beneficial search bias for NLP tasks. In this work, we show that the standard implementation of beam search can be made up to 10x faster in practice. Our method assumes that the scoring function is monotonic in the sequence length, which allows us to safely prune hypotheses that cannot be in the final set of hypotheses early on. We devise effective monotonic approximations to popular nonmonontic scoring functions, including length normalization and mutual information decoding. Lastly, we propose a memory-reduced variant of best-first beam search, which has a similar beneficial search bias in terms of downstream performance, but runs in a fraction of the time.
引用
收藏
页码:795 / 809
页数:15
相关论文
共 50 条
  • [1] Best-first minimax search
    Univ of California, Los Angeles, United States
    Artif Intell, 1-2 (299-337):
  • [2] Best-first minimax search
    Korf, RE
    Chickering, DM
    ARTIFICIAL INTELLIGENCE, 1996, 84 (1-2) : 299 - 337
  • [3] Recursive Best-First Search with Bounded Overhead
    Hatem, Matthew
    Kiesel, Scott
    Ruml, Wheeler
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 1151 - 1157
  • [4] Best-First Heuristic Search for Multicore Machines
    Burns, Ethan
    Lemons, Sofia
    Ruml, Wheeler
    Zhou, Rong
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2010, 39 : 689 - 743
  • [5] Best-first minimax search: Othello results
    Korf, Richard E.
    Chickering, David Maxwell
    Proceedings of the National Conference on Artificial Intelligence, 1994, 2 : 1365 - 1370
  • [6] A solution to the GHI problem for best-first search
    Breuker, DM
    van den Herik, HJ
    Uiterwijk, JWHM
    Allis, LV
    COMPUTERS AND GAMES, 1999, 1558 : 25 - 49
  • [7] Effective Heuristics for Suboptimal Best-First Search
    Wilt, Christopher
    Ruml, Wheeler
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2016, 57 : 273 - 306
  • [8] A solution to the GHI problem for best-first search
    Breuker, DM
    van den Herik, HJ
    Uiterwijk, JWHM
    Allis, LV
    THEORETICAL COMPUTER SCIENCE, 2001, 252 (1-2) : 121 - 149
  • [9] Parallel randomized Best-First Minimax Search
    Shoham, Y
    Toledo, S
    ARTIFICIAL INTELLIGENCE, 2002, 137 (1-2) : 165 - 196
  • [10] Best-first Utility-guided Search
    Ruml, Wheeler
    Do, Minh B.
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2378 - 2384