Impact of Batch Size on Stopping Active Learning for Text Classification

被引:10
作者
Beatty, Garrett [1 ]
Kochis, Ethan [1 ]
Bloodgood, Michael [1 ]
机构
[1] Coll New Jersey, Dept Comp Sci, Ewing, NJ 08628 USA
来源
2018 IEEE 12TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC) | 2018年
关键词
D O I
10.1109/ICSC.2018.00059
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When using active learning, smaller batch sizes are typically more efficient from a learning efficiency perspective. However, in practice due to speed and human annotator considerations, the use of larger batch sizes is necessary. While past work has shown that larger batch sizes decrease learning efficiency from a learning curve perspective, it remains an open question how batch size impacts methods for stopping active learning. We find that large batch sizes degrade the performance of a leading stopping method over and above the degradation that results from reduced learning efficiency. We analyze this degradation and find that it can be mitigated by changing the window size parameter of how many past iterations of learning are taken into account when making the stopping decision. We find that when using larger batch sizes, stopping methods are more effective when smaller window sizes are used.
引用
收藏
页码:306 / 307
页数:2
相关论文
共 9 条
  • [1] [Anonymous], P 2018 IEEE 12 INT C
  • [2] [Anonymous], 2008, P 22 INT C COMP LING
  • [3] [Anonymous], 2013, Proceedings of the Seventeenth Conference on Computational Natural Language Learning
  • [4] Bloodgood, 2009, P 13 C COMP NAT LANG, P39
  • [5] Bloodgood M., 2009, Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, NAACL, (Morristown, NJ, USA), P137
  • [6] Brinker K, 2003, P 20 INT C MACH LEAR, P59
  • [7] Filtering Tweets for Social Unrest
    Mishler, Alan
    Wonus, Kevin
    Chambers, Wendy
    Bloodgood, Michael
    [J]. 2017 11TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2017, : 17 - 23
  • [8] Schohn Greg, 2000, ICML, P839
  • [9] A stopping criterion for active learning
    Vlachos, Andreas
    [J]. COMPUTER SPEECH AND LANGUAGE, 2008, 22 (03) : 295 - 312