Optima Query Selection Using Multi-Armed Bandits

被引:6
作者
Kocanaogullari, Aziz [1 ]
Marghi, Yeganeh M. [1 ]
Akcakaya, Murat [2 ]
Erdogmus, Deniz [1 ]
机构
[1] Northeastern Univ, Boston, MA 02115 USA
[2] Univ Pittsburgh, Pittsburgh, PA 15260 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
Subset selection; query optimization; misleading prior; multi-armed bandit framework; 20; QUESTIONS; FRAMEWORK; BCI;
D O I
10.1109/LSP.2018.2878066
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Query selection for latent variable estimation is conventionally performed by opting for observations with low noise or optimizing information-theoretic objectives related to reducing the level of estimated uncertainty based on the current best estimate. In these approaches, typically, the system makes a decision by leveraging the current available information about the state. However, trusting the current hest estimate results in poor query selection when truth is far from the current estimate, and this negatively impacts the speed and accuracy of the latent variable estimation procedure. We introduce a novel sequential adaptive action value function for query selection using the multi-armed bandit framework, which allows us to find a tractable solution. For this adaptive-sequential query selection method, we analytically show: 1) performance improvement in the query selection for a dynamical system; and 2) the conditions where the model outperforms competitors. We also present favorable empirical assessments of the performance for this method, compared to alternative methods, both using Monte Carlo simulations and human-in-the-loop experiments with a brain-computer interface typing system, where the language model provides the prior information.
引用
收藏
页码:1870 / 1874
页数:5
相关论文
共 36 条
[1]   Structure Learning in Human Sequential Decision-Making [J].
Acuna, Daniel E. ;
Schrater, Paul .
PLOS COMPUTATIONAL BIOLOGY, 2010, 6 (12)
[2]  
[Anonymous], 2014, International Series of Monographs on Electronics and Instrumentation
[3]  
[Anonymous], 2012, P 10 INT WORKSH QUAL
[4]  
[Anonymous], 2000, Proceedings of the 16th conference on Uncertainty in Artificial Intelligence
[5]  
[Anonymous], P 7 INT BCI M AS CA
[6]  
[Anonymous], 2018, ARXIV180208452
[7]  
Azar M. G., 2013, Advances in Neural Information Processing Systems(NIPS), V26, P2220
[8]  
Baram Y, 2004, J MACH LEARN RES, V5, P255
[9]   Sparsity-Promoting Sensor Selection for Non-Linear Measurement Models [J].
Chepuri, Sundeep Prabhakar ;
Leus, Geert .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2015, 63 (03) :684-698
[10]  
Chu HM, 2016, IEEE DATA MINING, P841, DOI [10.1109/ICDM.2016.135, 10.1109/ICDM.2016.0100]