Exploring the Unknown - Query Synthesis in One-Class Active Learning

被引:1
作者
Englhardt, Adrian [1 ]
Boehm, Klemens [1 ]
机构
[1] Karlsruhe Inst Technol KIT, Karlsruhe, Germany
来源
PROCEEDINGS OF THE 2020 SIAM INTERNATIONAL CONFERENCE ON DATA MINING (SDM) | 2020年
关键词
one-class classification; active learning; query synthesis; domain expansion;
D O I
10.1137/1.9781611976236.17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The quality of a classifier hinges on the availability of training data. In scenarios where data collection is restricted or expensive, e.g., compute-intensive simulations, training data may be small and/or biased. In principle, data synthesis then allows to extend the data set. Yet it is difficult for a user to extend the data without any guidance when the data space is unbound or of high dimensionality. In this article we target at the domain expansion problem, i.e., expanding the classifier knowledge beyond an initial sample that completely falls into one class. We first propose a general framework for query synthesis in the one-class setting. Then we present a new query synthesis strategy to quickly explore the data space beyond the initial sample. For the evaluation we derive three options to simulate an oracle in the one-class setting that can answer arbitrary queries. Experiments on both synthetic and real world data demonstrate that our new query strategy indeed expands the knowledge of a one-class classifier beyond a small and biased initial sample. Our strategy outperforms realistic baselines on most domain expansion problems.
引用
收藏
页码:145 / 153
页数:9
相关论文
共 40 条
[1]  
Abe Naoki, 2006, SIGKDD
[2]  
Alabdulmohsin Ibrahim M, 2015, AAAI
[3]  
[Anonymous], 2015, ICMLA
[4]  
[Anonymous], 2018, ARXIV180804759
[5]  
[Anonymous], 2016, DATA MIN KNOWL DISC
[6]  
[Anonymous], 2013, JAIR
[7]  
[Anonymous], 2006, NIPS
[8]  
Banhalmi Andras, 2007, ECML
[9]  
Basudhar Anirban, 2008, COMPUT STRUCT, V86, P19
[10]  
Baum Eric, 1992, IJCNN