Data-Fusion Techniques for Open-Set Recognition Problems

被引:16
作者
Cordova Neira, Manuel Alberto [1 ]
Mendes Junior, Pedro Ribeiro [1 ]
Rocha, Anderson [1 ]
Torres, Ricardo Da Silva [1 ]
机构
[1] Univ Estadual Campinas, Inst Comp, BR-13083872 Campinas, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
Pattern recognition; open-set recognition; data fusion; optimum-path forest; genetic programming; majority voting; RELEVANCE FEEDBACK; IMAGE RETRIEVAL; CLASSIFICATION; ALGORITHMS;
D O I
10.1109/ACCESS.2018.2824240
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most pattern classification techniques are focused on solving closed-set problems in which a classifier is trained with samples of all classes that may appear during the testing phase. In many situations, however, samples of unknown classes, i.e., whose classes did not have any example during the training stage, need to be properly handled during testing. This specific setup is referred to in the literature as open-set recognition. Open-set problems are harder as they might be ill-sampled, not sampled at all, or even undefined. Differently from existing literature, here we aim at solving open-set recognition problems combining different classifiers and features while, at the same time, taking care of unknown classes. Researchers have greatly benefited from combining different methods in order to achieve more robust and reliable classifiers in daring recognition conditions, but those solutions have often focused on closed-set setups. In this paper, we propose the integration of a newly designed open-set graph-based optimum-path forest (OSOPF) classifier with genetic programming (GP) and majority voting fusion techniques. While OSOPF takes care of learning decision boundaries more resilient to unknown classes and outliers, GP combines different problem features to discover appropriate similarity functions and allows a more robust classification through early fusion. Finally, the majority-voting approach combines different classification evidence from different classifier outcomes and features through late-fusion techniques. Performed experiments show the proposed data-fusion approaches yield effective results for open-set recognition problems, significantly outperforming existing counterparts in the literature and paving the way for investigations in this field.
引用
收藏
页码:21242 / U24
页数:24
相关论文
共 70 条
[1]  
Abdi H., 2010, Encyclopedia Res. Des., V169, P1, DOI DOI 10.4135/9781412961288.N178
[2]   Optimum-Path Forest Classifier for Large Scale Biometric Applications [J].
Afonso, L. C. S. ;
Papa, J. P. ;
Marana, A. N. ;
Poursaberi, A. ;
Yanushkevich, S. ;
Gavrilova, M. .
2012 THIRD INTERNATIONAL CONFERENCE ON EMERGING SECURITY TECHNOLOGIES (EST), 2012, :58-61
[3]   Deriving vegetation indices for phenology analysis using genetic programming [J].
Almeida, Jurandy ;
dos Santos, Jefersson A. ;
Miranda, Waner O. ;
Alberton, Bruna ;
Morellato, Leonor Patricia C. ;
Torres, Ricardo da S. .
ECOLOGICAL INFORMATICS, 2015, 26 :61-69
[4]  
Amorim W. P., 2012, 2012 XXV SIBGRAPI - Conference on Graphics, Patterns and Images (SIBGRAPI 2012), P330, DOI 10.1109/SIBGRAPI.2012.53
[5]  
Andrade Felipe S. P., 2012, Progress in Pattern Recognition, Image Analysis, ComputerVision, and Applications. Proceedings 17th Iberoamerican Congress, CIARP 2012, P845, DOI 10.1007/978-3-642-33275-3_104
[6]  
[Anonymous], CHAPMAN HALL CRC TEX
[7]  
[Anonymous], 2007, Caltech-256 Object Category Dataset
[8]  
[Anonymous], CUCS00696 COMP VIS L
[9]  
[Anonymous], P INT JOINT C NEUR N
[10]  
[Anonymous], 2003, Genetic programming IV: routine human-competitive machine intelligence