Ensemble-learning based neural networks for novelty detection in multi-class systems

被引:18
作者
Chan, Felix T. S. [1 ]
Wang, Z. X. [2 ]
Patnaik, S. [1 ,4 ]
Tiwari, M. K. [4 ]
Wang, X. P. [3 ]
Ruan, J. H. [3 ,5 ]
机构
[1] Hong Kong Polytech Univ, Dept Ind & Syst Engn, Hung Hom, Hong Kong, Peoples R China
[2] Dongbei Univ Finance & Econ, Sch Business Adm, Dalian, Peoples R China
[3] Dalian Univ Technol, Inst Syst Engn, Dalian, Peoples R China
[4] Indian Inst Technol Kharagpur, Dept Ind & Syst Engn, Kharagpur, W Bengal, India
[5] Northwest A&F Univ, Coll Econ & Management, Yangling, Shaanxi, Peoples R China
关键词
Novelty detection; Neural networks; Ensemble-learning; Posterior class probability; Confidence intervals; REGRESSION; CLASSIFIER; LASSO; RIDGE;
D O I
10.1016/j.asoc.2020.106396
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In most real-world systems or processes, determining the complete set of classes during the training phase is generally impossible. There is a high chance that novelties or abnormal data can appear in future phases which might severely affect the performance of the machine learning system. Novelty detection is of great importance in many critical systems and domains, such as business intelligence, process monitoring, information security, clinical decision support etc. Most of the available methods for novelty detection use a one-class classification (OCC) criterion, i.e. treating multiple known classes as a single "Normal" class, whose aim is to distinguish data samples between "Normal'' and "Not Normal'' classes. In this paper, the problem of novelty detection in multi-class systems is addressed through ensemble based learning of neural networks (EBNN), capable of both detecting novelties and classifying the known normal samples in future datasets. Moreover, the model is analogous to the semisupervised learning system as it is trained using only the available normal classes. Evaluation of the proposed model (EBNN) on UCI machine learning datasets showed that the model not only outperforms other models in detecting novelties but also has a better multi-class classification accuracy for known normal classes. The proposed model implements a novel activation function in its framework and differs from the commonly available novelty detection models in three aspects. First, the model is much simpler to implement and does not need any initial assumptions about the model. Second, the model does not require any novel or abnormal data during training phase (semi-supervised learning). Third, it can be used as a two in one system to detect novelties and at the same time to classify data based on known classes. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:14
相关论文
共 49 条
[1]   A comprehensive survey of numeric and symbolic outlier mining techniques [J].
Agyemang, Malik ;
Barker, Ken ;
Alhajj, Rada .
INTELLIGENT DATA ANALYSIS, 2006, 10 (06) :521-538
[2]   A survey of network anomaly detection techniques [J].
Ahmed, Mohiuddin ;
Mahmood, Abdun Naser ;
Hu, Jiankun .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2016, 60 :19-31
[3]   Prototype-Based Domain Description for One-Class Classification [J].
Angiulli, Fabrizio .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (06) :1131-1144
[4]  
[Anonymous], P ADV NEUR INF
[5]  
[Anonymous], 2017, OUTLIER ENSEMBLES IN
[6]  
Bishop C. M., 2006, Pattern Recognition and Machine Learning, DOI DOI 10.1117/1.2819119
[7]   KERNEL DENSITY ESTIMATION VIA DIFFUSION [J].
Botev, Z. I. ;
Grotowski, J. F. ;
Kroese, D. P. .
ANNALS OF STATISTICS, 2010, 38 (05) :2916-2957
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]   Unsupervised novelty detection-based structural damage localization using a density peaks-based fast clustering algorithm [J].
Cha, Young-Jin ;
Wang, Zilong .
STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2018, 17 (02) :313-324