From one-class to two-class classification by incorporating expert knowledge: Novelty detection in human behaviour

被引:15
作者
Oosterlinck, Dieter [1 ]
Benoit, Dries F. [1 ]
Baecke, Philippe [2 ]
机构
[1] Univ Ghent, Fac Econ & Business Adm, TWeekerkenstr 2, B-9000 Ghent, Belgium
[2] Vlerick Business Sch, Area Mkt, Reep 1, B-9000 Ghent, Belgium
关键词
Analytics; One-class classification; Novelty detection; Expert knowledge; Decision support systems; FRAUD; FRAMEWORK; SYSTEM; MODEL;
D O I
10.1016/j.ejor.2019.10.015
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
One-class classification is the standard procedure for novelty detection. Novelty detection aims to identify observations that deviate from a determined normal behaviour. Only instances of one class are known, whereas so called novelties are unlabelled. Traditional novelty detection applies methods from the field of outlier detection. These standard one-class classification approaches have limited performance in many real business cases. The traditional techniques are mainly developed for industrial problems such as machine condition monitoring. When applying these to human behaviour, the performance drops significantly. This paper proposes a method that improves existing approaches by creating semi-synthetic novelties in order to have labelled data for the two classes. Expert knowledge is incorporated in the initial phase of this data generation process. The method was deployed on a real-life test case where the goal was to detect fraudulent subscriptions to a telecom family plan. This research demonstrates that the two-class expert model outperforms a one-class model on the semi-synthetic dataset. In a next step the model was validated on a real dataset. A fraud detection team of the company manually checked the top predicted novelties. The results show that incorporating expert knowledge to transform a one-class problem into a two-class problem is a valuable method. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:1011 / 1024
页数:14
相关论文
共 55 条
  • [1] Abe N., 2006, P 12 ACM SIGKDD INT, P504, DOI DOI 10.1145/1150402.1150459
  • [2] An autonomous low-cost infrared system for the on-line monitoring of manufacturing processes using novelty detection
    Al-Habaibeh, A
    Parkin, R
    [J]. INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2003, 22 (3-4) : 249 - 258
  • [3] [Anonymous], 1993, TECHNICAL REPORT
  • [4] [Anonymous], P 1999 ACTS MOB SUMM
  • [5] [Anonymous], J ARTIFICIAL INTELLI
  • [6] [Anonymous], 2007, P ADV NEURAL INF PRO
  • [7] [Anonymous], 2011, ACM T INTEL SYST TEC, DOI DOI 10.1145/1961189.1961199
  • [8] AN EXPERT SYSTEM FOR PREDICTING GAS DEMAND - A CASE-STUDY
    ASHOURI, F
    [J]. OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 1993, 21 (03): : 307 - 317
  • [9] Synthesizing test data for fraud detection systems
    Barse, EL
    Kvarnström, H
    Jonsson, E
    [J]. 19TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, PROCEEDINGS, 2003, : 384 - 394
  • [10] Bastian M, 2009, INT AAAI C WEBL SOC, V3, DOI [10.1609/icwsm.v3i1.13937, DOI 10.1609/ICWSM.V3I1.13937]