Concept drift detection based on Fisher's Exact test

被引:67
|
作者
de Lima Cabral, Danilo Rafael [1 ]
Maior de Barros, Roberto Souto [1 ]
机构
[1] Univ Fed Pernambuco, Ctr Informat, BR-50740560 Recife, PE, Brazil
关键词
Concept drift; Data streams; Drift detection; Online learning; Statistical tests; CLASSIFIERS;
D O I
10.1016/j.ins.2018.02.054
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Concept drift detectors are software that usually attempt to estimate the positions of concept drifts in large data streams in order to replace the base learner after changes in the data distribution and thus improve accuracy. Statistical Test of Equal Proportions (STEPD) is a simple, efficient, and well-known method which detects concept drifts based on a hypothesis test between two proportions. However, statistically, this test is not recommended when sample sizes are small or data are sparse and/or imbalanced. This article proposes an ingeniously efficient implementation of the statistically preferred but computationally expensive Fisher's Exact test and examines three slightly different applications of this test for concept drift detection, proposing FPDD, FSDD, and FTDD. Experiments run using four artificial dataset generators, with both abrupt and gradual drift versions, as well as three real-world datasets, suggest that the new methods improve the accuracy results and the detections of STEPD and other well-known and/or recent concept drift detectors in many scenarios, with little impact on memory and run-time usage. (C) 2018 Elsevier Inc. All rights reserved.
引用
收藏
页码:220 / 234
页数:15
相关论文
共 50 条
  • [1] Alternatives to Fisher's exact test
    Lydersen, Stian
    Fagerland, Morten Wang
    Laake, Petter
    TIDSSKRIFT FOR DEN NORSKE LAEGEFORENING, 2019, 139 (14) : 1397 - 1397
  • [2] Statistical test for rough set approximation based on Fisher's exact test
    Tsumoto, S
    ROUGH SETS AND CURRENT TRENDS IN COMPUTING, PROCEEDINGS, 2002, 2475 : 381 - 388
  • [3] Concept Drift Detection Method Based on Online Performance Test
    Guo H.-S.
    Zhang A.-J.
    Wang W.-J.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (04): : 932 - 947
  • [4] Testing association with Fisher's Exact test
    Warner, Pamela
    JOURNAL OF FAMILY PLANNING AND REPRODUCTIVE HEALTH CARE, 2013, 39 (04): : 281 - 284
  • [5] An optimal property of the exact multinomial test and the extended Fisher's exact test
    Kang, SH
    STATISTICS & PROBABILITY LETTERS, 1999, 44 (02) : 201 - 207
  • [6] Fisher's exact test - a clinician's perspective
    Cormack, RS
    BRITISH JOURNAL OF ANAESTHESIA, 2006, 96 (02) : 285P - 285P
  • [7] Fisher's Exact Test for Misclassified Data
    Lee, Tze-San
    JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2011, 10 (01) : 132 - 138
  • [8] FISHER EXACT TEST
    UPTON, GJG
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1992, 155 : 395 - 402
  • [9] The Comparison of the Unconditional and Conditional Exact Power of Fisher's Exact Test
    Kang, Seung-Ho
    Park, Yoonsoo
    KOREAN JOURNAL OF APPLIED STATISTICS, 2010, 23 (05) : 883 - 890
  • [10] On conditions for validity of the approximations to Fisher's exact test
    Andres, AM
    Tejedor, IH
    BIOMETRICAL JOURNAL, 1997, 39 (08) : 935 - 954