Concept Drift Monitoring and Diagnostics of Supervised Learning Models via Score Vectors

被引:5
|
作者
Zhang, Kungang [1 ]
Bui, Anh T. [2 ]
Apley, Daniel W. [1 ]
机构
[1] Northwestern Univ, Dept Ind Engn & Management Sci, Evanston, IL 60201 USA
[2] Virginia Commonwealth Univ, Dept Stat Sci & Operat Res, Richmond, VA USA
基金
美国国家科学基金会;
关键词
Control chart; Multivariate EWMA; Predictive model; Score function; CONTROL CHART; TESTS;
D O I
10.1080/00401706.2022.2124310
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Supervised learning models are one of the most fundamental classes of models. Viewing supervised learning from a probabilistic perspective, the set of training data to which the model is fitted is usually assumed to follow a stationary distribution. However, this stationarity assumption is often violated in a phenomenon called concept drift, which refers to changes over time in the predictive relationship between covariates X and a response variable Y and can render trained models suboptimal or obsolete. We develop a comprehensive and computationally efficient framework for detecting, monitoring, and diagnosing concept drift. Specifically, we monitor the Fisher score vector, defined as the gradient of the log-likelihood for the fitted model, using a form of multivariate exponentially weighted moving average, which monitors for general changes in the mean of a random vector. In spite of the substantial performance advantages that we demonstrate over popular error-based methods, a score-based approach has not been previously considered for concept drift monitoring. Advantages of the proposed score-based framework include applicability to broad classes of parametric models, more powerful detection of changes as shown in theory and experiments, and inherent diagnostic capabilities for helping to identify the nature of the changes.
引用
收藏
页码:137 / 149
页数:13
相关论文
共 50 条
  • [41] Prosthetic Valve Monitoring viaIn Situ Pressure Sensors: In Silico Concept Evaluation using Supervised Learning
    Bailoor, Shantanu
    Seo, Jung-Hee
    Dasi, Lakshmi
    Schena, Stefano
    Mittal, Rajat
    CARDIOVASCULAR ENGINEERING AND TECHNOLOGY, 2022, 13 (01) : 90 - 103
  • [42] Predicting the band gap of ZnO quantum dots via supervised machine learning models
    Regonia, Paul Rossener
    Pelicano, Christian Mark
    Tani, Ryosuke
    Ishizumi, Atsushi
    Yanagi, Hisao
    Ikeda, Kazushi
    OPTIK, 2020, 207 (207):
  • [43] Transferring Deep Models for Cloud Detection in Multisensor Images via Weakly Supervised Learning
    Zhu, Shaocong
    Li, Zhiwei
    Shen, Huanfeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 18
  • [44] Weakly Supervised Learning of Part-based Models for Interaction Prediction via LDA
    Chen, Jia-Lin
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 669 - 671
  • [45] Tuberculosis Diagnostics and Localization in Chest X-Rays via Deep Learning Models
    Guo, Ruihua
    Passi, Kalpdrum
    Jain, Chakresh Kumar
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2020, 3
  • [46] Visual Concept Detection and Annotation via Multiple Kernel Learning of Multiple Models
    Zhang, Yu
    Bres, Stephane
    Chen, Liming
    IMAGE ANALYSIS AND PROCESSING (ICIAP 2013), PT II, 2013, 8157 : 581 - 590
  • [47] Towards reduction of expert bias on Gleason score classification via a semi-supervised deep learning strategy
    Plazas, Miguel
    Ramos-Pollan, Raul
    Leon, Fabian
    Martinez, Fabio
    MEDICAL IMAGING 2022: IMAGE PROCESSING, 2022, 12032
  • [48] Semi-supervised vibration-based structural health monitoring via deep graph learning and contrastive learning
    Dang, Viet-Hung
    Le-Nguyen, Khuong
    Nguyen, Truong-Thang
    STRUCTURES, 2023, 51 : 158 - 170
  • [49] Incorporating Intratumoral Heterogeneity into Weakly-Supervised Deep Learning Models via Variance Pooling
    Carmichael, Iain
    Song, Andrew H.
    Chen, Richard J.
    Williamson, Drew F. K.
    Chen, Tiffany Y.
    Mahmood, Faisal
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 : 387 - 397
  • [50] Indoor occupancy monitoring using environmental feature fusion and semi-supervised machine learning models
    Mena-Martinez, Alma
    Alvarado-Uribe, Joanna
    Molino-Minero-Re, Erik
    Ceballos, Hector G.
    JOURNAL OF BUILDING PERFORMANCE SIMULATION, 2024, 17 (06) : 695 - 717