A new unsupervised predictive-model self-assessment approach that SCALEs

被引:6
作者
Ventura, Francesco [1 ]
Proto, Stefano [1 ]
Apiletti, Daniele [1 ]
Cerquitelli, Tania [1 ]
Panicucci, Simone [2 ]
Baralis, Elena [1 ]
Macii, Enrico [3 ]
Macii, Alberto [1 ]
机构
[1] Politecn Torino, Dept Control & Comp Engn, Turin, Italy
[2] COMAU SpA, Grugliasco, Italy
[3] Politecn Torino, Interuniv Dept Reg & Urban Studies & Planning, Turin, Italy
来源
2019 IEEE INTERNATIONAL CONGRESS ON BIG DATA (IEEE BIGDATA CONGRESS 2019) | 2019年
基金
欧盟地平线“2020”;
关键词
Self-Assessment; Silhouette; Big Data; Industry; 4.0;
D O I
10.1109/BigDataCongress.2019.00033
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Evaluating the degradation of predictive models over time has always been a difficult task, also considering that new unseen data might not fit the training distribution. This is a well-known problem in real-world use cases, where collecting the historical training set for all possible prediction labels may be very hard, too expensive or completely unfeasible. To solve this issue, we present a new unsupervised approach to detect and evaluate the degradation of classification and prediction models, based on a scalable variant of the Silhouette index, named Descriptor Silhouette, specifically designed to advance current Big Data state-of-the-art solutions. The newly proposed strategy has been tested and validated over both synthetic and real-world industrial use cases. To this aim, it has been included in a framework named SCALE and resulted to be efficient and more effective in assessing the degradation of prediction performance than current state-of-the-art best solutions.
引用
收藏
页码:144 / 148
页数:5
相关论文
共 13 条
  • [1] [Anonymous], P VLDB ENDOWMENT
  • [2] iSTEP, an integrated Self-Tuning Engine for Predictive maintenance in Industry 4.0
    Apiletti, Daniele
    Barberis, Claudia
    Cerquitelli, Tania
    Macii, Alberto
    Macii, Enrico
    Poncino, Massimo
    Ventura, Francesco
    [J]. 2018 IEEE INT CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, UBIQUITOUS COMPUTING & COMMUNICATIONS, BIG DATA & CLOUD COMPUTING, SOCIAL COMPUTING & NETWORKING, SUSTAINABLE COMPUTING & COMMUNICATIONS, 2018, : 924 - 931
  • [3] Canizo M, 2017, 2017 IEEE INTERNATIONAL CONFERENCE ON PROGNOSTICS AND HEALTH MANAGEMENT (ICPHM), P70, DOI 10.1109/ICPHM.2017.7998308
  • [4] D'silva GM, 2017, 2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), P1804, DOI 10.1109/RTEICT.2017.8256910
  • [5] Mining Sensor Data for Predictive Maintenance in the Automotive Industry
    Giobergia, Flavio
    Baralis, Elena
    Camuglia, Maria
    Cerquitelli, Tania
    Mellia, Marco
    Neri, Alessandra
    Tricarico, Davide
    Tuninetti, Alessia
    [J]. 2018 IEEE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2018, : 351 - 360
  • [6] A new metric of absolute percentage error for intermittent demand forecasts
    Kim, Sungil
    Kim, Heeyoung
    [J]. INTERNATIONAL JOURNAL OF FORECASTING, 2016, 32 (03) : 669 - 679
  • [7] Liu Y., 2010, 2010 IEEE INT C DAT, P911
  • [8] Meng XR, 2016, J MACH LEARN RES, V17
  • [9] Pedregosa F, 2011, J MACH LEARN RES, V12, P2825
  • [10] A comprehensive review of big data analytics throughout product lifecycle to support sustainable smart manufacturing: A framework, challenges and future research directions
    Ren, Shan
    Zhang, Yingfeng
    Liu, Yang
    Sakao, Tomohiko
    Huisingh, Donald
    Almeida, Cecilia M. V. B.
    [J]. JOURNAL OF CLEANER PRODUCTION, 2019, 210 (1343-1365) : 1343 - 1365