PAS - A Feature Selection Process Definition for Industrial Settings

被引:0
作者
Luftensteiner, Sabrina [1 ,2 ]
Chasparis, Georgios C. [1 ]
Kueng, Josef [2 ]
机构
[1] Software Competence Ctr Hagenberg, Softwarepk 32a, A-4232 Hagenberg Im Muehlkreis, Austria
[2] Johannes Kepler Univ Linz, Inst Applicat Oriented Knowledge Proc, Altenberger Str 69, A-4040 Linz, Austria
来源
5TH INTERNATIONAL CONFERENCE ON INDUSTRY 4.0 AND SMART MANUFACTURING, ISM 2023 | 2024年 / 232卷
基金
欧盟地平线“2020”;
关键词
Feature Selection; Process Definition; Industrial (Sensor) Data;
D O I
10.1016/j.procs.2024.01.030
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Industry 4.0, and the accompanying digitalization of industrial processes, lead to a continuously increasing amount of data. One reason is the installation of more sensors on machines and production lines in combination with shorter measurement intervals. The rising amount of data requires elaborate methods to remove redundant and uninformative data to reduce time complexity and required computational resources. This could be achieved by feature selection methods in combination with the right preprocessing and enhancement methods to generate and identify important features. Feature selection methods are mostly described in a methodological way without knowing how data is handled prior to their application, assuming a high-quality dataset. To fill this gap, we define a process for feature selection, focusing on time-series industrial data. The definition consists of three phases for the data adaptation: Preprocessing, Amplification and Selection. Each phase fulfills a specific purpose, ranging from handling missing and censored data to the enhancement of the dataset by higher-order data and aggregations and the selection itself. Our experiments show that the application of these phases on three different industrial use-cases lead to better accuracy results in machine learning models. (c) 2023 The Authors. Published by Elsevier B.V.
引用
收藏
页码:308 / 316
页数:9
相关论文
共 50 条
  • [41] An Industrial Internet of Things Feature Selection Method Based on Potential Entropy Evaluation Criteria
    Zhao, Long
    Dong, Xiangjun
    IEEE ACCESS, 2018, 6 : 4608 - 4617
  • [42] A Rigorous Information-Theoretic Definition of Redundancy and Relevancy in Feature Selection Based on (Partial) Information Decomposition
    Wollstadt, Patricia
    Schmitt, Sebastian
    Wibral, Michael
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [43] Feature Interaction for Streaming Feature Selection
    Zhou, Peng
    Li, Peipei
    Zhao, Shu
    Wu, Xindong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (10) : 4691 - 4702
  • [44] Threshold Feature Selection (TFS), an Improvement over Forward selection in Steganalysis Feature selection
    Hosseini, Morteza Darvish Morshedi
    Mahdavi, Mojtaba
    2015 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2015, : 187 - 194
  • [45] Reinforcing the Decision-making Process in Chemometrics: Feature Selection and Algorithm Optimization
    Sarsam, Samer Muthana
    2019 8TH INTERNATIONAL CONFERENCE ON SOFTWARE AND COMPUTER APPLICATIONS (ICSCA 2019), 2019, : 11 - 16
  • [46] Process monitoring for quality-a feature selection method for highly unbalance data
    Diaz, Carlos A. Escobar
    Arinez, Jorge
    Arregoyta, Daniela Macias
    Morales-Menendez, Ruben
    INTERNATIONAL JOURNAL OF INTERACTIVE DESIGN AND MANUFACTURING - IJIDEM, 2022, 16 (02): : 557 - 572
  • [47] A Gaussian process embedded feature selection method based on automatic relevance determination
    Deng, Yushi
    Eden, Mario
    Cremaschi, Selen
    COMPUTERS & CHEMICAL ENGINEERING, 2024, 191
  • [48] Process monitoring for quality–a feature selection method for highly unbalanced binary data
    Carlos A. Escobar Diaz
    Jorge Arinez
    Daniela Macías Arregoyta
    Ruben Morales-Menendez
    International Journal on Interactive Design and Manufacturing (IJIDeM), 2022, 16 : 557 - 572
  • [49] Incremental feature selection by sample selection and feature-based accelerator
    Yang, Yanyan
    Chen, Degang
    Zhang, Xiao
    Ji, Zhenyan
    Zhang, Yingjun
    APPLIED SOFT COMPUTING, 2022, 121
  • [50] Cascade Feature Selection and ELM for automatic fault diagnosis of the Tennessee Eastman process
    Boldt, Francisco de Assis
    Rauber, Thomas W.
    Varejao, Flavio M.
    NEUROCOMPUTING, 2017, 239 : 238 - 248