A neural network regularization method to address variance inflation in autoencoders

被引:0
作者
Kim, Boeun [1 ]
Ryu, Kyung Hwan [2 ]
Heo, Seongmin [3 ]
机构
[1] Princeton Univ, Andlinger Ctr Energy & Environm, Princeton, NJ 08544 USA
[2] Sunchon Natl Univ, Dept Chem Engn, 225 Jungang Ro, Sunchon 57922, Jeollanam Do, South Korea
[3] Dankook Univ, Dept Chem Engn, Yongin 16890, South Korea
来源
IFAC PAPERSONLINE | 2022年 / 55卷 / 07期
关键词
principal component analysis; autoencoder; feature extraction; feature variance; neural network regularization;
D O I
10.1016/j.ifacol.2022.07.533
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There exist various machine learning techniques which can be used to reduce the dimensionality of original data while minimizing the information loss. Principal component analysis (PCA) is one of the most well known such techniques, which transforms the original correlated variables into uncorrelated variables called principal components. Although PCA is known to preserve the total variance of the original data during the transformation, there are some cases with a potential of variance inflation, where the total variance of principal components becomes much larger than that of original variables. It is important to prevent variance inflation, as it can negatively affect the performance of other application systems (e.g. process monitoring systems) which are designed on the basis of principal component with inflated variances. Variance inflation also has a high potential to occur during the training of autoencoder, a special type of neural network performing nonlinear version of PCA. Although there are several neural network regularization methods available to alleviate the problem of variance inflation, none of them is tailored to do such task. To this end, in this work, an alternative neural network regularization method is proposed, which can strongly regulate the total variance in the feature space. Using the Tennessee Eastman process as an illustrative example, the proposed regularization method is compared with the existing ones in terms of neural network overfitting, variance inflation, and training time. Copyright (C) 2022 The Authors.
引用
收藏
页码:744 / 749
页数:6
相关论文
共 21 条
  • [1] Abrahamsen TJ, 2011, J MACH LEARN RES, V12, P2027
  • [2] A PLANT-WIDE INDUSTRIAL-PROCESS CONTROL PROBLEM
    DOWNS, JJ
    VOGEL, EF
    [J]. COMPUTERS & CHEMICAL ENGINEERING, 1993, 17 (03) : 245 - 255
  • [3] Garcia PA, 2012, 21 INT C COMP COMM N, P1
  • [4] Glorot X, 2011, P 14 INT C ART INT S, V15, P315, DOI DOI 10.1002/ECS2.1832
  • [5] Glorot X., 2010, P 13 INT C ARTIFICIA, V9, P249
  • [6] Hadsell R., 2006, CVPR, P1735
  • [7] Han S., 2016, PROC INT C LEARN REP
  • [8] Statistical Process Monitoring of the Tennessee Eastman Process Using Parallel Autoassociative Neural Networks and a Large Dataset
    Heo, Seongmin
    Lee, Jay H.
    [J]. PROCESSES, 2019, 7 (07)
  • [9] Parallel neural networks for improved nonlinear principal component analysis
    Heo, Seongmin
    Lee, Jay H.
    [J]. COMPUTERS & CHEMICAL ENGINEERING, 2019, 127 : 1 - 10
  • [10] Fault detection and classification using artificial neural networks
    Heo, Seongmin
    Lee, Jay H.
    [J]. IFAC PAPERSONLINE, 2018, 51 (18): : 470 - 475