Feature selection based convolutional neural network pruning and its application in calibration modeling for NIR spectroscopy

被引：27

作者：

Chen, Yuan-yuan ^{[1
,2
]}

Wang, Zhi-bin ^{[2
,3
]}

机构：

[1] North Univ China, Sch Informat & Commun Engn, Taiyuan, Shanxi, Peoples R China

[2] Engn Technol Res Ctr Shanxi Prov Optoelect Inform, Taiyuan, Shanxi, Peoples R China

[3] North Univ China, Sch Sci, Taiyuan, Shanxi, Peoples R China

来源：

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS | 2019年 / 191卷

基金：

中国国家自然科学基金;

关键词：

Calibration modeling; NIR spectroscopy; Convolutional neural network pruning; Feature selection;

D O I：

10.1016/j.chemolab.2019.06.004

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In our previous studies, we have found that convolutional neural network (CNN) can be applied to establish calibration model in the field of near infrared (NIR) spectroscopy. However, the values of CNN parameters are carefully chosen based on trial-and-error method, including convolutional kernel width (CKW), number of convolutional kernels (NCK), stride steps etc., otherwise underfitting phenomenon may occur and the generalized performance of calibration model will become worse. The possible reason is that the relationship between these parameters and model's generalized performance is not clear. Hence, to answer this question, this paper firstly investigated the influence of these parameters in detail and found that (1) if the CNN parameters' values are not carefully designed, the number of weights between full-connected and output layer is so large that limited samples in the training set cannot well fit the nonlinear relationship. (2) while convolutional kernels move through different subintervals of NIR spectra, features extracted with varied convolutional kernel width (VCKW) are more representative for calibration modeling than with fixed convolutional kernel width (FCKW). (3) in the subintervals near those absorption peaks, little stride steps (smaller than CKW) is prefer, because it means the extracted features are overlapping, which can capture the information around absorption peaks in detail. Additionally, the experimental results also showed that generalized performance of calibration model based on extracted CNN features outperforms that of based on raw NIR spectra.

引用

页码：103 / 108

页数：6

共 26 条

[1] Convolutional neural networks for vibrational spectroscopic data analysis [J].

Acquarelli, Jacopo ;

van Laarhoven, Twan ;

Gerretzen, Jan ;

Tran, Thanh N. ;

Buydens, Lutgarde M. C. ;

Marchiori, Elena .

ANALYTICA CHIMICA ACTA, 2017, 954 :22-31

[2]

Afara I. O., 2018, DEEP LEARNING CLASSI

[3]

[Anonymous], PRUNING CONVOLUTIONA

[4] Structured Pruning of Deep Convolutional Neural Networks [J].

Anwar, Sajid ;

Hwang, Kyuyeon ;

Sung, Wonyong .

ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2017, 13 (03)

[5]

Ardic E., 2018, 2018 26 SIGN PROC CO, P1

[6]

Bjerrum E. J., 2017, DATA AUGMENTATION SP

[7] End-to-end quantitative analysis modeling of near-infrared spectroscopy based on convolutional neural network [J].

Chen, Yuan-Yuan ;

Wang, Zhi-Bin .

JOURNAL OF CHEMOMETRICS, 2019, 33 (05)

[8] Quantitative analysis modeling of infrared spectroscopy based on ensemble convolutional neural networks [J].

Chen Yuanyuan ;

Wang Zhibin .

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2018, 181 :1-10

[9] Modern practical convolutional neural networks for multivariate regression: Applications to NIR calibration [J].

Cui, Chenhao ;

Fearn, Tom .

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2018, 182 :9-20

[10]

He Y., 2018, ARXIV E PRINTS

← 1 2 3 →