Heuristic techniques to optimize neural network architecture in manufacturing applications

被引:20
作者
Ciancio, Claudio [1 ]
Ambrogio, Giuseppina [1 ]
Gagliardi, Francesco [1 ]
Musmanno, Roberto [1 ]
机构
[1] Univ Calabria, Dept Mech Energy & Management Engn, I-87036 Arcavacata Di Rende, Italy
关键词
Neural network architecture design; Genetic algorithm; Tabu search; Taguchi; Decision trees; 2D numerical simulations;
D O I
10.1007/s00521-015-1994-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays application of neural networks in the manufacturing field is widely assessed even if this type of problem is typically characterized by an insufficient availability of data for a robust network training. Satisfactory results can be found in the literature, in both forming and machining operations, regarding the use of a neural network as a predictive tool. Nevertheless, the research of the optimal network configuration is still based on trial-and-error approaches, rather than on the application of specific techniques . As a consequence, the best method to determine the optimal neural network configuration is still a lack of knowledge in the literature overview. According to that, a comparative analysis is proposed in this work. More in detail four different approaches have been used to increase the generalization abilities of a neural network. These methods are based, respectively, on the use of genetic algorithms, Taguchi, tabu search and decision trees. The parameters taken into account in this work are the training algorithm, the number of hidden layers, the number of neurons and the activation function of each hidden layer. These techniques have been firstly tested on three different datasets, generated through numerical simulations in the Deform2D environment, in an attempt to map the input-output relationship for an extrusion, a rolling and a shearing process. Subsequently, the same approach has been validated on a fourth dataset derived from the literature review for a complex industrial process to widely generalize and asses the proposed methodology in the whole manufacturing field. Four tests were carried out for each dataset modifying the original data with a random noise with zero mean and standard deviation of one, two and five per cent. The results show that the use of a suitable technique for determining the architecture of a neural network can generate a significant performance improvement compared to a trial-and-error approach.
引用
收藏
页码:2001 / 2015
页数:15
相关论文
共 53 条
[1]  
Al Timemy AH, 2010, BREAST CANC, V4, P6
[2]   Design of an optimized procedure to predict opposite performances in porthole die extrusion [J].
Ambrogio, G. ;
Gagliardi, F. .
NEURAL COMPUTING & APPLICATIONS, 2013, 23 (01) :195-206
[3]  
[Anonymous], 2011, MATL US GUID VERS 7
[4]   A generalized feedforward neural network architecture for classification and regression [J].
Arulampalam, G ;
Bouzerdoum, A .
NEURAL NETWORKS, 2003, 16 (5-6) :561-568
[5]   What Size Net Gives Valid Generalization? [J].
Baum, Eric B. ;
Haussler, David .
NEURAL COMPUTATION, 1989, 1 (01) :151-160
[6]  
Berger JO., 2013, Statistical decision theory and Bayesian analysis
[7]   Specification of training sets and the number of hidden neurons for multilayer perceptrons [J].
Camargo, LS ;
Yoneyama, T .
NEURAL COMPUTATION, 2001, 13 (12) :2673-2680
[8]   A new approach to study material bonding in extrusion porthole dies [J].
Ceretti, E. ;
Fratini, L. ;
Gagliardi, F. ;
Giardini, C. .
CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2009, 58 (01) :259-262
[9]  
Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274
[10]  
De Jong K. A., 1992, Annals of Mathematics and Artificial Intelligence, V5, P1, DOI 10.1007/BF01530777