Operational neural networks

被引:63
作者
Kiranyaz, Serkan [1 ]
Ince, Turker [2 ]
Iosifidis, Alexandros [3 ]
Gabbouj, Moncef [4 ]
机构
[1] Qatar Univ, Coll Engn, Elect Engn, Doha, Qatar
[2] Izmir Univ Econ, Elect & Elect Engn Dept, Izmir, Turkey
[3] Aarhus Univ, Dept Engn, Aarhus, Denmark
[4] Tampere Univ, Dept Comp Sci, Tampere, Finland
关键词
Operational neural network; Heterogeneous and nonlinear neural networks; Convolutional neural networks; NEURONAL DIVERSITY;
D O I
10.1007/s00521-020-04780-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feed-forward, fully connected artificial neural networks or the so-called multi-layer perceptrons are well-known universal approximators. However, their learning performance varies significantly depending on the function or the solution space that they attempt to approximate. This is mainly because of their homogenous configuration based solely on the linear neuron model. Therefore, while they learn very well those problems with a monotonous, relatively simple and linearly separable solution space, they may entirely fail to do so when the solution space is highly nonlinear and complex. Sharing the same linear neuron model with two additional constraints (local connections and weight sharing), this is also true for the conventional convolutional neural networks (CNNs) and it is, therefore, not surprising that in many challenging problems only the deep CNNs with a massive complexity and depth can achieve the required diversity and the learning performance. In order to address this drawback and also to accomplish a more generalized model over the convolutional neurons, this study proposes a novel network model, called operational neural networks (ONNs), which can be heterogeneous and encapsulate neurons with any set of operators to boost diversity and to learn highly complex and multi-modal functions or spaces with minimal network complexity and training data. Finally, the training method to back-propagate the error through the operational layers of ONNs is formulated. Experimental results over highly challenging problems demonstrate the superior learning capabilities of ONNs even with few neurons and hidden layers.
引用
收藏
页码:6645 / 6668
页数:24
相关论文
共 45 条
[1]  
[Anonymous], 1974, Beyond Regression: New Tools for Prediction and Analysis in the Behavioral Science
[2]  
[Anonymous], Neural Information Processing: 19th International
[3]  
[Anonymous], 1993, NEURAL NETWORKS OPTI
[4]  
[Anonymous], C GRAPH PATT IM
[5]  
[Anonymous], 2015, PROC CVPR IEEE
[6]  
[Anonymous], 2009, Advances in Neural Information Processing Systems
[7]  
[Anonymous], 2010, Tech. rep. UM-CS- 2010-009
[8]  
[Anonymous], 2018, IEEE CVPR
[9]  
[Anonymous], 2016, CORR
[10]  
Chen YJ, 2015, PROC CVPR IEEE, P5261, DOI 10.1109/CVPR.2015.7299163