Structured Matrices and Their Application in Neural Networks: A Survey

被引:0
作者
Matthias Kissel
Klaus Diepold
机构
[1] Technical University of Munich,TUM School of Computation, Information and Technology
来源
New Generation Computing | 2023年 / 41卷
关键词
Matrix structures; Neural network; Efficient propagation; Fast inference;
D O I
暂无
中图分类号
学科分类号
摘要
Modern neural network architectures are becoming larger and deeper, with increasing computational resources needed for training and inference. One approach toward handling this increased resource consumption is to use structured weight matrices. By exploiting structures in weight matrices, the computational complexity for propagating information through the network can be reduced. However, choosing the right structure is not trivial, especially since there are many different matrix structures and structure classes. In this paper, we give an overview over the four main matrix structure classes, namely semiseparable matrices, matrices of low displacement rank, hierarchical matrices and products of sparse matrices. We recapitulate the definitions of each structure class, present special structure subclasses, and provide references to research papers in which the structures are used in the domain of neural networks. We present two benchmarks comparing the classes. First, we benchmark the error for approximating different test matrices. Second, we compare the prediction performance of neural networks in which the weight matrix of the last layer is replaced by structured matrices. After presenting the benchmark results, we discuss open research questions related to the use of structured matrices in neural networks and highlight future research directions.
引用
收藏
页码:697 / 722
页数:25
相关论文
共 50 条
[41]   Single-image deblurring with neural networks: A comparative survey [J].
Koh, Jaihyun ;
Lee, Jangho ;
Yoon, Sungroh .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 203
[42]   Application of neural networks in production system's simulation [J].
Moniaci, W ;
Carmellino, P ;
Pasero, E .
INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS, 2005, :827-831
[43]   Parameter redundancy in neural networks: An application of Chebyshev polynomials [J].
Curry B. .
Computational Management Science, 2007, 4 (3) :227-242
[44]   APPLICATION OF AN IMPROVED GENETIC ALGORITHM TO THE LEARNING OF NEURAL NETWORKS [J].
IKUNO, Y ;
KAWABATA, H ;
SHIRAO, Y ;
HIRATA, M ;
NAGAHARA, T ;
INAGAKI, Y .
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 1994, E77A (04) :731-735
[45]   A TRANSPARENT ALTERNATIVE TO NEURAL NETWORKS WITH AN APPLICATION TO PREDICTING VOLATILITY [J].
Czasonis, Megan ;
Kritzman, Mark ;
Turkington, David .
JOURNAL OF INVESTMENT MANAGEMENT, 2025, 23 (03) :4-17
[46]   Application of Neural Networks in Injection Moulding Process Control [J].
S.-J. Huang ;
T.-H. Lee .
The International Journal of Advanced Manufacturing Technology, 2003, 21 :956-964
[47]   SCHEDULING WITH NEURAL NETWORKS - APPLICATION TO TIMETABLE-CONSTRUCTION [J].
PELLERIN, D ;
HERAULT, J .
NEUROCOMPUTING, 1994, 6 (04) :419-442
[48]   Application of neural networks to predict ice jam occurrence [J].
Massie, DD ;
White, KD ;
Daly, SF .
COLD REGIONS SCIENCE AND TECHNOLOGY, 2002, 35 (02) :115-122
[49]   Application of neural networks in injection moulding process control [J].
Huang, SJ ;
Lee, TH .
INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2003, 21 (12) :956-964
[50]   THE APPLICATION OF NEURAL NETWORKS TO THE PROCESS OF GAINING AND CONSOLIDATING THE KNOWLEDGE [J].
Plichta, Anna .
PROCEEDINGS - 25TH EUROPEAN CONFERENCE ON MODELLING AND SIMULATION, ECMS 2011, 2011, :436-439