MS-Net: A lightweight separable ConvNet for multi-dimensional image processing

被引:2
|
作者
Hou, Zhenning [1 ]
Shi, Yunhui [1 ]
Wang, Jin [1 ]
Cui, Yingxuan [1 ]
Yin, Baocai [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-dimensional image processing; Separable convolution neural network; Feature extraction and representation; Matricization;
D O I
10.1007/s11042-021-10903-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As the core technology of deep learning, convolutional neural networks have been widely applied in a variety of computer vision tasks and have achieved state-of-the-art performance. However, it's difficult and inefficient for them to deal with high dimensional image signals due to the dramatic increase of training parameters. In this paper, we present a lightweight and efficient MS-Net for the multi-dimensional(MD) image processing, which provides a promising way to handle MD images, especially for devices with limited computational capacity. It takes advantage of a series of one dimensional convolution kernels and introduces a separable structure in the ConvNet throughout the learning process to handle MD image signals. Meanwhile, multiple group convolutions with kernel size 1 x 1 are used to extract channel information. Then the information of each dimension and channel is fused by a fusion module to extract the complete image features. Thus the proposed MS-Net significantly reduces the training complexity, parameters and memory cost. The proposed MS-Net is evaluated on both 2D and 3D benchmarks CIFAR-10, CIFAR-100 and KTH. Extensive experimental results show that the MS-Net achieves competitive performance with greatly reduced computational and memory cost compared with the state-of-the-art ConvNet models.
引用
收藏
页码:25673 / 25688
页数:16
相关论文
共 50 条
  • [41] Multi-dimensional histogram-based image segmentation
    Weiler, Daniel
    Eggert, Julian
    NEURAL INFORMATION PROCESSING, PART I, 2008, 4984 : 963 - +
  • [42] Multi-Source Skyline Queries Processing in Multi-Dimensional Space
    Li, Cuiping
    He, Wenlin
    Chen, Hong
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT I, PROCEEDINGS, 2010, 6118 : 471 - 479
  • [43] A Sparse Multi-Dimensional Fast Fourier Transform with Stability to Noise in the Context of Image Processing and Change Detection
    Letourneau, Pierre-David
    Langston, M. Harper
    Lethin, Richard
    2016 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2016,
  • [44] Handwritten Chinese Text Recognition Using Separable Multi-Dimensional Recurrent Neural Network
    Wu, Yi-Chao
    Yin, Fei
    Chen, Zhuo
    Liu, Cheng-Lin
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 79 - 84
  • [45] LMDA-Net:A lightweight multi-dimensional attention network for general EEG-based brain-computer interfaces and interpretability
    Miao, Zhengqing
    Zhao, Meirong
    Zhangbc, Xin
    Ming, Dong
    NEUROIMAGE, 2023, 276
  • [46] Multi-dimensional optical code processing in MPLS photonic routers
    Cincotti, G.
    Moreolo, M. Svaluto
    Manzacca, G.
    Wang, X.
    Wada, N.
    Kitayama, K. -, I
    2006 OPTICAL FIBER COMMUNICATION CONFERENCE/NATIONAL FIBER OPTIC ENGINEERS CONFERENCE, VOLS 1-6, 2006, : 112 - +
  • [47] Distributed Neural Processing Predictors of Multi-dimensional Properties of Affect
    Bush, Keith A.
    Inman, Cory S.
    Hamann, Stephan
    Kilts, Clinton D.
    James, G. Andrew
    FRONTIERS IN HUMAN NEUROSCIENCE, 2017, 11
  • [48] BBoxDB streams: scalable processing of multi-dimensional data streams
    Jan Kristof Nidzwetzki
    Ralf Hartmut Güting
    Distributed and Parallel Databases, 2022, 40 : 559 - 625
  • [49] BBoxDB streams: scalable processing of multi-dimensional data streams
    Nidzwetzki, Jan Kristof
    Gueting, Ralf Hartmut
    DISTRIBUTED AND PARALLEL DATABASES, 2022, 40 (2-3) : 559 - 625
  • [50] Computation of the minimum data storage for multi-dimensional signal processing
    Luican, Ilie I.
    Zhu, Hongwei
    Balasa, Florin
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3, 2007, : 25 - +