Multi-Modal Convolutional Dictionary Learning

被引:30
作者
Gao, Fangyuan [1 ]
Deng, Xin [1 ]
Xu, Mai [2 ]
Xu, Jingyi [2 ]
Dragotti, Pier Luigi [3 ]
机构
[1] Beihang Univ, Sch Cyber Sci & Technol, Beijing 100191, Peoples R China
[2] Beihang Univ, Dept Elect Informat Engn, Beijing 100191, Peoples R China
[3] Imperial Coll London, Dept Elect & Elect Engn, London SW7 2AZ, England
基金
北京市自然科学基金;
关键词
Dictionaries; Training; Memory management; Noise level; Toy manufacturing industry; Standards; Paints; Multi-modal dictionary learning; convolutional sparse coding; image denoising; IMAGE SUPERRESOLUTION; LOW-RANK; SPARSE; TRANSFORM;
D O I
10.1109/TIP.2022.3141251
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional dictionary learning has become increasingly popular in signal and image processing for its ability to overcome the limitations of traditional patch-based dictionary learning. Although most studies on convolutional dictionary learning mainly focus on the unimodal case, real-world image processing tasks usually involve images from multiple modalities, e.g., visible and near-infrared (NIR) images. Thus, it is necessary to explore convolutional dictionary learning across different modalities. In this paper, we propose a novel multi-modal convolutional dictionary learning algorithm, which efficiently correlates different image modalities and fully considers neighborhood information at the image level. In this model, each modality is represented by two convolutional dictionaries, in which one dictionary is for common feature representation and the other is for unique feature representation. The model is constrained by the requirement that the convolutional sparse representations (CSRs) for the common features should be the same across different modalities, considering that these images are captured from the same scene. We propose a new training method based on the alternating direction method of multipliers (ADMM) to alternatively learn the common and unique dictionaries in the discrete Fourier transform (DFT) domain. We show that our model converges in less than 20 iterations between the convolutional dictionary updating and the CSRs calculation. The effectiveness of the proposed dictionary learning algorithm is demonstrated on various multimodal image processing tasks, achieves better performance than both dictionary learning methods and deep learning based methods with limited training data.
引用
收藏
页码:1325 / 1339
页数:15
相关论文
共 57 条
  • [1] K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation
    Aharon, Michal
    Elad, Michael
    Bruckstein, Alfred
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) : 4311 - 4322
  • [2] [Anonymous], 2017, ARXIV170902893
  • [3] [Anonymous], 2016, SPARSE OPTIMIZATION
  • [4] Multimodal Task-Driven Dictionary Learning for Image Classification
    Bahrampour, Soheil
    Nasrabadi, Nasser M.
    Ray, Asok
    Jenkins, William Kenneth
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (01) : 24 - 38
  • [5] Convolutional Sparse Coding for Compressed Sensing CT Reconstruction
    Bao, Peng
    Xia, Wenjun
    Yang, Kang
    Chen, Weiyan
    Chen, Mianyi
    Xi, Yan
    Niu, Shanzhou
    Zhou, Jiliu
    Zhang, He
    Sun, Huaiqiang
    Wang, Zhangyang
    Zhang, Yi
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2019, 38 (11) : 2607 - 2619
  • [6] Spectral Video Compression Using Convolutional Sparse Coding
    Barajas-Solano, C.
    Ramirez, J. M.
    Arguello, H.
    [J]. 2020 DATA COMPRESSION CONFERENCE (DCC 2020), 2020, : 253 - 262
  • [7] Fusion of Infrared and Visible Sensor Images Based on Anisotropic Diffusion and Karhunen-Loeve Transform
    Bavirisetti, Durga Prasad
    Dhuli, Ravindra
    [J]. IEEE SENSORS JOURNAL, 2016, 16 (01) : 203 - 209
  • [8] Distributed optimization and statistical learning via the alternating direction method of multipliers
    Boyd S.
    Parikh N.
    Chu E.
    Peleato B.
    Eckstein J.
    [J]. Foundations and Trends in Machine Learning, 2010, 3 (01): : 1 - 122
  • [9] Fast Convolutional Sparse Coding
    Bristow, Hilton
    Eriksson, Anders
    Lucey, Simon
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 391 - 398
  • [10] Unsupervised Transfer Learning via Multi-Scale Convolutional Sparse Coding for Biomedical Applications
    Chang, Hang
    Han, Ju
    Zhong, Cheng
    Snijders, Antoine M.
    Mao, Jian-Hua
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (05) : 1182 - 1194