Multi-Category Image Super-Resolution with Convolutional Neural Network and Multi-Task Learning

被引:3
|
作者
Urazoe, Kazuya [1 ,3 ]
Kuroki, Nobutaka [1 ]
Kato, Yu [1 ,4 ]
Ohtani, Shinya [1 ,5 ]
Hirose, Tetsuya [2 ]
Numa, Masahiro [1 ]
机构
[1] Kobe Univ, Grad Sch Engn, Kobe, Hyogo 6578501, Japan
[2] Osaka Univ, Grad Sch Engn, Suita, Osaka 5650871, Japan
[3] Panasonic Corp, Osaka, Japan
[4] EIZO Corp, Haku San, Japan
[5] Toyota Motor Co Ltd, Tokyo, Japan
关键词
super-resolution; resolution enhancement; convolutional neural network; multi-task learning; deep learning;
D O I
10.1587/transinf.2020EDP7054
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an image super-resolution technique using a convolutional neural network (CNN) and multi-task learning for multiple image categories. The image categories include natural, manga, and text images. Their features differ from each other. However, several CNNs for super-resolution are trained with a single category. If the input image category is different from that of the training images, the performance of super-resolution is degraded. There are two possible solutions to manage multi-categories with conventional CNNs. The first involves the preparation of the CNNs for every category. This solution, however, requires a category classifier to select an appropriate CNN. The second is to learn all categories with a single CNN. In this solution, the CNN cannot optimize its internal behavior for each category. Therefore, this paper presents a super-resolution CNN architecture for multiple image categories. The proposed CNN has two parallel outputs for a high-resolution image and a category label. The main CNN for the high-resolution image is a normal three convolutional layer-architecture, and the sub neural network for the category label is branched out from its middle layer and consists of two fully-connected layers. This architecture can simultaneously learn the high-resolution image and its category using multi-task learning. The category information is used for optimizing the super-resolution. In an applied setting, the proposed CNN can automatically estimate the input image category and change the internal behavior. Experimental results of 2x image magnification have shown that the average peak signal-to-noise ratio for the proposed method is approximately 0.22 dB higher than that for the conventional super-resolution with no difference in processing time and parameters. We have ensured that the proposed method is useful when the input image category is varying.
引用
收藏
页码:183 / 193
页数:11
相关论文
共 50 条
  • [1] MMSRNet: Pathological image super-resolution by multi-task and multi-scale learning
    Wu, Xinyue
    Chen, Zhineng
    Peng, Changgen
    Ye, Xiongjun
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 81
  • [2] Multi-Task Interaction Learning for Spatiospectral Image Super-Resolution
    Ma, Qing
    Jiang, Junjun
    Liu, Xianming
    Ma, Jiayi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 2950 - 2961
  • [3] High-Magnification Super-Resolution Reconstruction of Image with Multi-Task Learning
    Li, Yanghui
    Zhu, Hong
    Yu, Shunyuan
    ELECTRONICS, 2022, 11 (09)
  • [4] Multi-Task Convolutional Neural Network for Image Aesthetic Assessment
    Soydaner, Derya
    Wagemans, Johan
    IEEE ACCESS, 2024, 12 : 4716 - 4729
  • [5] Improvement of Text Image Super-Resolution Benefiting Multi-task Learning
    Honda, Kosuke
    Fujita, Hamido
    Kurematsu, Masaki
    ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE: THEORY AND PRACTICES IN ARTIFICIAL INTELLIGENCE, 2022, 13343 : 275 - 286
  • [6] Multi-Channel Convolutional Neural Networks for Image Super-Resolution
    Ohtani, Shinya
    Kato, Yu
    Kuroki, Nobutaka
    Hirose, Tetsuya
    Numa, Masahiro
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2017, E100A (02) : 572 - 580
  • [7] Single Image Super-Resolution Using Multi-scale Convolutional Neural Network
    Jia, Xiaoyi
    Xu, Xiangmin
    Cai, Bolun
    Guo, Kailing
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I, 2018, 10735 : 149 - 157
  • [8] Multi-Task Learning for Scene Text Image Super-Resolution with Multiple Transformers
    Honda, Kosuke
    Kurematsu, Masaki
    Fujita, Hamido
    Selamat, Ali
    ELECTRONICS, 2022, 11 (22)
  • [9] Multi-focus image fusion and super-resolution with convolutional neural network
    Yang, Bin
    Zhong, Jinying
    Li, Yuehua
    Chen, Zhongze
    INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2017, 15 (04)
  • [10] Image Super-Resolution with Multi-Channel Convolutional Neural Networks
    Kato, Yu
    Ohtani, Shinya
    Kuroki, Nobutaka
    Hirose, Tetsuya
    Numa, Masahiro
    2016 14TH IEEE INTERNATIONAL NEW CIRCUITS AND SYSTEMS CONFERENCE (NEWCAS), 2016,