Multi-Category Image Super-Resolution with Convolutional Neural Network and Multi-Task Learning

被引：3

作者：

Urazoe, Kazuya ^{[1
,3
]}

Kuroki, Nobutaka ^{[1
]}

Kato, Yu ^{[1
,4
]}

Ohtani, Shinya ^{[1
,5
]}

Hirose, Tetsuya ^{[2
]}

Numa, Masahiro ^{[1
]}

机构：

[1] Kobe Univ, Grad Sch Engn, Kobe, Hyogo 6578501, Japan

[2] Osaka Univ, Grad Sch Engn, Suita, Osaka 5650871, Japan

[3] Panasonic Corp, Osaka, Japan

[4] EIZO Corp, Haku San, Japan

[5] Toyota Motor Co Ltd, Tokyo, Japan

来源：

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2021年 / E104D卷 / 01期

关键词：

super-resolution; resolution enhancement; convolutional neural network; multi-task learning; deep learning;

D O I：

10.1587/transinf.2020EDP7054

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents an image super-resolution technique using a convolutional neural network (CNN) and multi-task learning for multiple image categories. The image categories include natural, manga, and text images. Their features differ from each other. However, several CNNs for super-resolution are trained with a single category. If the input image category is different from that of the training images, the performance of super-resolution is degraded. There are two possible solutions to manage multi-categories with conventional CNNs. The first involves the preparation of the CNNs for every category. This solution, however, requires a category classifier to select an appropriate CNN. The second is to learn all categories with a single CNN. In this solution, the CNN cannot optimize its internal behavior for each category. Therefore, this paper presents a super-resolution CNN architecture for multiple image categories. The proposed CNN has two parallel outputs for a high-resolution image and a category label. The main CNN for the high-resolution image is a normal three convolutional layer-architecture, and the sub neural network for the category label is branched out from its middle layer and consists of two fully-connected layers. This architecture can simultaneously learn the high-resolution image and its category using multi-task learning. The category information is used for optimizing the super-resolution. In an applied setting, the proposed CNN can automatically estimate the input image category and change the internal behavior. Experimental results of 2x image magnification have shown that the average peak signal-to-noise ratio for the proposed method is approximately 0.22 dB higher than that for the conventional super-resolution with no difference in processing time and parameters. We have ensured that the proposed method is useful when the input image category is varying.

引用

页码：183 / 193

页数：11

共 50 条

[1] MMSRNet: Pathological image super-resolution by multi-task and multi-scale learning
Wu, Xinyue
Chen, Zhineng
Peng, Changgen
Ye, Xiongjun
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 81
[2] Multi-Task Interaction Learning for Spatiospectral Image Super-Resolution
Ma, Qing
Jiang, Junjun
Liu, Xianming
Ma, Jiayi
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 2950 - 2961
[3] High-Magnification Super-Resolution Reconstruction of Image with Multi-Task Learning
Li, Yanghui
Zhu, Hong
Yu, Shunyuan
ELECTRONICS, 2022, 11 (09)
[4] Multi-Task Convolutional Neural Network for Image Aesthetic Assessment
Soydaner, Derya
Wagemans, Johan
IEEE ACCESS, 2024, 12 : 4716 - 4729
[5] Improvement of Text Image Super-Resolution Benefiting Multi-task Learning
Honda, Kosuke
Fujita, Hamido
Kurematsu, Masaki
ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE: THEORY AND PRACTICES IN ARTIFICIAL INTELLIGENCE, 2022, 13343 : 275 - 286
[6] Multi-Channel Convolutional Neural Networks for Image Super-Resolution
Ohtani, Shinya
Kato, Yu
Kuroki, Nobutaka
Hirose, Tetsuya
Numa, Masahiro
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2017, E100A (02) : 572 - 580
[7] Single Image Super-Resolution Using Multi-scale Convolutional Neural Network
Jia, Xiaoyi
Xu, Xiangmin
Cai, Bolun
Guo, Kailing
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I, 2018, 10735 : 149 - 157
[8] Multi-Task Learning for Scene Text Image Super-Resolution with Multiple Transformers
Honda, Kosuke
Kurematsu, Masaki
Fujita, Hamido
Selamat, Ali
ELECTRONICS, 2022, 11 (22)
[9] Multi-focus image fusion and super-resolution with convolutional neural network
Yang, Bin
Zhong, Jinying
Li, Yuehua
Chen, Zhongze
INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2017, 15 (04)
[10] Image Super-Resolution with Multi-Channel Convolutional Neural Networks
Kato, Yu
Ohtani, Shinya
Kuroki, Nobutaka
Hirose, Tetsuya
Numa, Masahiro
2016 14TH IEEE INTERNATIONAL NEW CIRCUITS AND SYSTEMS CONFERENCE (NEWCAS), 2016,

← 1 2 3 4 5 →