No-Reference Screen Content Image Quality Assessment Based on Edge Assistance and Multi-Scale Transformer

被引:0
|
作者
Chen, Yu-Zhong [1 ,2 ,3 ]
Chen, You-Kun [1 ,2 ]
Lin, Min-Hu [1 ,2 ]
Niu, Yu-Zhen [1 ,2 ,3 ]
机构
[1] College of Computer and Data Science, Fuzhou University, Fujian, Fuzhou
[2] Fujian Key Laboratory of Network Computing and Intelligent Information Processing, Fuzhou University, Fujian, Fuzhou
[3] Big Data Intelligence Engineering Research Center, The Ministry of Education, Fujian, Fuzhou
来源
Tien Tzu Hsueh Pao/Acta Electronica Sinica | 2024年 / 52卷 / 07期
基金
中国国家自然科学基金;
关键词
convolutional neural network; laplacian of gaussian; multi-scale features; no-reference screen content image quality assessment; Transformer;
D O I
10.12263/DZXB.20230607
中图分类号
学科分类号
摘要
Different from the natural images captured from real-world scenes, screen content images (SCI) are synthetic images typically composed of various multimedia contents, such as computer-generated text, graphics, and animations. Existing SCI quality assessment methods usually fail to fully consider the impacts of image edge and global context on the perceived quality of screen content images. To address the above issues, this paper proposed a no-reference screen content image quality assessment model based on edge assistance and multi-scale Transformer. Firstly, an edge structure map consisting of the high-frequency information in a distorted SCI is constructed using Gaussian Laplace operators. Then a convolutional neural network (CNN) is used to extract and fuse the multi-scale features from the input distorted SCI and the corresponding edge structure map, thus providing additional edge information gain for model training. In addition, this paper further proposed a multi-scale feature encoding module based on Transformer to better model the global context information of different scale images and edge features on the basis of the local features obtained by CNN. The experimental results show that the model proposed in this paper outperforms the state-of-the-art no-reference and full-reference SCI quality assessment methods, and achieves higher consistency with the subjective visual perception. © 2024 Chinese Institute of Electronics. All rights reserved.
引用
收藏
页码:2242 / 2256
页数:14
相关论文
共 38 条
  • [1] PENG W H, WALLS F G, COHEN R A, Et al., Overview of screen content video coding: Technologies, standards, and beyond, IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 6, 4, pp. 393-408, (2016)
  • [2] JIA X, CAO Y D, SUN F M, Et al., Vein image acquisition method based on quality assessment model without reference, Acta Electronica Sinica, 43, 2, pp. 236-241, (2015)
  • [3] CHEN W J, YANG C L., Feature-space optimization-inspired and self-attention enhanced neural network reconstruction algorithm for image compressive sensing, Acta Electronica Sinica, 50, 11, pp. 2629-2637, (2022)
  • [4] LI Q Y, ZHANG X L., Aerial remote sensing image transmission using multiple description coding and unequal error protection, Acta Electronica Sinica, 38, 11, pp. 2655-2659, (2010)
  • [5] NI Z K, ZENG H Q, MA L, Et al., A Gabor feature-based quality assessment model for the screen content images, IEEE Transactions on Image Processing, 27, 9, pp. 4516-4528, (2018)
  • [6] WEI L S, CHEN J H, NIU Y Z., Blind quality assessment for screen content images based on edge and structure, Journal of Beijing University of Aeronautics and Astronautics, 45, 12, pp. 2449-2455, (2019)
  • [7] LIN G M, WEI L S, NIU Y Z., No reference image quality assessment for screen content image based on multi-scale features, Journal of Chinese Computer Systems, 43, 2, pp. 372-380, (2022)
  • [8] ZHANG Y, CHANDLER D M, MOU X Q., Quality assessment of screen content images via convolutional-neural-network-based synthetic/natural segmentation, IEEE Transactions on Image Processing, 27, 10, pp. 5113-5128, (2018)
  • [9] JIANG X H, SHEN L Q, DING Q, Et al., Screen content image quality assessment based on convolutional neural networks, Journal of Visual Communication and Image Representation, 67, (2020)
  • [10] GAO R, HUANG Z Q, LIU S G., Multi-task deep learning for no-reference screen content image quality assessment, International Conference on Multimedia Modeling, pp. 213-226, (2021)