No-Reference Screen Content Image Quality Assessment Based on Edge Assistance and Multi-Scale Transformer

被引:0
|
作者
Chen, Yu-Zhong [1 ,2 ,3 ]
Chen, You-Kun [1 ,2 ]
Lin, Min-Hu [1 ,2 ]
Niu, Yu-Zhen [1 ,2 ,3 ]
机构
[1] College of Computer and Data Science, Fuzhou University, Fujian, Fuzhou
[2] Fujian Key Laboratory of Network Computing and Intelligent Information Processing, Fuzhou University, Fujian, Fuzhou
[3] Big Data Intelligence Engineering Research Center, The Ministry of Education, Fujian, Fuzhou
来源
Tien Tzu Hsueh Pao/Acta Electronica Sinica | 2024年 / 52卷 / 07期
基金
中国国家自然科学基金;
关键词
convolutional neural network; laplacian of gaussian; multi-scale features; no-reference screen content image quality assessment; Transformer;
D O I
10.12263/DZXB.20230607
中图分类号
学科分类号
摘要
Different from the natural images captured from real-world scenes, screen content images (SCI) are synthetic images typically composed of various multimedia contents, such as computer-generated text, graphics, and animations. Existing SCI quality assessment methods usually fail to fully consider the impacts of image edge and global context on the perceived quality of screen content images. To address the above issues, this paper proposed a no-reference screen content image quality assessment model based on edge assistance and multi-scale Transformer. Firstly, an edge structure map consisting of the high-frequency information in a distorted SCI is constructed using Gaussian Laplace operators. Then a convolutional neural network (CNN) is used to extract and fuse the multi-scale features from the input distorted SCI and the corresponding edge structure map, thus providing additional edge information gain for model training. In addition, this paper further proposed a multi-scale feature encoding module based on Transformer to better model the global context information of different scale images and edge features on the basis of the local features obtained by CNN. The experimental results show that the model proposed in this paper outperforms the state-of-the-art no-reference and full-reference SCI quality assessment methods, and achieves higher consistency with the subjective visual perception. © 2024 Chinese Institute of Electronics. All rights reserved.
引用
收藏
页码:2242 / 2256
页数:14
相关论文
共 38 条
  • [31] NI Z K, MA L, ZENG H Q, Et al., ESIM: Edge similarity for screen content image quality assessment, IEEE Transactions on Image Processing, 26, 10, pp. 4818-4831, (2017)
  • [32] Final Report From the Video Quality Experts Group on the Validation of Objective Models of Video[R/ OL]
  • [33] WANG Z, BOVIK A C, SHEIKH H R, Et al., Image quality assessment: From error visibility to structural similarity[J], IEEE Transactions on Image Processing, 13, 4, pp. 600-612, (2004)
  • [34] XUE W F, ZHANG L, MOU X Q, Et al., Gradient magnitude similarity deviation: A highly efficient perceptual image quality index, IEEE Transactions on Image Processing, 23, 2, pp. 684-695, (2014)
  • [35] GU K, WANG S Q, YANG H, Et al., Saliency-guided quality assessment of screen content images, IEEE Transactions on Multimedia, 18, 6, pp. 1098-1110, (2016)
  • [36] TOLIE H F, FARAJI M R., Screen content image quality assessment using distortion-based directional edge and gradient similarity maps, Signal Processing, 101, (2022)
  • [37] FANG Y M, DU R G, ZUO Y F, Et al., Perceptual quality assessment for screen content images by spatial continuity, IEEE Transactions on Circuits and Systems for Video Technology, 30, 11, pp. 4050-4063, (2020)
  • [38] YANG J C, ZHAO Y, LIU J C, Et al., No reference quality assessment for screen content images using stacked autoencoders in pictorial and textual regions, IEEE Transactions on Cybernetics, 52, 5, pp. 2798-2810, (2022)