Image Quality Distortion Classification Using Vision Transformer

被引:0
作者
Lynn, Nay Chi [1 ]
Shimamura, Tetsuya [1 ]
机构
[1] Saitama Univ, Saitama, Japan
来源
ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 1, AINA 2024 | 2024年 / 199卷
关键词
D O I
10.1007/978-3-031-57840-3_32
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a method for classifying image quality distortions to identify common types of distortions typically present in images, utilizing a vision transformer. The method aims to enhance quality-related image processing approaches by identifying specific distortions as the initial step in distortion-based blind image quality assessment (BIQA). This simplifies the quality reconstruction process by tailoring it to the prior knowledge of distortion types, thereby aiding in improving image classification and potentially reducing biases caused by certain distortions. The proposed method is experimented on common benchmark image quality assessment (IQA) databases, including LIVE2008, TID2013, and KADID-10k. To generalize the performance with a larger database, we distorted images using four general distortion types: Gaussian noise, Gaussian blur, JPEG compression, and contrast degradation, applied to the ImageNet-1k database. The experimental results demonstrate that the proposed method outperforms other solutions in terms of accuracy
引用
收藏
页码:353 / 361
页数:9
相关论文
共 11 条
  • [1] Image classification with deep learning in the presence of noisy labels: A survey
    Algan, Gorkem
    Ulusoy, Ilkay
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 215
  • [2] IMAGE-RECONSTRUCTION AND RESTORATION - OVERVIEW OF COMMON ESTIMATION STRUCTURES AND PROBLEMS
    DEMOMENT, G
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (12): : 2024 - 2036
  • [3] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
  • [4] imagej, US
  • [5] Lin HH, 2019, INT WORK QUAL MULTIM
  • [6] Lynn N.C., 2024, J. Signal Process., V28, P19
  • [7] No-Reference Image Quality Assessment in the Spatial Domain
    Mittal, Anish
    Moorthy, Anush Krishna
    Bovik, Alan Conrad
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2012, 21 (12) : 4695 - 4708
  • [8] da Costa GBP, 2016, Arxiv, DOI arXiv:1609.02781
  • [9] Ponomarenko N, 2013, LECT NOTES COMPUT SC, V8192, P402, DOI 10.1007/978-3-319-02895-8_36
  • [10] Sheikh H.R., LIVE IMAGE QUALITY A