Quality Assessment of Light Field Images Based on Contrastive Visual-Textual Model

被引：0

作者：

Wang, Han-Ling ^{[1
,2
]}

Ke, Xiao ^{[1
,3
,4
]}

Jiang, Ao-Xin ^{[1
,3
,4
]}

Guo, Wen-Zhong ^{[1
,3
,4
]}

机构：

[1] College of Computer and Data Science, Fuzhou University, Fujian, Fuzhou

[2] Key Laboratory of Earthquake Engineering and Engineering Vibration, Institute of Engineering Mechanics, China Earthquake Administration, Heilongjiang, Harbin

[3] Fujian Provincial Key Laboratory of Networking Computing and Intelligent Information Processing, Fuzhou University, Fujian, Fuzhou

[4] Engineering Research Center of Big Data Intelligence, Ministry of Education, Fujian, Fuzhou

来源：

Tien Tzu Hsueh Pao/Acta Electronica Sinica | 2024年 / 52卷 / 10期

基金：

中国国家自然科学基金;

关键词：

image enhancement; image quality assessment; light field images; multi-task mode; noise prediction; visual-textual model;

D O I：

10.12263/DZXB.20240533

中图分类号：

学科分类号：

摘要：

Light field imaging, as an image type capable of capturing light information from every position in a scene, holds broad application prospects in fields such as electronic imaging, medical imaging, and virtual reality. Light field image quality assessment (LFIQA) aims to measure the quality of such images, yet current methods confront significant challenges arising from the heterogeneity between visual effects and textual modalities. To address these issues, this paper proposes a multi-modal light field image quality assessment model grounded in text-vision integration. Specifically, for the visual modality, we devise a multi-task model that effectively enriches the crucial representational features of light field images by incorporating an edge auto-thresholding algorithm. On the textual side, we accurately identify noise categories in light field images based on the comparison between input noise features and predicted noise features, thereby validating the importance of noise prediction in optimizing visual representations. Building upon these findings, we further introduce an optimized universal noise text configuration approach combined with an edge enhancement strategy, which notably enhances the accuracy and generalization capabilities of the baseline model in LFIQA. Additionally, ablation experiments are conducted to assess the contribution of each component to the overall model performance, thereby verifying the effectiveness and robustness of our proposed method. Experimental results demonstrate that our approach not only excels in tests on public datasets like Win5-LID and NBU-LF1.0 but also shows remarkable outcomes in fused datasets. Compared to the state-of-the-art algorithms, our method achieves performance improvements of 2% and 6% respectively on the two databases. The noise verification strategy and configuration method presented in this paper not only provide valuable insights for light field noise prediction tasks but can also be applied as auxiliary tools for other noise prediction types. © 2024 Chinese Institute of Electronics. All rights reserved.

引用

页码：3562 / 3577

页数：15

共 66 条

[1]

WU G, MASIA B, JARABO A, Et al., Light field image processing: An overview, IEEE Journal of Selected Topics in Signal Processing, 11, 7, pp. 926-954, (2017)

[2]

CAO Y, LI S, LIU Y, Et al., A comprehensive survey of aigenerated content (AIGC): A history of generative ai from gan to chatgpt

[3]

LIN H., Analysis and simulation of UAV terahertz wave synthetic aperture radar imaging, Information and Electronic Engineering, 8, 4, pp. 373-377, (2010)

[4]

LIU H F, ZHOU W, CAI X S, Et al., Three-dimensional particle tracking velocimetry based on light field imaging, Acta Optica Sinica, 40, 1, (2020)

[5]

WANG Y, WANG L, LIANG Z, Et al., NTIRE 2023 challenge on light field image super-resolution: Dataset, methods and results, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1320-1335, (2023)

[6]

WOOD D N, AZUMA D I, ALDINGER K, Et al., Surface light fields for 3D photography, Seminal Graphics Papers: Pushing the Boundaries, 2, pp. 487-496, (2023)

[7]

WANG Z, BOVIK A C., Modern Image Quality Assessment, (2006)

[8]

SHEIKH H R, SABIR M F, BOVIK A C., A statistical evaluation of recent full reference image quality assessment algorithms, IEEE Transactions on Image Processing, 15, 11, pp. 3440-3451, (2006)

[9]

BOSSE S, MANIRY D, MULLER K R, Et al., Deep neural networks for no-reference and full-reference image quality assessment, IEEE Transactions on Image Processing, 27, 1, pp. 206-219, (2017)

[10]

LARSON E C, CHANDLER D M., Most apparent distortion: full-reference image quality assessment and the role of strategy, Journal of Electronic Imaging, 19, 1, (2010)

← 1 2 3 4 5 6 7 →