A Light-Field Video Dataset of Scenes with Moving Objects Captured with a Plenoptic Video Camera

被引：0

作者：

Javidi, Kamran ^{[1
]}

Martini, Maria G. ^{[1
]}

机构：

[1] Kingston Univ, Fac Engn Comp & Environm, Sch Comp Sci & Math, Dept Networks & Digital Media,Wireless & Multimedi, Penrhyn Rd Campus, Kingston Upon Thames KT1 2EE, England

来源：

ELECTRONICS | 2024年 / 13卷 / 11期

关键词：

light field; dataset; content characterisation; objective quality assessment;

D O I：

10.3390/electronics13112223

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Light-field video provides a detailed representation of scenes captured from different perspectives. This results in a visualisation modality that enhances the immersion and engagement of the viewers with the depicted environment. In order to perform research on compression, transmission and signal processing of light field data, datasets with light-field contents of different categories and acquired with different modalities are required. In particular, the development of machine learning models for quality assessment and for light-field processing, including the generation of new views, require large amounts of data. Most existing datasets consist of static scenes and, in many cases, synthetic contents. This paper presents a novel light-field plenoptic video dataset, KULFR8, involving six real-world scenes with moving objects and 336 distorted light-field videos derived from the original contents; in total, the original scenes in the dataset contain 1800 distinctive frames, with angular resolution of 5x5 with and total spatial resolution of 9600x5400 pixels (considering all the views); overall, the dataset consists of 45,000 different views with spatial resolution of 1920x1080 pixels. We analyse the content characteristics based on the dimensions of the captured objects and via the acquired videos using the central views extracted from each quilted frame. Additionally, we encode and decode the contents using various video encoders across different bitrate ranges. For quality assessments, we consider all the views, utilising frames measuring 9600x5400 pixels, and employ two objective quality metrics: PSNR and SSIM.

引用

页数：20

共 43 条

[41] A New Prediction Structure for Efficient MV-HEVC based Light Field Video Compression
Khoury, Joseph
Pourazad, Mahsa T.
Nasiopoulos, Panos
2019 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), 2019, : 588 - 591
[42] Panoramic Light Field From Hand-Held Video and Its Sampling for Real-Time Rendering
Zhao, Qiang
Dai, Feng
Lv, Jing
Ma, Yike
Zhang, Yongdong
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (04) : 1011 - 1021
[43] Estimation of Signal Distortion Using Effective Sampling Density for Light Field-Based Free Viewpoint Video
Shidanshidi, Hooman
Safaei, Farzad
Li, Wanqing
IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (10) : 1677 - 1693

← 1 2 3 4 5 →