KonVid-150k: A Dataset for No-Reference Video Quality Assessment of Videos in-the-Wild

被引：33

作者：

Goetz-Hahn, Franz ^{[1
]}

Hosu, Vlad ^{[1
]}

Lin, Hanhe ^{[1
]}

Saupe, Dietmar ^{[1
]}

机构：

[1] Univ Konstanz, Dept Comp Sci, D-78464 Constance, Germany

来源：

IEEE ACCESS | 2021年 / 9卷

关键词：

Streaming media; Distortion; Feature extraction; Quality assessment; Video recording; Training; Cameras; Datasets; deep transfer learning; multi-level spatially-pooled features; video quality assessment; video quality dataset; PREDICTION;

D O I：

10.1109/ACCESS.2021.3077642

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Video quality assessment (VQA) methods focus on particular degradation types, usually artificially induced on a small set of reference videos. Hence, most traditional VQA methods under-perform in-the-wild. Deep learning approaches have had limited success due to the small size and diversity of existing VQA datasets, either artificial or authentically distorted. We introduce a new in-the-wild VQA dataset that is substantially larger and diverse: KonVid-150k. It consists of a coarsely annotated set of 153,841 videos having five quality ratings each, and 1,596 videos with a minimum of 89 ratings each. Additionally, we propose new efficient VQA approaches (MLSP-VQA) relying on multi-level spatially pooled deep-features (MLSP). They are exceptionally well suited for training at scale, compared to deep transfer learning approaches. Our best method, MLSP-VQA-FF, improves the Spearman rank-order correlation coefficient (SRCC) performance metric on the commonly used KoNViD-1k in-the-wild benchmark dataset to 0.82. It surpasses the best existing deep-learning model (0.80 SRCC) and hand-crafted feature-based method (0.78 SRCC). We further investigate how alternative approaches perform under different levels of label noise, and dataset size, showing that MLSP-VQA-FF is the overall best method for videos in-the-wild. Finally, we show that the MLSP-VQA models trained on KonVid-150k sets the new state-of-the-art for cross-test performance on KoNViD-1k and LIVE-Qualcomm with a 0.83 and 0.64 SRCC, respectively. For KoNViD-1k this inter-dataset testing outperforms intra-dataset experiments, showing excellent generalization.

引用

页码：72139 / 72160

页数：22

共 50 条

[21] No-Reference Laparoscopic Video Quality Assessment for Sensor Distortions Using Optimized Long Short-Term Memory Framework
Biswas, Sria
Palanisamy, Rohini
IEEE SENSORS LETTERS, 2025, 9 (04)
[22] HDR-ChipQA: No-reference quality assessment on High Dynamic Range videos
Ebenezer, Joshua P.
Shang, Zaixi
Wu, Yongjun
Wei, Hai
Sethuraman, Sriram
Bovik, Alan C.
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2024, 129
[23] No-Reference Multi-Level Video Quality Assessment Metric for 3D-Synthesized Videos
Wang, Guangcheng
Huang, Baojin
Gu, Ke
Liu, Yuchen
Liu, Hongyan
Shi, Quan
Zhai, Guangtao
Zhang, Wenjun
IEEE TRANSACTIONS ON BROADCASTING, 2024, 70 (02) : 654 - 666
[24] NO-REFERENCE VIDEO QUALITY ASSESSMENT USING MPEG ANALYSIS
Sogaard, Jacob
Forchhammer, Soren
Korhonen, Jari
2013 PICTURE CODING SYMPOSIUM (PCS), 2013, : 161 - 164
[25] No-Reference Video Quality Assessment by HEVC Codec Analysis
Huang, Xin
Sogaard, Jacob
Forchhammer, Soren
2015 VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2015,
[26] A content-oriented no-reference perceptual video quality assessment method for computer graphics animation videos
Xian, Weizhi
Zhou, Mingliang
Fang, Bin
Kwong, Sam
INFORMATION SCIENCES, 2022, 608 : 1731 - 1746
[27] NO-REFERENCE VIDEO QUALITY ASSESSMENT VIA FEATURE LEARNING
Xu, Jingtao
Ye, Peng
Liu, Yong
Doermann, David
2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 491 - 495
[28] Reconstruction-based No-Reference Video Quality Assessment
Wu, Zhenyu
Hu, Hong
PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 3075 - 3078
[29] No-Reference Video Quality Assessment Using Space-Time Chips
Ebenezer, Joshua P.
Shang, Zaixi
Wu, Yongjun
Wei, Hai
Bovik, Alan C.
2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,
[30] No-Reference Quality Assessment of Stereoscopic Videos With Inter-Frame Cross on a Content-Rich Database
Yang, Jiachen
Zhao, Yang
Jiang, Bin
Meng, Qinggang
Lu, Wen
Gao, Xinbo
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (10) : 3608 - 3623

← 1 2 3 4 5 →