KonVid-150k: A Dataset for No-Reference Video Quality Assessment of Videos in-the-Wild

被引：33

作者：

Goetz-Hahn, Franz ^{[1
]}

Hosu, Vlad ^{[1
]}

Lin, Hanhe ^{[1
]}

Saupe, Dietmar ^{[1
]}

机构：

[1] Univ Konstanz, Dept Comp Sci, D-78464 Constance, Germany

来源：

IEEE ACCESS | 2021年 / 9卷

关键词：

Streaming media; Distortion; Feature extraction; Quality assessment; Video recording; Training; Cameras; Datasets; deep transfer learning; multi-level spatially-pooled features; video quality assessment; video quality dataset; PREDICTION;

D O I：

10.1109/ACCESS.2021.3077642

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Video quality assessment (VQA) methods focus on particular degradation types, usually artificially induced on a small set of reference videos. Hence, most traditional VQA methods under-perform in-the-wild. Deep learning approaches have had limited success due to the small size and diversity of existing VQA datasets, either artificial or authentically distorted. We introduce a new in-the-wild VQA dataset that is substantially larger and diverse: KonVid-150k. It consists of a coarsely annotated set of 153,841 videos having five quality ratings each, and 1,596 videos with a minimum of 89 ratings each. Additionally, we propose new efficient VQA approaches (MLSP-VQA) relying on multi-level spatially pooled deep-features (MLSP). They are exceptionally well suited for training at scale, compared to deep transfer learning approaches. Our best method, MLSP-VQA-FF, improves the Spearman rank-order correlation coefficient (SRCC) performance metric on the commonly used KoNViD-1k in-the-wild benchmark dataset to 0.82. It surpasses the best existing deep-learning model (0.80 SRCC) and hand-crafted feature-based method (0.78 SRCC). We further investigate how alternative approaches perform under different levels of label noise, and dataset size, showing that MLSP-VQA-FF is the overall best method for videos in-the-wild. Finally, we show that the MLSP-VQA models trained on KonVid-150k sets the new state-of-the-art for cross-test performance on KoNViD-1k and LIVE-Qualcomm with a 0.83 and 0.64 SRCC, respectively. For KoNViD-1k this inter-dataset testing outperforms intra-dataset experiments, showing excellent generalization.

引用

页码：72139 / 72160

页数：22

共 50 条

[31] No-reference image and video quality assessment: a classification and review of recent approaches
Shahid, Muhammad
Rossholm, Andreas
Lovstrom, Benny
Zepernick, Hans-Jurgen
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2014,
[32] No-Reference Video Quality Assessment with Heterogeneous Knowledge Ensemble
Wu, Jinjian
Liu, Yongxu
Li, Leida
Dong, Weisheng
Shi, Guangming
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4174 - 4182
[33] Semantic Information Oriented No-Reference Video Quality Assessment
Wu, Wei
Li, Qinyao
Chen, Zhenzhong
Liu, Shan
IEEE SIGNAL PROCESSING LETTERS, 2021, 28 (28) : 204 - 208
[34] Conformer Based No-Reference Quality Assessment for UGC Video
Yang, Zike
Zhang, Yingxue
Si, Zhanjun
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14867 : 464 - 472
[35] Quality Feature Learning via Multi-Channel CNN and GRU for No-Reference Video Quality Assessment
Kwong, Ngai-Wing
Chan, Yui-Lam
Tsang, Sik-Ho
Lun, Daniel Pak-Kong
IEEE ACCESS, 2023, 11 : 28060 - 28075
[36] No-reference pixel based video quality assessment for HEVC decoded video
Huang, Xin
Sogaard, Jacob
Forchhammer, Soren
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 43 : 173 - 184
[37] No-reference quality assessment for live broadcasting videos in temporal and spatial domains
Huang, Yipo
Li, Leida
Zhou, Yu
Hu, Bo
IET IMAGE PROCESSING, 2020, 14 (04) : 774 - 781
[38] No-reference Mobile Video Quality Assessment Based on Video Natural Statistics
Shi Wenjuan
Sun Yanjing
Zuo Haiwei
Cao Qi
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2018, 40 (01) : 143 - 150
[39] A Deep Learning based No-reference Quality Assessment Model for UGC Videos
Sun, Wei
Min, Xiongkuo
Lu, Wei
Zhai, Guangtao
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
[40] Spatiotemporal Feature Combination Model for No-Reference Video Quality Assessment
Men, Hui
Lin, Hanhe
Saupe, Dietmar
2018 TENTH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2018, : 72 - 74

← 1 2 3 4 5 →