KonVid-150k: A Dataset for No-Reference Video Quality Assessment of Videos in-the-Wild

被引:33
|
作者
Goetz-Hahn, Franz [1 ]
Hosu, Vlad [1 ]
Lin, Hanhe [1 ]
Saupe, Dietmar [1 ]
机构
[1] Univ Konstanz, Dept Comp Sci, D-78464 Constance, Germany
关键词
Streaming media; Distortion; Feature extraction; Quality assessment; Video recording; Training; Cameras; Datasets; deep transfer learning; multi-level spatially-pooled features; video quality assessment; video quality dataset; PREDICTION;
D O I
10.1109/ACCESS.2021.3077642
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video quality assessment (VQA) methods focus on particular degradation types, usually artificially induced on a small set of reference videos. Hence, most traditional VQA methods under-perform in-the-wild. Deep learning approaches have had limited success due to the small size and diversity of existing VQA datasets, either artificial or authentically distorted. We introduce a new in-the-wild VQA dataset that is substantially larger and diverse: KonVid-150k. It consists of a coarsely annotated set of 153,841 videos having five quality ratings each, and 1,596 videos with a minimum of 89 ratings each. Additionally, we propose new efficient VQA approaches (MLSP-VQA) relying on multi-level spatially pooled deep-features (MLSP). They are exceptionally well suited for training at scale, compared to deep transfer learning approaches. Our best method, MLSP-VQA-FF, improves the Spearman rank-order correlation coefficient (SRCC) performance metric on the commonly used KoNViD-1k in-the-wild benchmark dataset to 0.82. It surpasses the best existing deep-learning model (0.80 SRCC) and hand-crafted feature-based method (0.78 SRCC). We further investigate how alternative approaches perform under different levels of label noise, and dataset size, showing that MLSP-VQA-FF is the overall best method for videos in-the-wild. Finally, we show that the MLSP-VQA models trained on KonVid-150k sets the new state-of-the-art for cross-test performance on KoNViD-1k and LIVE-Qualcomm with a 0.83 and 0.64 SRCC, respectively. For KoNViD-1k this inter-dataset testing outperforms intra-dataset experiments, showing excellent generalization.
引用
收藏
页码:72139 / 72160
页数:22
相关论文
共 50 条
  • [31] No-reference image and video quality assessment: a classification and review of recent approaches
    Shahid, Muhammad
    Rossholm, Andreas
    Lovstrom, Benny
    Zepernick, Hans-Jurgen
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2014,
  • [32] No-Reference Video Quality Assessment with Heterogeneous Knowledge Ensemble
    Wu, Jinjian
    Liu, Yongxu
    Li, Leida
    Dong, Weisheng
    Shi, Guangming
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4174 - 4182
  • [33] Semantic Information Oriented No-Reference Video Quality Assessment
    Wu, Wei
    Li, Qinyao
    Chen, Zhenzhong
    Liu, Shan
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 (28) : 204 - 208
  • [34] Conformer Based No-Reference Quality Assessment for UGC Video
    Yang, Zike
    Zhang, Yingxue
    Si, Zhanjun
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VI, ICIC 2024, 2024, 14867 : 464 - 472
  • [35] Quality Feature Learning via Multi-Channel CNN and GRU for No-Reference Video Quality Assessment
    Kwong, Ngai-Wing
    Chan, Yui-Lam
    Tsang, Sik-Ho
    Lun, Daniel Pak-Kong
    IEEE ACCESS, 2023, 11 : 28060 - 28075
  • [36] No-reference pixel based video quality assessment for HEVC decoded video
    Huang, Xin
    Sogaard, Jacob
    Forchhammer, Soren
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 43 : 173 - 184
  • [37] No-reference quality assessment for live broadcasting videos in temporal and spatial domains
    Huang, Yipo
    Li, Leida
    Zhou, Yu
    Hu, Bo
    IET IMAGE PROCESSING, 2020, 14 (04) : 774 - 781
  • [38] No-reference Mobile Video Quality Assessment Based on Video Natural Statistics
    Shi Wenjuan
    Sun Yanjing
    Zuo Haiwei
    Cao Qi
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2018, 40 (01) : 143 - 150
  • [39] A Deep Learning based No-reference Quality Assessment Model for UGC Videos
    Sun, Wei
    Min, Xiongkuo
    Lu, Wei
    Zhai, Guangtao
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
  • [40] Spatiotemporal Feature Combination Model for No-Reference Video Quality Assessment
    Men, Hui
    Lin, Hanhe
    Saupe, Dietmar
    2018 TENTH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2018, : 72 - 74