Omnidirectional Image Quality Assessment by Distortion Discrimination Assisted Multi-Stream Network

被引:40
|
作者
Zhou, Yu [1 ,2 ]
Sun, Yanjing [1 ,2 ]
Li, Leida [3 ,4 ]
Gu, Ke [5 ,6 ]
Fang, Yuming [7 ]
机构
[1] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Jiangsu, Peoples R China
[2] Xuzhou Engn Res Ctr Intelligent Ind Safety & Emer, Xuzhou 221116, Jiangsu, Peoples R China
[3] Xidian Univ, Guangzhou Inst Technol, Guangzhou 510555, Peoples R China
[4] Pazhou Lab, Guangzhou 510330, Peoples R China
[5] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[6] Beijing Univ Technol, Engn Res Ctr Intelligent Percept & Autonomous Con, Beijing Artificial Intelligence Inst,Beijing Lab, Minist Educ,Beijing Key Lab Computat Intelligence, Beijing 100124, Peoples R China
[7] Jiangxi Univ Finance & Econ, Sch Informat Technol, Nanchang 330013, Jiangxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Measurement; Distortion; Quality assessment; Sun; Image coding; Visualization; Image quality assessment; virtual reality (VR); omnidirectional image (OI); viewport generation; distortion discrimination; INDEX; DEGRADATION; STATISTICS;
D O I
10.1109/TCSVT.2021.3081162
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Omnidirectional image (OI) quality assessment is crucial to facilitate the development of virtual reality (VR) related technology. In this work, a distortion discrimination assisted multi-stream network is proposed for OI quality assessment. The multi-stream architecture is constructed by generating the viewport images received by the retina at one point to simulate the characteristics of humans perceiving VR contents. Additionally, the strategy of generating several viewport image sets from one OI is proposed for data augmentation. Furthermore, the facts that the human brain has the ability for both quality assessment and distortion type distinguishment, and the process of human brain handling two tasks exists information interaction inspire us to employ an auxiliary distortion discrimination task to facilitate the quality assessment task learning. Extensive experiments conducted on two public OI databases demonstrate the superiority of the proposed method to both traditional 2D quality metrics and existing metrics specific for OIs. Moreover, utilizing the assistant task is proven to be more effective than the single task learning for OI quality evaluation. Better generalization performance is also verified to be another valuable trait of the proposed method.
引用
收藏
页码:1767 / 1777
页数:11
相关论文
共 50 条
  • [21] Multi-Stream Refining Network for Person Re-Identification
    Wang, Xu
    Huang, Yan
    Wang, Qicong
    Chen, Yan
    Shen, Yehu
    IEEE ACCESS, 2021, 9 : 6596 - 6607
  • [22] Multi-stream Point-based model for Blind Geometric Point Cloud Quality Assessment
    Bourbia, Salima
    Karine, Ayoub
    Chetouani, Aladine
    El Hassouni, Mohammed
    Jridi, Maher
    20TH INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING, CBMI 2023, 2023, : 224 - 228
  • [23] Vision Language Modeling of Content, Distortion and Appearance for Image Quality Assessment
    Zhou, Fei
    Gu, Tianhao
    Huang, Zhicong
    Qiu, Guoping
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2025, 19 (01) : 234 - 247
  • [24] Explicit-implicit dual stream network for image quality assessment
    Yang, Guangyi
    Ding, Xingyu
    Huang, Tian
    Cheng, Kun
    Jin, Weizheng
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2020, 2020 (01)
  • [25] Multifaceted perception-based blind omnidirectional image quality assessment
    Liu, Hongxi
    Wang, Chaoyong
    Jiang, Jingyu
    Liu, Yun
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (04)
  • [26] Projection Invariant Feature and Visual Saliency-Based Stereoscopic Omnidirectional Image Quality Assessment
    Zhou, Xuemei
    Zhang, Yun
    Li, Na
    Wang, Xu
    Zhou, Yang
    Ho, Yo-Sung
    IEEE TRANSACTIONS ON BROADCASTING, 2021, 67 (02) : 512 - 523
  • [27] Multi-Stream Attention-Aware Graph Convolution Network for Video Salient Object Detection
    Xu, Mingzhu
    Fu, Ping
    Liu, Bing
    Li, Junbao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4183 - 4197
  • [28] MMMNet: An End-to-End Multi-Task Deep Convolution Neural Network With Multi-Scale and Multi-Hierarchy Fusion for Blind Image Quality Assessment
    Li, Fan
    Zhang, Yangfan
    Cosman, Pamela C.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (12) : 4798 - 4811
  • [29] FMSNet: A Multi-Stream CNN for Multi-Stereo Image Classification by Feature Map Sharing
    Can, Ferit
    Eyupoglu, Can
    IEEE ACCESS, 2024, 12 : 105566 - 105572
  • [30] MC360IQA: A Multi-channel CNN for Blind 360-Degree Image Quality Assessment
    Sun, Wei
    Min, Xiongkuo
    Zhai, Guangtao
    Gu, Ke
    Duan, Huiyu
    Ma, Siwei
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (01) : 64 - 77