Bidirectional Feature Aggregation Network for Stereo Image Quality Assessment Considering Parallax Attention-Based Binocular Fusion

被引:8
作者
Chang, Yongli [1 ]
Li, Sumei [1 ]
Liu, Anqi [1 ]
Zhang, Wenlin [2 ]
Jin, Jie [1 ]
Xiang, Wei [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Tianjin Univ, Tianjin Int Engn Inst, Tianjin 300072, Peoples R China
[3] La Trobe Univ, Sch Engn & Math Sci, Melbourne, Vic 3086, Australia
基金
中国国家自然科学基金;
关键词
Feature extraction; Visualization; Image quality; Semantics; Information processing; Convolutional neural networks; Task analysis; Stereo image quality assessment; human visual system; bidirectional feature aggregation; hierarchical binocular fusion; TOP-DOWN; PREDICTION; MODEL;
D O I
10.1109/TBC.2023.3278096
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Inspired by the two-path visual information processing mechanism (i.e., a bottom-up path and a top-down path), we propose a bidirectional binocular feature aggregation based stereo image quality assessment (SIQA) network, which considers a two-path visual mechanism and realizes the binocular fusion based on parallax information. To better aggregate binocular features from different levels, a two-path feature aggregation structure, which simulates the bottom-up and top-down mechanism in human visual system (HVS), is proposed. It not only realizes the supplement of low-level detail information to high-level semantic in the bottom-up path, but also realizes the supplement of high-level semantic information to low-level detail in the top-down path. Simultaneously, because feature misalignment exists in binocular features of adjacent levels, a feature alignment module (FAM) based on deformable convolution is designed to integrate the binocular fusion features of adjacent levels. In addition, considering the importance role of parallax in guiding binocular fusion, a binocular fusion module (BFM) based on parallax attention mechanism, which is different with existing binocular fusion methods, is explicitly proposed to achieve the binocular fusion between the left and right view features. Extensive experiments are conducted on LIVE I, LIVE II, WIVC I and WIVC II databases to demonstrate the effectiveness of the proposed method.
引用
收藏
页码:278 / 289
页数:12
相关论文
共 55 条
[51]   Attentional Feature Fusion for End-to-End Blind Image Quality Assessment [J].
Zhou, Mingliang ;
Lang, Shujun ;
Zhang, Taiping ;
Liao, Xingran ;
Shang, Zhaowei ;
Xiang, Tao ;
Fang, Bin .
IEEE TRANSACTIONS ON BROADCASTING, 2023, 69 (01) :144-152
[52]   Dual-Stream Interactive Networks for No-Reference Stereoscopic Image Quality Assessment [J].
Zhou, Wei ;
Chen, Zhibo ;
Li, Weiping .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (08) :3946-3958
[53]   Multi-layer fusion network for blind stereoscopic 3D visual quality prediction [J].
Zhou, Wujie ;
Lin, Xinyang ;
Zhou, Xi ;
Lei, Jingsheng ;
Yu, Lu ;
Luo, Ting .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 91
[54]   Blind Binocular Visual Quality Predictor Using Deep Fusion Network [J].
Zhou, Wujie ;
Lei, Jingsheng ;
Jiang, Qiuping ;
Yu, Lu ;
Luo, Ting .
IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2020, 6 (06) :883-893
[55]   Utilizing Dictionary Learning and Machine Learning for Blind Quality Assessment of 3-D Images [J].
Zhou, Wujie ;
Qiu, Weiwei ;
Wu, Ming-Wei .
IEEE TRANSACTIONS ON BROADCASTING, 2017, 63 (02) :404-415