Bidirectional Feature Aggregation Network for Stereo Image Quality Assessment Considering Parallax Attention-Based Binocular Fusion

被引:8
作者
Chang, Yongli [1 ]
Li, Sumei [1 ]
Liu, Anqi [1 ]
Zhang, Wenlin [2 ]
Jin, Jie [1 ]
Xiang, Wei [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Tianjin Univ, Tianjin Int Engn Inst, Tianjin 300072, Peoples R China
[3] La Trobe Univ, Sch Engn & Math Sci, Melbourne, Vic 3086, Australia
基金
中国国家自然科学基金;
关键词
Feature extraction; Visualization; Image quality; Semantics; Information processing; Convolutional neural networks; Task analysis; Stereo image quality assessment; human visual system; bidirectional feature aggregation; hierarchical binocular fusion; TOP-DOWN; PREDICTION; MODEL;
D O I
10.1109/TBC.2023.3278096
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Inspired by the two-path visual information processing mechanism (i.e., a bottom-up path and a top-down path), we propose a bidirectional binocular feature aggregation based stereo image quality assessment (SIQA) network, which considers a two-path visual mechanism and realizes the binocular fusion based on parallax information. To better aggregate binocular features from different levels, a two-path feature aggregation structure, which simulates the bottom-up and top-down mechanism in human visual system (HVS), is proposed. It not only realizes the supplement of low-level detail information to high-level semantic in the bottom-up path, but also realizes the supplement of high-level semantic information to low-level detail in the top-down path. Simultaneously, because feature misalignment exists in binocular features of adjacent levels, a feature alignment module (FAM) based on deformable convolution is designed to integrate the binocular fusion features of adjacent levels. In addition, considering the importance role of parallax in guiding binocular fusion, a binocular fusion module (BFM) based on parallax attention mechanism, which is different with existing binocular fusion methods, is explicitly proposed to achieve the binocular fusion between the left and right view features. Extensive experiments are conducted on LIVE I, LIVE II, WIVC I and WIVC II databases to demonstrate the effectiveness of the proposed method.
引用
收藏
页码:278 / 289
页数:12
相关论文
共 55 条
[1]   A MULTI-TASK CONVOLUTIONAL NEURAL NETWORK FOR BLIND STEREOSCOPIC IMAGE QUALITY ASSESSMENT USING NATURALNESS ANALYSIS [J].
Bourbia, Salima ;
Karine, Ayoub ;
Chetouani, Aladine ;
El Hassoun, Mohammed .
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, :1434-1438
[2]   No-Reference Quality Assessment of Natural Stereopairs [J].
Chen, Ming-Jun ;
Cormack, Lawrence K. ;
Bovik, Alan C. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (09) :3379-3391
[3]   Stereoscopic image quality assessment by analysing visual hierarchical structures and binocular effects [J].
Ding, Yong ;
Zhao, Yang ;
Chen, Xiaodong ;
Zhu, Xiaolei ;
Andrey, Krylov .
IET IMAGE PROCESSING, 2019, 13 (10) :1608-1615
[4]   No-Reference Stereoscopic Image Quality Assessment Using Convolutional Neural Network for Adaptive Feature Extraction [J].
Ding, Yong ;
Deng, Ruizhe ;
Xie, Xin ;
Xu, Xiaogang ;
Zhao, Yang ;
Chen, Xiaodong ;
Krylov, Andrey S. .
IEEE ACCESS, 2018, 6 :37595-37603
[5]   A DEEP FEATURE FUSION METHOD FOR ANDROID MALWARE DETECTION [J].
Ding, Yuxin ;
Hu, Jieke ;
Xu, Wenting ;
Zhang, Xiao .
PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), 2019, :547-552
[6]   Task-demands can immediately reverse the effects of sensory-driven saliency in complex visual stimuli [J].
Einhaeuser, Wolfgang ;
Rutishauser, Ueli ;
Koch, Christof .
JOURNAL OF VISION, 2008, 8 (02)
[7]   Learning a No-Reference Quality Predictor of Stereoscopic Images by Visual Binocular Properties [J].
Fang, Yuming ;
Yan, Jiebin ;
Wang, Jiheng ;
Liu, Xuelin ;
Zhai, Guangtao ;
Le Callet, Patrick .
IEEE ACCESS, 2019, 7 :132649-132661
[8]   Stereoscopic image quality assessment by deep convolutional neural network [J].
Fang, Yuming ;
Yan, Jiebin ;
Liu, Xuelin ;
Wang, Jiheng .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 58 :400-406
[9]  
Fang YM, 2011, INT CONF ACOUST SPEE, P1293
[10]   No-reference stereoscopic image quality assessment on both complex contourlet and spatial domain via Kernel ELM [J].
Guan, Tuxin ;
Li, Chaofeng ;
Zheng, Yuhui ;
Zhao, Shenghu ;
Wu, Xiaojun .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 101