Intra Prediction Method for Depth Video Coding by Finding Spatial Correlation Using CNN and Attention

被引:0
作者
Lee, Jae-young [1 ]
Lee, Dong-seok [2 ]
Kwon, Soon-kak [1 ]
机构
[1] Dong eui Univ, Dept Comp Software Engn, Busan, South Korea
[2] Dong eui Univ, AI Grand ICT Res Ctr, Busan, South Korea
来源
XR AND METAVERSE, XR-METAVERSE CONFERENCE 2024 | 2025年
关键词
Video coding; Intra prediction; Depth video; Attention mechanism; CNN; NETWORK;
D O I
10.1007/978-3-031-77975-6_36
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose an intra prediction method using CNN and attention mechanism for coding high-resolution depth videos utilized in virtual reality. The proposed method enhances intra prediction performance for depth pictures by predicting spatial correlations between an input block and reference pixels which is adjacent to the block. The proposed network extracts spatial features through CNN layers and predicts the spatial correlations through attention mechanism. Spatial features in vertical and horizontal directions are extracted from top and left adjacent blocks, respectively, and merged to predict the spatial features of pixels in the input block. The attention layers predict correlations between the spatial features of the input block and the reference pixels. Finally, the pixel values are predicted through the predicted correlation. In the simulation results, the intra prediction accuracies are improved up to 3.37% compared with the intra modes of VVC.
引用
收藏
页码:477 / 487
页数:11
相关论文
共 17 条
[1]   Learning-based Image Coding: Early Solutions Reviewing and Subjective Quality Evaluation [J].
Ascenso, Joao ;
Akyazi, Pinar ;
Pereira, Fernando ;
Ebrahimi, Touradj .
OPTICS, PHOTONICS AND DIGITAL TECHNOLOGIES FOR IMAGING APPLICATIONS VI, 2021, 11353
[2]   Overview of the Versatile Video Coding (VVC) Standard and its Applications [J].
Bross, Benjamin ;
Wang, Ye-Kui ;
Ye, Yan ;
Liu, Shan ;
Chen, Jianle ;
Sullivan, Gary J. ;
Ohm, Jens-Rainer .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (10) :3736-3764
[3]   Progressive Spatial Recurrent Neural Network for Intra Prediction [J].
Hu, Yueyu ;
Yang, Wenhan ;
Li, Mading ;
Liu, Jiaying .
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (12) :3024-3037
[4]  
Hu YY, 2018, IEEE I C VI COM I PR
[5]  
Kim H. J., 2024, Journal of Multimedia and Information Systems, V11
[6]   Intra Prediction Method for Depth Video Coding by Block Clustering through Deep Learning [J].
Lee, Dong-seok ;
Kwon, Soon-kak .
SENSORS, 2022, 22 (24)
[7]   Efficient Depth Data Coding Method Based on Plane Modeling for Intra Prediction [J].
Lee, Dong-Seok ;
Kim, Byung-Gyu ;
Kwon, Soon-Kak .
IEEE ACCESS, 2021, 9 :29153-29164
[8]   Fully Connected Network-Based Intra Prediction for Image Coding [J].
Li, Jiahao ;
Li, Bin ;
Xu, Jizheng ;
Xiong, Ruiqin ;
Gao, Wen .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (07) :3236-3247
[9]   One-dimensional convolutional neural network (1D-CNN) image reconstruction for electrical impedance tomography [J].
Li, Xiuyan ;
Lu, Rengui ;
Wang, Qi ;
Wang, Jianming ;
Duan, Xiaojie ;
Sun, Yukuan ;
Li, Xiaojie ;
Zhou, Yong .
REVIEW OF SCIENTIFIC INSTRUMENTS, 2020, 91 (12)
[10]  
Nenci F, 2014, IEEE INT C INT ROBOT, P3794, DOI 10.1109/IROS.2014.6943095