Energy-Efficient Saliency-Guided Video Coding Framework for Real-Time Applications

被引：1

作者：

Partanen, Tero ^{[1
]}

Hoang, Minh ^{[1
]}

Mercat, Alexandre ^{[1
]}

Sainio, Joose ^{[1
]}

Vanne, Jarno ^{[1
]}

机构：

[1] Tampere Univ, Ultra Video Grp, Tampere 33014, Finland

来源：

IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS | 2025年 / 15卷 / 01期

关键词：

Encoding; Streaming media; Video coding; Image coding; Saliency detection; Energy efficiency; Energy consumption; Visualization; Object tracking; Computational modeling; Saliency-guided encoding; region-of-interest (ROI); ROI tracking; deep learning (DL); motion vector (MV); OBJECT DETECTION; REGION; MODEL; TRACKING; SCHEME;

D O I：

10.1109/JETCAS.2024.3525339

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The significant growth in global video data traffic can be mitigated by saliency-based video coding schemes that seek to increase coding efficiency without any loss of objective visual quality by compressing salient video regions less heavily than non-salient regions. However, conducting salient object detection (SOD) on every video frame before encoding tends to lead to substantial complexity and energy consumption overhead, especially if state-of-the-art deep learning techniques are used in saliency detection. This work introduces a saliency-guided video encoding framework that reduces the energy consumption over frame-by-frame SOD by increasing the detection interval and applying the proposed region-of-interest (ROI) tracking between successive detections. The computational complexity of our ROI tracking technique is kept low by predicting object movements from motion vectors, which are inherently calculated during encoding. Our experimental results demonstrate that the proposed ROI tracking solution saves energy by 86-95% and attains 84-94% accuracy over frame-by-frame SOD. Correspondingly, integrating our proposal into the complete saliency-guided video coding scheme reduces energy consumption on CPU by 79-82% at a cost of weighted PSNR of less than 5%. These findings indicate that our solution has significant potential for low-cost and low-power streaming media applications.

引用

页码：44 / 57

页数：14

共 74 条

[1]

Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596

[2]

[Anonymous], 2018, P IEEE C COMP VIS PA

[3] PyTorch 2: Faster Machine Learning Through Dynamic Python']Python Bytecode Transformation and Graph Compilation [J].

Ansel, Jason ;

Yang, Edward ;

He, Horace ;

Gimelshein, Natalia ;

Jain, Animesh ;

Voznesensky, Michael ;

Bao, Bin ;

Bell, Peter ;

Berard, David ;

Burovski, Evgeni ;

Chauhan, Geeta ;

Chourdia, Anjali ;

Constable, Will ;

Desmaison, Alban ;

DeVito, Zachary ;

Ellison, Elias ;

Feng, Will ;

Gong, Jiong ;

Gschwind, Michael ;

Hirsh, Brian ;

Huang, Sherlock ;

Kalambarkar, Kshiteej ;

Kirsch, Laurent ;

Lazos, Michael ;

Lezcano, Mario ;

Liang, Yanbo ;

Liang, Jason ;

Lu, Yinghai ;

Luk, C. K. ;

Maher, Bert ;

Pan, Yunjie ;

Puhrsch, Christian ;

Reso, Matthias ;

Saroufim, Mark ;

Siraichi, Marcos Yukio ;

Suk, Helen ;

Suo, Michael ;

Tillet, Phil ;

Wang, Eikan ;

Wang, Xiaodong ;

Wen, William ;

Zhang, Shunting ;

Zhao, Xu ;

Zhou, Keren ;

Zou, Richard ;

Mathews, Ajit ;

Chanan, Gregory ;

Wu, Peng ;

Chintala, Soumith .

PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, ASPLOS 2024, VOL 2, 2024, :929-947

[4]

Bagdanov A. D., 2011, Proceedings of the 2011 IEEE International Symposium on Multimedia (ISM 2011), P190, DOI 10.1109/ISM.2011.38

[5] Real-time face perception based encoding strategy optimization method for UHD videos [J].

Bi, Jiang ;

Wang, Lidong ;

Han, Yu ;

Zhou, Cheng .

IET IMAGE PROCESSING, 2023, 17 (09) :2764-2779

[6]

Bjontegaard G., 2001, CALCULATION AVERAGE

[7]

Bjontegaard G., 2008, Improvements of the BD-PSNR model

[8]

Bommes L, 2020, C IND ELECT APPL, P1419, DOI [10.1109/iciea48937.2020.9248145, 10.1109/ICIEA48937.2020.9248145]

[9] Salient object detection: A survey [J].

Borji, Ali ;

Cheng, Ming-Ming ;

Hou, Qibin ;

Jiang, Huaizu ;

Li, Jia .

COMPUTATIONAL VISUAL MEDIA, 2019, 5 (02) :117-150

[10] Overview of the Versatile Video Coding (VVC) Standard and its Applications [J].

Bross, Benjamin ;

Wang, Ye-Kui ;

Ye, Yan ;

Liu, Shan ;

Chen, Jianle ;

Sullivan, Gary J. ;

Ohm, Jens-Rainer .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (10) :3736-3764

← 1 2 3 4 5 6 7 8 →