A new deep learning-based fast transcoding for internet of things applications

被引:0
作者
Yang, Jia [1 ,4 ]
Peng, Yonghong [2 ]
Qing, Linbo [3 ]
Xue, Yajuan [1 ]
Yang, Hong [3 ]
机构
[1] Chengdu Univ Informat Technol, Coll Commun Engn, Chengdu 610225, Peoples R China
[2] Anglia Ruskin Univ, Fac Sci & Engn, Cambridge CB1 1PT, England
[3] Sichuan Univ, Coll Elect & Informat Engn, Chengdu 610065, Peoples R China
[4] Chengdu Univ Informat Technol, Adv Cryptog & Syst Secur Key Lab Sichuan Prov, Chengdu 610225, Peoples R China
基金
中国国家自然科学基金;
关键词
Internet of things; Low-power video coding; Transcoding; Distributed video coding; High efficiency video coding; Deep learning; WYNER-ZIV; DVC;
D O I
10.1038/s41598-025-99533-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
To achieve low-power video communication in Internet of Things, this study presents a new deep learning-based fast transcoding algorithm from distributed video coding (DVC) to high efficiency video coding (HEVC). The proposed method accelerates transcoding by minimizing HEVC encoding complexity. Specifically, it models the selections of coding unit (CU) partitions and prediction unit (PU) partition modes as classification tasks. To address these tasks, a novel lightweight deep learning network has been developed acting as the classifier in a top-down transcoding strategy for improved efficiency. The proposed transcoding algorithm operates efficiently at both CU and PU levels. At the CU level, it reduces HEVC encoding complexity by accurately predicting CU partitions. At the PU level, predicting PU partition modes for non-split CUs further streamlines the encoding process. Experimental results demonstrate that the proposed CU-level transcoding reduces complexity overhead by 45.69%, with a 1.33% average Bj & oslash;ntegaard delta bit-rate (BD-BR) increase. At the PU level, the transcoding achieves an even greater complexity reduction, averaging 60.97%, with a 2.16% average BD-BR increase. These results highlight the algorithm's efficiency in balancing computational cost and compression performance. The proposed method provides a promising low-power video coding scheme for resource-constrained terminals in both upstream and downstream video communication scenarios.
引用
收藏
页数:12
相关论文
共 43 条
[1]  
[Anonymous], 2013, ITU-T Recommendation H.265 and ISO/IEC 23008-2 (MPEG-H HEVC), Tech. Rep.
[2]  
Badem M. B., 2007, Picture Coding Sym, P1
[3]   Internet-of-Things Devices and Assistive Technologies for Health Care: Applications, Challenges, and Opportunities [J].
Baucas, Marc Jayson ;
Spachos, Petros ;
Gregori, Stefano .
IEEE SIGNAL PROCESSING MAGAZINE, 2021, 38 (04) :65-77
[4]  
Bjontegaard G., 2001, ITUT SG16 Q.6 Document, VCEG-M33
[5]  
Boudechiche D. E., 2023, P SCC, P1
[6]   Intelligent manufacturing production line data monitoring system for industrial internet of things [J].
Chen, Wei .
COMPUTER COMMUNICATIONS, 2020, 151 :31-41
[7]   Multimedia Communications Using a Fast and Flexible DVC to H.264/AVC/SVC Transcoder [J].
Corrales-Garcia, Alberto ;
Rodriguez-Sanchez, Rafael ;
Luis Martinez, Jose ;
Fernandez-Escribano, Gerardo ;
Jose Quiles, Francisco .
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2015, 79 (03) :211-232
[8]   Variable and constant bitrate in a DVC to H.264/AVC transcoder [J].
Corrales-Garcia, Alberto ;
Luis Martinez, Jose ;
Fernandez-Escribano, Gerardo ;
Jose Quiles, Francisco .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2011, 26 (06) :310-323
[9]  
Corrales-García A, 2010, LECT NOTES ARTIF INT, V6077, P233, DOI 10.1007/978-3-642-13803-4_29
[10]   Rate-distortion/complexity analysis of HEVC, VVC and AV1 video codecs [J].
Garcia-Lucas, D. ;
Cebrian-Marquez, G. ;
Cuenca, P. .
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (39-40) :29621-29638