Crowdsourcing Based Cross Random Access Point Referencing for Video Coding

被引:1
作者
Yu, Hualong [1 ]
Gao, Xiaoding [1 ]
Yu, Lu [1 ]
机构
[1] Zhejiang Univ, Inst Informat & Commun Engn, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Decoding; Crowdsourcing; Video coding; Encoding; Streaming media; Signal processing algorithms; Redundancy; Cross random access point referencing; crowdsourcing; reference structure; streaming; video coding;
D O I
10.1109/LSP.2020.2983306
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In video coding, Random Access Points (RAPs) are inserted in a bitstream to support flexible tune-in but divide it into multiple independent Random Access Segments (RASs) that may have similar contents. To reduce redundancy between RASs, this letter proposes a novel Cross Random-access-point Referencing (CRR) structure to provide inter prediction for RAP pictures by using multiple External Reference Pictures (ERPs) across RAPs that are selected from preceding or following RASs other than the current RAS. With ERPs shared by multiple RASs, a crowdsourcing method is proposed to optimize the joint rate distortion costs of RASs and ERPs to generate an optimal set of ERPs. Content preparation and bitstream splicing processes supported by system environments are also designed to ensure random access functionality of CRR coded RASs. Simulation results show that CRR achieves significant coding gain compared to Versatile Video Coding (VVC), i.e., 12.00% on sequences in common test condition and 25.48% on long drama sequences.
引用
收藏
页码:560 / 564
页数:5
相关论文
共 22 条
[1]  
[Anonymous], 2015, 14496122015COR1 ISOI
[2]  
[Anonymous], 2014, 230091 ISOIEC
[3]   Overview of SHVC: Scalable Extensions of the High Efficiency Video Coding Standard [J].
Boyce, Jill M. ;
Ye, Yan ;
Chen, Jianle ;
Ramasubramonian, Adarsh K. .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (01) :20-34
[4]   MAXIMIZING NON-MONOTONE SUBMODULAR FUNCTIONS [J].
Feige, Uriel ;
Mirrokni, Vahab S. ;
Vondrak, Jan .
SIAM JOURNAL ON COMPUTING, 2011, 40 (04) :1133-1153
[5]  
Gao XY, 2019, CONF REC ASILOMAR C, P930, DOI [10.1109/IEEECONF44664.2019.9048939, 10.1109/ieeeconf44664.2019.9048939]
[6]  
Liu S., 2019, JVETM1001
[7]   Traffic surveillance video coding with libraries of vehicles and background [J].
Ma, Changyue ;
Liu, Dong ;
Peng, Xiulian ;
Li, Li ;
Wu, Feng .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 60 (426-440) :426-440
[8]   ANALYSIS OF APPROXIMATIONS FOR MAXIMIZING SUBMODULAR SET FUNCTIONS .1. [J].
NEMHAUSER, GL ;
WOLSEY, LA ;
FISHER, ML .
MATHEMATICAL PROGRAMMING, 1978, 14 (03) :265-294
[9]  
Pettersson M., 2019, JVETN0494
[10]  
Pettersson M, 2015, IEEE IMAGE PROC, P867, DOI 10.1109/ICIP.2015.7350923