期刊文献+

3D-HEVC深度图帧内预测快速算法 被引量:1

Fast intra prediction algorithm for depth maps in 3D-HEVC
原文传递
导出
摘要 目的多视点纹理加深度视频(MVD)格式逐渐成为立体视频的主流表现形式之一。新一代高效率立体视频编码(3D-HEVC)继承了HEVC的编码结构并引入一些新的编码技术,导致深度图帧内编码过程具有较高的计算复杂度。针对这一问题,提出了一种深度图帧内编码快速算法。方法本文算法利用深度图的特征分别对CU分割过程和粗略模式选择(RMD)过程进行优化。首先在四叉树编码结构上,利用基于纹理元的图像分析方法计算编码单元的梯度矩阵,若梯度矩阵中的梯度值之和小于给定的阈值,则终止该CU的分割进程。同时,对大尺寸的PU和小尺寸的PU分别利用纹理特征与粗略模式选择过程中Planar和DC进行低复杂度率失真计算后的最小率失真代价,跳过RMD中角度模式的检查过程。结果实验结果表明,与原始算法相比,本文算法平均节省40.64%的深度图编码时间,而合成视点的平均比特率仅仅增加了0.17%。本文算法不仅能对平坦的CU跳过不必要的深度决策过程,而且有效地减少了RMD中需要遍历的模式数目,提高了编码器的效率。结论该算法对CU分割进程和粗略模式选择过程都进行优化,在合成视点的视频质量几乎不变的前提下,有效降低了深度图的帧内编码复杂度。 Objective The multi-view video plus depth (MVD) format is gradually becoming one of the main representations for 3 D videos. The 3 D high-efficiency video coding (3 D-HEVC) is the latest coding standard for compressing the MVD ibr- mat. The 3 D-HEVC inherits the coding structure of HEVC. Consequently, the splitting process of coding units (CUs) and the intra mode search process in depth map intra coding have great computational complexity. New techniques for depth map intra coding, such as depth modeling mode and simplified depth coding, have been introduced in recent years to pre- serve the sharp edges of depth maps. These techniques play an important role in the coding of depth maps. However, the adoption of these techniques further increases the computational complexity in 3D-HEVC eneoder. A fast algorithm for depth map intra coding is proposed in this study to reduce the computational complexity of depth map intra coding. Method There are a lot of large smooth regions which are separated by sharp edges in depth maps. The CU splitting process and the rough mode decision (RMD) process are improved by the proposed algorithm by using the unique characteristics of depth maps. An algorithm based on the concept of texture primitive is proposed for the hierarchical quad-tree coding structure to early terminate the CU splitting process. First, the gradient matrix of the current CU can be calculated by using the texture analysis algorithm based on texture primitive. On the basis of statistical analysis, a strong correlation between the optimal size and sum of gradient values in gradient matrix is considered for each CU. If the sum of gradient values in gradient matrix is small, then the optimal size of current CU will be large. By contrast, if the sum of gradient values in gradient matrix is large, then the optimal size of current CU will be small. Therefore, if the sum of gradient values in gradient matrix is smal- ler than a given threshold, then the CU splitting process should be terminated. For the RMD process, the texture features and the smallest LCRD,,~ of Planar and DC are used to skip the search of angular modes in RMD for prediction units (PUs) of large size and PUs of small size, respectively. Planar and DC are two intra prediction modes that are highly suited to code smooth PUs. If the texture of current PU is flat, then Planar or DC is likely to be selected as the optimal mode. Hence, for PUs of large size, if the sum of the gradient values in the gradient matrix is zero, then only Planar and DC are added to the full-RD search list, and the RMD process is skipped. When the size of PUs is small, if the smallest LCRDcost of Planar and DC is smaller than a given threshold, then the RMD process is terminated immediately and the search of an- gular modes in RMD is skipped. Result In the proposed approach, the unnecessary depth levels of smooth CUs can be skipped, and the number of intra mode candidates for RMD is effectively reduced. The reference software HTM 13.0 of the 3D-HEVC standard is used to verify the coding performance of the proposed algorithm. Eight JCT-3V specified test se- quences with two resolutions of 1 024 x 768 and 1 920 × 1088 are tested. The quantization parameter (QP) values for tex- ture are 25, 30, 35, and 40, and the QP values for depth maps are 34, 39, 42, and 45. Experimental results show that compared with HTM 13.0, the proposed algorithm achieves an average depth map coding time reduction of 40. 64% with a small bitrate loss of 0. 17% for synthesized views under all intra scenario. For the eight test sequences, the coding time re- duction of depth maps ranges from 34.77% to 51.42% , which indicates that the proposed algorithm can effectively improve encoder efficiency and has a general validity. In particular, the time saving of Poznan_ I-Iall2 is over 50% , which is con- siderably larger than that of other sequences. This result is due to the fact that the depth maps of Poznan_ Hall2 contain lesser edges and have a larger proportion of flat regions. The proposed algorithm also has advantages compared with other existing algorithms. A subjective quality comparison of synthesized views for Balloons ( 1 024 × 768) sequence and Poznan Hall2 ( 1 920 × 1 088) sequence is presented to further evaluate the performance of the proposed algorithm. The results indicate that the quality of decoded synthesized views generated by the proposed algorithm is almost the same as the quality of those generated by the original HTM-13.0. The proposed algorithm can preferably preserve the edge information of depth maps. Conclusion The proposed algorithm not only accelerates quad-tree decision but also optimizes the RMD process. The algorithm periodically updates thresholds on the basis of temporal correlation of video sequences to ensure a good video qual- ity of synthesized views. Subjective and objective evaluations show that the proposed algorithm can significantly reduce the computational complexity of depth map intra coding without decreasing the quality of synthesized views. The proposed algo- rithm also has practical values and can be applied to actual situations. Nonetheless, the proposed algorithm can be further improved. The algorithm optimizes the reeursive splitting process of smooth CUs; however, the splitting process of CUs with a complex texture still has high computational complexity. Therefore, effective and efficient ways for reducing the depth lev- els of CUs with a complex texture will be studied in future research.
出处 《中国图象图形学报》 CSCD 北大核心 2018年第1期18-27,共10页 Journal of Image and Graphics
基金 福建省自然科学基金项目(2016J01306) 华侨大学研究生科研创新能力培育计划资助项目~~
关键词 3维高效率立体视频编码(3D-HEVC) 计算复杂度 深度图 帧内编码 编码单元 梯度矩阵 3D-HEVC computational complexity depth map intra coding coding unit (CU) gradient matrix
  • 相关文献

参考文献3

二级参考文献40

  • 1Mfiller K, Merkle P, and Wiegaad T. 3-D video representation using depth maps[J]. Proceedings of the IEEE, 2011, 99(4): 643-656.
  • 2Fehn C. Depth-Image-Based Rendering (DIBR), compression and transmission for a new approach on 3D-TV [C]. Proceedings in SPIE Stereoscopic Displays and Virtual Reality Systems XI, San Jose, CA, USA, 2004: 93-104.
  • 3Merkle P, Morvan Y, Smolic A, et al.. The effects of multiview depth video compression on multiview rendering [J]. Signal Processing: Image Communication, 2009, 24(1/2): 73-88.
  • 4Maitre M and Do M N. Depth and depth-color coding using shape-adaptive wavelets[J]. Journal of Visual Communication and Image Representation, 2010, 21(5-6): 513-522.
  • 5Kamolrat B, Fernando W, Mrak M, et al.. 3D motion estimation for depth image coding in 3D video coding[J].IEEE Transactions on Consumer Electronics, 2009, 55(2): 824-830.
  • 6Oh K J, Yea S, Vetro A, et al.. Depth reconstruction filter and down/up sampling for depth coding in 3D video[J]. IEEE Signal Processing Letters, 2009, 16(9): 747-750.
  • 7Secker A and Taubman D. Highly scalable video compression with scalable motion coding[J]. IEEE Transactions on Image Processing, 2004, 13(8): 1029-1041.
  • 8Tourapis A M, Leontaris A, Suehring K, et al.. H.264/ MPEG-4 AVC reference software manual. Joint Video Team (JVT) of ISO-IEC MPEG & ITU-T VCEG, JVT-AD010, Jan 2009.
  • 9Zitnick C L, Kang S B, Uyttendaele M, et al.. High-quality video view interpolation using a layered representation[J]. A CM Transactions on Graphics, 2004, 23(3): 600-608.
  • 10Ho Y S, Lee E K, and Lee C. Multiview video test sequence and camera parameters. ISO/IEC JTC1/SC29/WGll, MPEG 2008/M15419, Arehamps, France, April 2008.

共引文献22

同被引文献6

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部