The Rate Distortion Optimization(RDO)algorithm in High Efficiency Video Coding(HEVC)has many iterations and a large number of calculations.In order to decrease the calculation time and meet the requirements of fast sw...The Rate Distortion Optimization(RDO)algorithm in High Efficiency Video Coding(HEVC)has many iterations and a large number of calculations.In order to decrease the calculation time and meet the requirements of fast switching of RDO algorithms of different scales,an RDO dynamic reconfigurable structure is proposed.First,the Quantization Parameter(QP)and bit rate values were loaded through an H⁃tree Configurable Network(HCN),and the execution status of the array was detected in real time.When the switching request of the RDO algorithm was detected,the corresponding configuration information was delivered.This self⁃reconfiguration implementation method improved the flexibility and utilization of hardware.Experimental results show that when the control bit width was only increased by 31.25%,the designed configuration network could increase the number of controllable processing units by 32 times,and the execution cycle was 50%lower than the same type of design.Compared with previous RDO algorithm,the RDO algorithm implemented on the reconfigurable array based on the configuration network had an average operating frequency increase of 12.5%and an area reduction of 56.4%.展开更多
We propose a Rate-Distortion (RD) optimized strategy for frame-dropping and scheduling of multi-user conversa- tional and streaming videos. We consider a scenario where conversational and streaming videos share the fo...We propose a Rate-Distortion (RD) optimized strategy for frame-dropping and scheduling of multi-user conversa- tional and streaming videos. We consider a scenario where conversational and streaming videos share the forwarding resources at a network node. Two buffers are setup on the node to temporarily store the packets for these two types of video applications. For streaming video, a big buffer is used as the associated delay constraint of the application is moderate and a very small buffer is used for conversational video to ensure that the forwarding delay of every packet is limited. A scheduler is located behind these two buffers that dynamically assigns transmission slots on the outgoing link to the two buffers. Rate-distortion side information is used to perform RD-optimized frame dropping in case of node overload. Sharing the data rate on the outgoing link between the con- versational and the streaming videos is done either based on the fullness of the two associated buffers or on the mean incoming rates of the respective videos. Simulation results showed that our proposed RD-optimized frame dropping and scheduling ap- proach provides significant improvements in performance over the popular priority-based random dropping (PRD) technique.展开更多
This paper proposes an adaptive video pre-processing algorithm for video coding. This algorithm works on the original image before intraor inter-prediction. It adopts Gaussian filter to remove noise and insignificant ...This paper proposes an adaptive video pre-processing algorithm for video coding. This algorithm works on the original image before intraor inter-prediction. It adopts Gaussian filter to remove noise and insignificant features existing in images of video. Detection and restoration of edges are followed to restore the edges which are excessively filtered out in filtered images. Rate-Distortion Optimization (RDO) is employed to decide adaptively whether a processed block or a unprocessed block is coded into bit-streams doe more efficient coding. Our experiment results show that the algorithm achieves good coding performances on both subjective and objective aspects. In addition, the proposed pre-processing algorithm is transparent to decoder, and thus can be compliant with any video coding standards without modifying the decoder.展开更多
传统编码器虽然具有可控的码率,但却无法有效控制编码视频的质量,存在随图像内容变化而产生抖动的缺陷。针对当前互联网带宽特性,如自适应码率(Adaptive Bitrate,ABR)网络带宽控制技术中带宽固定的限制条件比传统广播网松弛,在以失真为...传统编码器虽然具有可控的码率,但却无法有效控制编码视频的质量,存在随图像内容变化而产生抖动的缺陷。针对当前互联网带宽特性,如自适应码率(Adaptive Bitrate,ABR)网络带宽控制技术中带宽固定的限制条件比传统广播网松弛,在以失真为约束的条件下,提出了一种新的率失真优化的失真分配方案,根据每个编码单元的拉格朗日乘子与图像组(Group of Pictures,GOP)级别的乘子之间的相互关系模型,设计了以帧级为单元的失真分配策略。基于高效率视频编码(High Efficiency Video Coding,HEVC)模型随机编码结构的默认配置下,对通用测试条件中规定的标准测试序列,实验结果显示质量一致性限制的编码器率失真性能Bj ntegaard Delta-Peak Signal to Noise Rate(BD-PSNR)提升了0.057 dB,编码后的图像组失真的方差减小了50%,能有效地减少编码视频的质量抖动,具有更加平稳的编码质量。展开更多
在R-λ帧内码控中,提出基于卷积神经网络(Convolutional Neural Networks,CNN)的最佳比特分配和最优拉格朗日因子λ选择。首先,探索编码树单元(Coding Tree Unit,CTU)的码率与失真(Rate-Distortion,R-D)及码率与拉格朗日因子λ(Rate-λ...在R-λ帧内码控中,提出基于卷积神经网络(Convolutional Neural Networks,CNN)的最佳比特分配和最优拉格朗日因子λ选择。首先,探索编码树单元(Coding Tree Unit,CTU)的码率与失真(Rate-Distortion,R-D)及码率与拉格朗日因子λ(Rate-λ,R-λ)的关系特性,设计具有四输出的CNN预测R-D和R-λ曲线的关键参数;然后,建立帧级λ和目标码率的优化方程,反演得到最佳CTU码率分配;最后,根据CTU码率分配和先知的R-λ曲线,得到最优CTU级λ。实验表明,算法在保持4.76%控制精度下,比VTM13.0默认码控算法提高0.31 dB的编码质量。展开更多
An improved rate distortion optimization (RDO) algorithm in JPEG2000 is proposed. The proposed algorithm is suitable for integrated circuit (IC) implementation and can reduce 30% computational cost. A hardware arc...An improved rate distortion optimization (RDO) algorithm in JPEG2000 is proposed. The proposed algorithm is suitable for integrated circuit (IC) implementation and can reduce 30% computational cost. A hardware architecture which includes control unit, memory, divider, data converter is also given to implement the algorithm. The circuit based on the improved algorithm is tested on FPGAs and integrated in a JPG2000 chip codec core.展开更多
This paper presents an improved rate control method for H.264. First, the scene changes are detected by the average absolute difference of the brightness histograms between the adjacent frames. Then, the bit allocatio...This paper presents an improved rate control method for H.264. First, the scene changes are detected by the average absolute difference of the brightness histograms between the adjacent frames. Then, the bit allocation and quantization parameters are adjusted, using a certain threshold. In addition, the calculation of the mean absolute difference (MAD) is modified in an alternative way, which makes the rate distortion optimization (RDO) more accurate. Extensive simulation results show that the proposed method, compared with G012, can improve the average peak signal-to-noise ratio (PSNR) and moderate the image quality.展开更多
Many modern video encoders use the Lagrangian rate-distortion optimization (RDO) algorithm for mode deci- sions during the compression procedure. For each encoding stage, this approach involves minimizing a cost, wh...Many modern video encoders use the Lagrangian rate-distortion optimization (RDO) algorithm for mode deci- sions during the compression procedure. For each encoding stage, this approach involves minimizing a cost, which is a function of rate, distortion and a multiplier called Lambda. This paper proposes to improve the RDO process by applying two modifications. The first modification is to increase the ac- curacy of rate estimation, which is achieved by computing a non-integer number of bits for arithmetic coding of the syntax elements. This leads to a more accurate cost computation and therefore a better mode decision. The second modification is to search and adjust the value of Lambda based on the char- acteristics of each coding stage. For the encoder used, this paper proposes to search multiple values of Lambda for the intra-4x4 mode decision. Moreover, a simple shift in Lambda value is proposed for motion estimation. Each of these modi- fications offers a certain gain in RDO performance, and, when all are combined, an average bit-rate saving of up to 7.0% can be achieved for the H.264/AVC codec while the same concept is applicable to the H.265/HEVC codec as well. The extra added complexity is contained to a certain level, and is also adjustable according to the processing resources available.展开更多
基于视频的点云压缩(Video based point cloud compression, V-PCC)为压缩动态点云提供了高效的解决方案,但V-PCC从三维到二维的投影使得三维帧间运动的相关性被破坏,降低了帧间编码性能.针对这一问题,提出一种基于V-PCC改进的自适应分...基于视频的点云压缩(Video based point cloud compression, V-PCC)为压缩动态点云提供了高效的解决方案,但V-PCC从三维到二维的投影使得三维帧间运动的相关性被破坏,降低了帧间编码性能.针对这一问题,提出一种基于V-PCC改进的自适应分割的视频点云多模式帧间编码方法,并依此设计了一种新型动态点云帧间编码框架.首先,为实现更精准的块预测,提出区域自适应分割的块匹配方法以寻找最佳匹配块;其次,为进一步提高帧间编码性能,提出基于联合属性率失真优化(Rate distortion optimization, RDO)的多模式帧间编码方法,以更好地提高预测精度和降低码率消耗.实验结果表明,提出的改进算法相较于V-PCC实现了-22.57%的BD-BR (Bjontegaard delta bit rate)增益.该算法特别适用于视频监控和视频会议等帧间变化不大的动态点云场景.展开更多
基金Sponsored by the National Natural Science Foundation of China(Grant Nos.61834005,61772417,61802304,61602377,and 61634004)the Shaanxi Province Coordination Innovation Project of Science and Technology(Grant No.2016KTZDGY02-04-02)+1 种基金the Shaanxi Provincial Key R&D Plan(Grant No.2017GY-060)the Shaanxi International Science and Technology Cooperation Program(Grant No.2018KW-006).
文摘The Rate Distortion Optimization(RDO)algorithm in High Efficiency Video Coding(HEVC)has many iterations and a large number of calculations.In order to decrease the calculation time and meet the requirements of fast switching of RDO algorithms of different scales,an RDO dynamic reconfigurable structure is proposed.First,the Quantization Parameter(QP)and bit rate values were loaded through an H⁃tree Configurable Network(HCN),and the execution status of the array was detected in real time.When the switching request of the RDO algorithm was detected,the corresponding configuration information was delivered.This self⁃reconfiguration implementation method improved the flexibility and utilization of hardware.Experimental results show that when the control bit width was only increased by 31.25%,the designed configuration network could increase the number of controllable processing units by 32 times,and the execution cycle was 50%lower than the same type of design.Compared with previous RDO algorithm,the RDO algorithm implemented on the reconfigurable array based on the configuration network had an average operating frequency increase of 12.5%and an area reduction of 56.4%.
基金Project (No. STE1093/1-1) supported by the German ResearchFoundation, Germany
文摘We propose a Rate-Distortion (RD) optimized strategy for frame-dropping and scheduling of multi-user conversa- tional and streaming videos. We consider a scenario where conversational and streaming videos share the forwarding resources at a network node. Two buffers are setup on the node to temporarily store the packets for these two types of video applications. For streaming video, a big buffer is used as the associated delay constraint of the application is moderate and a very small buffer is used for conversational video to ensure that the forwarding delay of every packet is limited. A scheduler is located behind these two buffers that dynamically assigns transmission slots on the outgoing link to the two buffers. Rate-distortion side information is used to perform RD-optimized frame dropping in case of node overload. Sharing the data rate on the outgoing link between the con- versational and the streaming videos is done either based on the fullness of the two associated buffers or on the mean incoming rates of the respective videos. Simulation results showed that our proposed RD-optimized frame dropping and scheduling ap- proach provides significant improvements in performance over the popular priority-based random dropping (PRD) technique.
文摘This paper proposes an adaptive video pre-processing algorithm for video coding. This algorithm works on the original image before intraor inter-prediction. It adopts Gaussian filter to remove noise and insignificant features existing in images of video. Detection and restoration of edges are followed to restore the edges which are excessively filtered out in filtered images. Rate-Distortion Optimization (RDO) is employed to decide adaptively whether a processed block or a unprocessed block is coded into bit-streams doe more efficient coding. Our experiment results show that the algorithm achieves good coding performances on both subjective and objective aspects. In addition, the proposed pre-processing algorithm is transparent to decoder, and thus can be compliant with any video coding standards without modifying the decoder.
文摘传统编码器虽然具有可控的码率,但却无法有效控制编码视频的质量,存在随图像内容变化而产生抖动的缺陷。针对当前互联网带宽特性,如自适应码率(Adaptive Bitrate,ABR)网络带宽控制技术中带宽固定的限制条件比传统广播网松弛,在以失真为约束的条件下,提出了一种新的率失真优化的失真分配方案,根据每个编码单元的拉格朗日乘子与图像组(Group of Pictures,GOP)级别的乘子之间的相互关系模型,设计了以帧级为单元的失真分配策略。基于高效率视频编码(High Efficiency Video Coding,HEVC)模型随机编码结构的默认配置下,对通用测试条件中规定的标准测试序列,实验结果显示质量一致性限制的编码器率失真性能Bj ntegaard Delta-Peak Signal to Noise Rate(BD-PSNR)提升了0.057 dB,编码后的图像组失真的方差减小了50%,能有效地减少编码视频的质量抖动,具有更加平稳的编码质量。
基金This project was supported by the National"863"High Technology Programof China (2002AA1Z1420)
文摘An improved rate distortion optimization (RDO) algorithm in JPEG2000 is proposed. The proposed algorithm is suitable for integrated circuit (IC) implementation and can reduce 30% computational cost. A hardware architecture which includes control unit, memory, divider, data converter is also given to implement the algorithm. The circuit based on the improved algorithm is tested on FPGAs and integrated in a JPG2000 chip codec core.
基金Supported by the National Natural Science Foundation of China (60372057)
文摘This paper presents an improved rate control method for H.264. First, the scene changes are detected by the average absolute difference of the brightness histograms between the adjacent frames. Then, the bit allocation and quantization parameters are adjusted, using a certain threshold. In addition, the calculation of the mean absolute difference (MAD) is modified in an alternative way, which makes the rate distortion optimization (RDO) more accurate. Extensive simulation results show that the proposed method, compared with G012, can improve the average peak signal-to-noise ratio (PSNR) and moderate the image quality.
文摘Many modern video encoders use the Lagrangian rate-distortion optimization (RDO) algorithm for mode deci- sions during the compression procedure. For each encoding stage, this approach involves minimizing a cost, which is a function of rate, distortion and a multiplier called Lambda. This paper proposes to improve the RDO process by applying two modifications. The first modification is to increase the ac- curacy of rate estimation, which is achieved by computing a non-integer number of bits for arithmetic coding of the syntax elements. This leads to a more accurate cost computation and therefore a better mode decision. The second modification is to search and adjust the value of Lambda based on the char- acteristics of each coding stage. For the encoder used, this paper proposes to search multiple values of Lambda for the intra-4x4 mode decision. Moreover, a simple shift in Lambda value is proposed for motion estimation. Each of these modi- fications offers a certain gain in RDO performance, and, when all are combined, an average bit-rate saving of up to 7.0% can be achieved for the H.264/AVC codec while the same concept is applicable to the H.265/HEVC codec as well. The extra added complexity is contained to a certain level, and is also adjustable according to the processing resources available.
文摘基于视频的点云压缩(Video based point cloud compression, V-PCC)为压缩动态点云提供了高效的解决方案,但V-PCC从三维到二维的投影使得三维帧间运动的相关性被破坏,降低了帧间编码性能.针对这一问题,提出一种基于V-PCC改进的自适应分割的视频点云多模式帧间编码方法,并依此设计了一种新型动态点云帧间编码框架.首先,为实现更精准的块预测,提出区域自适应分割的块匹配方法以寻找最佳匹配块;其次,为进一步提高帧间编码性能,提出基于联合属性率失真优化(Rate distortion optimization, RDO)的多模式帧间编码方法,以更好地提高预测精度和降低码率消耗.实验结果表明,提出的改进算法相较于V-PCC实现了-22.57%的BD-BR (Bjontegaard delta bit rate)增益.该算法特别适用于视频监控和视频会议等帧间变化不大的动态点云场景.