Scalable video quality enhancement refers to the process of enhancing low quality frames using high quality ones in scalable video bitstreams with time-varying qualities. A key problem in the enhancement is how to sea...Scalable video quality enhancement refers to the process of enhancing low quality frames using high quality ones in scalable video bitstreams with time-varying qualities. A key problem in the enhancement is how to search for correspondence between high quality and low quality frames. Previous algorithms usually use block-based motion estimation to search for correspondences. Such an approach can hardly estimate scale and rotation transforms and always in- troduces outliers to the motion estimation results. In this paper, we propose a pixel-based outlier-free motion estimation algorithm to solve this problem. In our algorithm, the motion vector for each pixel is calculated with respect to estimate translation, scale, and rotation transforms. The motion relationships between neighboring pixels are considered via the Markov random field model to improve the motion estimation accuracy. Outliers are detected and avoided by taking both blocking effects and matching percentage in scale- invariant feature transform field into consideration. Experiments are conducted in two scenarios that exhibit spatial scalability and quality scalability, respectively. Experimental results demonstrate that, in comparison with previous algorithms, the proposed algorithm achieves better correspondence and avoids the simultaneous introduction of outliers, especially for videos with scale and rotation transforms.展开更多
With the development of general-purpose processors (GPP) and video signal processing algorithms, it is possible to implement a software-based real-time video encoder on GPP, and its low cost and easy upgrade attract d...With the development of general-purpose processors (GPP) and video signal processing algorithms, it is possible to implement a software-based real-time video encoder on GPP, and its low cost and easy upgrade attract developers' interests to transfer video encoding from specialized hardware to more flexible software. In this paper, the encoding structure is set up first to support complexity scalability; then a lot of high performance algorithms are used on the key time-consuming modules in coding process; finally, at programming level, processor characteristics are considered to improve data access efficiency and processing parallelism. Other programming methods such as lookup table are adopted to reduce the computational complexity. Simulation results showed that these ideas could not only improve the global performance of video coding, but also provide great flexibility in complexity regulation.展开更多
首先提出了一种运动估计方法MLBS(Modified Low Band Shift),并在此基础上提出了基于MLBS的小波域视频可分级运动估计方案MLBSSME.利用MLBS方法,提高了运动估计的准确度.借助重叠块运动补偿方法,有效降低了最高分辨率下解码图像...首先提出了一种运动估计方法MLBS(Modified Low Band Shift),并在此基础上提出了基于MLBS的小波域视频可分级运动估计方案MLBSSME.利用MLBS方法,提高了运动估计的准确度.借助重叠块运动补偿方法,有效降低了最高分辨率下解码图像的块效应.根据小波变换的多分辨率特性,利用缩小了的搜索窗口提高了搜索速度.这种方法可有效应用于空间可分级和数率可分级的视频编码器中.实验结果证明,对于多种类型的标准测试视频流,MLBSSME算法始终能保持很高的估计精度.利用该算法补偿得到的预测帧,其PSNR较之基于下层LL子带的分层运动估计方法和子带直接运动估计方法平均要高出1~3dB,而对于空间细节较简单的视频,其PSNR比LBS方法提高了0.5~1dB,并且算法的时空复杂度是LBS方法复杂度的30%~40%.展开更多
基金Acknowledgements This work was supported by the National Science Fund for Distinguished Young Scholars of China (61125102), and the State Key Program of National Natural Science Foundation of China (Grant No. 61133008).
文摘Scalable video quality enhancement refers to the process of enhancing low quality frames using high quality ones in scalable video bitstreams with time-varying qualities. A key problem in the enhancement is how to search for correspondence between high quality and low quality frames. Previous algorithms usually use block-based motion estimation to search for correspondences. Such an approach can hardly estimate scale and rotation transforms and always in- troduces outliers to the motion estimation results. In this paper, we propose a pixel-based outlier-free motion estimation algorithm to solve this problem. In our algorithm, the motion vector for each pixel is calculated with respect to estimate translation, scale, and rotation transforms. The motion relationships between neighboring pixels are considered via the Markov random field model to improve the motion estimation accuracy. Outliers are detected and avoided by taking both blocking effects and matching percentage in scale- invariant feature transform field into consideration. Experiments are conducted in two scenarios that exhibit spatial scalability and quality scalability, respectively. Experimental results demonstrate that, in comparison with previous algorithms, the proposed algorithm achieves better correspondence and avoids the simultaneous introduction of outliers, especially for videos with scale and rotation transforms.
文摘With the development of general-purpose processors (GPP) and video signal processing algorithms, it is possible to implement a software-based real-time video encoder on GPP, and its low cost and easy upgrade attract developers' interests to transfer video encoding from specialized hardware to more flexible software. In this paper, the encoding structure is set up first to support complexity scalability; then a lot of high performance algorithms are used on the key time-consuming modules in coding process; finally, at programming level, processor characteristics are considered to improve data access efficiency and processing parallelism. Other programming methods such as lookup table are adopted to reduce the computational complexity. Simulation results showed that these ideas could not only improve the global performance of video coding, but also provide great flexibility in complexity regulation.
文摘首先提出了一种运动估计方法MLBS(Modified Low Band Shift),并在此基础上提出了基于MLBS的小波域视频可分级运动估计方案MLBSSME.利用MLBS方法,提高了运动估计的准确度.借助重叠块运动补偿方法,有效降低了最高分辨率下解码图像的块效应.根据小波变换的多分辨率特性,利用缩小了的搜索窗口提高了搜索速度.这种方法可有效应用于空间可分级和数率可分级的视频编码器中.实验结果证明,对于多种类型的标准测试视频流,MLBSSME算法始终能保持很高的估计精度.利用该算法补偿得到的预测帧,其PSNR较之基于下层LL子带的分层运动估计方法和子带直接运动估计方法平均要高出1~3dB,而对于空间细节较简单的视频,其PSNR比LBS方法提高了0.5~1dB,并且算法的时空复杂度是LBS方法复杂度的30%~40%.