A fast mode decision algorithm is proposed in this paper to accelerate the process of transcoding videos into H.264with arbitrary rate spatial resolution down-scaling. The proposed algorithm consists of three steps. F...A fast mode decision algorithm is proposed in this paper to accelerate the process of transcoding videos into H.264with arbitrary rate spatial resolution down-scaling. The proposed algorithm consists of three steps. First, an early-stop technique is introduced to determine the 16× 16-mode blocks, which take up about 70% of all the macroblocks; then, a bottom-up merging process is performed to determine the mode of rest non-early-stopped blocks; and then, we adopt half-pixel motion estimation to further refine the acquired predictive motion vectors. In order to obtain the predictive motion vectors for early-stop and merging processes, we propose a motion vector composition scheme, which can reuse the information in the input pre-encoded videos to handle the spatial resolution down-scaling. Experimental results showed that our algorithm is about four times faster than the Cascaded-Decoder-Encoder method and has negligible PSNR drop and little bit rate increase.展开更多
H.264/MPEG-4 AVC standard appears highly competitive due to its high efficiency, flexibility and error resilience. In order to maintain universal multimedia access, statistical multiplexing, or adaptive video content ...H.264/MPEG-4 AVC standard appears highly competitive due to its high efficiency, flexibility and error resilience. In order to maintain universal multimedia access, statistical multiplexing, or adaptive video content delivery, etc., it induces an immense demand for converting a large volume of existing multimedia content from other formats into the H.264/AVC format and vice versa. In this work, we study the remultiplexing and resynchronization issue within system coding after transcoding, aiming to sustain the management and time information destroyed in transcoding and enable synchronized decoding of decoder buffers over a wide range of retrieval or receipt conditions. Given the common intention of multiplexing and synchronization mechanism in system coding of different standards, this paper takes the most widely used MPEG-2 transport stream (TS) as an example, and presents a software system and the key technologies to solve the time stamp mapping and relevant buffer management. The solution reuses previous information contained in the input streams to remultiplex and resynchronize the output information with the regulatory coding and composition structure. Experimental results showed that our solutions efficiently preserve the performance in multimedia presentation.展开更多
Although the coding modes of H.264 coded video would be changed by the transcoding process of spatial resolution reduction, there exists good correlation in prediction modes and prediction directions between input and...Although the coding modes of H.264 coded video would be changed by the transcoding process of spatial resolution reduction, there exists good correlation in prediction modes and prediction directions between input and output video. In this paper, we first introduce a new spatial resolution reduction transcoding architecture of intra coded frames where the distortion can be calculated directly in compression domain. We then propose a fast mode decision algorithm in which only a small part of rate distortion optimization (RDO) calculation is needed for mode decision. For 4×4 luma block, the proposed scheme has average 21.3% computation saving, compared to the cascaded pixel-domain transcoding scheme with the fast intra mode decision algorithm proposed in JVT-G013. For 16×16 luma block, RDO calculation is completely avoided in our scheme while the scheme in JVT-G013 needs 2 RDO calculations. Experimental results show that our scheme outperforms that of JVT-G013 in terms of significantly computasavings with negligible loss of PSNR展开更多
基金Project supported by the National Natural Science Foundation of China (No. 60573176)the Key Technologies R & D Program of Zhejiang Province (Nos. 2005C23047 and 2004C11052), China
文摘A fast mode decision algorithm is proposed in this paper to accelerate the process of transcoding videos into H.264with arbitrary rate spatial resolution down-scaling. The proposed algorithm consists of three steps. First, an early-stop technique is introduced to determine the 16× 16-mode blocks, which take up about 70% of all the macroblocks; then, a bottom-up merging process is performed to determine the mode of rest non-early-stopped blocks; and then, we adopt half-pixel motion estimation to further refine the acquired predictive motion vectors. In order to obtain the predictive motion vectors for early-stop and merging processes, we propose a motion vector composition scheme, which can reuse the information in the input pre-encoded videos to handle the spatial resolution down-scaling. Experimental results showed that our algorithm is about four times faster than the Cascaded-Decoder-Encoder method and has negligible PSNR drop and little bit rate increase.
基金Project supported by the National Natural Science Foundation of China(No.60502033),the Natural Science Foundation of Shanghai (No.04ZRl4084)and the Research Fund for the Doctoral Program of Higher Eduction(No.20040248047),China
文摘H.264/MPEG-4 AVC standard appears highly competitive due to its high efficiency, flexibility and error resilience. In order to maintain universal multimedia access, statistical multiplexing, or adaptive video content delivery, etc., it induces an immense demand for converting a large volume of existing multimedia content from other formats into the H.264/AVC format and vice versa. In this work, we study the remultiplexing and resynchronization issue within system coding after transcoding, aiming to sustain the management and time information destroyed in transcoding and enable synchronized decoding of decoder buffers over a wide range of retrieval or receipt conditions. Given the common intention of multiplexing and synchronization mechanism in system coding of different standards, this paper takes the most widely used MPEG-2 transport stream (TS) as an example, and presents a software system and the key technologies to solve the time stamp mapping and relevant buffer management. The solution reuses previous information contained in the input streams to remultiplex and resynchronize the output information with the regulatory coding and composition structure. Experimental results showed that our solutions efficiently preserve the performance in multimedia presentation.
文摘Although the coding modes of H.264 coded video would be changed by the transcoding process of spatial resolution reduction, there exists good correlation in prediction modes and prediction directions between input and output video. In this paper, we first introduce a new spatial resolution reduction transcoding architecture of intra coded frames where the distortion can be calculated directly in compression domain. We then propose a fast mode decision algorithm in which only a small part of rate distortion optimization (RDO) calculation is needed for mode decision. For 4×4 luma block, the proposed scheme has average 21.3% computation saving, compared to the cascaded pixel-domain transcoding scheme with the fast intra mode decision algorithm proposed in JVT-G013. For 16×16 luma block, RDO calculation is completely avoided in our scheme while the scheme in JVT-G013 needs 2 RDO calculations. Experimental results show that our scheme outperforms that of JVT-G013 in terms of significantly computasavings with negligible loss of PSNR