Wyner-Ziv Video Coding (WZVC) is considered as a promising video coding scheme for Wireless Video Sensor Networks (WVSNs) due to its high compression efficiency and error resilience functionalities, as well as its...Wyner-Ziv Video Coding (WZVC) is considered as a promising video coding scheme for Wireless Video Sensor Networks (WVSNs) due to its high compression efficiency and error resilience functionalities, as well as its low encoding complex- ity. To achieve a good Rate-Distortion (R-D) per- formance, the current WZVC paradi^prls usually a- dopt an end-to-end rate control scheme in which the decoder repeatedly requests the additional deco- ding data from the encoder for decoding Wyner-Ziv frames. Therefore, the waiting time of the additional decoding data is especially long in multihop WVSNs. In this paper, we propose a novel pro- gressive in-network rate control scheme for WZVC. The proposed in-network puncturing-based rate control scheme transfers the partial channel codes puncturing task from the encoder to the relay nodes. Then, the decoder can request the addition- al decoding data from the relay nodes instead of the encoder, and the total waiting time for deco- ding Wyner-Ziv frames is reduced consequently. Simulation results validate the proposed rate con- trol scheme.展开更多
This paper presents a streaming system using scalable video coding based on H.264/AVC. The system provides a congestion control algorithm supported by channel bandwidth estimation of the client. It uses retransmission...This paper presents a streaming system using scalable video coding based on H.264/AVC. The system provides a congestion control algorithm supported by channel bandwidth estimation of the client. It uses retransmission only for packets of the base layer to disburden the congested network. The bandwidth estimation allows for adjusting the transmission rate quickly to the current available bandwidth of the network. Compared to binomial congestion control, the proposed system allows for shorter start-up times and data rate adaptation. The paper describes the components of this streaming system and the results of experiments showing that the proposed approach works effectively for streaming video.展开更多
Asymmetric stereoscopic video coding can take advantage of binocular suppression in human vision by representing one of the two views in lower quality.This paper proposes a bit allocation strategy for asymmetric stere...Asymmetric stereoscopic video coding can take advantage of binocular suppression in human vision by representing one of the two views in lower quality.This paper proposes a bit allocation strategy for asymmetric stereoscopic video coding.In order to improve the accuracy of bit allocation and rate control in the left view,a proportionalintegral-derivative controller is adopted.Meanwhile,to control the quality fluctuation between consecutive frames of the left view,a quality controller is adopted.Besides,a fuzzy controller is proposed to control the variation in quality between the left and right views by comparing the PSNR disparity of two views with a fixed threshold,which is used to quantize the binocular psycho-visual redundancy and adjust the quantization parameter (QP) of the right view correspondingly.The proposed algorithm has been implemented in H.264/AVC video codec,and the experimental results show its effectiveness in rate control while keeping a good quality for the left view,and fewer bits are allocated for the right view so that the overall bit rate is saved by 7.2% at most without the loss of subjective visual quality for stereoscopic video.展开更多
For rate control (RC) of hierarchical structure coding, an independent rate-quantization (R-Q) model was proposed based on mean absolute differences (MADs) in different temporal levels (TLs). In the proposed R-Q model...For rate control (RC) of hierarchical structure coding, an independent rate-quantization (R-Q) model was proposed based on mean absolute differences (MADs) in different temporal levels (TLs). In the proposed R-Q model, a novel MAD model was developed according to the hierarchical structure. The experimental results demonstrate that the proposed algorithm provides better performance, in terms of average peak signal-to-noise ratio (PSNR) and quality smoothness, than the H.264 reference model, JM14.2, under various sequences.展开更多
This paper presents a new video coding system based on wavelet transform and its rate control scheme over ATM networks. First, three dimensional wavelet transform is performed for the original image sequence, and an e...This paper presents a new video coding system based on wavelet transform and its rate control scheme over ATM networks. First, three dimensional wavelet transform is performed for the original image sequence, and an extension of set partitioning in hierarchical trees algorithm is employed to quantize the wavelet coefficients. Then, the output rate of the coder is controlled at group of frame scale, ensuring that it conforms to the parameters of a leaky bucket controller. Several leaky buckets with different sizes are discussed too. Simulation shows the efficiency of this codec and the effectiveness of the proposed rate control scheme.展开更多
Rate control plays a critical role in achieving perceivable video quality under a variable bit rate,limited buffer sizes and low delay applications.Since a rate control system exhibits non-linear and unpredictable cha...Rate control plays a critical role in achieving perceivable video quality under a variable bit rate,limited buffer sizes and low delay applications.Since a rate control system exhibits non-linear and unpredictable characteristics,it is difficult to establish a very accurate rate-distortion(R-D)model and acquire effective rate control performance.Considering the excellent control ability and low computing complexity of the fuzzy logic in non-linear systems,this paper proposes a bitrate control algorithm based on a fuzzy controller,named the Fuzzy Rate Control Algorithm(FRCA),for All-Intra(AI)and low-delay(LD)video source coding.Contributions of the proposed FRCA mainly consist of four aspects.First,fuzzy logic is adopted to minimize the deviation between the actual and the target buffer size in the hypothetical reference decoder(HRD).Second,a fast lookup table is employed in fuzzy rate control,which reduces computing cost of the control process.Third,an input domain determination scheme is proposed to improve the precision of the fuzzy controller.Fourth,a novel scene change detection is introduced and integrated in the FRCA to adaptively adjust the Group-of-Pictures(GOP)length when the source content fluctuates.The FRCA can be transplanted and implemented in various industry coders.Extensive experiments show that the FRCA has accurate variable bit-rate control ability and maintains a steady buffer size during the encoding processes.Compared with the default configuration encoding under AI and LD,the proposed FRCA can achieve the target bit rates more accurately in various classical encoders.展开更多
Current multi-view video coding (MVC) reference model in joint video team (JVT) does not provide efficient rate control schemes. This paper presents a rate control algorithm for MVC by improving the quadratic rate...Current multi-view video coding (MVC) reference model in joint video team (JVT) does not provide efficient rate control schemes. This paper presents a rate control algorithm for MVC by improving the quadratic rate-distortion (R-D) model. We reasonably allocate bit-rate among views based on the correlation analysisl The proposed algorithm consists of three levels to control the rate bits more accurately, of which the frame layer allocates bits according to the frame complexity and the temporal activity. Extensive experiments show that the proposed algorithm can control the bit rate efficiently.展开更多
Nowadays video coding approach is a major key in many applications for easy transmission and storage consumption. The process of transformation is based on the empirical wavelet transform (EWT). The encoding process o...Nowadays video coding approach is a major key in many applications for easy transmission and storage consumption. The process of transformation is based on the empirical wavelet transform (EWT). The encoding process of video data provides secure and less consumption of storage and the reconstruction process consists of the reverse process with the extraction. In this paper, the coding of video is carried out at a very low bit rate with the enhancement of performance by proposing an approach of modified Set Partitioning in Hierarchical Tree (MSPIHT). This method encodes the high frequency frames with the scheduling of wavelet transform for efficient performances of encoding and improves the ability of both the frequency and time. By applying empirical wavelet transform on each video frame, the component of video frequency is extracted and the low frequency frame is encoded by the H.264/AVC standard. The low coefficient values are ignored in applying the threshold and in the reconstruction process, HBLPCE method is used for imaging enhancement. The simulation of the proposed approach analysis shows better performance in reliable process and efficiency when compared to existing.展开更多
We describe a system for multipoint videoconferencing that offers extremely low end-to-end delay, low cost and complexity, and high scalability, alongside standard features associated with high-end solutions such as r...We describe a system for multipoint videoconferencing that offers extremely low end-to-end delay, low cost and complexity, and high scalability, alongside standard features associated with high-end solutions such as rate matching and per- sonal video layout. The system accommodates heterogeneous receivers and networks based on the Internet Protocol and relies on scalable video coding to provide a coded representation of a source video signal at multiple temporal and spatial resolutions as well as quality levels. These are represented by distinct bitstream components which are created at each end-user encoder. Depending on the specific conferencing environment, some or all of these components are transmitted to a Scalable Video Conferencing Server (SVCS). The SVCS redirects these components to one or more recipients depending on, e.g., the available network con- ditions and user preferences. The scalable aspect of the video coding technique allows the system to adapt to different network conditions, and also accommodates different end-user requirements (e.g., a user may elect to view another user at a high or low spatial resolution). Performance results concerning flexibility, video quality and delay of the system are presented using the Joint Scalable Video Model (JSVM) of the forthcoming SVC (H.264 Annex G) standard, demonstrating that scalable coding outper- forms existing state-of-the-art systems and offers the right platform for building next-generation multipoint videoconferencing systems.展开更多
In this paper, we propose a new method for very low bit-rate video coding that combines H.264/AVC standard and two-dimensional discrete wavelet transform. In this method, first a two dimensional wavelet transform is a...In this paper, we propose a new method for very low bit-rate video coding that combines H.264/AVC standard and two-dimensional discrete wavelet transform. In this method, first a two dimensional wavelet transform is applied on each video frame independently to extract the low frequency components for each frame and then the low frequency parts of all frames are coded using H.264/AVC codec. On the other hand, the high frequency parts of the video frames are coded by Run Length Coding algorithm, after applying a threshold to neglect the low value coefficients. Experiments show that our proposed method can achieve better rate-distortion performance at very low bit-rate applications below 16 kbits/s compared to applying H.264/AVC standard directly to all frames. Applications of our proposed video coding technique include video telephony, video-conferencing, transmitting or receiving video over half-rate traffic channels of GSM networks.展开更多
The dilemma of the quantization parameter (QP) being involved in both rate control and rate-distortion optimization (RDO) prevents using the traditional rate control scheme. Although some rate control schemes are prop...The dilemma of the quantization parameter (QP) being involved in both rate control and rate-distortion optimization (RDO) prevents using the traditional rate control scheme. Although some rate control schemes are proposed to circumvent the dilemma, the inaccurate prediction model and improper bit allocation deter H.264 application on low bandwidth channel. To resolve this issue, this paper proposes a novel rate control scheme by considering the macroblock (MB) encoding complexity variation and buffer variation and by exploiting the spatio-temporal correlation sufficiently well. Simulations showed that this scheme improves the perceptual quality of the pictures with similar or smaller PSNR deviations when compared to that of rate control in JVT-O016.展开更多
In order to further improve the efficiency of video compression, we introduce a perceptual characteristics of Human Visual System (HVS) to video coding, and propose a novel video coding rate control algorithm based on...In order to further improve the efficiency of video compression, we introduce a perceptual characteristics of Human Visual System (HVS) to video coding, and propose a novel video coding rate control algorithm based on human visual saliency model in H.264/AVC. Firstly, we modifie Itti's saliency model. Secondly, target bits of each frame are allocated through the correlation of saliency region between the current and previous frame, and the complexity of each MB is modified through the saliency value and its Mean Absolute Difference (MAD) value. Lastly, the algorithm was implemented in JVT JM12.2. Simulation results show that, comparing with traditional rate control algorithm, the proposed one can reduce the coding bit rate and improve the reconstructed video subjective quality, especially for visual saliency region. It is very suitable for wireless video transmission.展开更多
This letter proposes a rate control algorithm for H.264 video encoder, which is based on block activity and buffer state. Experimental results indicate that it has an excellent performance by providing much accurate b...This letter proposes a rate control algorithm for H.264 video encoder, which is based on block activity and buffer state. Experimental results indicate that it has an excellent performance by providing much accurate bit rate and better coding efficiency compared with H.264. The computational complexity of the algorithm is reduced by adopting a novel block activity description method using the Sum of Absolute Difference (SAD) of 16× 16 mode, and its robustness is enhanced by introducing a feedback circuit at frame layer.展开更多
The design and realization of a videoconference system based on international recommendation are introduced in this paper, and the hardware implementation of video codec based on ITU-T H. 261 is briefly discussed. Fur...The design and realization of a videoconference system based on international recommendation are introduced in this paper, and the hardware implementation of video codec based on ITU-T H. 261 is briefly discussed. Furthermore, the buffer control method and the adaptive control strategy for quantization are proposed, which are adaptive and robust. This system can be operated under the transmission rate ranging from 128kb/s to 2Mb/s. With these strategies for the videoconference system, the high quality image is obtained. The time delay of the system is about half a second.展开更多
A new video watermarking method for the Audio Video coding Standard (AVS) is proposed. According to human visual masking properties, this method determines the region of interest for watermark embedding by analyzing v...A new video watermarking method for the Audio Video coding Standard (AVS) is proposed. According to human visual masking properties, this method determines the region of interest for watermark embedding by analyzing video semantics, and generates dynamic robust watermark according to video motion semantics, and embeds watermarks in the Intermediate Frequency (IF) Discrete Cosine Transform (DCT) coefficients of the luminance sub-block prediction residual in the region of interest. This method controls watermark embedding strength adaptively by video textures semantics. Ex- periments show that this method is robust not only to various conventional attacks, but also to re-frame, frame cropping, frame deletion and other video-specific attacks.展开更多
Audio Video Coding Standard (AVS) is a second-generation source coding standard and the first standard for audio and video coding in China with independent intellectual property rights. Its performance has reached t...Audio Video Coding Standard (AVS) is a second-generation source coding standard and the first standard for audio and video coding in China with independent intellectual property rights. Its performance has reached the international standard. Its coding efficiency is 2 to 3 times greater than that of MPEG -2. This technical solution is more simple, and it can greatly save channel resource. After more than ten years' development, AVS has achieved great success. The latest version of the AVS audio coding standard is ongoing and mainly aims at the increasing demand for low bitrate and high quality audio services. The paper reviews the history and recent development of AVS audio coding standard in terms of basic features, key techniques and performance. Finally, the future development of AVS audio coding standard is discussed.展开更多
Block matching has been used for motion estimation and motion compensation in the Au- dio and Video Standard (AVS) for years. While having an acceptable performance in describing mo- tion between frames, it requires q...Block matching has been used for motion estimation and motion compensation in the Au- dio and Video Standard (AVS) for years. While having an acceptable performance in describing mo- tion between frames, it requires quite a few bits to represent the motion vectors. In certain circum- stances, the use of global motion estimation and compensation would perform equally well or even better than the block matching in terms of motion accuracy, while it results in the coding of global motion model parameters. In this letter, we modify an AVS coder by adding (1) six global motion model parameters to the frame header, and (2) mode selection among INTRA, SKIP, INTER-16×16, INTER-16×8, INTER-8×16, INTER-8×8, and Global Motion Compensation (GMC) modes by La- grange optimal rate-distortion criteria. Simulation results demonstrate that over 0.1dB improvement in PSNR is obtained compared to the AVS coder for an average coded P-frame with the same bit- rate.展开更多
The next generation space internet (NGSI) is based on all-IP-based mobile network that merges land-based network, sea-based network, sky-based network, spacebased network, deep space-based network together using exi...The next generation space internet (NGSI) is based on all-IP-based mobile network that merges land-based network, sea-based network, sky-based network, spacebased network, deep space-based network together using existing assess network technologies. There are high signal propagation delays, high error rate, bandwidth variation and time-variety in NGSh In order to adapt to various space communication environment constraints and bandwidth variation, we propose a reduced dimension scalable video coding scheme based on CCSDS IDCS algorithm and quality of service (QoS) control method by cross layer design (CLD). The experimental result shows that this new method has better performance than that of existing algorithms, and can be adaptive to the bandwidth variation dynamically.展开更多
An approach based on adaptive congestion control and adaptive error recovery with RS (Reed-Solomon) coding method is presented for efficient video transmission over the Internet. Featured by weighted moving average ra...An approach based on adaptive congestion control and adaptive error recovery with RS (Reed-Solomon) coding method is presented for efficient video transmission over the Internet. Featured by weighted moving average rate control and TCP-friendliness, AVSP, a novel adaptive video streaming protocol, is designed with adjustable rate control parameters so as to respond quickly to the QoS status fluctuation during video transmission over the Internet. Combined with congestion control policy, an adaptive RS coding error recovery scheme with variable parameters is presented to enhance the robustness of MPEG video transmission over the Internet with restriction to the total system bandwidth .展开更多
基金This paper was supported by the National Key Basic Re- search Program of China under Grant No. 2011 CB302701 the National Natural Science Foundation of China under Grants No. 60833009, No. 61133015+2 种基金 the China National Funds for Distinguished Young Scientists under Grant No. 60925010 the Funds for Creative Research Groups of China under Grant No. 61121001 the Program for Changjiang Scholars and Innovative Research Team in University under Grant No. IRT1049.
文摘Wyner-Ziv Video Coding (WZVC) is considered as a promising video coding scheme for Wireless Video Sensor Networks (WVSNs) due to its high compression efficiency and error resilience functionalities, as well as its low encoding complex- ity. To achieve a good Rate-Distortion (R-D) per- formance, the current WZVC paradi^prls usually a- dopt an end-to-end rate control scheme in which the decoder repeatedly requests the additional deco- ding data from the encoder for decoding Wyner-Ziv frames. Therefore, the waiting time of the additional decoding data is especially long in multihop WVSNs. In this paper, we propose a novel pro- gressive in-network rate control scheme for WZVC. The proposed in-network puncturing-based rate control scheme transfers the partial channel codes puncturing task from the encoder to the relay nodes. Then, the decoder can request the addition- al decoding data from the relay nodes instead of the encoder, and the total waiting time for deco- ding Wyner-Ziv frames is reduced consequently. Simulation results validate the proposed rate con- trol scheme.
文摘This paper presents a streaming system using scalable video coding based on H.264/AVC. The system provides a congestion control algorithm supported by channel bandwidth estimation of the client. It uses retransmission only for packets of the base layer to disburden the congested network. The bandwidth estimation allows for adjusting the transmission rate quickly to the current available bandwidth of the network. Compared to binomial congestion control, the proposed system allows for shorter start-up times and data rate adaptation. The paper describes the components of this streaming system and the results of experiments showing that the proposed approach works effectively for streaming video.
基金Supported by National Natural Science Foundation of China(No.60972054)National High Technology Research and Development Program of China("863"Program,No.2009AA011507)
文摘Asymmetric stereoscopic video coding can take advantage of binocular suppression in human vision by representing one of the two views in lower quality.This paper proposes a bit allocation strategy for asymmetric stereoscopic video coding.In order to improve the accuracy of bit allocation and rate control in the left view,a proportionalintegral-derivative controller is adopted.Meanwhile,to control the quality fluctuation between consecutive frames of the left view,a quality controller is adopted.Besides,a fuzzy controller is proposed to control the variation in quality between the left and right views by comparing the PSNR disparity of two views with a fixed threshold,which is used to quantize the binocular psycho-visual redundancy and adjust the quantization parameter (QP) of the right view correspondingly.The proposed algorithm has been implemented in H.264/AVC video codec,and the experimental results show its effectiveness in rate control while keeping a good quality for the left view,and fewer bits are allocated for the right view so that the overall bit rate is saved by 7.2% at most without the loss of subjective visual quality for stereoscopic video.
基金National Natural Science Foundations of China (No. 60972035,No. 61074009)Natural Science Foundation Program of Shanghai,China ( No. 10ZR1432800)
文摘For rate control (RC) of hierarchical structure coding, an independent rate-quantization (R-Q) model was proposed based on mean absolute differences (MADs) in different temporal levels (TLs). In the proposed R-Q model, a novel MAD model was developed according to the hierarchical structure. The experimental results demonstrate that the proposed algorithm provides better performance, in terms of average peak signal-to-noise ratio (PSNR) and quality smoothness, than the H.264 reference model, JM14.2, under various sequences.
文摘This paper presents a new video coding system based on wavelet transform and its rate control scheme over ATM networks. First, three dimensional wavelet transform is performed for the original image sequence, and an extension of set partitioning in hierarchical trees algorithm is employed to quantize the wavelet coefficients. Then, the output rate of the coder is controlled at group of frame scale, ensuring that it conforms to the parameters of a leaky bucket controller. Several leaky buckets with different sizes are discussed too. Simulation shows the efficiency of this codec and the effectiveness of the proposed rate control scheme.
基金supported by ZTE Industry-Academia-Research Cooperation Funds under Grant No.CON1503180004the Postdoctoral Science Foundation of China under Gant No.2014M552342the Foundation of Science and Technology Department of Sichuan Province,China under Grant No.2014GZ0005
文摘Rate control plays a critical role in achieving perceivable video quality under a variable bit rate,limited buffer sizes and low delay applications.Since a rate control system exhibits non-linear and unpredictable characteristics,it is difficult to establish a very accurate rate-distortion(R-D)model and acquire effective rate control performance.Considering the excellent control ability and low computing complexity of the fuzzy logic in non-linear systems,this paper proposes a bitrate control algorithm based on a fuzzy controller,named the Fuzzy Rate Control Algorithm(FRCA),for All-Intra(AI)and low-delay(LD)video source coding.Contributions of the proposed FRCA mainly consist of four aspects.First,fuzzy logic is adopted to minimize the deviation between the actual and the target buffer size in the hypothetical reference decoder(HRD).Second,a fast lookup table is employed in fuzzy rate control,which reduces computing cost of the control process.Third,an input domain determination scheme is proposed to improve the precision of the fuzzy controller.Fourth,a novel scene change detection is introduced and integrated in the FRCA to adaptively adjust the Group-of-Pictures(GOP)length when the source content fluctuates.The FRCA can be transplanted and implemented in various industry coders.Extensive experiments show that the FRCA has accurate variable bit-rate control ability and maintains a steady buffer size during the encoding processes.Compared with the default configuration encoding under AI and LD,the proposed FRCA can achieve the target bit rates more accurately in various classical encoders.
基金supported by the National Natural Science Foundation of China (Grant Nos.60832003,60672052,60902085,60972137)the Key Project of Shanghai Municipal Education Commission (Grant No.09ZZ90)+2 种基金the Natural Science Foundation of Shanghai(Grant No.09ZR1412500)the Innovation Foundation of Shanghai University (Grants Nos.10YZ09,SHUCX091061)the Shuguang Plan of Shanghai Education Development Foundation (Grant No.06SG43)
文摘Current multi-view video coding (MVC) reference model in joint video team (JVT) does not provide efficient rate control schemes. This paper presents a rate control algorithm for MVC by improving the quadratic rate-distortion (R-D) model. We reasonably allocate bit-rate among views based on the correlation analysisl The proposed algorithm consists of three levels to control the rate bits more accurately, of which the frame layer allocates bits according to the frame complexity and the temporal activity. Extensive experiments show that the proposed algorithm can control the bit rate efficiently.
文摘Nowadays video coding approach is a major key in many applications for easy transmission and storage consumption. The process of transformation is based on the empirical wavelet transform (EWT). The encoding process of video data provides secure and less consumption of storage and the reconstruction process consists of the reverse process with the extraction. In this paper, the coding of video is carried out at a very low bit rate with the enhancement of performance by proposing an approach of modified Set Partitioning in Hierarchical Tree (MSPIHT). This method encodes the high frequency frames with the scheduling of wavelet transform for efficient performances of encoding and improves the ability of both the frequency and time. By applying empirical wavelet transform on each video frame, the component of video frequency is extracted and the low frequency frame is encoded by the H.264/AVC standard. The low coefficient values are ignored in applying the threshold and in the reconstruction process, HBLPCE method is used for imaging enhancement. The simulation of the proposed approach analysis shows better performance in reliable process and efficiency when compared to existing.
文摘We describe a system for multipoint videoconferencing that offers extremely low end-to-end delay, low cost and complexity, and high scalability, alongside standard features associated with high-end solutions such as rate matching and per- sonal video layout. The system accommodates heterogeneous receivers and networks based on the Internet Protocol and relies on scalable video coding to provide a coded representation of a source video signal at multiple temporal and spatial resolutions as well as quality levels. These are represented by distinct bitstream components which are created at each end-user encoder. Depending on the specific conferencing environment, some or all of these components are transmitted to a Scalable Video Conferencing Server (SVCS). The SVCS redirects these components to one or more recipients depending on, e.g., the available network con- ditions and user preferences. The scalable aspect of the video coding technique allows the system to adapt to different network conditions, and also accommodates different end-user requirements (e.g., a user may elect to view another user at a high or low spatial resolution). Performance results concerning flexibility, video quality and delay of the system are presented using the Joint Scalable Video Model (JSVM) of the forthcoming SVC (H.264 Annex G) standard, demonstrating that scalable coding outper- forms existing state-of-the-art systems and offers the right platform for building next-generation multipoint videoconferencing systems.
文摘In this paper, we propose a new method for very low bit-rate video coding that combines H.264/AVC standard and two-dimensional discrete wavelet transform. In this method, first a two dimensional wavelet transform is applied on each video frame independently to extract the low frequency components for each frame and then the low frequency parts of all frames are coded using H.264/AVC codec. On the other hand, the high frequency parts of the video frames are coded by Run Length Coding algorithm, after applying a threshold to neglect the low value coefficients. Experiments show that our proposed method can achieve better rate-distortion performance at very low bit-rate applications below 16 kbits/s compared to applying H.264/AVC standard directly to all frames. Applications of our proposed video coding technique include video telephony, video-conferencing, transmitting or receiving video over half-rate traffic channels of GSM networks.
文摘The dilemma of the quantization parameter (QP) being involved in both rate control and rate-distortion optimization (RDO) prevents using the traditional rate control scheme. Although some rate control schemes are proposed to circumvent the dilemma, the inaccurate prediction model and improper bit allocation deter H.264 application on low bandwidth channel. To resolve this issue, this paper proposes a novel rate control scheme by considering the macroblock (MB) encoding complexity variation and buffer variation and by exploiting the spatio-temporal correlation sufficiently well. Simulations showed that this scheme improves the perceptual quality of the pictures with similar or smaller PSNR deviations when compared to that of rate control in JVT-O016.
基金supported by National Natural Science Foundation of China under Grant No.610700800973 Sub-Program Projects under Grant No.2009CB320906+3 种基金National Science and Technology of Major Special Projects under Grant No.2010ZX03004-003S&T Planning Project of Hubei Provincial Department of Education under Grant No. Q20112805H&SPlanning Project of Hubei Provincial Department of Education under Grant No.2011jyte142Science Foundation of HubeiProvincial under Grant No.2010CDB05103
文摘In order to further improve the efficiency of video compression, we introduce a perceptual characteristics of Human Visual System (HVS) to video coding, and propose a novel video coding rate control algorithm based on human visual saliency model in H.264/AVC. Firstly, we modifie Itti's saliency model. Secondly, target bits of each frame are allocated through the correlation of saliency region between the current and previous frame, and the complexity of each MB is modified through the saliency value and its Mean Absolute Difference (MAD) value. Lastly, the algorithm was implemented in JVT JM12.2. Simulation results show that, comparing with traditional rate control algorithm, the proposed one can reduce the coding bit rate and improve the reconstructed video subjective quality, especially for visual saliency region. It is very suitable for wireless video transmission.
基金the National Nature Science Foundation of China(No.90104013) 863 Project(No.2002AA119010, 2001AA121061 and 2002AA123041)
文摘This letter proposes a rate control algorithm for H.264 video encoder, which is based on block activity and buffer state. Experimental results indicate that it has an excellent performance by providing much accurate bit rate and better coding efficiency compared with H.264. The computational complexity of the algorithm is reduced by adopting a novel block activity description method using the Sum of Absolute Difference (SAD) of 16× 16 mode, and its robustness is enhanced by introducing a feedback circuit at frame layer.
基金the High Technology Research and Development Programme of China
文摘The design and realization of a videoconference system based on international recommendation are introduced in this paper, and the hardware implementation of video codec based on ITU-T H. 261 is briefly discussed. Furthermore, the buffer control method and the adaptive control strategy for quantization are proposed, which are adaptive and robust. This system can be operated under the transmission rate ranging from 128kb/s to 2Mb/s. With these strategies for the videoconference system, the high quality image is obtained. The time delay of the system is about half a second.
基金Supported by the Natural Science Foundation of Shaanxi Province (SJ08F15)the Industry Tackling Project of Shaanxi Province (2010K06-20)the National Natural Science Foundation of China and Civil Aviation Ad-ministration of China (No. 61072110)
文摘A new video watermarking method for the Audio Video coding Standard (AVS) is proposed. According to human visual masking properties, this method determines the region of interest for watermark embedding by analyzing video semantics, and generates dynamic robust watermark according to video motion semantics, and embeds watermarks in the Intermediate Frequency (IF) Discrete Cosine Transform (DCT) coefficients of the luminance sub-block prediction residual in the region of interest. This method controls watermark embedding strength adaptively by video textures semantics. Ex- periments show that this method is robust not only to various conventional attacks, but also to re-frame, frame cropping, frame deletion and other video-specific attacks.
文摘Audio Video Coding Standard (AVS) is a second-generation source coding standard and the first standard for audio and video coding in China with independent intellectual property rights. Its performance has reached the international standard. Its coding efficiency is 2 to 3 times greater than that of MPEG -2. This technical solution is more simple, and it can greatly save channel resource. After more than ten years' development, AVS has achieved great success. The latest version of the AVS audio coding standard is ongoing and mainly aims at the increasing demand for low bitrate and high quality audio services. The paper reviews the history and recent development of AVS audio coding standard in terms of basic features, key techniques and performance. Finally, the future development of AVS audio coding standard is discussed.
基金Supported by the National Natural Science Foundation of China (60673092)Natural science Foundation of Ji-angsu Province (No.BK2003029)the Key Laboratory Project of Information Science & Engineering of Railway of the Ministry of Railways, China (No.TDXX0501).
文摘Block matching has been used for motion estimation and motion compensation in the Au- dio and Video Standard (AVS) for years. While having an acceptable performance in describing mo- tion between frames, it requires quite a few bits to represent the motion vectors. In certain circum- stances, the use of global motion estimation and compensation would perform equally well or even better than the block matching in terms of motion accuracy, while it results in the coding of global motion model parameters. In this letter, we modify an AVS coder by adding (1) six global motion model parameters to the frame header, and (2) mode selection among INTRA, SKIP, INTER-16×16, INTER-16×8, INTER-8×16, INTER-8×8, and Global Motion Compensation (GMC) modes by La- grange optimal rate-distortion criteria. Simulation results demonstrate that over 0.1dB improvement in PSNR is obtained compared to the AVS coder for an average coded P-frame with the same bit- rate.
基金the National Natural Science Foundation of China (Grant Nos. 60773137 and 60573112)the National 863 Project (Grant No. 2007AA01Z297)the National Foundation (Grant No. 1320061262)
文摘The next generation space internet (NGSI) is based on all-IP-based mobile network that merges land-based network, sea-based network, sky-based network, spacebased network, deep space-based network together using existing assess network technologies. There are high signal propagation delays, high error rate, bandwidth variation and time-variety in NGSh In order to adapt to various space communication environment constraints and bandwidth variation, we propose a reduced dimension scalable video coding scheme based on CCSDS IDCS algorithm and quality of service (QoS) control method by cross layer design (CLD). The experimental result shows that this new method has better performance than that of existing algorithms, and can be adaptive to the bandwidth variation dynamically.
文摘An approach based on adaptive congestion control and adaptive error recovery with RS (Reed-Solomon) coding method is presented for efficient video transmission over the Internet. Featured by weighted moving average rate control and TCP-friendliness, AVSP, a novel adaptive video streaming protocol, is designed with adjustable rate control parameters so as to respond quickly to the QoS status fluctuation during video transmission over the Internet. Combined with congestion control policy, an adaptive RS coding error recovery scheme with variable parameters is presented to enhance the robustness of MPEG video transmission over the Internet with restriction to the total system bandwidth .