A two-stage automatic key frame selection method is proposed to enhance stitching speed and quality for UAV aerial videos. In the first stage, to reduce redundancy, the overlapping rate of the UAV aerial video sequenc...A two-stage automatic key frame selection method is proposed to enhance stitching speed and quality for UAV aerial videos. In the first stage, to reduce redundancy, the overlapping rate of the UAV aerial video sequence within the sampling period is calculated. Lagrange interpolation is used to fit the overlapping rate curve of the sequence. An empirical threshold for the overlapping rate is then applied to filter candidate key frames from the sequence. In the second stage, the principle of minimizing remapping spots is used to dynamically adjust and determine the final key frame close to the candidate key frames. Comparative experiments show that the proposed method significantly improves stitching speed and accuracy by more than 40%.展开更多
In order to further improve the efficiency of video compression, we introduce a perceptual characteristics of Human Visual System (HVS) to video coding, and propose a novel video coding rate control algorithm based on...In order to further improve the efficiency of video compression, we introduce a perceptual characteristics of Human Visual System (HVS) to video coding, and propose a novel video coding rate control algorithm based on human visual saliency model in H.264/AVC. Firstly, we modifie Itti's saliency model. Secondly, target bits of each frame are allocated through the correlation of saliency region between the current and previous frame, and the complexity of each MB is modified through the saliency value and its Mean Absolute Difference (MAD) value. Lastly, the algorithm was implemented in JVT JM12.2. Simulation results show that, comparing with traditional rate control algorithm, the proposed one can reduce the coding bit rate and improve the reconstructed video subjective quality, especially for visual saliency region. It is very suitable for wireless video transmission.展开更多
The support for multiple video streams in an ad-hoc wireless network requires appropriate routing and rate allocation measures ascertaining the set of links for transmitting each stream and the encoding rate of the vi...The support for multiple video streams in an ad-hoc wireless network requires appropriate routing and rate allocation measures ascertaining the set of links for transmitting each stream and the encoding rate of the video to be delivered over the chosen links. The routing and rate allocation procedures impact the sustained quality of each video stream measured as the mean squared error (MSE) distortion at the receiver, and the overall network congestion in terms of queuing delay per link. We study the trade-off between these two competing objectives in a convex optimization formulation, and discuss both centralized and dis- tributed solutions for joint routing and rate allocation for multiple streams. For each stream, the optimal allocated rate strikes a balance between the selfish motive of minimizing video distortion and the global good of minimizing network congestions, while the routes are chosen over the least-congested links in the network. In addition to detailed analysis, network simulation results using ns-2 are presented for studying the optimal choice of parameters and to confirm the effectiveness of the proposed measures.展开更多
A new rate allocation method for fine-granular scalability (FGS) coded bitstreams is presented in order to achieve smooth quality reconstruction of frames under channel conditions with a wide range of bandwidth variat...A new rate allocation method for fine-granular scalability (FGS) coded bitstreams is presented in order to achieve smooth quality reconstruction of frames under channel conditions with a wide range of bandwidth variation and improve the average PSNR of the whole sequence. Based on a quality weighted bit allocation method, a sliding window rate allocation method is proposed for the first time so that the window can slide along the video sequence with a certain sliding step. Experimental results show that, under dynamic bandwidth conditions, the proposed method can simultaneously satisfy the requirements for improving average PSNR of the whole video sequence greatly and reducing the fluctuations between adjacent frames greatly.展开更多
Two video coding schemes based on wavelet transform achieving very low bit rate are presented in this paper. The first is a hybrid motion compensated wavelet transform(MC WT)system which behaves better at very low ...Two video coding schemes based on wavelet transform achieving very low bit rate are presented in this paper. The first is a hybrid motion compensated wavelet transform(MC WT)system which behaves better at very low bit rates than the block DCT residual coder. The second is a new efficient coding system based on a simple frame differencing wavelet transform(FD WT)which performs well in both PSNR and visual quality with substantially reduced complexity.展开更多
Rate control plays a critical role in achieving perceivable video quality under a variable bit rate,limited buffer sizes and low delay applications.Since a rate control system exhibits non-linear and unpredictable cha...Rate control plays a critical role in achieving perceivable video quality under a variable bit rate,limited buffer sizes and low delay applications.Since a rate control system exhibits non-linear and unpredictable characteristics,it is difficult to establish a very accurate rate-distortion(R-D)model and acquire effective rate control performance.Considering the excellent control ability and low computing complexity of the fuzzy logic in non-linear systems,this paper proposes a bitrate control algorithm based on a fuzzy controller,named the Fuzzy Rate Control Algorithm(FRCA),for All-Intra(AI)and low-delay(LD)video source coding.Contributions of the proposed FRCA mainly consist of four aspects.First,fuzzy logic is adopted to minimize the deviation between the actual and the target buffer size in the hypothetical reference decoder(HRD).Second,a fast lookup table is employed in fuzzy rate control,which reduces computing cost of the control process.Third,an input domain determination scheme is proposed to improve the precision of the fuzzy controller.Fourth,a novel scene change detection is introduced and integrated in the FRCA to adaptively adjust the Group-of-Pictures(GOP)length when the source content fluctuates.The FRCA can be transplanted and implemented in various industry coders.Extensive experiments show that the FRCA has accurate variable bit-rate control ability and maintains a steady buffer size during the encoding processes.Compared with the default configuration encoding under AI and LD,the proposed FRCA can achieve the target bit rates more accurately in various classical encoders.展开更多
When wireless hosts use different rates to transmit data in IEEE 802.11 networks, it will take on the state of performance anomaly which will severely decrease the throughputs of all the higher rate hosts. Hence, it i...When wireless hosts use different rates to transmit data in IEEE 802.11 networks, it will take on the state of performance anomaly which will severely decrease the throughputs of all the higher rate hosts. Hence, it is bad for video service transmission. Considering that video is very sensitive to packet delivery delay but can tolerate some packet losses, we propose a novel cross-layer scheme which takes these two characteristics into consideration. Firstly, the maximum number of retransmissions for a video Medium Access Control (MAC) frame is computed in MAC layer according to video frame rate requirement of application layer and current access delay of MAC layer. Secondly, within the margin of the tolerant Packet Loss Rate (PLR) of application layer, several video MAC frames are allowed to drop so that we can adaptively select the transmission rate as high as possible for the rest of video MAC frames in terms of current channel quality and the maximum number of retransmissions. Experiment results show that the proposed method can reduce the delay and jitter of video service and improve the throughputs of fast hosts. Therefore, it increases the quality of reconstructed video to a certain extent and relieves the performance anomaly of network effectively.展开更多
This paper presents a new video coding system based on wavelet transform and its rate control scheme over ATM networks. First, three dimensional wavelet transform is performed for the original image sequence, and an e...This paper presents a new video coding system based on wavelet transform and its rate control scheme over ATM networks. First, three dimensional wavelet transform is performed for the original image sequence, and an extension of set partitioning in hierarchical trees algorithm is employed to quantize the wavelet coefficients. Then, the output rate of the coder is controlled at group of frame scale, ensuring that it conforms to the parameters of a leaky bucket controller. Several leaky buckets with different sizes are discussed too. Simulation shows the efficiency of this codec and the effectiveness of the proposed rate control scheme.展开更多
We propose a Rate-Distortion (RD) optimized strategy for frame-dropping and scheduling of multi-user conversa- tional and streaming videos. We consider a scenario where conversational and streaming videos share the fo...We propose a Rate-Distortion (RD) optimized strategy for frame-dropping and scheduling of multi-user conversa- tional and streaming videos. We consider a scenario where conversational and streaming videos share the forwarding resources at a network node. Two buffers are setup on the node to temporarily store the packets for these two types of video applications. For streaming video, a big buffer is used as the associated delay constraint of the application is moderate and a very small buffer is used for conversational video to ensure that the forwarding delay of every packet is limited. A scheduler is located behind these two buffers that dynamically assigns transmission slots on the outgoing link to the two buffers. Rate-distortion side information is used to perform RD-optimized frame dropping in case of node overload. Sharing the data rate on the outgoing link between the con- versational and the streaming videos is done either based on the fullness of the two associated buffers or on the mean incoming rates of the respective videos. Simulation results showed that our proposed RD-optimized frame dropping and scheduling ap- proach provides significant improvements in performance over the popular priority-based random dropping (PRD) technique.展开更多
In this paper we discuss the source rate control problem of adapting variable bit-rate (VBR) compressed video over constant bit-rate (CBR) channels. Firstly we formulate it as an optimal control problem of a discr...In this paper we discuss the source rate control problem of adapting variable bit-rate (VBR) compressed video over constant bit-rate (CBR) channels. Firstly we formulate it as an optimal control problem of a discrete linear system with state and control constraints. Then we apply the discrete maximum principle to get the optimal solution. Experimental results are given in the end. Compared with traditional algorithms, the proposed algorithm is suitable for the coder with continuous output rates, and can achieve the better solution. Our algorithm can be used in both off-line and on-line coding.展开更多
A new improved Goh's 3 D wavelet transform(WT) coding scheme is presented in this paper. The new scheme has great advantages including a simple code structure, low computation cost and good performance in PSNR, c...A new improved Goh's 3 D wavelet transform(WT) coding scheme is presented in this paper. The new scheme has great advantages including a simple code structure, low computation cost and good performance in PSNR, compression ratios and visual quality of reconstructions, when compared to the other existing 3 D WT coding methods and the 2 D WT based coding methods. The new 3 D WT coding scheme is suitable for very low bit rate video coding.展开更多
A new motion compensated 3 D wavelet transform (MC 3DWT) video coding scheme is presented in this paper. The new coding scheme has a good performance in average PSNR, compression ratio and visual quality of reconst...A new motion compensated 3 D wavelet transform (MC 3DWT) video coding scheme is presented in this paper. The new coding scheme has a good performance in average PSNR, compression ratio and visual quality of reconstructions compared with the existing 3 D wavelet transform (3DWT) coding methods and motion compensated 2 D wavelet transform (MC WT) coding method. The new MC 3DWT coding scheme is suitable for very low bit rate video coding.展开更多
Purpose: Objective of this study was to determine whether video assisted anesthesia induction reduced pediatric patients’ stress. Methods: With approval from the local ethics committee and parental informed consent, ...Purpose: Objective of this study was to determine whether video assisted anesthesia induction reduced pediatric patients’ stress. Methods: With approval from the local ethics committee and parental informed consent, 75 children undergoing minor surgery were investigated in this prospective observational study. Patients were divided into three groups: group 1 was aged two to three years old, group 2 was aged four to six years old and group 3 was aged from seven to ten years old. The following three characteristics were evaluated: 1) the pulse rate at four points (the ward, the entrance at the operating room, mask notification and the mask fit);2) the behavioral score in the operating room;3) the amount of pain killers after the operation. Results: In group 1 (N = 20), there was a significant difference between the control group and the video assisted group regarding the percentage change in pulse rate based on the children’s ward when the patients looked at the mask. In group 2 (N = 26), there was no significant difference regarding any points. In group 3 (N = 29), there was a significant difference between control and video assisted group regarding the percentage change in pulse rate based on the children’s ward for all points. Also, regarding to the behavioral score, there was a significant difference between the control group and the video assisted group of all ages. However, there was no significant difference regarding the use of NSAIDs in the postoperative period between the control and the video assisted group. Conclusion: These results show that the video assisted anesthesia induction is effective for pediatric patients.展开更多
For rate control (RC) of hierarchical structure coding, an independent rate-quantization (R-Q) model was proposed based on mean absolute differences (MADs) in different temporal levels (TLs). In the proposed R-Q model...For rate control (RC) of hierarchical structure coding, an independent rate-quantization (R-Q) model was proposed based on mean absolute differences (MADs) in different temporal levels (TLs). In the proposed R-Q model, a novel MAD model was developed according to the hierarchical structure. The experimental results demonstrate that the proposed algorithm provides better performance, in terms of average peak signal-to-noise ratio (PSNR) and quality smoothness, than the H.264 reference model, JM14.2, under various sequences.展开更多
文摘A two-stage automatic key frame selection method is proposed to enhance stitching speed and quality for UAV aerial videos. In the first stage, to reduce redundancy, the overlapping rate of the UAV aerial video sequence within the sampling period is calculated. Lagrange interpolation is used to fit the overlapping rate curve of the sequence. An empirical threshold for the overlapping rate is then applied to filter candidate key frames from the sequence. In the second stage, the principle of minimizing remapping spots is used to dynamically adjust and determine the final key frame close to the candidate key frames. Comparative experiments show that the proposed method significantly improves stitching speed and accuracy by more than 40%.
基金supported by National Natural Science Foundation of China under Grant No.610700800973 Sub-Program Projects under Grant No.2009CB320906+3 种基金National Science and Technology of Major Special Projects under Grant No.2010ZX03004-003S&T Planning Project of Hubei Provincial Department of Education under Grant No. Q20112805H&SPlanning Project of Hubei Provincial Department of Education under Grant No.2011jyte142Science Foundation of HubeiProvincial under Grant No.2010CDB05103
文摘In order to further improve the efficiency of video compression, we introduce a perceptual characteristics of Human Visual System (HVS) to video coding, and propose a novel video coding rate control algorithm based on human visual saliency model in H.264/AVC. Firstly, we modifie Itti's saliency model. Secondly, target bits of each frame are allocated through the correlation of saliency region between the current and previous frame, and the complexity of each MB is modified through the saliency value and its Mean Absolute Difference (MAD) value. Lastly, the algorithm was implemented in JVT JM12.2. Simulation results show that, comparing with traditional rate control algorithm, the proposed one can reduce the coding bit rate and improve the reconstructed video subjective quality, especially for visual saliency region. It is very suitable for wireless video transmission.
基金Project (No. CCR-0325639) partially supported by the National Science Foundation, USA
文摘The support for multiple video streams in an ad-hoc wireless network requires appropriate routing and rate allocation measures ascertaining the set of links for transmitting each stream and the encoding rate of the video to be delivered over the chosen links. The routing and rate allocation procedures impact the sustained quality of each video stream measured as the mean squared error (MSE) distortion at the receiver, and the overall network congestion in terms of queuing delay per link. We study the trade-off between these two competing objectives in a convex optimization formulation, and discuss both centralized and dis- tributed solutions for joint routing and rate allocation for multiple streams. For each stream, the optimal allocated rate strikes a balance between the selfish motive of minimizing video distortion and the global good of minimizing network congestions, while the routes are chosen over the least-congested links in the network. In addition to detailed analysis, network simulation results using ns-2 are presented for studying the optimal choice of parameters and to confirm the effectiveness of the proposed measures.
文摘A new rate allocation method for fine-granular scalability (FGS) coded bitstreams is presented in order to achieve smooth quality reconstruction of frames under channel conditions with a wide range of bandwidth variation and improve the average PSNR of the whole sequence. Based on a quality weighted bit allocation method, a sliding window rate allocation method is proposed for the first time so that the window can slide along the video sequence with a certain sliding step. Experimental results show that, under dynamic bandwidth conditions, the proposed method can simultaneously satisfy the requirements for improving average PSNR of the whole video sequence greatly and reducing the fluctuations between adjacent frames greatly.
文摘Two video coding schemes based on wavelet transform achieving very low bit rate are presented in this paper. The first is a hybrid motion compensated wavelet transform(MC WT)system which behaves better at very low bit rates than the block DCT residual coder. The second is a new efficient coding system based on a simple frame differencing wavelet transform(FD WT)which performs well in both PSNR and visual quality with substantially reduced complexity.
基金supported by ZTE Industry-Academia-Research Cooperation Funds under Grant No.CON1503180004the Postdoctoral Science Foundation of China under Gant No.2014M552342the Foundation of Science and Technology Department of Sichuan Province,China under Grant No.2014GZ0005
文摘Rate control plays a critical role in achieving perceivable video quality under a variable bit rate,limited buffer sizes and low delay applications.Since a rate control system exhibits non-linear and unpredictable characteristics,it is difficult to establish a very accurate rate-distortion(R-D)model and acquire effective rate control performance.Considering the excellent control ability and low computing complexity of the fuzzy logic in non-linear systems,this paper proposes a bitrate control algorithm based on a fuzzy controller,named the Fuzzy Rate Control Algorithm(FRCA),for All-Intra(AI)and low-delay(LD)video source coding.Contributions of the proposed FRCA mainly consist of four aspects.First,fuzzy logic is adopted to minimize the deviation between the actual and the target buffer size in the hypothetical reference decoder(HRD).Second,a fast lookup table is employed in fuzzy rate control,which reduces computing cost of the control process.Third,an input domain determination scheme is proposed to improve the precision of the fuzzy controller.Fourth,a novel scene change detection is introduced and integrated in the FRCA to adaptively adjust the Group-of-Pictures(GOP)length when the source content fluctuates.The FRCA can be transplanted and implemented in various industry coders.Extensive experiments show that the FRCA has accurate variable bit-rate control ability and maintains a steady buffer size during the encoding processes.Compared with the default configuration encoding under AI and LD,the proposed FRCA can achieve the target bit rates more accurately in various classical encoders.
基金Supported by the National Natural Science Foundation of China(No.61071091,No.60802021)the Research and Innovation Program for University Postgraduates of Jiangsu Province(CX10B_188Z)
文摘When wireless hosts use different rates to transmit data in IEEE 802.11 networks, it will take on the state of performance anomaly which will severely decrease the throughputs of all the higher rate hosts. Hence, it is bad for video service transmission. Considering that video is very sensitive to packet delivery delay but can tolerate some packet losses, we propose a novel cross-layer scheme which takes these two characteristics into consideration. Firstly, the maximum number of retransmissions for a video Medium Access Control (MAC) frame is computed in MAC layer according to video frame rate requirement of application layer and current access delay of MAC layer. Secondly, within the margin of the tolerant Packet Loss Rate (PLR) of application layer, several video MAC frames are allowed to drop so that we can adaptively select the transmission rate as high as possible for the rest of video MAC frames in terms of current channel quality and the maximum number of retransmissions. Experiment results show that the proposed method can reduce the delay and jitter of video service and improve the throughputs of fast hosts. Therefore, it increases the quality of reconstructed video to a certain extent and relieves the performance anomaly of network effectively.
文摘This paper presents a new video coding system based on wavelet transform and its rate control scheme over ATM networks. First, three dimensional wavelet transform is performed for the original image sequence, and an extension of set partitioning in hierarchical trees algorithm is employed to quantize the wavelet coefficients. Then, the output rate of the coder is controlled at group of frame scale, ensuring that it conforms to the parameters of a leaky bucket controller. Several leaky buckets with different sizes are discussed too. Simulation shows the efficiency of this codec and the effectiveness of the proposed rate control scheme.
基金Project (No. STE1093/1-1) supported by the German ResearchFoundation, Germany
文摘We propose a Rate-Distortion (RD) optimized strategy for frame-dropping and scheduling of multi-user conversa- tional and streaming videos. We consider a scenario where conversational and streaming videos share the forwarding resources at a network node. Two buffers are setup on the node to temporarily store the packets for these two types of video applications. For streaming video, a big buffer is used as the associated delay constraint of the application is moderate and a very small buffer is used for conversational video to ensure that the forwarding delay of every packet is limited. A scheduler is located behind these two buffers that dynamically assigns transmission slots on the outgoing link to the two buffers. Rate-distortion side information is used to perform RD-optimized frame dropping in case of node overload. Sharing the data rate on the outgoing link between the con- versational and the streaming videos is done either based on the fullness of the two associated buffers or on the mean incoming rates of the respective videos. Simulation results showed that our proposed RD-optimized frame dropping and scheduling ap- proach provides significant improvements in performance over the popular priority-based random dropping (PRD) technique.
文摘In this paper we discuss the source rate control problem of adapting variable bit-rate (VBR) compressed video over constant bit-rate (CBR) channels. Firstly we formulate it as an optimal control problem of a discrete linear system with state and control constraints. Then we apply the discrete maximum principle to get the optimal solution. Experimental results are given in the end. Compared with traditional algorithms, the proposed algorithm is suitable for the coder with continuous output rates, and can achieve the better solution. Our algorithm can be used in both off-line and on-line coding.
基金Supported by National Natural Science Foundation of China(60434030,60673178,and 60472076) and National Basic Research Program of China(973 Program)(2007CB307106)
文摘A new improved Goh's 3 D wavelet transform(WT) coding scheme is presented in this paper. The new scheme has great advantages including a simple code structure, low computation cost and good performance in PSNR, compression ratios and visual quality of reconstructions, when compared to the other existing 3 D WT coding methods and the 2 D WT based coding methods. The new 3 D WT coding scheme is suitable for very low bit rate video coding.
文摘A new motion compensated 3 D wavelet transform (MC 3DWT) video coding scheme is presented in this paper. The new coding scheme has a good performance in average PSNR, compression ratio and visual quality of reconstructions compared with the existing 3 D wavelet transform (3DWT) coding methods and motion compensated 2 D wavelet transform (MC WT) coding method. The new MC 3DWT coding scheme is suitable for very low bit rate video coding.
文摘Purpose: Objective of this study was to determine whether video assisted anesthesia induction reduced pediatric patients’ stress. Methods: With approval from the local ethics committee and parental informed consent, 75 children undergoing minor surgery were investigated in this prospective observational study. Patients were divided into three groups: group 1 was aged two to three years old, group 2 was aged four to six years old and group 3 was aged from seven to ten years old. The following three characteristics were evaluated: 1) the pulse rate at four points (the ward, the entrance at the operating room, mask notification and the mask fit);2) the behavioral score in the operating room;3) the amount of pain killers after the operation. Results: In group 1 (N = 20), there was a significant difference between the control group and the video assisted group regarding the percentage change in pulse rate based on the children’s ward when the patients looked at the mask. In group 2 (N = 26), there was no significant difference regarding any points. In group 3 (N = 29), there was a significant difference between control and video assisted group regarding the percentage change in pulse rate based on the children’s ward for all points. Also, regarding to the behavioral score, there was a significant difference between the control group and the video assisted group of all ages. However, there was no significant difference regarding the use of NSAIDs in the postoperative period between the control and the video assisted group. Conclusion: These results show that the video assisted anesthesia induction is effective for pediatric patients.
基金National Natural Science Foundations of China (No. 60972035,No. 61074009)Natural Science Foundation Program of Shanghai,China ( No. 10ZR1432800)
文摘For rate control (RC) of hierarchical structure coding, an independent rate-quantization (R-Q) model was proposed based on mean absolute differences (MADs) in different temporal levels (TLs). In the proposed R-Q model, a novel MAD model was developed according to the hierarchical structure. The experimental results demonstrate that the proposed algorithm provides better performance, in terms of average peak signal-to-noise ratio (PSNR) and quality smoothness, than the H.264 reference model, JM14.2, under various sequences.