Super-Resolution (SR) technique means to reconstruct High-Resolution (HR) images from a sequence of Low-Resolution (LR) observations,which has been a great focus for compressed video. Based on the theory of Projection...Super-Resolution (SR) technique means to reconstruct High-Resolution (HR) images from a sequence of Low-Resolution (LR) observations,which has been a great focus for compressed video. Based on the theory of Projection Onto Convex Set (POCS),this paper constructs Quantization Constraint Set (QCS) using the quantization information extracted from the video bit stream. By combining the statistical properties of image and the Human Visual System (HVS),a novel Adaptive Quantization Constraint Set (AQCS) is proposed. Simulation results show that AQCS-based SR al-gorithm converges at a fast rate and obtains better performance in both objective and subjective quality,which is applicable for compressed video.展开更多
Semantic video analysis plays an important role in the field of machine intelligence and pattern recognition. In this paper, based on the Hidden Markov Model (HMM), a semantic recognition framework on compressed video...Semantic video analysis plays an important role in the field of machine intelligence and pattern recognition. In this paper, based on the Hidden Markov Model (HMM), a semantic recognition framework on compressed videos is proposed to analyze the video events according to six low-level features. After the detailed analysis of video events, the pattern of global motion and five features in foreground—the principal parts of videos, are employed as the observations of the Hidden Markov Model to classify events in videos. The applications of the proposed framework in some video event detections demonstrate the promising success of the proposed framework on semantic video analysis.展开更多
With the development of general-purpose processors (GPP) and video signal processing algorithms, it is possible to implement a software-based real-time video encoder on GPP, and its low cost and easy upgrade attract d...With the development of general-purpose processors (GPP) and video signal processing algorithms, it is possible to implement a software-based real-time video encoder on GPP, and its low cost and easy upgrade attract developers' interests to transfer video encoding from specialized hardware to more flexible software. In this paper, the encoding structure is set up first to support complexity scalability; then a lot of high performance algorithms are used on the key time-consuming modules in coding process; finally, at programming level, processor characteristics are considered to improve data access efficiency and processing parallelism. Other programming methods such as lookup table are adopted to reduce the computational complexity. Simulation results showed that these ideas could not only improve the global performance of video coding, but also provide great flexibility in complexity regulation.展开更多
Smweillance system using active tracking camera has no distance limitation of surveillance range compared to supersonic or sound sensors. However, complex motion tracking algorithm requires huge amount of computation,...Smweillance system using active tracking camera has no distance limitation of surveillance range compared to supersonic or sound sensors. However, complex motion tracking algorithm requires huge amount of computation, and it often requires exfmasive DSPs or embedded processors. This paper proposes a novel motion tracking trait based on different image for fast and simple motion tracking. It uses configuration factor to avoid noise and inaccuracy. It reduces the required computation significantly, so as to be implemented on Field Programmable Gate Array(FFGAs ) instead of expensive Digital Signal Processing(DSPs). It also performs calculation for motion estimation in video compression, so it can be easily combined with surveil system with video recording functionality based on video compression. The proposed motion tracking system implemented on Xilinx Vertex-4 FPGA can process 48 frames per second, and operating frequency of motion tracking trait is 100 MHz.展开更多
Straightforward techniques for spatial domain digital video editing (DVE) of compressed video via decompression and recompression are computationally expensive. In this paper, a novel algorithm was proposed for mirror...Straightforward techniques for spatial domain digital video editing (DVE) of compressed video via decompression and recompression are computationally expensive. In this paper, a novel algorithm was proposed for mirror-image special effect editing in compressed video without full frame decompression and motion estimation. The results show that with the reducing of computational complexity, the quality of edited video in compressed domain is still close to the quality of the edited video in uncompressed domain at the same bit rate.展开更多
With relatively high transmission capacity and usually unconstrained connections, IEEE802.11 WLANs provide the ideal infrastructure for pervasive video content sharing and communications. However, the delivery of high...With relatively high transmission capacity and usually unconstrained connections, IEEE802.11 WLANs provide the ideal infrastructure for pervasive video content sharing and communications. However, the delivery of high-performance video streams over 802.11 WLANs remains a challenging task due to the inherent characteristics of compressed video and dynamic channels. In this paper, we present a brief survey of various recent innovations that have been developed to enhance the Quality of Service (QoS) performance for video over WLANs. Based on the application scenarios, the solutions have focused mainly on three network layers, that is, Application layer (APP), Media Access Control layer (MAC), and Physical layer (PHY). After reviewing the video compression technology, we first examine various single-layer solutions for video over WLANs. We then discuss several cross-layer solutions that take advantage of mutual interactions between different network layers. Finally, several technical issues beyond QoS performance, including energy and security, are also addressed. We conclude that the application of video over WLANs will continue to increase in future.展开更多
This letter proposes a novel method of compressed video super-resolution reconstruction based on MAP-POCS (Maximum Posterior Probability-Projection Onto Convex Set). At first assuming the high-resolution model subject...This letter proposes a novel method of compressed video super-resolution reconstruction based on MAP-POCS (Maximum Posterior Probability-Projection Onto Convex Set). At first assuming the high-resolution model subject to Poisson-Markov distribution, then constructing the projecting convex based on MAP. According to the characteristics of compressed video, two different convexes are constructed based on integrating the inter-frame and intra-frame information in the wavelet-domain. The results of the experiment demonstrate that the new method not only outperforms the traditional algorithms on the aspects of PSNR (Peak Signal-to-Noise Ratio), MSE (Mean Square Error) and reconstruction vision effect, but also has the advantages of rapid convergence and easy extension.展开更多
A novel Compressed-Sensing-based(CS-based)Distributed Video Coding(DVC)system,called Distributed Adaptive Compressed Video Sensing(DISACOS),is proposed in this paper.In this system,the input frames are divided into ke...A novel Compressed-Sensing-based(CS-based)Distributed Video Coding(DVC)system,called Distributed Adaptive Compressed Video Sensing(DISACOS),is proposed in this paper.In this system,the input frames are divided into key frames and non-key frames,which are encoded by block CS sampling.The key frames are encoded as CS measurements at substantially higher rates than the non-key frames and decoded by the Smoothed Projected Landweber(SPL)algorithm using multi-hypothesis predictions.For the non-key frames,a small number of CS measurements are first transmitted to detect blocks having low-quality Side Information(SI)generated by the conventional interpolation or extrapolation at the decoder;then,another group of CS measurements are sampled again upon the decoder’s request.To fully utilise the CS measurements,we adaptively allocate these measurements to each block in terms of different edge features.Finally,the residual frame is reconstructed using the SPL algorithm and the decoded non-key frame is simply determined as the sum of the residual frame and the SI.Experimental results have revealed that our CS-based DVC system yields better rate-distortion performance when compared with other schemes.展开更多
Video reconstruction quality largely depends on the ability of employed sparse domain to adequately represent the underlying video in Distributed Compressed Video Sensing (DCVS). In this paper, we propose a novel dyna...Video reconstruction quality largely depends on the ability of employed sparse domain to adequately represent the underlying video in Distributed Compressed Video Sensing (DCVS). In this paper, we propose a novel dynamic global-Principal Component Analysis (PCA) sparse representation algorithm for video based on the sparse-land model and nonlocal similarity. First, grouping by matching is realized at the decoder from key frames that are previously recovered. Second, we apply PCA to each group (sub-dataset) to compute the principle components from which the sub-dictionary is constructed. Finally, the non-key frames are reconstructed from random measurement data using a Compressed Sensing (CS) reconstruction algorithm with sparse regularization. Experimental results show that our algorithm has a better performance compared with the DCT and K-SVD dictionaries.展开更多
Pornographic image/video recognition plays a vital role in network information surveillance and management. In this paper, its key techniques, such as skin detection, key frame extraction, and classifier design, etc.,...Pornographic image/video recognition plays a vital role in network information surveillance and management. In this paper, its key techniques, such as skin detection, key frame extraction, and classifier design, etc., are studied in compressed domain. A skin detection method based on data-mining in compressed domain is proposed firstly and achieves the higher detection accuracy as well as higher speed. Then, a cascade scheme of pornographic image recognition based on selective decision tree ensemble is proposed in order to improve both the speed and accuracy of recognition. A pornographic video oriented key frame extraction solution in compressed domain and an approach of pornographic video recognition are discussed respectively in the end.展开更多
Rate-distortion optimization greatly improves the performance of compression coding system so that it pervades all of the source coding from an informationtheoretic standpoint and for the design of practical coding sy...Rate-distortion optimization greatly improves the performance of compression coding system so that it pervades all of the source coding from an informationtheoretic standpoint and for the design of practical coding systems. For the case of rate-distortion optimization, Lagrange multiplier method provides the efficient and nearly optimal solution. In this paper, a fast and efficient algorithm is proposed to solve the optimal slope λ* of the rate-distortion curve at the given bit budget. Based on Lagrange multiplier method, the presented algorithm find λ* using the golden-ratio search. Compared with the Bisection method that only adapts to the system with the dense operational points on the rate-distortion curve, the proposed algorithm can be adapted to the system whether the operational points are populated densely or not. Thus it can be applied to both the wavelet coding system and the video coding standards such as H. 264, where Bisection method can not work well. In particular, the algorithm has been verified on the platform of the quadtree classified and trellis coded quantized (QTCQ) wavelet image compression system and the newest video coding standard H. 264. The experimental results are provided to demonstrate the efficiency of the algorithm. The proposed algorithm can improve the performance. A gain abour 0.6 - 0.7 dB can be achieved with the same rate in H. 264. In addition, it converges as fast as Bisection method, with almost the same ctinplexity.展开更多
In this paper, a rate-distortion based quantization level adjustment (RDQLA) algorithm is presented. Based on the rate-distortion criterion, the quantization level adjustment algorithm effectively improves coding ef...In this paper, a rate-distortion based quantization level adjustment (RDQLA) algorithm is presented. Based on the rate-distortion criterion, the quantization level adjustment algorithm effectively improves coding efficiency by adaptively optimizing quantization levels of the signals near the boundaries of quantization cells and adjusting quantization levels per block. In addition, it has no overhead and is fully compatible with the existing compression standards. The proposed algorithm can be applied in any block based image and video coding method. In particular, the algorithm has been verified on the platform of H.264. Experimental results show that the proposed algorithm improves objective and subjective performances substantially. It is shown that the proposed algorithm has a gain of several dB comparing with the newest H.264 standard for high bit rates.展开更多
基金the Natural Science Foundation of Jiangsu Province (No.BK2004151).
文摘Super-Resolution (SR) technique means to reconstruct High-Resolution (HR) images from a sequence of Low-Resolution (LR) observations,which has been a great focus for compressed video. Based on the theory of Projection Onto Convex Set (POCS),this paper constructs Quantization Constraint Set (QCS) using the quantization information extracted from the video bit stream. By combining the statistical properties of image and the Human Visual System (HVS),a novel Adaptive Quantization Constraint Set (AQCS) is proposed. Simulation results show that AQCS-based SR al-gorithm converges at a fast rate and obtains better performance in both objective and subjective quality,which is applicable for compressed video.
基金Supported in part by the National Natural Science Foundation of China (No. 60572045)the Ministry of Education of China Ph.D. Program Foundation (No.20050698033)Cooperation Project (2005.7-2007.6) with Microsoft Research Asia.
文摘Semantic video analysis plays an important role in the field of machine intelligence and pattern recognition. In this paper, based on the Hidden Markov Model (HMM), a semantic recognition framework on compressed videos is proposed to analyze the video events according to six low-level features. After the detailed analysis of video events, the pattern of global motion and five features in foreground—the principal parts of videos, are employed as the observations of the Hidden Markov Model to classify events in videos. The applications of the proposed framework in some video event detections demonstrate the promising success of the proposed framework on semantic video analysis.
文摘With the development of general-purpose processors (GPP) and video signal processing algorithms, it is possible to implement a software-based real-time video encoder on GPP, and its low cost and easy upgrade attract developers' interests to transfer video encoding from specialized hardware to more flexible software. In this paper, the encoding structure is set up first to support complexity scalability; then a lot of high performance algorithms are used on the key time-consuming modules in coding process; finally, at programming level, processor characteristics are considered to improve data access efficiency and processing parallelism. Other programming methods such as lookup table are adopted to reduce the computational complexity. Simulation results showed that these ideas could not only improve the global performance of video coding, but also provide great flexibility in complexity regulation.
基金sponsored by the MKE(The Ministry of Knowledge Economy,Korea),the ITRC(Information Technology Research Center)support program(NIPA-2009-(C1090-0902-0007))the System Semiconductor Industry Development Center,Human Resource Development Project for IT SOC Architecture
文摘Smweillance system using active tracking camera has no distance limitation of surveillance range compared to supersonic or sound sensors. However, complex motion tracking algorithm requires huge amount of computation, and it often requires exfmasive DSPs or embedded processors. This paper proposes a novel motion tracking trait based on different image for fast and simple motion tracking. It uses configuration factor to avoid noise and inaccuracy. It reduces the required computation significantly, so as to be implemented on Field Programmable Gate Array(FFGAs ) instead of expensive Digital Signal Processing(DSPs). It also performs calculation for motion estimation in video compression, so it can be easily combined with surveil system with video recording functionality based on video compression. The proposed motion tracking system implemented on Xilinx Vertex-4 FPGA can process 48 frames per second, and operating frequency of motion tracking trait is 100 MHz.
文摘Straightforward techniques for spatial domain digital video editing (DVE) of compressed video via decompression and recompression are computationally expensive. In this paper, a novel algorithm was proposed for mirror-image special effect editing in compressed video without full frame decompression and motion estimation. The results show that with the reducing of computational complexity, the quality of edited video in compressed domain is still close to the quality of the edited video in uncompressed domain at the same bit rate.
文摘With relatively high transmission capacity and usually unconstrained connections, IEEE802.11 WLANs provide the ideal infrastructure for pervasive video content sharing and communications. However, the delivery of high-performance video streams over 802.11 WLANs remains a challenging task due to the inherent characteristics of compressed video and dynamic channels. In this paper, we present a brief survey of various recent innovations that have been developed to enhance the Quality of Service (QoS) performance for video over WLANs. Based on the application scenarios, the solutions have focused mainly on three network layers, that is, Application layer (APP), Media Access Control layer (MAC), and Physical layer (PHY). After reviewing the video compression technology, we first examine various single-layer solutions for video over WLANs. We then discuss several cross-layer solutions that take advantage of mutual interactions between different network layers. Finally, several technical issues beyond QoS performance, including energy and security, are also addressed. We conclude that the application of video over WLANs will continue to increase in future.
基金Supported by the Natural Science Foundation of Jiangsu Province (No. BK2004151).
文摘This letter proposes a novel method of compressed video super-resolution reconstruction based on MAP-POCS (Maximum Posterior Probability-Projection Onto Convex Set). At first assuming the high-resolution model subject to Poisson-Markov distribution, then constructing the projecting convex based on MAP. According to the characteristics of compressed video, two different convexes are constructed based on integrating the inter-frame and intra-frame information in the wavelet-domain. The results of the experiment demonstrate that the new method not only outperforms the traditional algorithms on the aspects of PSNR (Peak Signal-to-Noise Ratio), MSE (Mean Square Error) and reconstruction vision effect, but also has the advantages of rapid convergence and easy extension.
基金supported by the Graduate Student Research Innovation Project of Jiangsu Province China under Grants No. CXZZ12_0466, No. CXZZ11_0390the National Natural Science Foundation of China under Grants No. 61071091, No. 61271240+2 种基金the Natural Science Foundation of the Higher Education Institutions of Jiangsu Province China under Grant No. 12KJB510019the Nanjing University of Posts and Telecommunications Natural Science Foundation under Grant No. NY212015the Technology Research Program of Hubei Provincial Department of Education under Grant No. D20121408
文摘A novel Compressed-Sensing-based(CS-based)Distributed Video Coding(DVC)system,called Distributed Adaptive Compressed Video Sensing(DISACOS),is proposed in this paper.In this system,the input frames are divided into key frames and non-key frames,which are encoded by block CS sampling.The key frames are encoded as CS measurements at substantially higher rates than the non-key frames and decoded by the Smoothed Projected Landweber(SPL)algorithm using multi-hypothesis predictions.For the non-key frames,a small number of CS measurements are first transmitted to detect blocks having low-quality Side Information(SI)generated by the conventional interpolation or extrapolation at the decoder;then,another group of CS measurements are sampled again upon the decoder’s request.To fully utilise the CS measurements,we adaptively allocate these measurements to each block in terms of different edge features.Finally,the residual frame is reconstructed using the SPL algorithm and the decoded non-key frame is simply determined as the sum of the residual frame and the SI.Experimental results have revealed that our CS-based DVC system yields better rate-distortion performance when compared with other schemes.
基金supported by the Innovation Project of Graduate Students of Jiangsu Province, China under Grants No. CXZZ12_0466, No. CXZZ11_0390the National Natural Science Foundation of China under Grants No. 61071091, No. 61271240, No. 61201160, No. 61172118+2 种基金the Natural Science Foundation of the Higher Education Institutions of Jiangsu Province, China under Grant No. 12KJB510019the Science and Technology Research Program of Hubei Provincial Department of Education under Grants No. D20121408, No. D20121402the Program for Research Innovation of Nanjing Institute of Technology Project under Grant No. CKJ20110006
文摘Video reconstruction quality largely depends on the ability of employed sparse domain to adequately represent the underlying video in Distributed Compressed Video Sensing (DCVS). In this paper, we propose a novel dynamic global-Principal Component Analysis (PCA) sparse representation algorithm for video based on the sparse-land model and nonlocal similarity. First, grouping by matching is realized at the decoder from key frames that are previously recovered. Second, we apply PCA to each group (sub-dataset) to compute the principle components from which the sub-dictionary is constructed. Finally, the non-key frames are reconstructed from random measurement data using a Compressed Sensing (CS) reconstruction algorithm with sparse regularization. Experimental results show that our algorithm has a better performance compared with the DCT and K-SVD dictionaries.
基金Supported by the National Natural Science Foundation of China (No.60772069)863 High-Tech Project (2008AA01A313)
文摘Pornographic image/video recognition plays a vital role in network information surveillance and management. In this paper, its key techniques, such as skin detection, key frame extraction, and classifier design, etc., are studied in compressed domain. A skin detection method based on data-mining in compressed domain is proposed firstly and achieves the higher detection accuracy as well as higher speed. Then, a cascade scheme of pornographic image recognition based on selective decision tree ensemble is proposed in order to improve both the speed and accuracy of recognition. A pornographic video oriented key frame extraction solution in compressed domain and an approach of pornographic video recognition are discussed respectively in the end.
基金Special Foundation of Outstanding Young Teacher of ShanghaiShanghai Educational Development Foundation,China (No.2007CG66)+1 种基金Shanghai Key Research Project,China ( No.071605125,No.08160510600)Innovation Program of Shanghai Municipal Education Commission,China(No.09ZZ185,No.09YZ337)
文摘Rate-distortion optimization greatly improves the performance of compression coding system so that it pervades all of the source coding from an informationtheoretic standpoint and for the design of practical coding systems. For the case of rate-distortion optimization, Lagrange multiplier method provides the efficient and nearly optimal solution. In this paper, a fast and efficient algorithm is proposed to solve the optimal slope λ* of the rate-distortion curve at the given bit budget. Based on Lagrange multiplier method, the presented algorithm find λ* using the golden-ratio search. Compared with the Bisection method that only adapts to the system with the dense operational points on the rate-distortion curve, the proposed algorithm can be adapted to the system whether the operational points are populated densely or not. Thus it can be applied to both the wavelet coding system and the video coding standards such as H. 264, where Bisection method can not work well. In particular, the algorithm has been verified on the platform of the quadtree classified and trellis coded quantized (QTCQ) wavelet image compression system and the newest video coding standard H. 264. The experimental results are provided to demonstrate the efficiency of the algorithm. The proposed algorithm can improve the performance. A gain abour 0.6 - 0.7 dB can be achieved with the same rate in H. 264. In addition, it converges as fast as Bisection method, with almost the same ctinplexity.
基金the Special Foundation of Outstanding Young Teacher of Shanghai
文摘In this paper, a rate-distortion based quantization level adjustment (RDQLA) algorithm is presented. Based on the rate-distortion criterion, the quantization level adjustment algorithm effectively improves coding efficiency by adaptively optimizing quantization levels of the signals near the boundaries of quantization cells and adjusting quantization levels per block. In addition, it has no overhead and is fully compatible with the existing compression standards. The proposed algorithm can be applied in any block based image and video coding method. In particular, the algorithm has been verified on the platform of H.264. Experimental results show that the proposed algorithm improves objective and subjective performances substantially. It is shown that the proposed algorithm has a gain of several dB comparing with the newest H.264 standard for high bit rates.