In this paper, a 3-D video encoding scheme suitable for digital TV/HDTV (high definition television) is studied through computer simulation. The encoding scheme is designed to provide a good match to human vision. Bas...In this paper, a 3-D video encoding scheme suitable for digital TV/HDTV (high definition television) is studied through computer simulation. The encoding scheme is designed to provide a good match to human vision. Basically, this involves transmission of low frequency luminance information at full frame rate for good motion rendition and transmission of high frequency luminance signal at reduced frame rate for good detail in static images.展开更多
For intelligent transportation surveillance, a novel background model based on Mart wavelet kernel and a background subtraction technique based on binary discrete wavelet transforms were introduced. The background mod...For intelligent transportation surveillance, a novel background model based on Mart wavelet kernel and a background subtraction technique based on binary discrete wavelet transforms were introduced. The background model kept a sample of intensity values for each pixel in the image and used this sample to estimate the probability density function of the pixel intensity. The density function was estimated using a new Marr wavelet kernel density estimation technique. Since this approach was quite general, the model could approximate any distribution for the pixel intensity without any assumptions about the underlying distribution shape. The background and current frame were transformed in the binary discrete wavelet domain, and background subtraction was performed in each sub-band. After obtaining the foreground, shadow was eliminated by an edge detection method. Experimental results show that the proposed method produces good results with much lower computational complexity and effectively extracts the moving objects with accuracy ratio higher than 90%, indicating that the proposed method is an effective algorithm for intelligent transportation system.展开更多
This letter presents a novel spatial error concealment algorithm for the H.264 video coding. The error concealment algorithm is based on directional interpolation. Mojette transform is used to estimate the orientation...This letter presents a novel spatial error concealment algorithm for the H.264 video coding. The error concealment algorithm is based on directional interpolation. Mojette transform is used to estimate the orientation features of the damaged blocks,and the image is interpolated in the appro-priate directions. The proposed method is compared with bilinear interpolation algorithm in the ref-erence implementation of H.264 and all directional interpolation. Experimental results prove that the proposed algorithm has better subjective and objective image reconstruction quality.展开更多
In the H.263 video codec related systems, motion estimation and Discrete Cosine Transform (DCT) have the most computational requirements. In order to reduce complexity of the encoder to dedicate more resources to othe...In the H.263 video codec related systems, motion estimation and Discrete Cosine Transform (DCT) have the most computational requirements. In order to reduce complexity of the encoder to dedicate more resources to other functions, according to the study of existing methods, an Improved All Zero Block Finding (IAZBF) method based on the statistic characteristics of DCT coefficients is proposed. Compared with existing methods, IAZBF improves the detecting efficiency by about 50% without importing too much extra computation requirement. Being computed with additions and shifts instead of complicated multiplications, IAZBF is of low computation complexity, especially for low-end processors. In addition, IAZBF upholds picture fidelity and remains compatible with the H.263 bitstream standard.展开更多
Somatic cell counts (SCCs) levels indicate the occurrence of infections in goat udders and are related to the productivity of goat milk, cheese and yoghurt. This work presents a segmentation method for counting soma...Somatic cell counts (SCCs) levels indicate the occurrence of infections in goat udders and are related to the productivity of goat milk, cheese and yoghurt. This work presents a segmentation method for counting somatic cells in goat milk images, intending to detect an infection known as mastiffs, which is the major cause of loss in dairy farming. The image segmentation procedure is devised by using the lab color space and the watershed transform. A large number of samples under variable preparation conditions are treated with the proposed method. A comparison between manual and the proposed technique is presented. Promising results indicates that video-microscopy systems may be employed to develop automated SCC for goat milk.展开更多
Karhunen-Loeve transform (KLT) is the optimal transform that minimizes distortion at a given bit allocation for Gaussian source. As a KLT matrix usually contains non-integers, integer-KLT design is a classical probl...Karhunen-Loeve transform (KLT) is the optimal transform that minimizes distortion at a given bit allocation for Gaussian source. As a KLT matrix usually contains non-integers, integer-KLT design is a classical problem. In this paper, a joint reversibility-gain (R-G) model is proposed for integer-KLT design in video coding. Specifically, the 'reversibility' is modeled according to distortion analysis in using forward and inverse integer transform without quantization. It not only measures how invcrtible a transform is, but also bounds the distortion introduced by the non-orthonormal integer transform process. The 'gain' means transform coding gain (TCG), which is a widely used criterion for transform design in video coding. Since KLT maximizes the TCG under some assumptions, here we define the TCG loss ratio (LR) to measure how much coding gain an integer-KLT loses when compared with the original KLT. Thus, the R-G model can be explained as follows: subject to a certain TCG LR, an integer- KLT with the best reversibility is the optimal integer transform for a given non-integer-KLT. Experimental results show that the R-G model can guide the design of integer-KLTs with good performance.展开更多
文摘In this paper, a 3-D video encoding scheme suitable for digital TV/HDTV (high definition television) is studied through computer simulation. The encoding scheme is designed to provide a good match to human vision. Basically, this involves transmission of low frequency luminance information at full frame rate for good motion rendition and transmission of high frequency luminance signal at reduced frame rate for good detail in static images.
基金Project(60772080) supported by the National Natural Science Foundation of ChinaProject(3240120) supported by Tianjin Subway Safety System, Honeywell Limited, China
文摘For intelligent transportation surveillance, a novel background model based on Mart wavelet kernel and a background subtraction technique based on binary discrete wavelet transforms were introduced. The background model kept a sample of intensity values for each pixel in the image and used this sample to estimate the probability density function of the pixel intensity. The density function was estimated using a new Marr wavelet kernel density estimation technique. Since this approach was quite general, the model could approximate any distribution for the pixel intensity without any assumptions about the underlying distribution shape. The background and current frame were transformed in the binary discrete wavelet domain, and background subtraction was performed in each sub-band. After obtaining the foreground, shadow was eliminated by an edge detection method. Experimental results show that the proposed method produces good results with much lower computational complexity and effectively extracts the moving objects with accuracy ratio higher than 90%, indicating that the proposed method is an effective algorithm for intelligent transportation system.
基金the National Natural Science Foundation of China (No.60472036, 60402036)the Natural Science Foundation of Beijing (No.4042008)the Ph.D. Foundation of Ministry of Education (No.20040005015).
文摘This letter presents a novel spatial error concealment algorithm for the H.264 video coding. The error concealment algorithm is based on directional interpolation. Mojette transform is used to estimate the orientation features of the damaged blocks,and the image is interpolated in the appro-priate directions. The proposed method is compared with bilinear interpolation algorithm in the ref-erence implementation of H.264 and all directional interpolation. Experimental results prove that the proposed algorithm has better subjective and objective image reconstruction quality.
基金Supported by the China Aviation Fund (No. 02153071)
文摘In the H.263 video codec related systems, motion estimation and Discrete Cosine Transform (DCT) have the most computational requirements. In order to reduce complexity of the encoder to dedicate more resources to other functions, according to the study of existing methods, an Improved All Zero Block Finding (IAZBF) method based on the statistic characteristics of DCT coefficients is proposed. Compared with existing methods, IAZBF improves the detecting efficiency by about 50% without importing too much extra computation requirement. Being computed with additions and shifts instead of complicated multiplications, IAZBF is of low computation complexity, especially for low-end processors. In addition, IAZBF upholds picture fidelity and remains compatible with the H.263 bitstream standard.
文摘Somatic cell counts (SCCs) levels indicate the occurrence of infections in goat udders and are related to the productivity of goat milk, cheese and yoghurt. This work presents a segmentation method for counting somatic cells in goat milk images, intending to detect an infection known as mastiffs, which is the major cause of loss in dairy farming. The image segmentation procedure is devised by using the lab color space and the watershed transform. A large number of samples under variable preparation conditions are treated with the proposed method. A comparison between manual and the proposed technique is presented. Promising results indicates that video-microscopy systems may be employed to develop automated SCC for goat milk.
基金Project supported by the National Natural Science Foundation of China(Nos.61371162 and 61431015)
文摘Karhunen-Loeve transform (KLT) is the optimal transform that minimizes distortion at a given bit allocation for Gaussian source. As a KLT matrix usually contains non-integers, integer-KLT design is a classical problem. In this paper, a joint reversibility-gain (R-G) model is proposed for integer-KLT design in video coding. Specifically, the 'reversibility' is modeled according to distortion analysis in using forward and inverse integer transform without quantization. It not only measures how invcrtible a transform is, but also bounds the distortion introduced by the non-orthonormal integer transform process. The 'gain' means transform coding gain (TCG), which is a widely used criterion for transform design in video coding. Since KLT maximizes the TCG under some assumptions, here we define the TCG loss ratio (LR) to measure how much coding gain an integer-KLT loses when compared with the original KLT. Thus, the R-G model can be explained as follows: subject to a certain TCG LR, an integer- KLT with the best reversibility is the optimal integer transform for a given non-integer-KLT. Experimental results show that the R-G model can guide the design of integer-KLTs with good performance.