In view of the limited bandwidth of underwater video image transmission,a low bit rate underwater video compression coding method is proposed.Based on the preprocessing process of wavelet transform and coefficient dow...In view of the limited bandwidth of underwater video image transmission,a low bit rate underwater video compression coding method is proposed.Based on the preprocessing process of wavelet transform and coefficient down-sampling,the visual redundancy of underwater image is removed and the computational coefficients and coding bits are reduced.At the same time,combined with multi-level wavelet decomposition,inter frame motion compensation,entropy coding and other methods,according to the characteristics of different types of frame image data,reduce the number of calculations and improve the coding efficiency.The experimental results show that the reconstructed image quality can meet the visual requirements,and the average compression ratio of underwater video can meet the requirements of underwater acoustic channel transmission rate.展开更多
Two video coding schemes based on wavelet transform achieving very low bit rate are presented in this paper. The first is a hybrid motion compensated wavelet transform(MC WT)system which behaves better at very low ...Two video coding schemes based on wavelet transform achieving very low bit rate are presented in this paper. The first is a hybrid motion compensated wavelet transform(MC WT)system which behaves better at very low bit rates than the block DCT residual coder. The second is a new efficient coding system based on a simple frame differencing wavelet transform(FD WT)which performs well in both PSNR and visual quality with substantially reduced complexity.展开更多
A new motion compensated 3 D wavelet transform (MC 3DWT) video coding scheme is presented in this paper. The new coding scheme has a good performance in average PSNR, compression ratio and visual quality of reconst...A new motion compensated 3 D wavelet transform (MC 3DWT) video coding scheme is presented in this paper. The new coding scheme has a good performance in average PSNR, compression ratio and visual quality of reconstructions compared with the existing 3 D wavelet transform (3DWT) coding methods and motion compensated 2 D wavelet transform (MC WT) coding method. The new MC 3DWT coding scheme is suitable for very low bit rate video coding.展开更多
A new improved Goh's 3 D wavelet transform(WT) coding scheme is presented in this paper. The new scheme has great advantages including a simple code structure, low computation cost and good performance in PSNR, c...A new improved Goh's 3 D wavelet transform(WT) coding scheme is presented in this paper. The new scheme has great advantages including a simple code structure, low computation cost and good performance in PSNR, compression ratios and visual quality of reconstructions, when compared to the other existing 3 D WT coding methods and the 2 D WT based coding methods. The new 3 D WT coding scheme is suitable for very low bit rate video coding.展开更多
ZTE Corporation announced on 1 March that its innovative IPTVlowbitrate highdefinition transcoding solution has been nominated for the World's Best Component or Enabler Award by the IPTV World Forum. The ZTE solution...ZTE Corporation announced on 1 March that its innovative IPTVlowbitrate highdefinition transcoding solution has been nominated for the World's Best Component or Enabler Award by the IPTV World Forum. The ZTE solution is on display at the Mobile World Congress 2012 (MWC 2012) in Barcelona.展开更多
In this paper, more efficient, low-complexity and reliable region of interest (ROI) image codec for compressing smooth low texture remote sensing images is proposed. We explore the efficiency of the modified RO! cod...In this paper, more efficient, low-complexity and reliable region of interest (ROI) image codec for compressing smooth low texture remote sensing images is proposed. We explore the efficiency of the modified RO! codec with respect to the selected set of convenient wavelet filters, which is a novel method. Such ROI coding experiment analysis representing low bit rate lossy to high quality lossless reconstruction with timing analysis is useful for improving remote sensing ground truth surveillance efficiency in terms of time and quality. The subjective [i.e. fair, five observer (HVS) evaluations using enhanced 3D picture view Hyper memory display technology] and the objective results revealed that for faster ground truth ROI coding applications, the Symlet-4 adaptation performs better than Biorthogonal 4.4 and Biorthogonal 6.8. However, the discrete Meyer wavelet adaptation is the best solution for delayed ROI image reconstructions.展开更多
At medium or long distance (〉 10 kin) underwater acoustic speech communication, information transfer rate is constrained by the complicated, time varying channel and limited bandwidth. The bit rate of speech coding...At medium or long distance (〉 10 kin) underwater acoustic speech communication, information transfer rate is constrained by the complicated, time varying channel and limited bandwidth. The bit rate of speech coding is required to be as low as possible. The time delay of underwater acoustic wave propagation can be used for low bit rate speech coding. After investigating the Mixed Excitation Linear Prediction (MELP) standard and taking account of the auditory perceptual features, a variable and adjustable bit rate speech codec algorithm has been proposed, whose average bit rate is about 600 bps. The average Perceptual Evaluation of Speech Quality Mean Opinion Score (PESQ MOS) of synthesized speeches is about 2.8. It has been proved by the computer simulation and sea trial that the performance of the proposed algorithm is well and robust when bit error rate is no more than 10-3. The synthesized speech is vivid and intelligible, and keeps main individual characteristics of speaker.展开更多
Visual search has been a long-standing problem in applications such as location recognition and product search. Much research has been done on image representation, matching, indexing, and retrieval. Key component tec...Visual search has been a long-standing problem in applications such as location recognition and product search. Much research has been done on image representation, matching, indexing, and retrieval. Key component technologies for visual search have been developed, and numerous real-world applications are emerging. To ensure application interoperability, the Moving Picture Experts Group (MPEG) has begun standardizing visuaJ search technologies and is developing the compact descriptors for visua) search (CDVS) standard. MPEG seeks to develop a collaborative platform for evaluating existing visual search technologies. Peking University has participated in this standardization since the 94th MPEG meeting, and significant progress has been made with the various proposals. A test model (TM) has been selected to determine the basic pipeline and key components of visual search. However, the first-version TM has high computational complexity and imperfect retrieval and matching. Core experiments have therefore been set up to improve TM. In this article, we summarize key technologies for visual search and report the progress of MPEG CDVS. We discuss Peking University' s efforts in CDVS and also discuss unresolved issues.展开更多
An edge oriented image sequence coding scheme is presented. On the basis of edge detecting, an image could be divided into the sensitized region and the smooth region. In this scheme, the architecture of sensitized r...An edge oriented image sequence coding scheme is presented. On the basis of edge detecting, an image could be divided into the sensitized region and the smooth region. In this scheme, the architecture of sensitized region is approximated with linear type of segments. Then a rectangle belt is constructed for each segment. Finally, the gray value distribution in the region is fitted by normal forms polynomials. The model matching and motion analysis are also based on the architecture of sensitized region. For the smooth region we use the run length scanning and linear approximating. By means of normal forms polynomial fitting and motion prediction by matching, the images are compressed. It is shown through the simulations that the subjective quality of reconstructed picture is excellent at 0.0075 bit per pel.展开更多
In real time applications, the low delay rate is an important requirement of video coding. We propose a simple low delay rate control method in this paper for such applications. In this method, target bits are divi...In real time applications, the low delay rate is an important requirement of video coding. We propose a simple low delay rate control method in this paper for such applications. In this method, target bits are divided into two parts: uncontrolled and controlled bits in the frame layer. The first part is assigned to the header, syntax and motion vectors according to that spent in the previous encoded frame. The second part is assigned to DCT coeffcients by employing a rate model for mactorblock Q P determination. Experiments show that the proposed method can achieve better performance compared with that of the test model TMN5 of H .263, and slightly worse performance, but with lower computation complexity, compared with that of the TMN8 of H.263+ .展开更多
In the context of object oriented video coding, the encoding of segmentation maps defined by contour networks is particularly critical. In this paper, we present a lossy contour network encoding algorithm where both t...In the context of object oriented video coding, the encoding of segmentation maps defined by contour networks is particularly critical. In this paper, we present a lossy contour network encoding algorithm where both the rate distortion contour encoding based on maximum operator and the prediction error for the current frame based on quadratic motion model are combined into a optimal polygon contour network compression scheme. The bit rate for the contour network can be further reduced by about 20% in comparison with that in the optimal polygonal boundary encoding scheme using maximum operator in the rate distortion sense.展开更多
文摘In view of the limited bandwidth of underwater video image transmission,a low bit rate underwater video compression coding method is proposed.Based on the preprocessing process of wavelet transform and coefficient down-sampling,the visual redundancy of underwater image is removed and the computational coefficients and coding bits are reduced.At the same time,combined with multi-level wavelet decomposition,inter frame motion compensation,entropy coding and other methods,according to the characteristics of different types of frame image data,reduce the number of calculations and improve the coding efficiency.The experimental results show that the reconstructed image quality can meet the visual requirements,and the average compression ratio of underwater video can meet the requirements of underwater acoustic channel transmission rate.
文摘Two video coding schemes based on wavelet transform achieving very low bit rate are presented in this paper. The first is a hybrid motion compensated wavelet transform(MC WT)system which behaves better at very low bit rates than the block DCT residual coder. The second is a new efficient coding system based on a simple frame differencing wavelet transform(FD WT)which performs well in both PSNR and visual quality with substantially reduced complexity.
文摘A new motion compensated 3 D wavelet transform (MC 3DWT) video coding scheme is presented in this paper. The new coding scheme has a good performance in average PSNR, compression ratio and visual quality of reconstructions compared with the existing 3 D wavelet transform (3DWT) coding methods and motion compensated 2 D wavelet transform (MC WT) coding method. The new MC 3DWT coding scheme is suitable for very low bit rate video coding.
文摘A new improved Goh's 3 D wavelet transform(WT) coding scheme is presented in this paper. The new scheme has great advantages including a simple code structure, low computation cost and good performance in PSNR, compression ratios and visual quality of reconstructions, when compared to the other existing 3 D WT coding methods and the 2 D WT based coding methods. The new 3 D WT coding scheme is suitable for very low bit rate video coding.
文摘ZTE Corporation announced on 1 March that its innovative IPTVlowbitrate highdefinition transcoding solution has been nominated for the World's Best Component or Enabler Award by the IPTV World Forum. The ZTE solution is on display at the Mobile World Congress 2012 (MWC 2012) in Barcelona.
基金Project (No. 2004144013) supported by the Chinese Government Scholarship Council, China
文摘In this paper, more efficient, low-complexity and reliable region of interest (ROI) image codec for compressing smooth low texture remote sensing images is proposed. We explore the efficiency of the modified RO! codec with respect to the selected set of convenient wavelet filters, which is a novel method. Such ROI coding experiment analysis representing low bit rate lossy to high quality lossless reconstruction with timing analysis is useful for improving remote sensing ground truth surveillance efficiency in terms of time and quality. The subjective [i.e. fair, five observer (HVS) evaluations using enhanced 3D picture view Hyper memory display technology] and the objective results revealed that for faster ground truth ROI coding applications, the Symlet-4 adaptation performs better than Biorthogonal 4.4 and Biorthogonal 6.8. However, the discrete Meyer wavelet adaptation is the best solution for delayed ROI image reconstructions.
基金supported by the National Natural Science Foundation of China(61102152)
文摘At medium or long distance (〉 10 kin) underwater acoustic speech communication, information transfer rate is constrained by the complicated, time varying channel and limited bandwidth. The bit rate of speech coding is required to be as low as possible. The time delay of underwater acoustic wave propagation can be used for low bit rate speech coding. After investigating the Mixed Excitation Linear Prediction (MELP) standard and taking account of the auditory perceptual features, a variable and adjustable bit rate speech codec algorithm has been proposed, whose average bit rate is about 600 bps. The average Perceptual Evaluation of Speech Quality Mean Opinion Score (PESQ MOS) of synthesized speeches is about 2.8. It has been proved by the computer simulation and sea trial that the performance of the proposed algorithm is well and robust when bit error rate is no more than 10-3. The synthesized speech is vivid and intelligible, and keeps main individual characteristics of speaker.
基金supported by National Basic Research "(973") Program of China(2009CB320902)the Chinese National Nature Science Foundation (60902057)
文摘Visual search has been a long-standing problem in applications such as location recognition and product search. Much research has been done on image representation, matching, indexing, and retrieval. Key component technologies for visual search have been developed, and numerous real-world applications are emerging. To ensure application interoperability, the Moving Picture Experts Group (MPEG) has begun standardizing visuaJ search technologies and is developing the compact descriptors for visua) search (CDVS) standard. MPEG seeks to develop a collaborative platform for evaluating existing visual search technologies. Peking University has participated in this standardization since the 94th MPEG meeting, and significant progress has been made with the various proposals. A test model (TM) has been selected to determine the basic pipeline and key components of visual search. However, the first-version TM has high computational complexity and imperfect retrieval and matching. Core experiments have therefore been set up to improve TM. In this article, we summarize key technologies for visual search and report the progress of MPEG CDVS. We discuss Peking University' s efforts in CDVS and also discuss unresolved issues.
文摘An edge oriented image sequence coding scheme is presented. On the basis of edge detecting, an image could be divided into the sensitized region and the smooth region. In this scheme, the architecture of sensitized region is approximated with linear type of segments. Then a rectangle belt is constructed for each segment. Finally, the gray value distribution in the region is fitted by normal forms polynomials. The model matching and motion analysis are also based on the architecture of sensitized region. For the smooth region we use the run length scanning and linear approximating. By means of normal forms polynomial fitting and motion prediction by matching, the images are compressed. It is shown through the simulations that the subjective quality of reconstructed picture is excellent at 0.0075 bit per pel.
文摘In real time applications, the low delay rate is an important requirement of video coding. We propose a simple low delay rate control method in this paper for such applications. In this method, target bits are divided into two parts: uncontrolled and controlled bits in the frame layer. The first part is assigned to the header, syntax and motion vectors according to that spent in the previous encoded frame. The second part is assigned to DCT coeffcients by employing a rate model for mactorblock Q P determination. Experiments show that the proposed method can achieve better performance compared with that of the test model TMN5 of H .263, and slightly worse performance, but with lower computation complexity, compared with that of the TMN8 of H.263+ .
基金upported by the National Natural Science Foundation of China!( 6 95 72 0 2 3)bytheKeyProjectfromtheShanghaiEducationComm
文摘In the context of object oriented video coding, the encoding of segmentation maps defined by contour networks is particularly critical. In this paper, we present a lossy contour network encoding algorithm where both the rate distortion contour encoding based on maximum operator and the prediction error for the current frame based on quadratic motion model are combined into a optimal polygon contour network compression scheme. The bit rate for the contour network can be further reduced by about 20% in comparison with that in the optimal polygonal boundary encoding scheme using maximum operator in the rate distortion sense.