The Rate Distortion Optimization(RDO)algorithm in High Efficiency Video Coding(HEVC)has many iterations and a large number of calculations.In order to decrease the calculation time and meet the requirements of fast sw...The Rate Distortion Optimization(RDO)algorithm in High Efficiency Video Coding(HEVC)has many iterations and a large number of calculations.In order to decrease the calculation time and meet the requirements of fast switching of RDO algorithms of different scales,an RDO dynamic reconfigurable structure is proposed.First,the Quantization Parameter(QP)and bit rate values were loaded through an H⁃tree Configurable Network(HCN),and the execution status of the array was detected in real time.When the switching request of the RDO algorithm was detected,the corresponding configuration information was delivered.This self⁃reconfiguration implementation method improved the flexibility and utilization of hardware.Experimental results show that when the control bit width was only increased by 31.25%,the designed configuration network could increase the number of controllable processing units by 32 times,and the execution cycle was 50%lower than the same type of design.Compared with previous RDO algorithm,the RDO algorithm implemented on the reconfigurable array based on the configuration network had an average operating frequency increase of 12.5%and an area reduction of 56.4%.展开更多
Electromagnetic pulse(EMP)is a kind of transient electromagnetic phenomenon with short rise time of the leading edge and wide spectrum,which usually disrupts communications and damages electronic equipment and system....Electromagnetic pulse(EMP)is a kind of transient electromagnetic phenomenon with short rise time of the leading edge and wide spectrum,which usually disrupts communications and damages electronic equipment and system.It is challenging for an EMP sensor to measure a wideband electromagnetic pulse without distortion for the whole spectrum.Therefore,analyzing the distortion of EMP measurement is crucial to evaluating the sensor distortion characteristics and correcting the measurement results.Waveform fidelity is usually employed to evaluate the distortion of an antenna.However,this metric depends on specific signal waveforms,thus is unsuitable for evaluating and analyzing the distortion of EMP sensors.In this paper,an associated-hermite-function based distortion analysis method including system transfer matrices and distortion rates is proposed,which is general and independent from individual waveforms.The system transfer matrix and distortion rate can be straightforwardly calculated by the signal orthogonal transformation coefficients using associated-hermite functions.Distortion of a sensor vs.frequency is then visualized via the system transfer matrix,which is convenient in quantitative analysis of the distortion.Measurement of a current probe,a coaxial pulse voltage probe and a B-field sensor were performed,based on which the feasibility and effectiveness of the proposed distortion analysis method is successfully verified.展开更多
As conventional communication systems based on classic information theory have closely approached Shannon capacity,semantic communication is emerging as a key enabling technology for the further improvement of communi...As conventional communication systems based on classic information theory have closely approached Shannon capacity,semantic communication is emerging as a key enabling technology for the further improvement of communication performance.However,it is still unsettled on how to represent semantic information and characterise the theoretical limits of semantic-oriented compression and transmission.In this paper,we consider a semantic source which is characterised by a set of correlated random variables whose joint probabilistic distribution can be described by a Bayesian network.We give the information-theoretic limit on the lossless compression of the semantic source and introduce a low complexity encoding method by exploiting the conditional independence.We further characterise the limits on lossy compression of the semantic source and the upper and lower bounds of the rate-distortion function.We also investigate the lossy compression of the semantic source with two-sided information at the encoder and decoder,and obtain the corresponding rate distortion function.We prove that the optimal code of the semantic source is the combination of the optimal codes of each conditional independent set given the side information.展开更多
Rate control is one of the key factors influencing the multi-view video transmission.However,there is not a rate control algorithm in the existing Joint Multi-view Video Coding Model.In this paper,an efficient rate co...Rate control is one of the key factors influencing the multi-view video transmission.However,there is not a rate control algorithm in the existing Joint Multi-view Video Coding Model.In this paper,an efficient rate control algorithm and a bit allocation strategy for multi-view video coding are proposed.In order to obtain the consistent view quality,a bit allocation model based on the Lagrange optimum algorithm is firstly proposed.Secondly,considering the encoding statistical characteristics of different view types,a view weighting factor is introduced,and it will help improve the precision of bit allocation among views.Compared with the fixed QP control strategy,experiment results show that the proposed algorithm can efficiently control the bit rate and obtain more consistent views,with video visual quality improved.展开更多
This paper presents an improved rate control method for H.264. First, the scene changes are detected by the average absolute difference of the brightness histograms between the adjacent frames. Then, the bit allocatio...This paper presents an improved rate control method for H.264. First, the scene changes are detected by the average absolute difference of the brightness histograms between the adjacent frames. Then, the bit allocation and quantization parameters are adjusted, using a certain threshold. In addition, the calculation of the mean absolute difference (MAD) is modified in an alternative way, which makes the rate distortion optimization (RDO) more accurate. Extensive simulation results show that the proposed method, compared with G012, can improve the average peak signal-to-noise ratio (PSNR) and moderate the image quality.展开更多
An improved rate distortion optimization (RDO) algorithm in JPEG2000 is proposed. The proposed algorithm is suitable for integrated circuit (IC) implementation and can reduce 30% computational cost. A hardware arc...An improved rate distortion optimization (RDO) algorithm in JPEG2000 is proposed. The proposed algorithm is suitable for integrated circuit (IC) implementation and can reduce 30% computational cost. A hardware architecture which includes control unit, memory, divider, data converter is also given to implement the algorithm. The circuit based on the improved algorithm is tested on FPGAs and integrated in a JPG2000 chip codec core.展开更多
In order to reduce the encoding complexity of macroblock coding mode decision in H.264/AVC, a selective smaller block-size searching algorithm and a selective intra coding mode searching algorithm are proposed by usin...In order to reduce the encoding complexity of macroblock coding mode decision in H.264/AVC, a selective smaller block-size searching algorithm and a selective intra coding mode searching algorithm are proposed by using the high correlation among coding modes and in spatial and temporal domains of video sequence. Simulation results demonstrate that the proposed algorithm can provide significant improvement in computational requirement, with negligible small picture quality degradation and slight bit rate increase.展开更多
Many modern video encoders use the Lagrangian rate-distortion optimization (RDO) algorithm for mode deci- sions during the compression procedure. For each encoding stage, this approach involves minimizing a cost, wh...Many modern video encoders use the Lagrangian rate-distortion optimization (RDO) algorithm for mode deci- sions during the compression procedure. For each encoding stage, this approach involves minimizing a cost, which is a function of rate, distortion and a multiplier called Lambda. This paper proposes to improve the RDO process by applying two modifications. The first modification is to increase the ac- curacy of rate estimation, which is achieved by computing a non-integer number of bits for arithmetic coding of the syntax elements. This leads to a more accurate cost computation and therefore a better mode decision. The second modification is to search and adjust the value of Lambda based on the char- acteristics of each coding stage. For the encoder used, this paper proposes to search multiple values of Lambda for the intra-4x4 mode decision. Moreover, a simple shift in Lambda value is proposed for motion estimation. Each of these modi- fications offers a certain gain in RDO performance, and, when all are combined, an average bit-rate saving of up to 7.0% can be achieved for the H.264/AVC codec while the same concept is applicable to the H.265/HEVC codec as well. The extra added complexity is contained to a certain level, and is also adjustable according to the processing resources available.展开更多
To combat packet loss and realize robust video transmission over Intemet and wireless networks, a new multiple description (MD) video coding method is proposed. In the method, two descriptions for each video frame i...To combat packet loss and realize robust video transmission over Intemet and wireless networks, a new multiple description (MD) video coding method is proposed. In the method, two descriptions for each video frame is first created by group of blocks (GOB) alternation. Motion information is then duplicated in both the descriptions and a process called low quality macroblock update is designed to redundantly encode textures in each frame using standard bit stream syntax. In this way, the output bit streams are standard compliant and better trade-offs between redundancy and single charmel reconstruction distortion are achieved. The proposed method has much better performance than the well-known MD transform coding (MDTC) method both in terms of redundancy rate distortion, and in the packet loss scenario.展开更多
A fast motion estimation algorithm for variable block-size using the "line scan and block merge procedure" is proposed for airborne image compression modules.Full hardware implementation via FPGA is discussed in det...A fast motion estimation algorithm for variable block-size using the "line scan and block merge procedure" is proposed for airborne image compression modules.Full hardware implementation via FPGA is discussed in detail.The proposed pipelined architecture based on the line scan algorithm is capable of calculating the required 41 motion vectors of various size blocks supported by H.264 within a 16 × 16 block in parallel.An adaptive rate distortion cost function is used for various size block decision.The motion vectors of adjacent small blocks are merged to predict the motion vectors of larger blocks for reducing computation.Experimental results show that our proposed method has lower computational complexity than full search algorithm with slight quality decrease and little bit rate increase.Due to the high real-time processing speed it can be easily realized in hardware.展开更多
In the context of object oriented video coding, the encoding of segmentation maps defined by contour networks is particularly critical. In this paper, we present a lossy contour network encoding algorithm where both t...In the context of object oriented video coding, the encoding of segmentation maps defined by contour networks is particularly critical. In this paper, we present a lossy contour network encoding algorithm where both the rate distortion contour encoding based on maximum operator and the prediction error for the current frame based on quadratic motion model are combined into a optimal polygon contour network compression scheme. The bit rate for the contour network can be further reduced by about 20% in comparison with that in the optimal polygonal boundary encoding scheme using maximum operator in the rate distortion sense.展开更多
In recent years,more than 50%of the cities in the inland area of our country have suffered from moderate pollution.It can affect the distribution of electric field along the surface of insulator and the pollution char...In recent years,more than 50%of the cities in the inland area of our country have suffered from moderate pollution.It can affect the distribution of electric field along the surface of insulator and the pollution characteristics of its surface area,even endanger the safe operation of transmission lines.This paper takes the XSP-160 porcelain three umbrella insulator under the action of 0–±30 kV DC voltage as the research object,and establishes the physical model of the insulator;uses COMSOL software to simulate the electric field of the insulator.The comparison with the results of the wind tunnel test of North China Electric Power University verifies the rationality of the simulation method.In view of the medium pollution environment which often occurs in the inland area,the force of the polluted particles deposited on the insulator under 110 kV DC voltage was analyzed.The distribution characteristics of the electric field along the surface in the clean and three kinds of pollution environment(light,medium and heavy)were simulated and compared.The results showed that:1)In the moderate pollution environment,the influence of fluid drag force on the movement of polluted particles is greater at the initial stage(0.28–0.33 s);at the late stage(after 0.33 s),the influence of the electric field force begins to increase,and gradually plays a major role in the process of fouling.2)In the three different polluted environments,the potential along the surface of each umbrella skirt of the insulator increases non-linearly with the increasing of the pollution concentration,and there are apparent potential inflection points near b and l.3)The pollution particle concentration has the most obvious influence on the electric field along the low-voltage umbrella skirt,whose potential distortion rate can be up to 220 times in the heavy polluted environment.展开更多
In view of some of the characteristics of a high-medium pressure gas regulator, such as small fault samples, many fault types and complex fault features etc, in order to improve the accuracy of fault precaution in thi...In view of some of the characteristics of a high-medium pressure gas regulator, such as small fault samples, many fault types and complex fault features etc, in order to improve the accuracy of fault precaution in this paper, a fault diagnosis method based on the combination of the improved wavelet packet and pressure harmonic distortion rate is proposed. On the basis of the harmonic distortion rate of the outlet pressure and the energy value of each frequency band obtained by improved wavelet packet decomposition, the rules for fault data were summarized. Finally, a safety precaution model of the high-medium pressure gas regulator is established.展开更多
This paper proposes a multiple description (MD) mesh-based motion coding method, which generates two descriptions for mesh-based motion by subsampling the nodes of a right-angled triangular mesh and dividing them into...This paper proposes a multiple description (MD) mesh-based motion coding method, which generates two descriptions for mesh-based motion by subsampling the nodes of a right-angled triangular mesh and dividing them into two groups. Motion vectors associated with the mesh nodes in each group are transmitted over distinct channels. With the nodes in each group, two other regular triangular meshes besides the original one can be constructed, and three different prediction images can be reconstructed according to descriptions available. The proposed MD mesh-based motion coding method is then combined with the pairwise correlating transform (PCT), and a complete MD video coding scheme is proposed. Further measures are taken to reduce the mismatch between the encoder and decoder that occurs when only one description is received and the decoder reconstruction is different from the encoder. The performance of the proposed scheme is evaluated using computer simulations, and the results show, compared to Reibman's MD transform coding (MDTC) method, the proposed scheme achieves better redundancy rate distortion (RRD) performance. In packet loss scenario, the proposed scheme outperforms the MDTC method.展开更多
Design of video encoders involves implementation of fast mode decision(FMD) algorithm to reduce computation complexity while maintaining the performance of the coding. Although H.264/scalable video coding(SVC) achieve...Design of video encoders involves implementation of fast mode decision(FMD) algorithm to reduce computation complexity while maintaining the performance of the coding. Although H.264/scalable video coding(SVC) achieves high scalability and coding efficiency, it also has high complexity in implementing its exhaustive computation. In this paper, a novel algorithm is proposed to reduce the redundant candidate modes by making use of the correlation among layers. A desired mode list is created based on the probability to be the best mode for each block in base layer and a candidate mode selection in the enhancement layer by the correlations of modes among reference frame and current frame. Our algorithm is implemented in joint scalable video model(JSVM)9.19.15 reference software and the performance is evaluated based on the average encoding time, peak signal to noise ration(PSNR)and bit rate. The experimental results show 41.89% improvement in encoding time with minimal loss of 0.02 dB in PSNR and 0.05%increase in bit rate.展开更多
H.264 improves the coding efficiency by adopting rate distortion optimization (RDO); however,it yields high computa-tional complexity. In order to reduce the intra-frame coding time,a fast and effective intra-predicti...H.264 improves the coding efficiency by adopting rate distortion optimization (RDO); however,it yields high computa-tional complexity. In order to reduce the intra-frame coding time,a fast and effective intra-prediction mode selection algorithm is proposed. The minimum sum of absolute differences between sub-block and its marginal weighted-pixels are used for selecting the candidates of prediction modes to speed up the intra-prediction process with an early termination criterion. Experimental results show that the proposed method can reduce the encoding time by more than 63% comparing with JM (Joint Model) software and is better than the previous algorithms,with negligible loss of coding performance.展开更多
基金Sponsored by the National Natural Science Foundation of China(Grant Nos.61834005,61772417,61802304,61602377,and 61634004)the Shaanxi Province Coordination Innovation Project of Science and Technology(Grant No.2016KTZDGY02-04-02)+1 种基金the Shaanxi Provincial Key R&D Plan(Grant No.2017GY-060)the Shaanxi International Science and Technology Cooperation Program(Grant No.2018KW-006).
文摘The Rate Distortion Optimization(RDO)algorithm in High Efficiency Video Coding(HEVC)has many iterations and a large number of calculations.In order to decrease the calculation time and meet the requirements of fast switching of RDO algorithms of different scales,an RDO dynamic reconfigurable structure is proposed.First,the Quantization Parameter(QP)and bit rate values were loaded through an H⁃tree Configurable Network(HCN),and the execution status of the array was detected in real time.When the switching request of the RDO algorithm was detected,the corresponding configuration information was delivered.This self⁃reconfiguration implementation method improved the flexibility and utilization of hardware.Experimental results show that when the control bit width was only increased by 31.25%,the designed configuration network could increase the number of controllable processing units by 32 times,and the execution cycle was 50%lower than the same type of design.Compared with previous RDO algorithm,the RDO algorithm implemented on the reconfigurable array based on the configuration network had an average operating frequency increase of 12.5%and an area reduction of 56.4%.
基金Research Project of High-Level Talents of Jiangsu Police Institute(No.2911118010).
文摘Electromagnetic pulse(EMP)is a kind of transient electromagnetic phenomenon with short rise time of the leading edge and wide spectrum,which usually disrupts communications and damages electronic equipment and system.It is challenging for an EMP sensor to measure a wideband electromagnetic pulse without distortion for the whole spectrum.Therefore,analyzing the distortion of EMP measurement is crucial to evaluating the sensor distortion characteristics and correcting the measurement results.Waveform fidelity is usually employed to evaluate the distortion of an antenna.However,this metric depends on specific signal waveforms,thus is unsuitable for evaluating and analyzing the distortion of EMP sensors.In this paper,an associated-hermite-function based distortion analysis method including system transfer matrices and distortion rates is proposed,which is general and independent from individual waveforms.The system transfer matrix and distortion rate can be straightforwardly calculated by the signal orthogonal transformation coefficients using associated-hermite functions.Distortion of a sensor vs.frequency is then visualized via the system transfer matrix,which is convenient in quantitative analysis of the distortion.Measurement of a current probe,a coaxial pulse voltage probe and a B-field sensor were performed,based on which the feasibility and effectiveness of the proposed distortion analysis method is successfully verified.
基金partly supported by NSFC under grant No.62293481,No.62201505partly by the SUTDZJU IDEA Grant(SUTD-ZJU(VP)202102)。
文摘As conventional communication systems based on classic information theory have closely approached Shannon capacity,semantic communication is emerging as a key enabling technology for the further improvement of communication performance.However,it is still unsettled on how to represent semantic information and characterise the theoretical limits of semantic-oriented compression and transmission.In this paper,we consider a semantic source which is characterised by a set of correlated random variables whose joint probabilistic distribution can be described by a Bayesian network.We give the information-theoretic limit on the lossless compression of the semantic source and introduce a low complexity encoding method by exploiting the conditional independence.We further characterise the limits on lossy compression of the semantic source and the upper and lower bounds of the rate-distortion function.We also investigate the lossy compression of the semantic source with two-sided information at the encoder and decoder,and obtain the corresponding rate distortion function.We prove that the optimal code of the semantic source is the combination of the optimal codes of each conditional independent set given the side information.
基金supported by National Natural Science Foundation of China under Grants No. 61071166,No. 61001152 and No. 61071091
文摘Rate control is one of the key factors influencing the multi-view video transmission.However,there is not a rate control algorithm in the existing Joint Multi-view Video Coding Model.In this paper,an efficient rate control algorithm and a bit allocation strategy for multi-view video coding are proposed.In order to obtain the consistent view quality,a bit allocation model based on the Lagrange optimum algorithm is firstly proposed.Secondly,considering the encoding statistical characteristics of different view types,a view weighting factor is introduced,and it will help improve the precision of bit allocation among views.Compared with the fixed QP control strategy,experiment results show that the proposed algorithm can efficiently control the bit rate and obtain more consistent views,with video visual quality improved.
基金Supported by the National Natural Science Foundation of China (60372057)
文摘This paper presents an improved rate control method for H.264. First, the scene changes are detected by the average absolute difference of the brightness histograms between the adjacent frames. Then, the bit allocation and quantization parameters are adjusted, using a certain threshold. In addition, the calculation of the mean absolute difference (MAD) is modified in an alternative way, which makes the rate distortion optimization (RDO) more accurate. Extensive simulation results show that the proposed method, compared with G012, can improve the average peak signal-to-noise ratio (PSNR) and moderate the image quality.
基金This project was supported by the National"863"High Technology Programof China (2002AA1Z1420)
文摘An improved rate distortion optimization (RDO) algorithm in JPEG2000 is proposed. The proposed algorithm is suitable for integrated circuit (IC) implementation and can reduce 30% computational cost. A hardware architecture which includes control unit, memory, divider, data converter is also given to implement the algorithm. The circuit based on the improved algorithm is tested on FPGAs and integrated in a JPG2000 chip codec core.
基金National Natural Science Foundation of China (60372018)
文摘In order to reduce the encoding complexity of macroblock coding mode decision in H.264/AVC, a selective smaller block-size searching algorithm and a selective intra coding mode searching algorithm are proposed by using the high correlation among coding modes and in spatial and temporal domains of video sequence. Simulation results demonstrate that the proposed algorithm can provide significant improvement in computational requirement, with negligible small picture quality degradation and slight bit rate increase.
文摘Many modern video encoders use the Lagrangian rate-distortion optimization (RDO) algorithm for mode deci- sions during the compression procedure. For each encoding stage, this approach involves minimizing a cost, which is a function of rate, distortion and a multiplier called Lambda. This paper proposes to improve the RDO process by applying two modifications. The first modification is to increase the ac- curacy of rate estimation, which is achieved by computing a non-integer number of bits for arithmetic coding of the syntax elements. This leads to a more accurate cost computation and therefore a better mode decision. The second modification is to search and adjust the value of Lambda based on the char- acteristics of each coding stage. For the encoder used, this paper proposes to search multiple values of Lambda for the intra-4x4 mode decision. Moreover, a simple shift in Lambda value is proposed for motion estimation. Each of these modi- fications offers a certain gain in RDO performance, and, when all are combined, an average bit-rate saving of up to 7.0% can be achieved for the H.264/AVC codec while the same concept is applicable to the H.265/HEVC codec as well. The extra added complexity is contained to a certain level, and is also adjustable according to the processing resources available.
文摘To combat packet loss and realize robust video transmission over Intemet and wireless networks, a new multiple description (MD) video coding method is proposed. In the method, two descriptions for each video frame is first created by group of blocks (GOB) alternation. Motion information is then duplicated in both the descriptions and a process called low quality macroblock update is designed to redundantly encode textures in each frame using standard bit stream syntax. In this way, the output bit streams are standard compliant and better trade-offs between redundancy and single charmel reconstruction distortion are achieved. The proposed method has much better performance than the well-known MD transform coding (MDTC) method both in terms of redundancy rate distortion, and in the packet loss scenario.
基金Supported by the Aviation Science Fund of China(2009ZC15001)
文摘A fast motion estimation algorithm for variable block-size using the "line scan and block merge procedure" is proposed for airborne image compression modules.Full hardware implementation via FPGA is discussed in detail.The proposed pipelined architecture based on the line scan algorithm is capable of calculating the required 41 motion vectors of various size blocks supported by H.264 within a 16 × 16 block in parallel.An adaptive rate distortion cost function is used for various size block decision.The motion vectors of adjacent small blocks are merged to predict the motion vectors of larger blocks for reducing computation.Experimental results show that our proposed method has lower computational complexity than full search algorithm with slight quality decrease and little bit rate increase.Due to the high real-time processing speed it can be easily realized in hardware.
基金upported by the National Natural Science Foundation of China!( 6 95 72 0 2 3)bytheKeyProjectfromtheShanghaiEducationComm
文摘In the context of object oriented video coding, the encoding of segmentation maps defined by contour networks is particularly critical. In this paper, we present a lossy contour network encoding algorithm where both the rate distortion contour encoding based on maximum operator and the prediction error for the current frame based on quadratic motion model are combined into a optimal polygon contour network compression scheme. The bit rate for the contour network can be further reduced by about 20% in comparison with that in the optimal polygonal boundary encoding scheme using maximum operator in the rate distortion sense.
基金Science and Technology Project of State Grid Corporation(GY7111053)[www.sgcc.com.cn].
文摘In recent years,more than 50%of the cities in the inland area of our country have suffered from moderate pollution.It can affect the distribution of electric field along the surface of insulator and the pollution characteristics of its surface area,even endanger the safe operation of transmission lines.This paper takes the XSP-160 porcelain three umbrella insulator under the action of 0–±30 kV DC voltage as the research object,and establishes the physical model of the insulator;uses COMSOL software to simulate the electric field of the insulator.The comparison with the results of the wind tunnel test of North China Electric Power University verifies the rationality of the simulation method.In view of the medium pollution environment which often occurs in the inland area,the force of the polluted particles deposited on the insulator under 110 kV DC voltage was analyzed.The distribution characteristics of the electric field along the surface in the clean and three kinds of pollution environment(light,medium and heavy)were simulated and compared.The results showed that:1)In the moderate pollution environment,the influence of fluid drag force on the movement of polluted particles is greater at the initial stage(0.28–0.33 s);at the late stage(after 0.33 s),the influence of the electric field force begins to increase,and gradually plays a major role in the process of fouling.2)In the three different polluted environments,the potential along the surface of each umbrella skirt of the insulator increases non-linearly with the increasing of the pollution concentration,and there are apparent potential inflection points near b and l.3)The pollution particle concentration has the most obvious influence on the electric field along the low-voltage umbrella skirt,whose potential distortion rate can be up to 220 times in the heavy polluted environment.
基金supported by Science and technology project of Beijing in 2015 from Beijing Municipal Science&Technology Commission
文摘In view of some of the characteristics of a high-medium pressure gas regulator, such as small fault samples, many fault types and complex fault features etc, in order to improve the accuracy of fault precaution in this paper, a fault diagnosis method based on the combination of the improved wavelet packet and pressure harmonic distortion rate is proposed. On the basis of the harmonic distortion rate of the outlet pressure and the energy value of each frequency band obtained by improved wavelet packet decomposition, the rules for fault data were summarized. Finally, a safety precaution model of the high-medium pressure gas regulator is established.
文摘This paper proposes a multiple description (MD) mesh-based motion coding method, which generates two descriptions for mesh-based motion by subsampling the nodes of a right-angled triangular mesh and dividing them into two groups. Motion vectors associated with the mesh nodes in each group are transmitted over distinct channels. With the nodes in each group, two other regular triangular meshes besides the original one can be constructed, and three different prediction images can be reconstructed according to descriptions available. The proposed MD mesh-based motion coding method is then combined with the pairwise correlating transform (PCT), and a complete MD video coding scheme is proposed. Further measures are taken to reduce the mismatch between the encoder and decoder that occurs when only one description is received and the decoder reconstruction is different from the encoder. The performance of the proposed scheme is evaluated using computer simulations, and the results show, compared to Reibman's MD transform coding (MDTC) method, the proposed scheme achieves better redundancy rate distortion (RRD) performance. In packet loss scenario, the proposed scheme outperforms the MDTC method.
文摘Design of video encoders involves implementation of fast mode decision(FMD) algorithm to reduce computation complexity while maintaining the performance of the coding. Although H.264/scalable video coding(SVC) achieves high scalability and coding efficiency, it also has high complexity in implementing its exhaustive computation. In this paper, a novel algorithm is proposed to reduce the redundant candidate modes by making use of the correlation among layers. A desired mode list is created based on the probability to be the best mode for each block in base layer and a candidate mode selection in the enhancement layer by the correlations of modes among reference frame and current frame. Our algorithm is implemented in joint scalable video model(JSVM)9.19.15 reference software and the performance is evaluated based on the average encoding time, peak signal to noise ration(PSNR)and bit rate. The experimental results show 41.89% improvement in encoding time with minimal loss of 0.02 dB in PSNR and 0.05%increase in bit rate.
基金Supported by the National High Technology Research and Development Program of China (863 Program) (2006AA040307)
文摘H.264 improves the coding efficiency by adopting rate distortion optimization (RDO); however,it yields high computa-tional complexity. In order to reduce the intra-frame coding time,a fast and effective intra-prediction mode selection algorithm is proposed. The minimum sum of absolute differences between sub-block and its marginal weighted-pixels are used for selecting the candidates of prediction modes to speed up the intra-prediction process with an early termination criterion. Experimental results show that the proposed method can reduce the encoding time by more than 63% comparing with JM (Joint Model) software and is better than the previous algorithms,with negligible loss of coding performance.