A variable-bit-rate characteristic waveform interpolation (VBR-CWI) speech codec with about 1.8 kbit/s average bit rate which integrates phonetic classification into characteristic waveform (CW) decomposition is p...A variable-bit-rate characteristic waveform interpolation (VBR-CWI) speech codec with about 1.8 kbit/s average bit rate which integrates phonetic classification into characteristic waveform (CW) decomposition is proposed. Each input frame is classified into one of 4 phonetic classes. Non-speech frames are represented with Bark-band noise model. The extracted CWs become rapidly evolving waveforms (REWs) or slowly evolving waveforms (SEWs) in the cases of unvoiced or stationary voiced frames respectively, while mixed voiced frames use the same CW decomposition as that in the conventional CWI. Experimental results show that the proposed codec can eliminate most buzzy and noisy artifacts existing in the fixed-bit-rate characteristic waveform interpolation (FBR-CWI) speech codec, the average bit rate can be much lower, and its reconstructed speech quality is much better than FS 1 016 CELP at 4.8 kbit/s and similar to G. 723.1 ACELP at 5.3 kbit/s.展开更多
A new motion compensated 3 D wavelet transform (MC 3DWT) video coding scheme is presented in this paper. The new coding scheme has a good performance in average PSNR, compression ratio and visual quality of reconst...A new motion compensated 3 D wavelet transform (MC 3DWT) video coding scheme is presented in this paper. The new coding scheme has a good performance in average PSNR, compression ratio and visual quality of reconstructions compared with the existing 3 D wavelet transform (3DWT) coding methods and motion compensated 2 D wavelet transform (MC WT) coding method. The new MC 3DWT coding scheme is suitable for very low bit rate video coding.展开更多
Rate control plays a critical role in achieving perceivable video quality under a variable bit rate,limited buffer sizes and low delay applications.Since a rate control system exhibits non-linear and unpredictable cha...Rate control plays a critical role in achieving perceivable video quality under a variable bit rate,limited buffer sizes and low delay applications.Since a rate control system exhibits non-linear and unpredictable characteristics,it is difficult to establish a very accurate rate-distortion(R-D)model and acquire effective rate control performance.Considering the excellent control ability and low computing complexity of the fuzzy logic in non-linear systems,this paper proposes a bitrate control algorithm based on a fuzzy controller,named the Fuzzy Rate Control Algorithm(FRCA),for All-Intra(AI)and low-delay(LD)video source coding.Contributions of the proposed FRCA mainly consist of four aspects.First,fuzzy logic is adopted to minimize the deviation between the actual and the target buffer size in the hypothetical reference decoder(HRD).Second,a fast lookup table is employed in fuzzy rate control,which reduces computing cost of the control process.Third,an input domain determination scheme is proposed to improve the precision of the fuzzy controller.Fourth,a novel scene change detection is introduced and integrated in the FRCA to adaptively adjust the Group-of-Pictures(GOP)length when the source content fluctuates.The FRCA can be transplanted and implemented in various industry coders.Extensive experiments show that the FRCA has accurate variable bit-rate control ability and maintains a steady buffer size during the encoding processes.Compared with the default configuration encoding under AI and LD,the proposed FRCA can achieve the target bit rates more accurately in various classical encoders.展开更多
Two video coding schemes based on wavelet transform achieving very low bit rate are presented in this paper. The first is a hybrid motion compensated wavelet transform(MC WT)system which behaves better at very low ...Two video coding schemes based on wavelet transform achieving very low bit rate are presented in this paper. The first is a hybrid motion compensated wavelet transform(MC WT)system which behaves better at very low bit rates than the block DCT residual coder. The second is a new efficient coding system based on a simple frame differencing wavelet transform(FD WT)which performs well in both PSNR and visual quality with substantially reduced complexity.展开更多
A new improved Goh's 3 D wavelet transform(WT) coding scheme is presented in this paper. The new scheme has great advantages including a simple code structure, low computation cost and good performance in PSNR, c...A new improved Goh's 3 D wavelet transform(WT) coding scheme is presented in this paper. The new scheme has great advantages including a simple code structure, low computation cost and good performance in PSNR, compression ratios and visual quality of reconstructions, when compared to the other existing 3 D WT coding methods and the 2 D WT based coding methods. The new 3 D WT coding scheme is suitable for very low bit rate video coding.展开更多
利用混合激励线性预测(mixed excitation linear prediction,MELP)算法和码激励线性预测(code excitation linear prediction,CELP)算法的优点,提出了一种混合MELP/CELP语音编码模型。编码端对强浊音帧采用MELP编码,对弱浊音帧和清音帧...利用混合激励线性预测(mixed excitation linear prediction,MELP)算法和码激励线性预测(code excitation linear prediction,CELP)算法的优点,提出了一种混合MELP/CELP语音编码模型。编码端对强浊音帧采用MELP编码,对弱浊音帧和清音帧进行CELP编码。MELP编码器采用相位对齐技术提取强浊音帧的相位参数,解决了合成语音与原始语音在时间上不同步的问题。对实现的4 kbit/s混合MELP/CELP声码器进行客观MOS(mean opinion score)值和主观DRT(diagnostic rhythm test)清晰度测试,结果表明,该声码器的合成语音具有较高的可懂度和清晰度。展开更多
为了研究不同码型的卷积码在水下湍流信道中的误码率(BER)性能,采用接受-拒绝采样模拟湍流信道乘性干扰,并选择二进制相移键控(BPSK)调制方式,建立Gamma-Gamma湍流信道通信系统仿真模型。仿真结果表明:在不同强度的湍流信道中,采用卷积...为了研究不同码型的卷积码在水下湍流信道中的误码率(BER)性能,采用接受-拒绝采样模拟湍流信道乘性干扰,并选择二进制相移键控(BPSK)调制方式,建立Gamma-Gamma湍流信道通信系统仿真模型。仿真结果表明:在不同强度的湍流信道中,采用卷积码编码均能提升系统的BER性能;卷积码的码率越小,系统BER性能提升越显著;随着信噪比(SNR)增大,记忆深度越长,系统BER下降速度越快;采用软译码比采用硬译码时增益至少提升2.82 d B;卷积码的解码不仅受当前信息的影响,还与之前的码元信息有关。展开更多
Current multi-view video coding (MVC) reference model in joint video team (JVT) does not provide efficient rate control schemes. This paper presents a rate control algorithm for MVC by improving the quadratic rate...Current multi-view video coding (MVC) reference model in joint video team (JVT) does not provide efficient rate control schemes. This paper presents a rate control algorithm for MVC by improving the quadratic rate-distortion (R-D) model. We reasonably allocate bit-rate among views based on the correlation analysisl The proposed algorithm consists of three levels to control the rate bits more accurately, of which the frame layer allocates bits according to the frame complexity and the temporal activity. Extensive experiments show that the proposed algorithm can control the bit rate efficiently.展开更多
Puncturing is the predominant strategy to construct high code rate turbo codes. Puncturing tables are crucial to the performance of punctured turbo codes(PTC). This paper developed a new searching algorithm of optimal...Puncturing is the predominant strategy to construct high code rate turbo codes. Puncturing tables are crucial to the performance of punctured turbo codes(PTC). This paper developed a new searching algorithm of optimal puncturing tables based on average distance spectrum(ADS) criterion. Consequently, some optimal puncturing tables were presented as the searching results. Finally, it presented the performance comparison among some optimal and bad puncturing tables by simulation.展开更多
Turbo codes can achieve excellent performance at low signal-to-noise ratio (SNR), but the performance can be severely degraded if no trellis termination is employed. This paper proved that if trellis termination bits ...Turbo codes can achieve excellent performance at low signal-to-noise ratio (SNR), but the performance can be severely degraded if no trellis termination is employed. This paper proved that if trellis termination bits were appended to RSC1, trellis of RSC2 could be terminated by designing the interleaver properly, consequently, derived the designing condition of such self-terminated interleaver (STI). Then we presented an algorithm of implementing a kind of STI, which could terminate RSC2 as well on condition that the RSC1 was terminated. We verified the performance of STI for turbo codes by simulation, and the simulation results showed that turbo codes with STI outperformed interleavers that could not terminate RSC2 as well.展开更多
This work investigates the performance of various forward error correction codes, by which the MIMO-OFDM system is deployed. To ensure fair investigation, the performance of four modulations, namely, binary phase shif...This work investigates the performance of various forward error correction codes, by which the MIMO-OFDM system is deployed. To ensure fair investigation, the performance of four modulations, namely, binary phase shift keying(BPSK), quadrature phase shift keying(QPSK), quadrature amplitude modulation(QAM)-16 and QAM-64 with four error correction codes(convolutional code(CC), Reed-Solomon code(RSC)+CC, low density parity check(LDPC)+CC, Turbo+CC) is studied under three channel models(additive white Guassian noise(AWGN), Rayleigh, Rician) and three different antenna configurations(2×2, 2×4, 4×4). The bit error rate(BER) and the peak signal to noise ratio(PSNR) are taken as the measures of performance. The binary data and the color image data are transmitted and the graphs are plotted for various modulations with different channels and error correction codes. Analysis on the performance measures confirm that the Turbo + CC code in 4×4 configurations exhibits better performance.展开更多
The application of protograph low density parity check (LDPC) codes involves the encoding complexity problem. Since the generator matrices are dense, and if the positions of "1" s are irregularity, the encoder nee...The application of protograph low density parity check (LDPC) codes involves the encoding complexity problem. Since the generator matrices are dense, and if the positions of "1" s are irregularity, the encoder needs to store every "1" of the generator matrices by using huge chip area. In order to solve this problem, we need to design the protograph LDPC codes with circular generator matrices. A theorem concerning the circulating property of generator matrices of nonsingular protograph LDPC codes is proposed. The circulating property of generator matrix of nonsingular protograph LDPC codes can be obtained from the corresponding quasi-cyclic parity check matrix. This paper gives a scheme of constructing protograph LDPC codes with circulating generator matrices, and it reveals that the fast encoding algorithm of protograph LDPC codes has lower encoding complexity under the condition of the proposed theorem. Simulation results in ad- ditive white Gaussian noise (AWGN) channels show that the bit error rate (BER) performance of the designed codes based on the proposed theorem is much better than that of GB20600 LDPC codes and Tanner LDPC codes.展开更多
Currently puncturing is the predominant strategy to construct high code rate turbo codes. The puncturing period and puncturing patterns, which have important effect on the performance of punctured turbo codes (PTC), y...Currently puncturing is the predominant strategy to construct high code rate turbo codes. The puncturing period and puncturing patterns, which have important effect on the performance of punctured turbo codes (PTC), yet have not received complete investigations, are addressed in this paper. Proposes on selecting puncturing period and puncturing patterns are presented. Since puncturing will alter the distance spectrum of turbo codes, the performance of PTC needs further consideration. We derive an analytical upper bound for PTC, based on the assumption of uniform puncturing defined in this paper. Finally, we present some numeric results on the performance of PTC.展开更多
文摘A variable-bit-rate characteristic waveform interpolation (VBR-CWI) speech codec with about 1.8 kbit/s average bit rate which integrates phonetic classification into characteristic waveform (CW) decomposition is proposed. Each input frame is classified into one of 4 phonetic classes. Non-speech frames are represented with Bark-band noise model. The extracted CWs become rapidly evolving waveforms (REWs) or slowly evolving waveforms (SEWs) in the cases of unvoiced or stationary voiced frames respectively, while mixed voiced frames use the same CW decomposition as that in the conventional CWI. Experimental results show that the proposed codec can eliminate most buzzy and noisy artifacts existing in the fixed-bit-rate characteristic waveform interpolation (FBR-CWI) speech codec, the average bit rate can be much lower, and its reconstructed speech quality is much better than FS 1 016 CELP at 4.8 kbit/s and similar to G. 723.1 ACELP at 5.3 kbit/s.
文摘A new motion compensated 3 D wavelet transform (MC 3DWT) video coding scheme is presented in this paper. The new coding scheme has a good performance in average PSNR, compression ratio and visual quality of reconstructions compared with the existing 3 D wavelet transform (3DWT) coding methods and motion compensated 2 D wavelet transform (MC WT) coding method. The new MC 3DWT coding scheme is suitable for very low bit rate video coding.
基金supported by ZTE Industry-Academia-Research Cooperation Funds under Grant No.CON1503180004the Postdoctoral Science Foundation of China under Gant No.2014M552342the Foundation of Science and Technology Department of Sichuan Province,China under Grant No.2014GZ0005
文摘Rate control plays a critical role in achieving perceivable video quality under a variable bit rate,limited buffer sizes and low delay applications.Since a rate control system exhibits non-linear and unpredictable characteristics,it is difficult to establish a very accurate rate-distortion(R-D)model and acquire effective rate control performance.Considering the excellent control ability and low computing complexity of the fuzzy logic in non-linear systems,this paper proposes a bitrate control algorithm based on a fuzzy controller,named the Fuzzy Rate Control Algorithm(FRCA),for All-Intra(AI)and low-delay(LD)video source coding.Contributions of the proposed FRCA mainly consist of four aspects.First,fuzzy logic is adopted to minimize the deviation between the actual and the target buffer size in the hypothetical reference decoder(HRD).Second,a fast lookup table is employed in fuzzy rate control,which reduces computing cost of the control process.Third,an input domain determination scheme is proposed to improve the precision of the fuzzy controller.Fourth,a novel scene change detection is introduced and integrated in the FRCA to adaptively adjust the Group-of-Pictures(GOP)length when the source content fluctuates.The FRCA can be transplanted and implemented in various industry coders.Extensive experiments show that the FRCA has accurate variable bit-rate control ability and maintains a steady buffer size during the encoding processes.Compared with the default configuration encoding under AI and LD,the proposed FRCA can achieve the target bit rates more accurately in various classical encoders.
文摘Two video coding schemes based on wavelet transform achieving very low bit rate are presented in this paper. The first is a hybrid motion compensated wavelet transform(MC WT)system which behaves better at very low bit rates than the block DCT residual coder. The second is a new efficient coding system based on a simple frame differencing wavelet transform(FD WT)which performs well in both PSNR and visual quality with substantially reduced complexity.
文摘A new improved Goh's 3 D wavelet transform(WT) coding scheme is presented in this paper. The new scheme has great advantages including a simple code structure, low computation cost and good performance in PSNR, compression ratios and visual quality of reconstructions, when compared to the other existing 3 D WT coding methods and the 2 D WT based coding methods. The new 3 D WT coding scheme is suitable for very low bit rate video coding.
文摘为了研究不同码型的卷积码在水下湍流信道中的误码率(BER)性能,采用接受-拒绝采样模拟湍流信道乘性干扰,并选择二进制相移键控(BPSK)调制方式,建立Gamma-Gamma湍流信道通信系统仿真模型。仿真结果表明:在不同强度的湍流信道中,采用卷积码编码均能提升系统的BER性能;卷积码的码率越小,系统BER性能提升越显著;随着信噪比(SNR)增大,记忆深度越长,系统BER下降速度越快;采用软译码比采用硬译码时增益至少提升2.82 d B;卷积码的解码不仅受当前信息的影响,还与之前的码元信息有关。
基金supported by the National Natural Science Foundation of China (Grant Nos.60832003,60672052,60902085,60972137)the Key Project of Shanghai Municipal Education Commission (Grant No.09ZZ90)+2 种基金the Natural Science Foundation of Shanghai(Grant No.09ZR1412500)the Innovation Foundation of Shanghai University (Grants Nos.10YZ09,SHUCX091061)the Shuguang Plan of Shanghai Education Development Foundation (Grant No.06SG43)
文摘Current multi-view video coding (MVC) reference model in joint video team (JVT) does not provide efficient rate control schemes. This paper presents a rate control algorithm for MVC by improving the quadratic rate-distortion (R-D) model. We reasonably allocate bit-rate among views based on the correlation analysisl The proposed algorithm consists of three levels to control the rate bits more accurately, of which the frame layer allocates bits according to the frame complexity and the temporal activity. Extensive experiments show that the proposed algorithm can control the bit rate efficiently.
文摘Puncturing is the predominant strategy to construct high code rate turbo codes. Puncturing tables are crucial to the performance of punctured turbo codes(PTC). This paper developed a new searching algorithm of optimal puncturing tables based on average distance spectrum(ADS) criterion. Consequently, some optimal puncturing tables were presented as the searching results. Finally, it presented the performance comparison among some optimal and bad puncturing tables by simulation.
文摘Turbo codes can achieve excellent performance at low signal-to-noise ratio (SNR), but the performance can be severely degraded if no trellis termination is employed. This paper proved that if trellis termination bits were appended to RSC1, trellis of RSC2 could be terminated by designing the interleaver properly, consequently, derived the designing condition of such self-terminated interleaver (STI). Then we presented an algorithm of implementing a kind of STI, which could terminate RSC2 as well on condition that the RSC1 was terminated. We verified the performance of STI for turbo codes by simulation, and the simulation results showed that turbo codes with STI outperformed interleavers that could not terminate RSC2 as well.
文摘This work investigates the performance of various forward error correction codes, by which the MIMO-OFDM system is deployed. To ensure fair investigation, the performance of four modulations, namely, binary phase shift keying(BPSK), quadrature phase shift keying(QPSK), quadrature amplitude modulation(QAM)-16 and QAM-64 with four error correction codes(convolutional code(CC), Reed-Solomon code(RSC)+CC, low density parity check(LDPC)+CC, Turbo+CC) is studied under three channel models(additive white Guassian noise(AWGN), Rayleigh, Rician) and three different antenna configurations(2×2, 2×4, 4×4). The bit error rate(BER) and the peak signal to noise ratio(PSNR) are taken as the measures of performance. The binary data and the color image data are transmitted and the graphs are plotted for various modulations with different channels and error correction codes. Analysis on the performance measures confirm that the Turbo + CC code in 4×4 configurations exhibits better performance.
基金supported by Beijing Natural Science Foundation(4102050)the National Natural Science of Foundation of China(NSFC)-Korea Science and Engineering Foundation (KOSF) Joint Research Project of China and Korea (60811140343)
文摘The application of protograph low density parity check (LDPC) codes involves the encoding complexity problem. Since the generator matrices are dense, and if the positions of "1" s are irregularity, the encoder needs to store every "1" of the generator matrices by using huge chip area. In order to solve this problem, we need to design the protograph LDPC codes with circular generator matrices. A theorem concerning the circulating property of generator matrices of nonsingular protograph LDPC codes is proposed. The circulating property of generator matrix of nonsingular protograph LDPC codes can be obtained from the corresponding quasi-cyclic parity check matrix. This paper gives a scheme of constructing protograph LDPC codes with circulating generator matrices, and it reveals that the fast encoding algorithm of protograph LDPC codes has lower encoding complexity under the condition of the proposed theorem. Simulation results in ad- ditive white Gaussian noise (AWGN) channels show that the bit error rate (BER) performance of the designed codes based on the proposed theorem is much better than that of GB20600 LDPC codes and Tanner LDPC codes.
基金This work is supported by National 863 Project of China (No. 2002 AA123046)
文摘Currently puncturing is the predominant strategy to construct high code rate turbo codes. The puncturing period and puncturing patterns, which have important effect on the performance of punctured turbo codes (PTC), yet have not received complete investigations, are addressed in this paper. Proposes on selecting puncturing period and puncturing patterns are presented. Since puncturing will alter the distance spectrum of turbo codes, the performance of PTC needs further consideration. We derive an analytical upper bound for PTC, based on the assumption of uniform puncturing defined in this paper. Finally, we present some numeric results on the performance of PTC.