Multichannel audio signal is more difficult to be compressed than mono and stereo ones.A novel multichannel audio signal compression method based on tensor representation and decomposition is proposed in this paper.Th...Multichannel audio signal is more difficult to be compressed than mono and stereo ones.A novel multichannel audio signal compression method based on tensor representation and decomposition is proposed in this paper.The multichannel audio is represented with 3-order tensor space and is decomposed into core tensor with three factor matrices in the way of channel,time and frequency.Only the truncated core tensor is transmitted which will be multiplied by the pre-trained factor matrices to reconstruct the original tensor space.Objective and subjective experiments have been done to show a very noticeable compression capability with an acceptable output quality.The novelty of the proposed compression method is that it enables both high compression capability and backward compatibility with limited signal distortion to the hearing.展开更多
A Hi Fi audio coding technology for ISDN and Internet is introduced. It is the ISO/MPEG Audio Layer III digital audio compression scheme coding at 64 kbit/s. First, the paper implements C language simulation accordin...A Hi Fi audio coding technology for ISDN and Internet is introduced. It is the ISO/MPEG Audio Layer III digital audio compression scheme coding at 64 kbit/s. First, the paper implements C language simulation according to the algorithm and gets satisfactory quality of the reconstructed music signal. The estimation of operation steps and simulation of decoder finished by a TMS 320C548 simulator are presented. The result is the same as that of the C language simulation.展开更多
Recently, several digital watermarking techniques have been proposed for hiding data in the frequency domain of audio files in order to protect their copyrights. In general, there is a tradeoff between the quality of ...Recently, several digital watermarking techniques have been proposed for hiding data in the frequency domain of audio files in order to protect their copyrights. In general, there is a tradeoff between the quality of watermarked audio and the tolerance of watermarks to signal processing methods, such as compression. In previous research, we simultaneously improved the performance of both by developing a multipurpose optimization problem for deciding the positions of watermarks in the frequency domain of audio data and obtaining a near-optimum solution to the problem. This solution was obtained using a wavelet transform and a genetic algorithm. However, obtaining the near-optimum solution was very time consuming. To overcome this issue essentially, we have developed an authentication method for digital audio using a discrete wavelet transform. In contrast to digital watermarking, no additional information is inserted into the original audio by the proposed method, and the audio is authenticated using features extracted by the wavelet transform and characteristic coding in the proposed method. Accordingly, one can always use copyright-protected original audio. The experimental results show that the method has high tolerance of authentication to all types of MP3, AAC, and WMA compression. In addition, the processing time of the method is acceptable for every-day use.展开更多
为了面向低延时的浅压缩场景提供更加适配的编码方案,并降低硬件实现成本,提出一种基于数字音视频编解码技术标准(Audio Video coding Standard,AVS)浅压缩算法的帧内预测模式优化以及快速率失真优化算法。该算法通过减少原有算法帧内...为了面向低延时的浅压缩场景提供更加适配的编码方案,并降低硬件实现成本,提出一种基于数字音视频编解码技术标准(Audio Video coding Standard,AVS)浅压缩算法的帧内预测模式优化以及快速率失真优化算法。该算法通过减少原有算法帧内预测所需的预测循环次数,以及打破各块之间的数据依赖关系等措施,克服了原始方案不适合硬件流水并行处理的限制,提高了编码的效率和稳定性,从而既保障了算法的视频质量,又使新的硬件实现方案更符合实际应用需求。实验结果表明,该算法优化方案能够有效改善实际面向低延时浅压缩场景下的编码效果。展开更多
基金This work was partially supported by the National Natural Science Foundation of China under Grants No.11161140319,No.61001188,the Specialized Research Fund for the Doctoral Program of Higher Education under Grant No.20101101110020,the Fund for Basic Research from Beijing Institute of Technology under Grant No.20120542011,the Fund for Beijing Higher Education Young Elite Teacher Project under Grant No.YETP1202
文摘Multichannel audio signal is more difficult to be compressed than mono and stereo ones.A novel multichannel audio signal compression method based on tensor representation and decomposition is proposed in this paper.The multichannel audio is represented with 3-order tensor space and is decomposed into core tensor with three factor matrices in the way of channel,time and frequency.Only the truncated core tensor is transmitted which will be multiplied by the pre-trained factor matrices to reconstruct the original tensor space.Objective and subjective experiments have been done to show a very noticeable compression capability with an acceptable output quality.The novelty of the proposed compression method is that it enables both high compression capability and backward compatibility with limited signal distortion to the hearing.
文摘A Hi Fi audio coding technology for ISDN and Internet is introduced. It is the ISO/MPEG Audio Layer III digital audio compression scheme coding at 64 kbit/s. First, the paper implements C language simulation according to the algorithm and gets satisfactory quality of the reconstructed music signal. The estimation of operation steps and simulation of decoder finished by a TMS 320C548 simulator are presented. The result is the same as that of the C language simulation.
文摘Recently, several digital watermarking techniques have been proposed for hiding data in the frequency domain of audio files in order to protect their copyrights. In general, there is a tradeoff between the quality of watermarked audio and the tolerance of watermarks to signal processing methods, such as compression. In previous research, we simultaneously improved the performance of both by developing a multipurpose optimization problem for deciding the positions of watermarks in the frequency domain of audio data and obtaining a near-optimum solution to the problem. This solution was obtained using a wavelet transform and a genetic algorithm. However, obtaining the near-optimum solution was very time consuming. To overcome this issue essentially, we have developed an authentication method for digital audio using a discrete wavelet transform. In contrast to digital watermarking, no additional information is inserted into the original audio by the proposed method, and the audio is authenticated using features extracted by the wavelet transform and characteristic coding in the proposed method. Accordingly, one can always use copyright-protected original audio. The experimental results show that the method has high tolerance of authentication to all types of MP3, AAC, and WMA compression. In addition, the processing time of the method is acceptable for every-day use.
文摘为了面向低延时的浅压缩场景提供更加适配的编码方案,并降低硬件实现成本,提出一种基于数字音视频编解码技术标准(Audio Video coding Standard,AVS)浅压缩算法的帧内预测模式优化以及快速率失真优化算法。该算法通过减少原有算法帧内预测所需的预测循环次数,以及打破各块之间的数据依赖关系等措施,克服了原始方案不适合硬件流水并行处理的限制,提高了编码的效率和稳定性,从而既保障了算法的视频质量,又使新的硬件实现方案更符合实际应用需求。实验结果表明,该算法优化方案能够有效改善实际面向低延时浅压缩场景下的编码效果。