高通量图像编码中的端到端量化参数优化方法

End-to-end quantization parameter optimization for high-throughput image coding in real time

下载PDF

导出

摘要高通量的图像传输可以获得更多的图像细节信息.在传输带宽受限和图像间时域相关性很低的条件下,图像编码的输出受到实时性和码率两方面的约束.有损图像编码的量化参数对输出码率和图像质量都有非常重要的影响.该文不同于基于图像复杂性特征的量化参数确定方法,提出了端到端的卷积神经网络深度模型、直接从图像预测最佳量化系数的方法.考虑编码实时性和算法泛化能力,在Inria aerial image labeling dataset数据集上训练,得到了优化的网络结构.实验结果表明,该文提出的端到端量化参数预测方法相比较相位一致性参数、SATD、图像信息熵等图像特征参数方法,码率预测准确度相较线性回归方法提高了10.31%,相较多层感知器方法提高了8.57%. In recent years,high throughput image transmission has become widely used due to its ability to obtain graphic details.Under the circumstances of limited bandwidth and low temporal correlation between images,image coding strategies should satisfy with the real-time requirement and bandwidth available.In lossy compression algorithms,the quantization parameter(QP)affects greatly both output bitrate and image quality.Different from QP optimization strategies based on numerical image features such as the sum of absolute transformed difference(SATD),phase congruency(PC),structural similarity(SSIM),a CNN-based end-to-end rate control solution was proposed,which predict the optimal QP directly from images.Trained on Inria Aerial Image Labeling Dataset,the refined rate control model is robust under real-time scenes.Experimental results show that the proposed end-to-end rate control method can achieve the target bitrates by 10.31%bit rate accuracy(BRA)more accurately than the original rate control algorithms based on numerical image features.The proposed method also achieves 8.57%BRA gain compared to multilayer perceptron(MLP)method.

作者李铮徐永昌乾方圆艾浩军 LI Zheng;XU Yongchang;QIAN Fangyuan;AI Haojun(School of Mathematical Sciences,Tongji University,Shanghai 200092,China;National School of Cyber Security,Wuhan University,Wuhan 430072,China)

机构地区同济大学数学科学学院武汉大学国家网络安全学院

出处《华中师范大学学报（自然科学版）》 CAS CSCD 北大核心 2022年第6期963-969,共7页 Journal of Central China Normal University：Natural Sciences

基金国家自然科学基金项目(61971316).

关键词图像编码码率控制量化参数机器学习端到端 image coding bit-rate control quantization parameter machine learning end to end

分类号 TP3-0 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

1高芳,舒远仲,朱雯雯.基于改进Deeplabv3+的遥感图像语义分割研究[J].南昌航空大学学报（自然科学版）,2022,36(2):24-31. 被引量：4
2王一琛,刘慧,王海涛,钱育蓉.面向遥感图像的建筑物轻量化语义分割方法[J].计算机工程与设计,2022,43(9):2646-2653.
3吴从中,董浩,方静.基于注意力机制的自适应滤波遥感图像分割网络[J].计算机工程与科学,2022,44(11):2010-2018. 被引量：4
4张立峰,王智.基于递归图的两相流流动特性分析与流型识别[J].计量学报,2022,43(11):1438-1444. 被引量：4
5郭杨亮,马瑞娟,韩子清.基于多尺度条件生成对抗网络(MSR-cGAN)的高分辨率遥感图像目标区域检测[J].河南科学,2022,40(9):1377-1383.
6李文书,李绅皓,赵朋.基于注意力门残差网络的遥感影像道路提取[J].智能计算机与应用,2022,12(10):31-35. 被引量：1
7郑仁鹏,郑雪钦,黄维彪.采用改进K-means算法的退役动力电池快速分选方法[J].厦门理工学院学报,2022,30(5):74-81. 被引量：1
8白岩.微课在聋校初中数学中的运用研究[J].世纪之星—初中版,2022(12):178-180.
9樊文杰.漾濞M S6.4地震前震源机制一致性参数演化特征[J].大地测量与地球动力学,2023,43(1):82-88.
10武耀星.尊重学生,爱护学生—关于高中班主任管理的有效性探讨[J].中华活页文选（高中版）,2022(20):12-14.

华中师范大学学报（自然科学版）

2022年第6期

浏览历史

内容加载中请稍等...

高通量图像编码中的端到端量化参数优化方法

相关作者

相关机构

相关主题

浏览历史