基于边缘辅助和多尺度Transformer的无参考屏幕内容图像质量评估

No-Reference Screen Content Image Quality Assessment Based on Edge Assistance and Multi-Scale Transformer

下载PDF

导出

摘要与从现实场景中拍摄的自然图像不同,屏幕内容图像是一种合成图像,通常由计算机生成的文本、图形和动画等各种多媒体形式组合而成.现有评估方法通常未能充分考虑图像边缘结构信息和全局上下文信息对屏幕内容图像质量感知的影响.为解决上述问题,本文提出一种基于边缘辅助和多尺度Transformer的无参考屏幕内容图像质量评估模型.首先,使用高斯拉普拉斯算子构造由失真屏幕内容图像高频信息组成的边缘结构图,然后通过卷积神经网络(Convolutional Neural Network,CNN)对输入的失真屏幕内容图像和相应的边缘结构图进行多尺度的特征提取与融合,以图像的边缘结构信息为模型训练提供额外的信息增益.此外,本文进一步构建了基于Transformer的多尺度特征编码模块,从而在CNN获得的局部特征基础上更好地建模不同尺度图像和边缘特征的全局上下文信息.实验结果表明,本文提出的方法在指标上优于其他现有的无参考和全参考屏幕内容图像质量评估方法,能够取得更高的主客观视觉感知一致性. Different from the natural images captured from real-world scenes,screen content images(SCI)are syn⁃thetic images typically composed of various multimedia contents,such as computer-generated text,graphics,and anima⁃tions.Existing SCI quality assessment methods usually fail to fully consider the impacts of image edge and global context on the perceived quality of screen content images.To address the above issues,this paper proposed a no-reference screen content image quality assessment model based on edge assistance and multi-scale Transformer.Firstly,an edge structure map consisting of the high-frequency information in a distorted SCI is constructed using Gaussian Laplace operators.Then a convolutional neural network(CNN)is used to extract and fuse the multi-scale features from the input distorted SCI and the corresponding edge structure map,thus providing additional edge information gain for model training.In addition,this paper further proposed a multi-scale feature encoding module based on Transformer to better model the global context infor⁃mation of different scale images and edge features on the basis of the local features obtained by CNN.The experimental re⁃sults show that the model proposed in this paper outperforms the state-of-the-art no-reference and full-reference SCI quality assessment methods,and achieves higher consistency with the subjective visual perception.

作者陈羽中陈友昆林闽沪牛玉贞 CHEN Yu-zhong;CHEN You-kun;LIN Min-hu;NIU Yu-zhen(College of Computer and Data Science,Fuzhou University,Fuzhou,Fujian 350108,China;Fujian Key Laboratory of Network Computing and Intelligent Information Processing,Fuzhou University,Fuzhou,Fujian 350108,China;Big Data Intelligence Engineering Research Center of the Ministry of Education,Fuzhou,Fujian 350108,China)

机构地区福州大学计算机与大数据学院福建省网络计算与智能信息处理重点实验室(福州大学) 大数据智能教育部工程研究中心

出处《电子学报》 EI CAS CSCD 北大核心 2024年第7期2242-2256,共15页 Acta Electronica Sinica

基金国家自然科学基金(No.U21A20472,No.61972097) 国家重点研发计划(No.2021YFB3600503) 福建省科技重大专项(No.2021HZ022007) 福建省自然科学基金(No.2021J01612,No.2020J01494) 福建省科技厅高校产学合作项目(No.2021H6022)~。

关键词无参考屏幕内容图像质量评估高斯拉普拉斯算子卷积神经网络 TRANSFORMER 多尺度特征 no-reference screen content image quality assessment laplacian of gaussian convolutional neural net⁃work Transformer multi-scale features

分类号 TN911.73 [电子电信—通信与信息系统] TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献6

1李群迎,张晓林.基于多描述和不等差错保护的航空遥感图像传输方法[J].电子学报,2010,38(11):2655-2659. 被引量：4
2陈文俊,杨春玲.图像压缩感知的特征域优化及自注意力增强神经网络重构算法[J].电子学报,2022,50(11):2629-2637. 被引量：8
3贾旭,曹玉东,孙福明,崔建江,薛定宇.基于无参考质量评价模型的静脉图像采集方法[J].电子学报,2015,43(2):236-241. 被引量：7
4徐少平,李芬,陈孝国,陈晓军,江顺亮.一种利用改进深度图像先验构建的图像降噪模型[J].电子学报,2022,50(7):1573-1578. 被引量：3
5林冠妙,魏乐松,牛玉贞.基于多尺度特征的无参考屏幕内容图像质量评估[J].小型微型计算机系统,2022,43(2):372-380. 被引量：2
6魏乐松,陈俊豪,牛玉贞.基于边缘和结构的无参考屏幕内容图像质量评估[J].北京航空航天大学学报,2019,45(12):2449-2455. 被引量：4

二级参考文献25

1Said,W A Pearlman.A new,fast,and efficient image codec based on set partitioning in hierarchical trees[J].IEEE Trans.Video Technol,1996,6(6):243-250.
2P G Sherwood,K Zeger.Progressive image coding for noisy channels[J].IEEE Signal Processing Lett,1997,4(7):189-191.
3V Chande,N Farvardin.Progressive transmission of images over memoryless noisy channels[J].IEEE J.Select.Areas Commun,2000,18(1):850-860.
4V Stankovic',et al.Fast algorithm for rate-based optimal error protection of embedded codes[J].IEEE Trans Commun,2003,51(11):1788-1795.
5M Fresia,F Lavagetto.Determination of optimal distortion-based protection in progressive image transmission:A heuristic approach[J].IEEE Trans Image Processing,2008,16(9):1654-1662.
6Lei Yao,Lei Cao.Turbo codes-based image transmission for channels with multiple types of distortion[J].IEEE Trans.Image Processing,2008,19(11):2112-2121.
7J Rogers,P Cosman.Robust wavelet zerotree image compression with fixed-length packetization.Proc.DCC'98.Snowbird:IEEE Press,1998.418-427.
8M Biswas,M R Frater.Multiple description wavelet video coding employing a new tree structure[J].IEEE Trans Circuits Syst.Video Technol,2008,44(10):1361-1368.
9P Cosman,et al.Combined forward error control and packetized zerotree wavelet encoding for transmission of images over varying channels[J].IEEE Trans.Image Processing,2000,9(6):982-993.
10V Stankovic',Raouf Hamzaoui,Zixiang Xiong.Efficient channel code rate selection algorithms for forward error correction of packetized multimedia bitstreams in varying channels[J].IEEE Trans.Multimedia,2004,6(4):240-248.

共引文献22

1李晓峰,周宁,刘洪盛,张敏.一种基于缩减栅格算法的SVC联合信源/信道编码方法[J].电子学报,2011,39(4):859-864. 被引量：2
2孙文珠,王洪玉,钱大兴,王洁.一种适用于丢包信道的小波编码图像传输方案[J].电子与信息学报,2012,34(10):2342-2347. 被引量：1
3贾旭,崔建江,孙福明,曹玉东.基于改进非负矩阵分解的手背静脉识别算法[J].信息与控制,2016,45(2):193-198. 被引量：3
4李毅红,韩焱,潘晋孝,陈平.基于递变能量线性约束的X射线图像质量评价方法[J].电子学报,2017,45(3):669-673. 被引量：2
5陈扬,李旦,张建秋.互补色小波域图像质量盲评价方法[J].电子学报,2019,47(4):775-783. 被引量：4
6JIA Xu,SUN Fuming,LI Haojie,CAO Yudong.Hand Vein Recognition Algorithm Based on NMF with Sparsity and Clustering Property Constraints in Feature Mapping Space[J].Chinese Journal of Electronics,2019,28(6):1184-1190. 被引量：2
7周汶,李旦,张建秋.评价彩色图像自动聚焦清晰度的互补色小波测度[J].自动化学报,2020,46(8):1615-1627. 被引量：3
8邹初红.基于视觉感知的乡村居住景观视觉敏感度特征提取[J].齐齐哈尔大学学报（自然科学版）,2021,37(4):66-70.
9林冠妙,魏乐松,牛玉贞.基于多尺度特征的无参考屏幕内容图像质量评估[J].小型微型计算机系统,2022,43(2):372-380. 被引量：2
10李妍,夏泽洋,吴晓君,熊璟.考虑软组织形变的实时无参考超声图像综合评估方法[J].生物医学工程学杂志,2022,39(3):480-487.

1王玲玲,刘元琳.医学图像分割算法研究[J].信息与电脑,2023,35(21):67-69.
2曲熠,陈莹.基于边缘强化的无监督单目深度估计[J].系统工程与电子技术,2024,46(1):71-79.
3王鑫,余磊.多尺度双注意力的图像超分辨率重建方法[J].计算机与现代化,2024(8):77-87.
4刘伟韬,潘志刚.基于U-Net和小波变换的SAR图像道路分割算法[J].曲阜师范大学学报（自然科学版）,2024,50(3):81-88.
5何柯材,徐琳,江金康,陶禹川,王学渊.面向荧光显微图像的斑点检测[J].计算机系统应用,2024,33(8):205-213.
6魏敏,姚鑫.基于多尺度与注意力机制的两阶段风暴单体外推研究[J].图学学报,2024,45(4):696-704.
7谭英宽,王昊,刘磊,李树.基于LOG算子和多特征融合的改进型KCF目标追踪算法研究[J].人工智能与机器人研究,2024,13(3):582-592.
8张欣,祝君,王静,翟德胜,陈敏,李开平.超声引导下两种不同路径药物注射联合针刺治疗神经根型颈椎病的前瞻性对照研究[J].颈腰痛杂志,2024,45(4):719-724.
9韦玲,荣雪芹,刘洪升.两种不同温度行腰神经内侧支射频热凝治疗腰椎关节突关节源性腰痛的疗效和安全性分析[J].颈腰痛杂志,2024,45(4):768-771.

电子学报

2024年第7期

浏览历史

内容加载中请稍等...

基于边缘辅助和多尺度Transformer的无参考屏幕内容图像质量评估

参考文献6

二级参考文献25

共引文献22

相关作者

相关机构

相关主题

浏览历史