期刊文献+

基于边缘辅助和多尺度Transformer的无参考屏幕内容图像质量评估

No-Reference Screen Content Image Quality Assessment Based on Edge Assistance and Multi-Scale Transformer
下载PDF
导出
摘要 与从现实场景中拍摄的自然图像不同,屏幕内容图像是一种合成图像,通常由计算机生成的文本、图形和动画等各种多媒体形式组合而成.现有评估方法通常未能充分考虑图像边缘结构信息和全局上下文信息对屏幕内容图像质量感知的影响.为解决上述问题,本文提出一种基于边缘辅助和多尺度Transformer的无参考屏幕内容图像质量评估模型.首先,使用高斯拉普拉斯算子构造由失真屏幕内容图像高频信息组成的边缘结构图,然后通过卷积神经网络(Convolutional Neural Network,CNN)对输入的失真屏幕内容图像和相应的边缘结构图进行多尺度的特征提取与融合,以图像的边缘结构信息为模型训练提供额外的信息增益.此外,本文进一步构建了基于Transformer的多尺度特征编码模块,从而在CNN获得的局部特征基础上更好地建模不同尺度图像和边缘特征的全局上下文信息.实验结果表明,本文提出的方法在指标上优于其他现有的无参考和全参考屏幕内容图像质量评估方法,能够取得更高的主客观视觉感知一致性. Different from the natural images captured from real-world scenes,screen content images(SCI)are syn⁃thetic images typically composed of various multimedia contents,such as computer-generated text,graphics,and anima⁃tions.Existing SCI quality assessment methods usually fail to fully consider the impacts of image edge and global context on the perceived quality of screen content images.To address the above issues,this paper proposed a no-reference screen content image quality assessment model based on edge assistance and multi-scale Transformer.Firstly,an edge structure map consisting of the high-frequency information in a distorted SCI is constructed using Gaussian Laplace operators.Then a convolutional neural network(CNN)is used to extract and fuse the multi-scale features from the input distorted SCI and the corresponding edge structure map,thus providing additional edge information gain for model training.In addition,this paper further proposed a multi-scale feature encoding module based on Transformer to better model the global context infor⁃mation of different scale images and edge features on the basis of the local features obtained by CNN.The experimental re⁃sults show that the model proposed in this paper outperforms the state-of-the-art no-reference and full-reference SCI quality assessment methods,and achieves higher consistency with the subjective visual perception.
作者 陈羽中 陈友昆 林闽沪 牛玉贞 CHEN Yu-zhong;CHEN You-kun;LIN Min-hu;NIU Yu-zhen(College of Computer and Data Science,Fuzhou University,Fuzhou,Fujian 350108,China;Fujian Key Laboratory of Network Computing and Intelligent Information Processing,Fuzhou University,Fuzhou,Fujian 350108,China;Big Data Intelligence Engineering Research Center of the Ministry of Education,Fuzhou,Fujian 350108,China)
出处 《电子学报》 EI CAS CSCD 北大核心 2024年第7期2242-2256,共15页 Acta Electronica Sinica
基金 国家自然科学基金(No.U21A20472,No.61972097) 国家重点研发计划(No.2021YFB3600503) 福建省科技重大专项(No.2021HZ022007) 福建省自然科学基金(No.2021J01612,No.2020J01494) 福建省科技厅高校产学合作项目(No.2021H6022)~。
关键词 无参考屏幕内容图像质量评估 高斯拉普拉斯算子 卷积神经网络 TRANSFORMER 多尺度特征 no-reference screen content image quality assessment laplacian of gaussian convolutional neural net⁃work Transformer multi-scale features
  • 相关文献

参考文献6

二级参考文献25

  • 1Said,W A Pearlman.A new,fast,and efficient image codec based on set partitioning in hierarchical trees[J].IEEE Trans.Video Technol,1996,6(6):243-250.
  • 2P G Sherwood,K Zeger.Progressive image coding for noisy channels[J].IEEE Signal Processing Lett,1997,4(7):189-191.
  • 3V Chande,N Farvardin.Progressive transmission of images over memoryless noisy channels[J].IEEE J.Select.Areas Commun,2000,18(1):850-860.
  • 4V Stankovic',et al.Fast algorithm for rate-based optimal error protection of embedded codes[J].IEEE Trans Commun,2003,51(11):1788-1795.
  • 5M Fresia,F Lavagetto.Determination of optimal distortion-based protection in progressive image transmission:A heuristic approach[J].IEEE Trans Image Processing,2008,16(9):1654-1662.
  • 6Lei Yao,Lei Cao.Turbo codes-based image transmission for channels with multiple types of distortion[J].IEEE Trans.Image Processing,2008,19(11):2112-2121.
  • 7J Rogers,P Cosman.Robust wavelet zerotree image compression with fixed-length packetization.Proc.DCC'98.Snowbird:IEEE Press,1998.418-427.
  • 8M Biswas,M R Frater.Multiple description wavelet video coding employing a new tree structure[J].IEEE Trans Circuits Syst.Video Technol,2008,44(10):1361-1368.
  • 9P Cosman,et al.Combined forward error control and packetized zerotree wavelet encoding for transmission of images over varying channels[J].IEEE Trans.Image Processing,2000,9(6):982-993.
  • 10V Stankovic',Raouf Hamzaoui,Zixiang Xiong.Efficient channel code rate selection algorithms for forward error correction of packetized multimedia bitstreams in varying channels[J].IEEE Trans.Multimedia,2004,6(4):240-248.

共引文献22

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部