自适应语义感知网络的盲图像质量评价

Self-adaptive semantic awareness network for blind image quality assessment

导出

摘要目的盲图像质量评价(blind image quality assessment,BIQA)在图像质量控制领域具有重要的实际意义。虽然目前针对自然失真图像的盲图像质量评价取得了合理的结果,但评价准确性仍有待进一步提升。方法提出一种自适应语义感知网络(self-adaptive semantic awareness network,SSA-Net)的盲图像质量评价方法,通过理解失真图像的内容和感知图像失真的类型来提高预测的准确性。首先,利用深度卷积神经网络(deep convolutional neural network,DCNN)获取各个阶段的语义特征,并提出多头位置注意力(multi-head position attention,MPA)模块通过聚合特征图的长距离语义信息来加强对图像内容的理解。接着,提出基于多尺度内核的自适应特征感知(self-adaptive feature awareness,SFA)模块感知图像的失真类型,并结合图像内容来捕获图像的全局失真和局部失真情况。最后,提出多级监督回归(multi-level supervision regression,MSR)网络通过利用低层次的语义特征辅助高层次的语义特征得到预测分数。结果本文方法在7个数据库上与11种不同方法进行了比较,在LIVEC(LIVE in the Wild Image Quality Challenge)、BID(blurred image database)、KonIQ-10k(Konstanz authentic image quality 10k database)和SPAQ(smartphone photography attribute and quality)4个自然失真图像数据库中的斯皮尔曼等级相关系数(Spearman rank order correlation coefficient,SRCC)值分别为0.867、0.877、0.913和0.915,获得了所有方法中最好的性能结果。同时在两个人工失真图像数据库中获得了排名前2的SRCC值。实验结果表明,与其他先进方法相比,本文方法在自然失真图像质量评价数据库上的表现更为优异。结论本文方法通过结合图像内容理解与不同失真类型感知,能更好地适应自然图像的失真,提高评价准确性。 Objective The rapid development of imaging technology has been accompanied by continuous updates in acqui⁃sition equipment and related technologies over the past few decades.However,the quality of images is susceptible to interferences from various stages,including acquisition,processing,transmission,and storage,which eventually introduce dif⁃ferent types(e.g.,JPEG2000 compression,JPEG compression,white Gaussian noise,Gaussian blur,fast fading distor⁃tion,and contrast distortion)and degrees of distortions that degrade image quality.Therefore,blind image quality assess⁃ment(BIQA)has practical significance in the field of image quality control and is helpful for subsequent image processing and analysis.Although many other methods have achieved reasonable results in the blind image quality assessment of degraded images,their image quality assessment accuracy warrants further improvement when dealing with the distortions of natural images.The challenges in assessing natural image distortions include the following:1)natural image distortions are much more complex compared with synthetic image distortions because the former contains not only global distortion(e.g.,out of focus and Gaussian noise)but also local distortion(e.g.,overexposure and motion blur),which increases the difficulty of image quality assessment;2)among the different semantic features extracted by deep convolutional neural network(DCNN),the lower-level semantic features contain less semantic information and cannot provide a comprehensive overview and understanding of the image information,thereby hindering networks from coping with the distortions of natural images with diverse contents;and 3)although the high-level semantic features obtained by DCNN contain rich semantic information,the lack of local detail information of the image easily makes the whole network overlook the local distortions.To address these problems,this paper proposes a blind image quality evaluation method called self-adaptive semantic awareness network(SSA-Net).Method First,images from different databases are not uniform in size and are prone to be large,and deep-learning-based networks usually require a fixed size for input images.Therefore,all input images are ran⁃domly cropped 25 times to represent the content of the original image.Second,to enable the network to extract rich seman⁃tic features,a 50-layer deep residual network(ResNet-50)with pre-trained weights obtained from ImageNet is leveraged for feature extraction and is used to capture the semantic features of the images at each stage.Third,a multi-head position attention(MPA)module is designed to address the content diversity of naturally degraded images,which would improve the understanding of image content and the accuracy of the subsequent perceptions of distortion types by adding absolute position encoding into the multi-head position attention to acquire fixed distortion position information.Fourth,the selfadaptive feature awareness(SFA)module is presented to address the diversity of distortion types in naturally degraded images.This module combines the understanding of image content and the use of pooling kernels with different sizes to cap⁃ture the global and local distortions in images.Fifth,a multi-level supervision regression(MSR)network with learnable parameters that uses lower-level semantic features to assist the higher-level semantic features is proposed to derive predic⁃tion scores that are in line with the human visual system.Result Experiments are conducted on 7 databases with 11 differ⁃ent methods for comparison.The proposed method achieves the best performance on four natural distortion image databases with Spearman rank order correlation coefficient(SRCC)values of 0.867,0.877,0.913,and 0.915 for LIVE in the Wild Image Quality Challenge(LIVEC)database,blurred image database(BID),Konstanz authentic image quality 10k data⁃base(KonIQ-10k),and smartphone photography attribute and quality(SPAQ)database,respectively.This method also obtains the highest Pearson linear correlation coefficient(PLCC)values of 0.886,0.881,0.923,and 0.921 on these databases.This method also obtains the top two SRCC values in two synthetic distortion image databases,including the laboratory for image&video engineering(LIVE)database and categorical subjective image quality(CSIQ)database.In the cross-validation,SSA-Net achieves competitive results in several natural distortion image quality databases and reasonable evaluation results in synthetic/natural image quality evaluation databases.SSA-Net also shows more desirable generalization performance than the self-adaptive hyper network and visual compensation restoration network on Waterloo Exploration database.Experimental results show that the proposed method outperforms the state-of-the-art methods in natural distortion image quality assessment databases and demonstrate stronger generalization performance.Conclusion The proposed method acquires accurate image distortion information by combining the understanding of the image content with the percep⁃tion of different distortion types.The network can fuse information from different stages through an improved deep supervi⁃sion mechanism and by setting learnable parameters that can efficiently adapt to the distortion of natural images and subse⁃quently improve the image quality assessment accuracy.

作者陈健万佳泽林丽李佐勇 Chen Jian;Wan Jiaze;Lin Li;Li Zuoyong(School of Electronic,Electrical Engineering and Physics,Fujian University of Technology,Fuzhou 350118,China;Fujian Provincial Key Laboratory of Information Processing and Intelligent Control(Minjiang University),Fuzhou 350121,China)

机构地区福建理工大学电子电气与物理学院福建省信息处理与智能控制重点实验室(闽江学院)

出处《中国图象图形学报》 CSCD 北大核心 2023年第11期3400-3414,共15页 Journal of Image and Graphics

基金国家自然科学基金项目(61972187) 福建省自然科学基金项目(2020J02024,2022J01952) 福建省信息处理与智能控制重点实验室(闽江学院)开放课题项目(MJUKF-IPIC202110)。

关键词图像质量评价(IQA) 盲图像质量评价(BIQA) 深度学习自适应语义感知网络(SSA-Net) 多级监督回归(MSR) image quality assessment(IQA) blind image quality assessment(BIQA) deep learning self-adaptive semantic awareness network(SSA-Net) multi-level supervision regression(MSR)

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献5

1陈健,李诗云,林丽,王猛,李佐勇.模糊失真图像无参考质量评价综述[J].自动化学报,2022,48(3):689-711. 被引量：9
2陈勇,吴明明,房昊,刘焕淋.基于差异激励的无参考图像质量评价[J].自动化学报,2020,46(8):1727-1737. 被引量：4
3高敏娟,党宏社,魏立力,王海龙,张选德.结合全局与局部变化的图像质量评价[J].自动化学报,2020,46(12):2662-2671. 被引量：7
4李博文,田猛,张维夏,王先培.基于多层级信息稀疏表征的盲图像质量评价[J].华中科技大学学报（自然科学版）,2021,49(8):40-45. 被引量：1
5鄢杰斌,方玉明,刘学林.图像质量评价研究综述——从失真的角度[J].中国图象图形学报,2022,27(5):1430-1466. 被引量：9

二级参考文献40

1闫乐乐,李辉,邱聚能,梁平.基于区域对比度和SSIM的图像质量评价方法[J].应用光学,2015,36(1):58-63. 被引量：18
2楼斌,沈海斌,赵武锋,严晓浪.基于自然图像统计的无参考图像质量评价[J].浙江大学学报（工学版）,2010,44(2):248-252. 被引量：18
3赵巨峰,冯华君,徐之海,李奇.基于模糊度和噪声水平的图像质量评价方法[J].光电子．激光,2010,21(7):1062-1066. 被引量：20
4李朝锋,唐国凤,吴小俊,琚宜文.学习相位一致特征的无参考图像质量评价[J].电子与信息学报,2013,35(2):484-488. 被引量：21
5张士杰,李俊山,杨亚威,张仲敏.湍流退化红外图像降晰函数辨识[J].光学精密工程,2013,21(2):514-521. 被引量：14
6尤玉虎,刘通,刘佳文.基于图像处理的自动对焦技术综述[J].激光与红外,2013,43(2):132-136. 被引量：43
7田浩南,李素梅.基于边缘的SSIM图像质量客观评价方法[J].光子学报,2013,42(1):110-114. 被引量：24
8桑庆兵,苏媛媛,李朝锋,吴小俊.基于梯度结构相似度的无参考模糊图像质量评价[J].光电子．激光,2013,24(3):573-577. 被引量：27
9桑庆兵,李朝锋,吴小俊.基于灰度共生矩阵的无参考模糊图像质量评价方法[J].模式识别与人工智能,2013,26(5):492-497. 被引量：21
10邵宇,孙富春,李洪波.基于视觉特性的无参考型遥感图像质量评价方法[J].清华大学学报（自然科学版）,2013,53(4):550-555. 被引量：17

共引文献25

1邓杰航,袁仲鸣,林好润,顾国生.协同超像素和视觉显著性的图像质量评价[J].广东工业大学学报,2021,38(5):33-39.
2邓杰航,袁仲鸣,刘栋濠,顾国生.基于视觉显著性的自适应图像质量评价方法[J].软件导刊,2021,20(11):191-196. 被引量：1
3陈健,李诗云,林丽,王猛,李佐勇.模糊失真图像无参考质量评价综述[J].自动化学报,2022,48(3):689-711. 被引量：9
4程茹秋,余烨,石岱宗,蔡文.图像与视频质量评价综述[J].中国图象图形学报,2022,27(5):1410-1429. 被引量：7
5刘兴奥,周日贵,郭文宇.量子线性卷积及其在图像处理中的应用[J].自动化学报,2022,48(6):1504-1519. 被引量：1
6汪杰,陈曼龙,李奎,杨帆,燕立志.机器视觉螺纹图像评价方法[J].应用光学,2022,43(5):904-912. 被引量：4
7温静,白鑫.自适应融合局部和全局特征的图像质量评价[J].计算机技术与发展,2022,32(11):50-57.
8张一鸣,杨曦晨.基于特征融合的雾化图像质量评价方法[J].计算机技术与发展,2022,32(11):72-80.
9艾达,白岩松,于可欣,元辉,刘颖.全景图像质量评价方法最新进展[J].计算机工程与应用,2022,58(24):1-11. 被引量：1
10石磊.铁路货车装载状态标准图像智能识别技术体系设计研究[J].铁道货运,2022,40(11):34-39. 被引量：6

1齐博,张国华,于立子.基于深度残差回归网络和图像块预置信度的盲图像质量评价研究[J].西南师范大学学报（自然科学版）,2023,48(7):21-30. 被引量：1
2汤应薇,张荣福,丁然,张杰.MSA-Net:一种基于多阶段注意力机制的少样本目标检测方法[J].光学仪器,2023,45(6):14-24.
3王昆,郭迎清,赵万里,周启凡,郭鹏飞.基于SSAE和相似性匹配的航空发动机剩余寿命预测[J].北京航空航天大学学报,2023,49(10):2817-2825. 被引量：1
4Nandhini CHOCKALINGAM,Brindha MURUGAN.A multimodal dense convolution network for blind image quality assessment[J].Frontiers of Information Technology & Electronic Engineering,2023,24(11):1601-1615.
5王子尧,宋艳平,刘爽,邓宇含,王久菊,权文香,董问天,刘宝花.非典型抑郁患者临床特征、季节性特征及影响因素分析[J].中华行为医学与脑科学杂志,2022,31(3):261-266. 被引量：1
6陈雪松,詹子依,王浩畅.融合SikuBERT模型与MHA的古汉语命名实体识别[J].吉林大学学报（信息科学版）,2023,41(5):866-875.
7李凤英,黎家鹏.联合三元组嵌入的实体对齐[J].计算机工程与应用,2023,59(24):70-77. 被引量：1
8秦小倩,杜浩.基于自然场景统计的图像质量评价算法[J].现代电子技术,2023,46(23):36-42.
9吴明月,周栋,赵文玉,屈薇.基于流形学习的句向量优化[J].计算机应用,2023,43(10):3062-3069.
10王勇,曾祥强,曾俊铖,黄开青,叶虎平,甘宏,陈宇焜.全局特征感知与融合的多层次蒸馏学习道路提取模型[J].计算机辅助设计与图形学学报,2023,35(10):1541-1553.

中国图象图形学报

2023年第11期

浏览历史

内容加载中请稍等...

自适应语义感知网络的盲图像质量评价

参考文献5

二级参考文献40

共引文献25

相关作者

相关机构

相关主题

浏览历史