深度学习分类模型解释图的对象相关性消融分析

Ablation based correspondence analysis of objects in deep learning interpretable heatmap

下载PDF

导出

摘要 [目的]为了提高深度学习的稳定性、可解释性和公平性,针对深度学习基于关联驱动存在偏见的问题,对深度学习卷积神经网络(convolutional neural network,CNN)图像分类模型的样本内对象进行相关分析,该分析结果可以为实现稳定学习提供所必须的相关甄别.[方法]提出一种深度学习分类模型解释图对象相关性消融分析方法:在对CNN分类模型输入图像进行超像素分割后获得超像素对象;采用基于敏感分析(sensitivity analysis,SA)理论量化对象的分类贡献值;依据该贡献值绘制分类可解释热力图(heatmap);再通过同步消融、相关计算,得到热力图中诸对象之间的相关量化值;根据相关值与分类重要性综合输出排序.[结果]生成带有样本对象间线性相关关联标注的CNN分类模型的解释图,输出相关对象组排序列表,分析得出超像素块参数选择对于相关度计算影响随着分块数由小到大呈现“先升后降”的变化趋势,并分析了其原因.[结论]本研究提出的相关性消融分析实现了CNN分类模型解释图对象间的相关性量化计算,获得的解释图可解释性较现有方法更好理解,研究内容可以为相关甄别、图像语义分析、知识图谱自动绘制、深度学习模型进化提供支持. [Objective]Although deep learning has achieved remarkable success,criticisms in its stability,interpretability,and fairness remain.Prominently,it is well known as a correspondences driven machine learning method,and its trained models,even the large models,are somewhat involved with biases.According to the theory of stable learning,these biases,which are induced by false correspondences,prompt problems of the stability and the interpretability.Consequently,the correspondence analysis for the discrimination is considered as a promising solution.As the most widely used deep learning model,convolutional neural network(CNN)image classification model has managed to solve this problem on the agenda.[Methods]Differing from existing research that primarily focuses on the extraction of objects to give interpretable heatmap,we deem that the correspondence among objects in the heatmap should also be studied.Then,we present ablation correspondence analysis(Ablation-CA).The Ablation-CA firstly implements superpixel segmentation of an input image to obtain objects.Subsequently,the classification contributions of these objects are quantified with sensitivity analysis(SA)algorithm to figure out interpretable heatmap.Through synchronous ablations and correlation calculations,correlation values among objects in the heatmap are obtained successively.Finally,all the correspondent object groups are yielded into a sort list.[Results]By the testing on pre-trained models of CNN classification(Inception-v3)and standard image data(PASCAL VOC2012,CIFAR-10,and CSDN among others),it is proved that the Ablation-CA may output more semantics and better interpretable heatmap than main traditional methods may,including local interpretable model-agnostic explanations(LIME),randomized input sampling for explanation(RISE),class activation mapping(CAM),saliency,deep Taylor decomposition(DTD),layer-wise relevance propagation(LRP),XRAI(novel region-based attribution method),guided-backpropagation(GBP),and integrated gradients(IG).The superiority is mainly attributed to the superpixel segmentation and Monte Carlo method used in the Ablation-CA.Experimental results also show that Ablation-CA can effectively calculate objects correspondence of the CNN classification model.As a result,Ablation-CA heatmap may provide correspondence labels on the heatmap,which existing methods do not have.Objectively,some room for improvements remains.From experimental instances,the effect of Ablation-CA to the single image content functions properly,and the linear relationships among which can be analyzed rapidly.However,for some complex content images with nonlinear correlation,Ablation-CA does not perform sufficiently satisfactorily.Because the size of superpixel segmentation blocks is the most important hyperparameter that affects the effectiveness of Ablation-CA,we test the maximum correlation value of top 10 images in PASCAL VOC2012 which include linear correlation objects.It is found that the relationship between the correlation value and the number of segmentation blocks shows a fluctuating trend,namely first increasing and then decreasing.For the test dataset,the maximum value is achieved when the number of segmentation blocks lies within 30—50,and then the value gradually decreases with the increase of the number of segmentation blocks.Our analysis indicates that finer superpixel segmentation can remove some classification interference(relevant experiments show that the classification probability,obtained by ablation of interference superpixels,is even higher than the original image).However,overly fine segmentation damages the semantic information of image objects,resulting in the model misrecognition.Therefore,the segmentation block number must be specified within a rational range.[Conclusions]In this paper,we discuss a CA dimension,namely the correspondence among objects in the CNN image classification model samples.Clearly,our analysis differs from normal existing explainable methods for CNN.Preliminary experiments have demonstrated the feasibility and the effectiveness of Ablation-CA.The correspondence output by Ablation-CA may be used for many relevant applications,including false correspondence discrimination for stable learning,image semantic analysis,object-relation drawing for the automatic generation of knowledge graphs,and regularization for model evolution among others.Urgently,some aspects of Ablation-CA continue to be improved.For the purpose of discovering more and deeper correspondence from CNN,some complex correlation algorithms ought to be added into Ablation-CA.The function with respect to block number and correlation values needs to be explored so that a balance between semantics and the analysis is maintained.Moreover,faster algorithms are also required for the enormous computational complexity of large graphs.

作者王晓东张盖群胡钰琪李孟珏 WANG Xiaodong;ZHANG Gaiqun;HU Yuqi;LI Mengjue(School of Information Science and Technology,Xiamen University Tan Kah Kee College,Zhangzhou 363105,China;School of Electronic Science and Engineering,Xiamen University,Xiamen 361005,China)

机构地区厦门大学嘉庚学院信息科学与技术学院厦门大学电子科学与技术学院

出处《厦门大学学报（自然科学版）》 CAS CSCD 北大核心 2024年第3期562-569,共8页 Journal of Xiamen University：Natural Science

基金福建省自然科学基金(2023J01035) 厦门市自然科学基金(3502Z20227326)。

关键词深度学习相关性可解释性消融分析热力图 deep learning correspondence interpretability ablation analysis heatmap

分类号 TN911.73 [电子电信—通信与信息系统] TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献6

1窦慧,张凌茗,韩峰,申富饶,赵健.卷积神经网络的可解释性研究综述[J].软件学报,2024,35(1):159-184. 被引量：12
2陈冲,陈杰,张慧,蔡磊,薛亚茹.深度学习可解释性综述[J].计算机科学,2023,50(5):52-63. 被引量：8
3阮利,温莎莎,牛易明,李绍宁,薛云志,阮涛,肖利民.基于可解释基拆解和知识图谱的深度神经网络可视化[J].计算机学报,2021,44(9):1786-1805. 被引量：7
4赵小阳,李仲年,王文玉,许新征.ADIC:一种面向可解释图像识别的自适应解纠缠CNN分类器[J].计算机研究与发展,2023,60(8):1754-1767. 被引量：3
5宋熙煜,周利莉,李中国,陈健,曾磊,闫镔.图像分割中的超像素方法研究综述[J].中国图象图形学报,2015,20(5):599-608. 被引量：98
6詹婉荣,于海.相关系数的传递性[J].大学数学,2013,29(1):91-94. 被引量：6

二级参考文献68

1苏金玲,王朝晖.基于Graph Cut和超像素的自然场景显著对象分割方法[J].苏州大学学报（自然科学版）,2012,28(2):27-33. 被引量：7
2李秀敏,江卫华.相关系数与相关性度量[J].数学的实践与认识,2006,36(12):188-192. 被引量：50
3北京大学数学系.高等代数[M].2版.北京:高等教育出版社,1988.
4Ren X, Malik J. Learning a classification model for segmentation [ C]//Proceedings of the IEEE International Conference on Com- puter Vision. Washington DC, USA: IEEE, 2003: 10-17. [ DOI: 10. 1109/ICCV. 2003. 1238308 ].
5Achanta R, Shaji A, Smith K, et al. SLIC superpixels compared to state-of-the-art superpixel methods[ J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012, 34 ( 11 ) : 2274-2282. [DOI: 10. 1109/TPAMI. 2012. 120].
6Xu C, Corso J J. Evaluation of super-voxel methods for early vid- eo processing[ C ]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Washington DC, USA: IEEE, 2012 : 1202-1209. [DOI : 10. 1109/CVPR. 2012. 6247802 ].
7Shi J, Malik ./. Normalized cuts and image segmentation [ J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000, 22(8): 888-905. [DOI: 10. 1109/34. 868688].
8Moore A P, Prince S, Warrell J, et al. Superpixel lattices[ C]// Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Washington DC, USA : IEEE, 2008 : 1-8. [ DOI: 10. 1109/CVPR. 2008. 4587471 ].
9Veksler O, Boykov Y, Mehrani P. Superpixels and supervoxels in an energy optimization framework [ M ]//Computer Vision-EC- CV 2010. Berlin Heidelberg: Springer, 2010: 211-224. [DOI: 10. 1007/978-3-642-15555-0_16 ].
10Achanta R, Shaji A, Smith K, et al. Slic superpixels[ R]. Lau- sanne, Vaud, Switzerland: Swiss federal Institute of Technology, 2010.

共引文献128

1冯筱妍,卢诗娟,李一鸣,林军.基于锥形线束CT数据的智能颈椎骨龄评估系统的建立[J].浙江大学学报（医学版）,2021,50(2):187-194. 被引量：4
2李振波,赵远洋,杨普,吴宇峰,李一鸣,郭若皓.基于机器视觉的鱼体长度测量研究综述[J].农业机械学报,2021,52(S01):207-218. 被引量：4
3方堃,谢淑丽,齐微微,王伯燕,王锐,姚青.超像素图像分割算法及其应用研究进展[J].家电科技,2022(S01):604-607.
4杜颖,蔡义承,谭昌伟,李振海,杨贵军,冯海宽,韩东.基于超像素分割的田间小麦穗数统计方法[J].中国农业科学,2019,52(1):21-33. 被引量：18
5钟茂生,彭超,姜林,韩丹,夏天翔,姚珏君,郑迪.污染场地土壤中Cd人体可给性影响因素及对筛选值的影响[J].中国环境科学,2015,35(7):2217-2224. 被引量：8
6戴庆焰,朱仲杰,段智勇,李伟杰.基于超像素和改进迭代图割算法的图像分割[J].计算机工程,2016,42(7):220-226. 被引量：6
7徐伟悦,田光兆,姬长英,张波,蒋思杰,张纯.自然场景下苹果图像FSLIC超像素分割方法[J].农业机械学报,2016,47(9):1-10. 被引量：3
8刘斌,渠星星,陈相庭.最新的超像素算法研究综述[J].现代计算机（中旬刊）,2016(12):62-65. 被引量：2
9马军福,魏玮.一种改进的快速SLIC分割算法[J].计算机工程与科学,2017,39(2):317-322. 被引量：1
10姜枫,顾庆,郝慧珍,李娜,郭延文,陈道蓄.基于内容的图像分割方法综述[J].软件学报,2017,28(1):160-183. 被引量：133

1徐小艳,吕伟,张贝贝,周帅鹏,魏嵬.异源在线网络话题早发现及演化特征研究[J].工程数学学报,2023,40(3):341-354.
2龙勇,代高富,范治政.大规模储能装置在电网中应用的可行性研究[J].消费电子,2024(3):93-96.
3李元.基于GIS的长白山天然次生林空间结构分析[J].现代农业研究,2023,29(9):142-146.
4孙博,孙术良,陶正印,李迪安,刘强,吴一涵,郭辉.半刚性预制梁柱节点力学性能分析[J].建筑结构,2023,53(S02):1002-1011.
5张建敏,张奎俊,李厚阵.利用Autocad快速绘制土石方工程横断面图的研究[J].中文科技期刊数据库（全文版）工程技术,2017(4):274-274.
6刘晓旭.主题网络爬虫研究综述[J].电脑知识与技术,2024,20(8):97-99. 被引量：1
7许佳佳.汉语“一+量量”结构的部分量化研究——以河南固始方言为例[J].青海师范大学学报（社会科学版）,2024,46(1):116-123.
8丁克凡,范春美,马斯棋,张春和,陈淑慧,文鑫邦,赵梓钦,杨毅坚.定坤丹联合西药治疗多囊卵巢综合征不孕症的Meta分析[J].生殖医学杂志,2024,33(6):775-784.
9李晗轲,李璟,王颖,邹国平,陈倩楠,蔡慧.用户用电负荷变化的异常检测与识别[J].现代电子技术,2024,47(10):1-5.
10刘承峰,陈振宇.汉语总括副词的演化进程考察[J].语言研究集刊,2023(1):9-31. 被引量：1

厦门大学学报（自然科学版）

2024年第3期

浏览历史

内容加载中请稍等...

深度学习分类模型解释图的对象相关性消融分析

参考文献6

二级参考文献68

共引文献128

相关作者

相关机构

相关主题

浏览历史