Interpretability of Neural Networks Based on Game-theoretic Interactions

导出

摘要 This paper introduces the system of game-theoretic interactions,which connects both the explanation of knowledge encoded in a deep neural networks(DNN)and the explanation of the representation power of a DNN.In this system,we define two gametheoretic interaction indexes,namely the multi-order interaction and the multivariate interaction.More crucially,we use these interaction indexes to explain feature representations encoded in a DNN from the following four aspects:(1)Quantifying knowledge concepts encoded by a DNN;(2)Exploring how a DNN encodes visual concepts,and extracting prototypical concepts encoded in the DNN;(3)Learning optimal baseline values for the Shapley value,and providing a unified perspective to compare fourteen different attribution methods;(4)Theoretically explaining the representation bottleneck of DNNs.Furthermore,we prove the relationship between the interaction encoded in a DNN and the representation power of a DNN(e.g.,generalization power,adversarial transferability,and adversarial robustness).In this way,game-theoretic interactions successfully bridge the gap between“the explanation of knowledge concepts encoded in a DNN”and"the explanation of the representation capacity of a DNN"as a unified explanation.

作者 Huilin Zhou Jie Ren Huiqi Deng Xu Cheng Jinpeng Zhang Quanshi Zhang

机构地区 School of Electronic Information and Electrical Engineering XLAB

出处《Machine Intelligence Research》 EI CSCD 2024年第4期718-739,共22页 机器智能研究（英文版）

基金 supported by National Science and Technology Major Project(No.2021ZD0111602) the National Nature Science Foundation of China(Nos.62276165 and U19B2043) Shanghai Natural Science Foundation,China(Nos.21JC1403800 and 21ZR1434600).

关键词 Model interpretability and transparency explainable AI game theory INTERACTION deep learning.

分类号 TP183 [自动化与计算机技术—控制理论与控制工程] TP391.3 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1罗余,王建波,李平,杜占玮,许小可.基于多阶邻居传播度量和拓扑特征的高影响力节点识别[J].中国科学：信息科学,2024,54(4):944-959.
2Haodong Wang,Xiaoxu Lai,Chi Chen,Pei Shi,Houzhao Wan,Hao Wang,Xingguang Chen,Dan Sun.Novel 2D bifunctional layered rare-earth hydroxides@GO catalyst as a functional interlayer for improved liquid-solid conversion of polysulfides in lithium-sulfur batteries[J].Chinese Chemical Letters,2024,35(5):434-440.
3刘博,任建新,吴翔宇,陈帅东,毛雅亚,赵利.Neural network equalization based on delta-sigma modulation[J].Chinese Optics Letters,2024,22(4):8-13.
4Fangfang Shan,Mengyao Liu,Menghan Zhang,Zhenyu Wang.Fake News Detection Based on Cross-Modal Message Aggregation and Gated Fusion Network[J].Computers, Materials & Continua,2024,80(7):1521-1542.

Machine Intelligence Research

2024年第4期

浏览历史

内容加载中请稍等...

Interpretability of Neural Networks Based on Game-theoretic Interactions

相关作者

相关机构

相关主题

浏览历史