Visual Superordinate Abstraction for Robust Concept Learning

导出

摘要 Concept learning constructs visual representations that are connected to linguistic semantics, which is fundamental to vision-language tasks. Although promising progress has been made, existing concept learners are still vulnerable to attribute perturbations and out-of-distribution compositions during inference. We ascribe the bottleneck to a failure to explore the intrinsic semantic hierarchy of visual concepts, e.g., {red, blue,···} ∈“color” subspace yet cube ∈“shape”. In this paper, we propose a visual superordinate abstraction framework for explicitly modeling semantic-aware visual subspaces(i.e., visual superordinates). With only natural visual question answering data, our model first acquires the semantic hierarchy from a linguistic view and then explores mutually exclusive visual superordinates under the guidance of linguistic hierarchy. In addition, a quasi-center visual concept clustering and superordinate shortcut learning schemes are proposed to enhance the discrimination and independence of concepts within each visual superordinate. Experiments demonstrate the superiority of the proposed framework under diverse settings, which increases the overall answering accuracy relatively by 7.5% for reasoning with perturbations and 15.6% for compositional generalization tests.

作者 Qi Zheng Chao-Yue Wang Dadong Wang Da-Cheng Tao

机构地区 University of Sydney JD Explore Academy DATA

出处《Machine Intelligence Research》 EI CSCD 2023年第1期79-91,共13页 机器智能研究（英文版）

基金 supported in part by the Australian Research Council(ARC)(Nos.FL-170100117,DP-180103424,IC-190100031 and LE-200100049).

关键词 Concept learning visual question answering weakly-supervised learning multi-modal learning curriculum learning

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1陈春芳,边小勇,费雄君,杨博,张晓龙.弱监督多示例子概念学习的遥感场景分类[J].小型微型计算机系统,2022,43(1):76-83. 被引量：1
2Gustavo de Bem Silveira,Alexandre Pastoris Muller,Ricardo Andrez Machado-de-Ávila,Paulo Cesar Lock Silveira.Advance in the use of gold nanoparticles in the treatment of neurodegenerative diseases:new perspectives[J].Neural Regeneration Research,2021,16(12):2425-2426.
3Rui LIU,Yahong HAN.Instance-sequence reasoning for video question answering[J].Frontiers of Computer Science,2022,16(6):93-101. 被引量：1
4段立娟,孙启超,乔元华,陈军成,崔国勤.基于注意力感知和语义感知的RGB-D室内图像语义分割算法[J].计算机学报,2021,44(2):275-291. 被引量：16
5Christina Stoiber,Davide Ceneda,Markus Wagner,Victor Schetinger,Theresia Gschwandtner,Marc Streit,Silvia Miksch,Wolfgang Aigner.Perspectives of visualization onboarding and guidance in VA[J].Visual Informatics,2022,6(1):68-83.
6Hugo Ladret,Laurent Perrinet.AB009.Learning dynamics in a neural network model of the primary visual cortex[J].Annals of Eye Science,2019(1):184-184.
7Zhongyi HAN,Le-Wen CAI,Wang-Zhou DAI,Yu-Xuan HUANG,Benzheng WEI,Wei WANG,Yilong YIN.Abductive subconcept learning[J].Science China(Information Sciences),2023,66(2):110-122.
8Dis Colon Rectum 2023年1期摘要[J].中华胃肠外科杂志,2023,26(1).
9Fei-Long Chen,Du-Zhen Zhang,Ming-Lun Han,Xiu-Yi Chen,Jing Shi,Shuang Xu,Bo Xu.VLP:A Survey on Vision-language Pre-training[J].Machine Intelligence Research,2023,20(1):38-56. 被引量：5
10An-An Liu,Xiaowen Wang,Ning Xu,Junbo Guo,Guoqing Jin,Quan Zhang,Yejun Tang,Shenyuan Zhang.A review of feature fusion-based media popularity prediction methods[J].Visual Informatics,2022,6(4):78-89.

Machine Intelligence Research

2023年第1期

浏览历史

内容加载中请稍等...

Visual Superordinate Abstraction for Robust Concept Learning

相关作者

相关机构

相关主题

浏览历史