期刊文献+

融合金字塔和注意力机制的文物子图检索模型

A cultural relic sub-image retrieval algorithm based on folded multi-hollow pyramid pooling and attention mechanism
下载PDF
导出
摘要 随着中国文化研究工作的深入以及数字化文物采集技术的发展,文化资源数据和文化数字内容的数量也随之增长,如何对文化数据进行有效存储、管理以及检索成为一项重要的工作。针对文物图像数据检索任务中因尺度变化和特征选择造成检索精度不高的问题,提出了一种融合折叠多空洞金字塔池化和注意力机制的文物子图检索模型。模型为提高不同尺度的文物子图检索精度,通过在图像特征提取模块使用优化后的折叠多空洞金字塔池化提取图像的多尺度信息;为避免密集局部特征和无关特征影响检索准确率,使用注意力机制对局部特征进行关键特征选择。最后在所构建的文物数据集上进行了消融实验和性能对比实验,实验结果取得了良好的效果,mAP达到85.3%。 With the deepening of research on Chinese culture and the development of digital cultural relics collection technology,the amount of cultural resource data and cultural digital content has also increased,so how to store,manage and retrieve cultural data has become an important task.In order to solve the problem of low retrieval accuracy caused by scale change and feature selection in cultural relic image retrieval tasks,in this paper a cultural relic sub-image retrieval algorithm based on folded multi-hollow pyramid pooling and attention mechanism(FMHPPA)was proposed.In order to solve the problem of scale change in sub-image retrieval,FMHPPA model extracted multi-scale information from image feature extraction module by optimizing folded multi-hollow pyramid pooling.In order to avoid the impact of dense local features and irrelevant features on retrieval performance and accuracy,FMHPPAM model used attention mechanism to select key features for local features.The model ablation experiment and performance comparison experiment were carried out on the constructed sub-image dataset,and the experimental results achieved better results as the mAP reached 85.3%.
作者 彭宏 侯小刚 曾凡璐 吴萌 PENG Hong;HOU Xiaogang;ZENG Fanlu;WU Meng(Center for Ethnic and Folk Literature and Art Development,Ministry of Culture and Tourism,PRC,Beijing 100007,China;National Museum of China,Beijing 100006,China;School of Artificial Intelligence,Beijing University of Posts and Telecommunications,Beijing 100876,China;The Palace Museum in Beijing,Beijing 100006,China)
出处 《中国传媒大学学报(自然科学版)》 2024年第2期19-26,共8页 Journal of Communication University of China:Science and Technology
基金 国家重点研发计划项目(2022YFF0904304) 内蒙古自治区科技计划(2023YFSW0021)。
关键词 子图检索 空洞金字塔 注意力机制 特征选择 图像检索 sub-image retrieval hollow pyramid attention mechanism feature selection image retrieval
  • 相关文献

参考文献3

二级参考文献20

  • 1Zarghili A, Kharroubi J, Benslimane R. Arabo-Moresque decor images retrieval system based on spatial relationships indexing[J]. Journal of Cultural Heritage, 2008, 9(3):317-325.
  • 2Koutsoudis A, Pavlidis G, Arnaoutoglou F, et al. Qp: A tool for generating 3D models of ancient Greek pottery[J]. Journal of Cultural Heritage, 2009, 10(2) :281-295.
  • 3Koutsoudis A, Pavlidis G, Liami V, et al. 3D pottery content-based retrieval based on pose normalisation and segmentation[J]. Journal of Cultural Heritage, 2010, 11(3):329-338.
  • 4Koutsoudis A,Chamzas C. 3D pottery shape matching using depth map images[J]. Journal of Cultural Heritage, 2011, 12(2):128-133.
  • 5Zhu Q, Wang X, Keogh E, et al. An efficient and effective similarity measure to enable data mining of petroglyphs[J]. Data Mining and Knowledge Discovery, 2011, 23(1) :91-127.
  • 6Zhu Q, Wang X, Keogh E, et al. Augmenting the generalized hough transform to enable the mining of petroglyphs[C]//Proceedings of the 15th ACM SIGKDD International Conference On Knowledge Discovery And Data Mining, 2009, Los Angeles: ACM, 2009 : 1057-1066.
  • 7Kaplan C S. Computer generated islamic star patterns[C] // Bridges 2000 Proceedings: Mathematical Connections in Art, Music and Science, Winfield, 2000:105-112.
  • 8Kaplan C S,Salesin D H. Islamic star patterns in absolute geometry[J]. ACM Transactions on Graphics, 2004, 23(2) :97-119.
  • 9Michel J B, Shen Y K, Aiden A P, et al. Quantitative analysis of culture using millions of digitized books[J]. Science, 2011, 14: 176-182.
  • 10Larabi S. Textual description of shapes[J]. Journal of Visual Communication and Image Representation, 2009, 20(8):563-584.

共引文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部