期刊文献+

基于DMN的跨模态目标实例分割方法

Cross-Modal Target Instance Segmentation Method Based on DMN
下载PDF
导出
摘要 在DMN的基础上提出一种跨模态目标实例分割方法,旨在结合自然语言表达,利用不同模态信息从图像中分割所描述对象。在视觉特征提取网络DPN92中引入CBAM注意力机制,关注空间和通道上的有用信息;将BN层替换为联合BN和FRN的正则化,减少批次量和通道数对提取特征网络性能的影响,提高网络的泛化能力;在三个通用数据集ReferIt、GRef和UNC上进行仿真实验。实验结果显示,提出的引入CBAM注意力机制和联合正则化改进模型在mIou评价指标上,ReferIt和GRef上分别提升了1.85和0.52个百分点,在UNC三个验证集上分别提升了1.98、2.22和2.75个百分点。表明改进模型在预测准确度方面优于已有模型。 A cross-modal target instance segmentation method based on DMN,which aims to segment the objects described by natural language expression from the image,is proposed in this paper.First of all,the CBAM attention mechanism is introduced in the visual feature extraction network DPN92,which pays attention to the useful information in space and channel.Secondly,the BN layer is replaced with the normalization of the union of BN and FRN,which reduces the influ-ence batch volume and number of channels in the performance of the extraction characteristic network,and improves the generalization ability of the network.Finally,the proposed scheme is simulated based on three common datasets,ReferIt,GRef and UNC.Simulation results indicate that the mIou evaluation index,which the introduction of CBAM attention mechanism and the joint normalization model,is improved by 1.85 and 0.52 percentage points respectively on the formal two datasets,and is improved by 1.98,2.22 and 2.75 percentage points on the three validation sets split by UNC,and the improved model is better than the existing model.
作者 熊珺瑶 宋振峰 王蓉 XIONG Junyao;SONG Zhenfeng;WANG Rong(School of Information Technology and Network Security,People’s Public Security University of China,Beijing 100038,China)
出处 《计算机工程与应用》 CSCD 北大核心 2022年第20期117-123,共7页 Computer Engineering and Applications
基金 国家自然科学基金面上项目(62076246)。
关键词 跨模态 自然语言处理 目标实例分割 注意力机制 联合正则化 cross-modal natural language processing target instance segmentation attention mechanisms union normalization
  • 相关文献

参考文献2

二级参考文献9

  • 1Kittler J, lllingworth J. Minimum Error Thresholding [J]. Pattern Recognition (S0031-3203), 1986, 19(1): 41-47.
  • 2Pal N R, Pal S K. Image Model, Poisson Distribution and Object Extraction [J]. International Journal of Pattern Recognition and Artificial Intelligence (S0218-0014), 1991, 5(3): 459-483.
  • 3Li C H, Lee C K. Minimum Cross Entropy Thresholding [J]. Pattern Recognition (S0031-3203 ), 1993, 26(4): 617-625.
  • 4C-I Chang, K Chert, J Wang, M LG Althouse. A Relative Entropy-Based Approach to Image Thresholding [J]. Pattern Recognition (S0031-3203), 1994, 27(9): 1275-1289.
  • 5Lee S S, Homg S-J, Tsai H-R. Entropy Thresholding and Its Parallel Algorithm on the Reconfigurable Array of Processors with Wide Bus Networks [J]. IEEE Transactions on Image Processing (S 1057-7149), 1999, 8(9): 1229-1242.
  • 6Ramac L C, Varshney P K. Image Thresholding Based on Ali-Silvey Distance Measures [J]. Pattern Recognition (S0031-3203), 1997, 30(7): 1161-1174.
  • 7Lee S-K, Lo C-S, Wang C-M, Chung P-C. A Computer-Aided Design Mammography Screening System for Detection And Classification Of Microcalcifications [J]. International Journal of Medical Informatics (S 1386-5056), 2000, 60(1): 29-57.
  • 8C-I Chang, Y Du, J Wang, S-M Guo, P D Thouin. Survey and Comparative Analysis of Entropy and Relative Entropy Thresholding Techniques [J]. Vision, Image and Signal Processing, IEE Proceedings, (S 1350-245X), 2006, 153(6): 837-850.
  • 9I El-Feghi, N Adem. Improved Co-occurrence Matrix as a Feature Space for Relative Entropy-based Image Thresholding [C]// Proceedings of the Computer Graphics, Imaging and Visualization, 10.1109/CGIV.2007.49, 314-320.

共引文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部