期刊文献+

基于CNN与Swin Transformer的新疆荒漠植物识别研究 被引量:1

Research on the identification of desert plants in Xinjiang based on CNN and Swin Transformer
下载PDF
导出
摘要 新疆荒漠地区受气候和环境的双重影响易出现干旱灾害和影响农牧业生产,不利于新疆经济的可持续,新疆荒漠植物的识别是各植物研究人员了解植物生长状况的基础,也是生态保护研究和实施治理措施的前提.同时,新疆荒漠植物图像存在类间相似、图像背景复杂和数据样本不平衡等特点,导致该研究具有一定的难度.为提高识别准确率、准确定位局部重要特征与综合考虑复杂全局信息,本文提出了一种融合卷积神经网络(CNN)和Swin Transformer网络的植物图像识别方法.该方法结合了CNN网络擅长提取局部特征和Swin Transformer擅长捕获全局表示的优点,同时在CNN分支中嵌入改进的Convolutional Block Attention Module(CBAM)注意力模块以便充分提取到具有区分度的局部关键特征,并使用Focal Loss损失函数解决数据样本不平衡问题.通过实验结果表明,提出的融合方法在新疆荒漠植物数据集上相较于单分支网络更能充分提取图像的特征,其识别准确率可达97.99%,且精准率、召回率和F1分数都优于现有的方法.最后通过可视化分析和混淆矩阵进一步佐证了该方法的有效性. The desert areas of Xinjiang are prone to drought disasters and agricultural and animal husbandry production under the dual influence of climate and environment,which is not conducive to the sustainable economy of Xinjiang,the identification of desert plants in Xinjiang is the basis for various plant researchers to understand the growth status of plants,as well as a prerequisite for ecological conservation research and implementation of management measures.At the same time,the study is difficult due to the similarity of Xinjiang desert plant images between classes,complex image background and unbalanced data samples.In order to improve recognition accuracy,accurately locate locally important features and comprehensively consider complex global information,a plant image recognition method that combines convolutional neural network(CNN)and Swin Transformer network is proposed.The method combines the advantages of CNN network which is good at extracting local features and Swin Transformer which is good at capturing global representation,and embeds an improved Convolutional Block Attention Module(CBAM)in the CNN branch to fully extract the local key features with differentiation,and the Focal Loss function is used to solve the problem of data sample imbalance.The experimental results show that the proposed fused method can extract the features of the images more adequately than the single-branch network on the Xinjiang desert plant dataset,and its recognition accuracy can reach 97.99%,and the precision,recall and F1 score are better than the existing methods.Finally,the effectiveness of the method is further corroborated by visualization analysis and confusion matrix.
作者 许春陶 钱育蓉 范迎迎 杜臻宇 邵游朋 XU Chuntao;QIAN Yurong;FAN Yingying;DU Zhenyu;SHAO Youpeng(School of Software,Xinjiang University,Urumqi 830000,Xinjiang,China;Key Laboratory of Signal Detection and Processing,Xinjiang Uygur Autonomous Region,Urumqi 830046,Xinjiang,China;Key Laboratory of Software Engineering,Xinjiang University,Urumqi 830000,Xinjiang,China;Xinjiang University of Finance and Economic,School of Information Management,Urumqi 830012,Xinjiang,China)
出处 《微电子学与计算机》 2023年第6期33-41,共9页 Microelectronics & Computer
基金 国家自然科学基金资助项目(61966035) 自治区科技厅国际合作项目(2020E01023) 国家自然科学基金联合基金——重点项目(U1803261) 新疆财经大学校级科研基金项目(2017XYB015)。
关键词 植物识别 卷积神经网络 Swin Transformer 注意力机制 plant identification Convolutional Neural Network Swin Transformer attention mechanism
  • 相关文献

参考文献3

二级参考文献17

共引文献72

同被引文献17

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部