期刊文献+
共找到1,029篇文章
< 1 2 52 >
每页显示 20 50 100
引入上下文信息和Attention Gate的GUS-YOLO遥感目标检测算法 被引量:1
1
作者 张华卫 张文飞 +2 位作者 蒋占军 廉敬 吴佰靖 《计算机科学与探索》 CSCD 北大核心 2024年第2期453-464,共12页
目前基于通用YOLO系列的遥感目标检测算法存在并未充分利用图像的全局上下文信息,在特征融合金字塔部分并未充分考虑缩小融合特征之间的语义鸿沟、抑制冗余信息干扰的缺点。在结合YOLO算法优点的基础上提出GUS-YOLO算法,其拥有一个能够... 目前基于通用YOLO系列的遥感目标检测算法存在并未充分利用图像的全局上下文信息,在特征融合金字塔部分并未充分考虑缩小融合特征之间的语义鸿沟、抑制冗余信息干扰的缺点。在结合YOLO算法优点的基础上提出GUS-YOLO算法,其拥有一个能够充分利用全局上下文信息的骨干网络Global Backbone。除此之外,该算法在融合特征金字塔自顶向下的结构中引入Attention Gate模块,可以突出必要的特征信息,抑制冗余信息。另外,为Attention Gate模块设计了最佳的网络结构,提出了网络的特征融合结构U-Net。最后,为克服ReLU函数可能导致模型梯度不再更新的问题,该算法将Attention Gate模块的激活函数升级为可学习的SMU激活函数,提高模型鲁棒性。在NWPU VHR-10遥感数据集上,该算法相较于YOLOV7算法取得宽松指标mAP^(0.50)1.64个百分点和严格指标mAP^(0.75)9.39个百分点的性能提升。相较于目前主流的七种检测算法,该算法取得较好的检测性能。 展开更多
关键词 遥感图像 Global Backbone Attention Gate SMU U-neck
下载PDF
改进注意力机制嵌入PR-Net模型的水稻病害识别仿真
2
作者 路阳 刘鹏飞 +3 位作者 许思源 刘启旺 顾福谦 王鹏 《系统仿真学报》 CAS CSCD 北大核心 2024年第6期1322-1333,共12页
针对现有的CNN模型在水稻叶部病害的识别中准确率较低的问题,提出了一种结合并行结构和残差结构的混合卷积神经网络模型PRC-Net(parallel residual with coordinate attention network)。引入并行结构,提高卷积的感受野;结合残差结构,... 针对现有的CNN模型在水稻叶部病害的识别中准确率较低的问题,提出了一种结合并行结构和残差结构的混合卷积神经网络模型PRC-Net(parallel residual with coordinate attention network)。引入并行结构,提高卷积的感受野;结合残差结构,使特征信息完整的连续传递;在骨干模型PR-Net中嵌入改进的空间注意力机制,增强对不同尺度病斑特征信息的凝聚程度;为进一步提升病害识别的准确率,并减少模型的训练时间和推理时间,通过改变加权方式对模型结构进行优化。仿真结果表明:与InceptionResNetV2等分类模型相比,PRC-Net具有更少的训练参数、更短的训练时间和更高的识别精度,性能优于其他作物病害识别模型。 展开更多
关键词 水稻叶部病害 PRC-Net(parallel residual with coordinate attention network) 卷积神经网络 注意力机制 图像识别
下载PDF
基于深度学习提取时空信息的流域内库水位预测模型研究
3
作者 周兰庭 陈思思 孙永明 《水电能源科学》 北大核心 2024年第4期133-136,132,共5页
为了解决流域连通水库增多,库水位影响因素复杂且具有非平稳性,难以直接通过水文计算预测的问题,对流域水文站点日降雨序列进行分析,首先将时间序列经小波变换去噪,在此基础上采用最大信息系数(MIC)相关性分析筛选与日水位序列相关性,... 为了解决流域连通水库增多,库水位影响因素复杂且具有非平稳性,难以直接通过水文计算预测的问题,对流域水文站点日降雨序列进行分析,首先将时间序列经小波变换去噪,在此基础上采用最大信息系数(MIC)相关性分析筛选与日水位序列相关性,增加了输入时序降雨与预测水位相关的信息密度,并提出将强相关性序列输入引入Attention机制的长短期记忆(LSTM)预测模型,提高LSTM神经网络选择和提取序列特征的能力。以福建某流域站点实测日降雨序列为例进行试验,结果表明该方法的均方预测误差仅为0.1908,相比LSTM模型有更高的预测精度,为水库水情调度及防洪减灾管理提供了决策依据。 展开更多
关键词 库水位预测 相关性分析 小波变换 Attention机制 LSTM
下载PDF
基于FasterNet和YOLOv5改进的玻璃绝缘子自爆缺陷快速检测方法
4
作者 邬开俊 徐泽浩 单宏全 《高电压技术》 EI CAS CSCD 北大核心 2024年第5期1865-1876,共12页
为了实现对电力输电线路中绝缘子缺陷实时快速的巡检需求,提出了一种结合FasterNet-tiny和YOLOv5-s-v6.1网络模型改进的缺陷快速检测算法FasterNet-YOLOv5。首先引入参数量小推理速度更快的FasterNet网络替换原先的CSPDarkNet53主干网络... 为了实现对电力输电线路中绝缘子缺陷实时快速的巡检需求,提出了一种结合FasterNet-tiny和YOLOv5-s-v6.1网络模型改进的缺陷快速检测算法FasterNet-YOLOv5。首先引入参数量小推理速度更快的FasterNet网络替换原先的CSPDarkNet53主干网络,加快网络的检测速度。然后结合由GhostNetv2网络提出的解耦全连接注意力机制(decoupled fully connected,DFC),在主干特征提取网络中设计了DFC-FasterNet模块,模块中的DFC Attention机制可以在特征提取过程中增大感受野,提升网络的检测精度。最后针对玻璃绝缘子自爆缺陷目标较小和背景较复杂的情况,重新设计Neck模块,提出BiFPN-F特征融合模块,使网络更精确地定位绝缘子缺陷区域。实验结果表明:改进后的算法可以快速精准定位,其均值平均精度(mean average precision,mAP)达到93.3%,相较于改进前提升5.67%,检测速度达到45.7 Hz,较改进前提升近1倍。同时与最新的YOLOv8n和YOLOv7-tiny相比,改进后的FasterNet-YOLOv5在自爆缺陷上的检测精度和速度更具优势,该文所提算法能够更快速地对绝缘子及其自爆缺陷实时定位识别。 展开更多
关键词 缺陷检测 BiFPN-F FasterNet YOLOv5s DFC Attention PConv
下载PDF
一种采用记忆神经网络和曲线形状修正的负荷预测方法
5
作者 张家安 李凤贤 +1 位作者 王铁成 郝妍 《电力工程技术》 北大核心 2024年第1期117-126,共10页
针对分布式电源和新型负荷容量累积造成负荷影响因素多元化和不确定性特性增强的问题,文中提出一种采用记忆神经网络和曲线形状修正的负荷预测方法。在负荷峰值预测中,采用最大信息系数计算负荷峰值与影响因素的非线性相关性,实现对输... 针对分布式电源和新型负荷容量累积造成负荷影响因素多元化和不确定性特性增强的问题,文中提出一种采用记忆神经网络和曲线形状修正的负荷预测方法。在负荷峰值预测中,采用最大信息系数计算负荷峰值与影响因素的非线性相关性,实现对输入特征的筛选;综合考虑负荷峰值序列的长短期自相关性和输入特征与负荷峰值的不同程度相关性,结合Attention机制和双向长短时记忆(bidirectional long short-term memory,BiLSTM)神经网络建立负荷峰值预测模型。在负荷标幺曲线预测中,通过误差倒数法组合相似日和相邻日,建立负荷标幺曲线预测模型;针对预测偏差的非平稳特征,利用自适应噪声的完全集成经验模态分解和BiLSTM网络建立误差预测模型,对曲线形状进行修正。应用中国北方某城市的区域电网负荷数据为算例,验证了所提模型的有效性。 展开更多
关键词 超短期负荷预测 Attention机制 双向长短时记忆(BiLSTM)神经网络 负荷峰值 负荷标幺曲线 曲线形状修正
下载PDF
基于改进残差网络的油气柱高度预测
6
作者 杜睿山 程永昌 孟令东 《海南大学学报(自然科学版)》 CAS 2024年第1期19-29,共11页
针对目前油气柱高度预测技术局限于传统的地质方法且预测效果不太理想的现状,展开一种基于改进残差神经网络的油气柱高度预测的研究.该模型从断层解释和油藏解剖提取的圈闭结构化特征数据中提取特征信息,以估计油气柱高度.模型将原始残... 针对目前油气柱高度预测技术局限于传统的地质方法且预测效果不太理想的现状,展开一种基于改进残差神经网络的油气柱高度预测的研究.该模型从断层解释和油藏解剖提取的圈闭结构化特征数据中提取特征信息,以估计油气柱高度.模型将原始残差块中的串行连接网络变成多个并行连接的网络,可以在多个尺度上同时进行卷积再聚合,能提取到不同尺度的特征,使其变成一个稀疏性、高计算性能的网络结构;同时保留了网络中跳跃连接的结构,缓解了在深度神经网络中增加深度带来了梯度消失和网络退化的问题,通过直接将输入信息绕道传到输出,保护信息的完整性;并在模型的首层和尾层增加注意力模块,来捕获集中于某个局部信息,使模型其能更快地收敛.此外对机器学习中常用的RF和BP神经网络以及深度学习中CNN、GoogleNet、ResNet和ResNet+Atten在圈闭数据上的应用进行了比较和分析.实验结果表明,改进的ResNet对油气柱高度预测有更加准确的结果 . 展开更多
关键词 油气柱高度 ResNet GoogleNet Attention机制 预测
下载PDF
基于Coordinate Attention和空洞卷积的异物识别 被引量:1
7
作者 王春霖 吴春雷 +1 位作者 李灿伟 朱明飞 《计算机系统应用》 2024年第3期178-186,共9页
在我国工厂的工业化生产中,带式运输机占有重要的地位,但是在其运输物料的过程中,常有木板、金属管、大型金属片等混入物料中,从而对带式运输机的传送带造成损毁,引起巨大的经济损失.为了检测出传送带上的不规则异物,设计了一种新的异... 在我国工厂的工业化生产中,带式运输机占有重要的地位,但是在其运输物料的过程中,常有木板、金属管、大型金属片等混入物料中,从而对带式运输机的传送带造成损毁,引起巨大的经济损失.为了检测出传送带上的不规则异物,设计了一种新的异物检测方法.针对传统异物检测方法中存在的对于图像特征提取能力不足以及网络感受野相对较小的问题,我们提出了一种基于coordinate attention和空洞卷积的单阶段异物识别方法.首先,网络利用coordinate attention机制,使网络更加关注图像的空间信息,并对图像中的重要特征进行了增强,增强了网络的性能;其次,在网络提取多尺度特征的部分,将原网络的静态卷积变为空洞卷积,有效减少了常规卷积造成的信息损失;除此之外,我们还使用了新的损失函数,进一步提高了网络的性能.实验结果证明,我们提出的网络能有效识别出传送带上的异物,较好地完成异物检测任务. 展开更多
关键词 coordinate attention 异物检测 空洞卷积 损失函数 目标识别
下载PDF
基于深度学习域适应的飞机结冰图像气泡提取方法
8
作者 赵红梅 彭博 +1 位作者 周志宏 易贤 《南京航空航天大学学报》 CAS CSCD 北大核心 2024年第2期291-299,共9页
针对采用深度学习方法提取结冰显微图像中的气泡需要大量标注数据,但人工标注气泡任务较为困难的问题,提出了一种基于风格迁移网络CycleGAN和图像分割网络Attention U-Net的域适应提取方法。该方法通过程序模拟气泡形态生成的图像为源域... 针对采用深度学习方法提取结冰显微图像中的气泡需要大量标注数据,但人工标注气泡任务较为困难的问题,提出了一种基于风格迁移网络CycleGAN和图像分割网络Attention U-Net的域适应提取方法。该方法通过程序模拟气泡形态生成的图像为源域,结冰显微图像为目标域,通过CycleGAN将源域图像转为目标域风格,采用风格转换后的源域数据集训练Attention U-Net网络。通过对比实验对无标注结冰图像和少量标注图像两种情况进行验证。实验结果表明,在无标注图像的情况下,可实现无监督的结冰显微图像的气泡提取;在只有少量标注图像的情况下,该方法可实现更精确的气泡提取。 展开更多
关键词 动态结冰 气泡提取 图像分割 域适应 Attention U-Net
下载PDF
基于改进BiLSTM的电力工程数据智能分析算法设计
9
作者 毛华 房向阳 +1 位作者 王斌 孙岳 《电子设计工程》 2024年第2期69-73,共5页
针对目前电力工程费用计算复杂、时间成本较高且准确性低的问题,文中开展了基于改进BiLSTM的电力工程数据智能分析算法设计研究。从技术、工程量和费用三个维度构建了电力工程数据智能分析指标体系,进而提出了一种基于BiLSTM与Attentio... 针对目前电力工程费用计算复杂、时间成本较高且准确性低的问题,文中开展了基于改进BiLSTM的电力工程数据智能分析算法设计研究。从技术、工程量和费用三个维度构建了电力工程数据智能分析指标体系,进而提出了一种基于BiLSTM与Attention联合模型的电力工程费用预测算法。该算法将电力工程数据作为BiLSTM的模型输入,并采用Attention机制提高了对重要数据的关注程度。通过引入数据指标与电力工程费用的自动关联分析技术,实现了对电力工程费用的精准预测。仿真算例分析结果表明,与LSTM及BiLSTM算法相比,所提算法具有更高的预测准确性,平均预测误差小于5%。 展开更多
关键词 电力工程 费用计算 深度学习 BiLSTM算法 Attention模型
下载PDF
基于ALBERT-Seq2Seq-Attention模型的数字化档案多标签分类
10
作者 王少阳 成新民 +3 位作者 王瑞琴 陈静雯 周阳 费志高 《湖州师范学院学报》 2024年第2期65-72,共8页
针对现有的数字化档案多标签分类方法存在分类标签之间缺少关联性的问题,提出一种用于档案多标签分类的深层神经网络模型ALBERT-Seq2Seq-Attention.该模型通过ALBERT(A Little BERT)预训练语言模型内部多层双向的Transfomer结构获取进... 针对现有的数字化档案多标签分类方法存在分类标签之间缺少关联性的问题,提出一种用于档案多标签分类的深层神经网络模型ALBERT-Seq2Seq-Attention.该模型通过ALBERT(A Little BERT)预训练语言模型内部多层双向的Transfomer结构获取进行文本特征向量的提取,并获得上下文语义信息;将预训练提取的文本特征作为Seq2Seq-Attention(Sequence to Sequence-Attention)模型的输入序列,构建标签字典以获取多标签间的关联关系.将分类模型在3种数据集上分别进行对比实验,结果表明:模型分类的效果F1值均超过90%.该模型不仅能提高档案文本的多标签分类效果,也能关注标签之间的相关关系. 展开更多
关键词 ALBERT Seq2Seq ATTENTION 多标签分类 数字化档案
下载PDF
融合MacBERT和Talking⁃Heads Attention实体关系联合抽取模型
11
作者 王春亮 姚洁仪 李昭 《现代电子技术》 北大核心 2024年第5期127-131,共5页
针对现有的医学文本关系抽取任务模型在训练过程中存在语义理解能力不足,可能导致关系抽取的效果不尽人意的问题,文中提出一种融合MacBERT和Talking⁃Heads Attention的实体关系联合抽取模型。该模型首先利用MacBERT语言模型来获取动态... 针对现有的医学文本关系抽取任务模型在训练过程中存在语义理解能力不足,可能导致关系抽取的效果不尽人意的问题,文中提出一种融合MacBERT和Talking⁃Heads Attention的实体关系联合抽取模型。该模型首先利用MacBERT语言模型来获取动态字向量表达,MacBERT作为改进的BERT模型,能够减少预训练和微调阶段之间的差异,从而提高模型的泛化能力;然后,将这些动态字向量表达输入到双向门控循环单元(BiGRU)中,以便提取文本的上下文特征。BiGRU是一种改进的循环神经网络(RNN),具有更好的长期依赖捕获能力。在获取文本上下文特征之后,使用Talking⁃Heads Attention来获取全局特征。Talking⁃Heads Attention是一种自注意力机制,可以捕获文本中不同位置之间的关系,从而提高关系抽取的准确性。实验结果表明,与实体关系联合抽取模型GRTE相比,该模型F1值提升1%,precision值提升0.4%,recall值提升1.5%。 展开更多
关键词 MacBERT BiGRU 关系抽取 医学文本 Talking⁃Heads Attention 深度学习 全局特征 神经网络
下载PDF
基于上下文语义联合YOLOv7的分心驾驶检测算法
12
作者 李富 徐凯 +2 位作者 朱灵龙 沈昊君 王泉 《国外电子测量技术》 2024年第3期121-128,共8页
针对分心驾驶检测方法存在实时性差、精度低、可部署性差的问题,提出了一种基于上下文语义增强联合YOLOv7的分心驾驶检测算法。首先将模型backbone和head部分的ELAN模块替换成语义上下文增强模块(contextual transformer,CoT),提高上下... 针对分心驾驶检测方法存在实时性差、精度低、可部署性差的问题,提出了一种基于上下文语义增强联合YOLOv7的分心驾驶检测算法。首先将模型backbone和head部分的ELAN模块替换成语义上下文增强模块(contextual transformer,CoT),提高上下文语义信息的捕获能力。其次,将语义关联增强机制(triplet attention)融入卷积块中,插入backbone和head的连接头之间以及融合MP2模块,强化目标间的关联关系以及提升目标特征提取能力。最后,将自注意力双向Transformer模块(Biformer)模块融合SPPCSPC模块,提升模型对分心驾驶中的复杂场景和遮挡目标的处理能力。改进的YOLOv7算法在分心驾驶数据集下平均精度均值(mean average precision,mAP)达到了87.3%,比原算法提高了4.3%,模型参数量减少了4.7%,每秒传输帧数达到了90 fps,具有较好的检测精度与速度。 展开更多
关键词 YOLOv7 分心驾驶检测 COT Biformer Triplet Attention
下载PDF
基于多模态融合的情绪识别
13
作者 廖强 王宇 《科技传播》 2024年第7期12-15,共4页
多模态情绪识别利用语音、文本、图像等多种数据形式,通过综合分析这些数据中的情绪信息来识别人类的情绪状态。相较于单一模态的情绪识别,这种方法能更全面地捕捉情绪的复杂性和微妙差异。为提高从特征提取到情绪分类的高效性,提出了... 多模态情绪识别利用语音、文本、图像等多种数据形式,通过综合分析这些数据中的情绪信息来识别人类的情绪状态。相较于单一模态的情绪识别,这种方法能更全面地捕捉情绪的复杂性和微妙差异。为提高从特征提取到情绪分类的高效性,提出了一种端对端的网络结构,直接利用音频与视频数据进行特征提取和情绪识别,并通过实验比较了三种不同的特征融合方法。实验结果表明,改进中间注意力融合方法在RAVDESS测试集上取得了71.67%的最高平均准确率,显示出优于其他融合策略的性能。 展开更多
关键词 情绪识别 TRANSFORMER ATTENTION 多模态融合
下载PDF
Image Inpainting Technique Incorporating Edge Prior and Attention Mechanism
14
作者 Jinxian Bai Yao Fan +1 位作者 Zhiwei Zhao Lizhi Zheng 《Computers, Materials & Continua》 SCIE EI 2024年第1期999-1025,共27页
Recently,deep learning-based image inpainting methods have made great strides in reconstructing damaged regions.However,these methods often struggle to produce satisfactory results when dealing with missing images wit... Recently,deep learning-based image inpainting methods have made great strides in reconstructing damaged regions.However,these methods often struggle to produce satisfactory results when dealing with missing images with large holes,leading to distortions in the structure and blurring of textures.To address these problems,we combine the advantages of transformers and convolutions to propose an image inpainting method that incorporates edge priors and attention mechanisms.The proposed method aims to improve the results of inpainting large holes in images by enhancing the accuracy of structure restoration and the ability to recover texture details.This method divides the inpainting task into two phases:edge prediction and image inpainting.Specifically,in the edge prediction phase,a transformer architecture is designed to combine axial attention with standard self-attention.This design enhances the extraction capability of global structural features and location awareness.It also balances the complexity of self-attention operations,resulting in accurate prediction of the edge structure in the defective region.In the image inpainting phase,a multi-scale fusion attention module is introduced.This module makes full use of multi-level distant features and enhances local pixel continuity,thereby significantly improving the quality of image inpainting.To evaluate the performance of our method.comparative experiments are conducted on several datasets,including CelebA,Places2,and Facade.Quantitative experiments show that our method outperforms the other mainstream methods.Specifically,it improves Peak Signal-to-Noise Ratio(PSNR)and Structure Similarity Index Measure(SSIM)by 1.141~3.234 db and 0.083~0.235,respectively.Moreover,it reduces Learning Perceptual Image Patch Similarity(LPIPS)and Mean Absolute Error(MAE)by 0.0347~0.1753 and 0.0104~0.0402,respectively.Qualitative experiments reveal that our method excels at reconstructing images with complete structural information and clear texture details.Furthermore,our model exhibits impressive performance in terms of the number of parameters,memory cost,and testing time. 展开更多
关键词 Image inpainting TRANSFORMER edge prior axial attention multi-scale fusion attention
下载PDF
An Approach for Human Posture Recognition Based on the Fusion PSE-CNN-BiGRU Model
15
作者 Xianghong Cao Xinyu Wang +2 位作者 Xin Geng Donghui Wu Houru An 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第7期385-408,共24页
This study proposes a pose estimation-convolutional neural network-bidirectional gated recurrent unit(PSECNN-BiGRU)fusion model for human posture recognition to address low accuracy issues in abnormal posture recognit... This study proposes a pose estimation-convolutional neural network-bidirectional gated recurrent unit(PSECNN-BiGRU)fusion model for human posture recognition to address low accuracy issues in abnormal posture recognition due to the loss of some feature information and the deterioration of comprehensive performance in model detection in complex home environments.Firstly,the deep convolutional network is integrated with the Mediapipe framework to extract high-precision,multi-dimensional information from the key points of the human skeleton,thereby obtaining a human posture feature set.Thereafter,a double-layer BiGRU algorithm is utilized to extract multi-layer,bidirectional temporal features from the human posture feature set,and a CNN network with an exponential linear unit(ELU)activation function is adopted to perform deep convolution of the feature map to extract the spatial feature of the human posture.Furthermore,a squeeze and excitation networks(SENet)module is introduced to adaptively learn the importance weights of each channel,enhancing the network’s focus on important features.Finally,comparative experiments are performed on available datasets,including the public human activity recognition using smartphone dataset(UCIHAR),the public human activity recognition 70 plus dataset(HAR70PLUS),and the independently developed home abnormal behavior recognition dataset(HABRD)created by the authors’team.The results show that the average accuracy of the proposed PSE-CNN-BiGRU fusion model for human posture recognition is 99.56%,89.42%,and 98.90%,respectively,which are 5.24%,5.83%,and 3.19%higher than the average accuracy of the five models proposed in the comparative literature,including CNN,GRU,and others.The F1-score for abnormal posture recognition reaches 98.84%(heartache),97.18%(fall),99.6%(bellyache),and 98.27%(climbing)on the self-builtHABRDdataset,thus verifying the effectiveness,generalization,and robustness of the proposed model in enhancing human posture recognition. 展开更多
关键词 Posture recognition mediapipe BiGRU CNN ELU ATTENTION
下载PDF
End-to-End Paired Ambisonic-Binaural Audio Rendering
16
作者 Yin Zhu Qiuqiang Kong +5 位作者 Junjie Shi Shilei Liu Xuzhou Ye Ju-Chiang Wang Hongming Shan Junping Zhang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第2期502-513,共12页
Binaural rendering is of great interest to virtual reality and immersive media. Although humans can naturally use their two ears to perceive the spatial information contained in sounds, it is a challenging task for ma... Binaural rendering is of great interest to virtual reality and immersive media. Although humans can naturally use their two ears to perceive the spatial information contained in sounds, it is a challenging task for machines to achieve binaural rendering since the description of a sound field often requires multiple channels and even the metadata of the sound sources. In addition, the perceived sound varies from person to person even in the same sound field. Previous methods generally rely on individual-dependent head-related transferred function(HRTF)datasets and optimization algorithms that act on HRTFs. In practical applications, there are two major drawbacks to existing methods. The first is a high personalization cost, as traditional methods achieve personalized needs by measuring HRTFs. The second is insufficient accuracy because the optimization goal of traditional methods is to retain another part of information that is more important in perception at the cost of discarding a part of the information. Therefore, it is desirable to develop novel techniques to achieve personalization and accuracy at a low cost. To this end, we focus on the binaural rendering of ambisonic and propose 1) channel-shared encoder and channel-compared attention integrated into neural networks and 2) a loss function quantifying interaural level differences to deal with spatial information. To verify the proposed method, we collect and release the first paired ambisonic-binaural dataset and introduce three metrics to evaluate the content information and spatial information accuracy of the end-to-end methods. Extensive experimental results on the collected dataset demonstrate the superior performance of the proposed method and the shortcomings of previous methods. 展开更多
关键词 Ambisonic ATTENTION binaural rendering neural network
下载PDF
Strengthening Network Security: Deep Learning Models for Intrusion Detectionwith Optimized Feature Subset and Effective Imbalance Handling
17
作者 Bayi Xu Lei Sun +2 位作者 Xiuqing Mao Chengwei Liu Zhiyi Ding 《Computers, Materials & Continua》 SCIE EI 2024年第2期1995-2022,共28页
In recent years,frequent network attacks have highlighted the importance of efficient detection methods for ensuring cyberspace security.This paper presents a novel intrusion detection system consisting of a data prep... In recent years,frequent network attacks have highlighted the importance of efficient detection methods for ensuring cyberspace security.This paper presents a novel intrusion detection system consisting of a data prepro-cessing stage and a deep learning model for accurately identifying network attacks.We have proposed four deep neural network models,which are constructed using architectures such as Convolutional Neural Networks(CNN),Bi-directional Long Short-Term Memory(BiLSTM),Bidirectional Gate Recurrent Unit(BiGRU),and Attention mechanism.These models have been evaluated for their detection performance on the NSL-KDD dataset.To enhance the compatibility between the data and the models,we apply various preprocessing techniques and employ the particle swarm optimization algorithm to perform feature selection on the NSL-KDD dataset,resulting in an optimized feature subset.Moreover,we address class imbalance in the dataset using focal loss.Finally,we employ the BO-TPE algorithm to optimize the hyperparameters of the four models,maximizing their detection performance.The test results demonstrate that the proposed model is capable of extracting the spatiotemporal features of network traffic data effectively.In binary and multiclass experiments,it achieved accuracy rates of 0.999158 and 0.999091,respectively,surpassing other state-of-the-art methods. 展开更多
关键词 Intrusion detection CNN BiLSTM BiGRU ATTENTION
下载PDF
Point Cloud Classification Using Content-Based Transformer via Clustering in Feature Space
18
作者 Yahui Liu Bin Tian +2 位作者 Yisheng Lv Lingxi Li Fei-Yue Wang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第1期231-239,共9页
Recently, there have been some attempts of Transformer in 3D point cloud classification. In order to reduce computations, most existing methods focus on local spatial attention,but ignore their content and fail to est... Recently, there have been some attempts of Transformer in 3D point cloud classification. In order to reduce computations, most existing methods focus on local spatial attention,but ignore their content and fail to establish relationships between distant but relevant points. To overcome the limitation of local spatial attention, we propose a point content-based Transformer architecture, called PointConT for short. It exploits the locality of points in the feature space(content-based), which clusters the sampled points with similar features into the same class and computes the self-attention within each class, thus enabling an effective trade-off between capturing long-range dependencies and computational complexity. We further introduce an inception feature aggregator for point cloud classification, which uses parallel structures to aggregate high-frequency and low-frequency information in each branch separately. Extensive experiments show that our PointConT model achieves a remarkable performance on point cloud shape classification. Especially, our method exhibits 90.3% Top-1 accuracy on the hardest setting of ScanObjectN N. Source code of this paper is available at https://github.com/yahuiliu99/PointC onT. 展开更多
关键词 Content-based Transformer deep learning feature aggregator local attention point cloud classification
下载PDF
Self-supervised recalibration network for person re-identification
19
作者 Shaoqi Hou Zhiming Wang +4 位作者 Zhihua Dong Ye Li Zhiguo Wang Guangqiang Yin Xinzhong Wang 《Defence Technology(防务技术)》 SCIE EI CAS CSCD 2024年第1期163-178,共16页
The attention mechanism can extract salient features in images,which has been proved to be effective in improving the performance of person re-identification(Re-ID).However,most of the existing attention modules have ... The attention mechanism can extract salient features in images,which has been proved to be effective in improving the performance of person re-identification(Re-ID).However,most of the existing attention modules have the following two shortcomings:On the one hand,they mostly use global average pooling to generate context descriptors,without highlighting the guiding role of salient information on descriptor generation,resulting in insufficient ability of the final generated attention mask representation;On the other hand,the design of most attention modules is complicated,which greatly increases the computational cost of the model.To solve these problems,this paper proposes an attention module called self-supervised recalibration(SR)block,which introduces both global and local information through adaptive weighted fusion to generate a more refined attention mask.In particular,a special"Squeeze-Excitation"(SE)unit is designed in the SR block to further process the generated intermediate masks,both for nonlinearizations of the features and for constraint of the resulting computation by controlling the number of channels.Furthermore,we combine the most commonly used Res Net-50 to construct the instantiation model of the SR block,and verify its effectiveness on multiple Re-ID datasets,especially the mean Average Precision(m AP)on the Occluded-Duke dataset exceeds the state-of-the-art(SOTA)algorithm by 4.49%. 展开更多
关键词 Person re-identification Attention mechanism Global information Local information Adaptive weighted fusion
下载PDF
Attention Markets of Blockchain-Based Decentralized Autonomous Organizations
20
作者 Juanjuan Li Rui Qin +3 位作者 Sangtian Guan Wenwen Ding Fei Lin Fei-Yue Wang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第6期1370-1380,共11页
The attention is a scarce resource in decentralized autonomous organizations(DAOs),as their self-governance relies heavily on the attention-intensive decision-making process of“proposal and voting”.To prevent the ne... The attention is a scarce resource in decentralized autonomous organizations(DAOs),as their self-governance relies heavily on the attention-intensive decision-making process of“proposal and voting”.To prevent the negative effects of pro-posers’attention-capturing strategies that contribute to the“tragedy of the commons”and ensure an efficient distribution of attention among multiple proposals,it is necessary to establish a market-driven allocation scheme for DAOs’attention.First,the Harberger tax-based attention markets are designed to facilitate its allocation via continuous and automated trading,where the individualized Harberger tax rate(HTR)determined by the pro-posers’reputation is adopted.Then,the Stackelberg game model is formulated in these markets,casting attention to owners in the role of leaders and other competitive proposers as followers.Its equilibrium trading strategies are also discussed to unravel the intricate dynamics of attention pricing.Moreover,utilizing the single-round Stackelberg game as an illustrative example,the existence of Nash equilibrium trading strategies is demonstrated.Finally,the impact of individualized HTR on trading strategies is investigated,and results suggest that it has a negative correlation with leaders’self-accessed prices and ownership duration,but its effect on their revenues varies under different conditions.This study is expected to provide valuable insights into leveraging attention resources to improve DAOs’governance and decision-making process. 展开更多
关键词 ATTENTION decentralized autonomous organizations Harberger tax Stackelberg game.
下载PDF
上一页 1 2 52 下一页 到第
使用帮助 返回顶部