期刊文献+
共找到843篇文章
< 1 2 43 >
每页显示 20 50 100
Remaining Useful Life Prediction of Rail Based on Improved Pulse Separable Convolution Enhanced Transformer Encoder
1
作者 Zhongmei Wang Min Li +2 位作者 Jing He Jianhua Liu Lin Jia 《Journal of Transportation Technologies》 2024年第2期137-160,共24页
In order to prevent possible casualties and economic loss, it is critical to accurate prediction of the Remaining Useful Life (RUL) in rail prognostics health management. However, the traditional neural networks is di... In order to prevent possible casualties and economic loss, it is critical to accurate prediction of the Remaining Useful Life (RUL) in rail prognostics health management. However, the traditional neural networks is difficult to capture the long-term dependency relationship of the time series in the modeling of the long time series of rail damage, due to the coupling relationship of multi-channel data from multiple sensors. Here, in this paper, a novel RUL prediction model with an enhanced pulse separable convolution is used to solve this issue. Firstly, a coding module based on the improved pulse separable convolutional network is established to effectively model the relationship between the data. To enhance the network, an alternate gradient back propagation method is implemented. And an efficient channel attention (ECA) mechanism is developed for better emphasizing the useful pulse characteristics. Secondly, an optimized Transformer encoder was designed to serve as the backbone of the model. It has the ability to efficiently understand relationship between the data itself and each other at each time step of long time series with a full life cycle. More importantly, the Transformer encoder is improved by integrating pulse maximum pooling to retain more pulse timing characteristics. Finally, based on the characteristics of the front layer, the final predicted RUL value was provided and served as the end-to-end solution. The empirical findings validate the efficacy of the suggested approach in forecasting the rail RUL, surpassing various existing data-driven prognostication techniques. Meanwhile, the proposed method also shows good generalization performance on PHM2012 bearing data set. 展开更多
关键词 Equipment Health Prognostics Remaining Useful Life Prediction Pulse separable convolution Attention Mechanism Transformer Encoder
下载PDF
MSSTNet:Multi-scale facial videos pulse extraction network based on separable spatiotemporal convolution and dimension separable attention
2
作者 Changchen ZHAO Hongsheng WANG Yuanjing FENG 《Virtual Reality & Intelligent Hardware》 2023年第2期124-141,共18页
Background The use of remote photoplethysmography(rPPG)to estimate blood volume pulse in a noncontact manner has been an active research topic in recent years.Existing methods are primarily based on a singlescale regi... Background The use of remote photoplethysmography(rPPG)to estimate blood volume pulse in a noncontact manner has been an active research topic in recent years.Existing methods are primarily based on a singlescale region of interest(ROI).However,some noise signals that are not easily separated in a single-scale space can be easily separated in a multi-scale space.Also,existing spatiotemporal networks mainly focus on local spatiotemporal information and do not emphasize temporal information,which is crucial in pulse extraction problems,resulting in insufficient spatiotemporal feature modelling.Methods Here,we propose a multi-scale facial video pulse extraction network based on separable spatiotemporal convolution(SSTC)and dimension separable attention(DSAT).First,to solve the problem of a single-scale ROI,we constructed a multi-scale feature space for initial signal separation.Second,SSTC and DSAT were designed for efficient spatiotemporal correlation modeling,which increased the information interaction between the long-span time and space dimensions;this placed more emphasis on temporal features.Results The signal-to-noise ratio(SNR)of the proposed network reached 9.58dB on the PURE dataset and 6.77dB on the UBFC-rPPG dataset,outperforming state-of-the-art algorithms.Conclusions The results showed that fusing multi-scale signals yielded better results than methods based on only single-scale signals.The proposed SSTC and dimension-separable attention mechanism will contribute to more accurate pulse signal extraction. 展开更多
关键词 Remote photoplethysmography Heart rate separable spatiotemporal convolution Dimension separable attention MULTI-SCALE Neural network
下载PDF
A Lightweight Convolutional Neural Network with Hierarchical Multi-Scale Feature Fusion for Image Classification
3
作者 Adama Dembele Ronald Waweru Mwangi Ananda Omutokoh Kube 《Journal of Computer and Communications》 2024年第2期173-200,共28页
Convolutional neural networks (CNNs) are widely used in image classification tasks, but their increasing model size and computation make them challenging to implement on embedded systems with constrained hardware reso... Convolutional neural networks (CNNs) are widely used in image classification tasks, but their increasing model size and computation make them challenging to implement on embedded systems with constrained hardware resources. To address this issue, the MobileNetV1 network was developed, which employs depthwise convolution to reduce network complexity. MobileNetV1 employs a stride of 2 in several convolutional layers to decrease the spatial resolution of feature maps, thereby lowering computational costs. However, this stride setting can lead to a loss of spatial information, particularly affecting the detection and representation of smaller objects or finer details in images. To maintain the trade-off between complexity and model performance, a lightweight convolutional neural network with hierarchical multi-scale feature fusion based on the MobileNetV1 network is proposed. The network consists of two main subnetworks. The first subnetwork uses a depthwise dilated separable convolution (DDSC) layer to learn imaging features with fewer parameters, which results in a lightweight and computationally inexpensive network. Furthermore, depthwise dilated convolution in DDSC layer effectively expands the field of view of filters, allowing them to incorporate a larger context. The second subnetwork is a hierarchical multi-scale feature fusion (HMFF) module that uses parallel multi-resolution branches architecture to process the input feature map in order to extract the multi-scale feature information of the input image. Experimental results on the CIFAR-10, Malaria, and KvasirV1 datasets demonstrate that the proposed method is efficient, reducing the network parameters and computational cost by 65.02% and 39.78%, respectively, while maintaining the network performance compared to the MobileNetV1 baseline. 展开更多
关键词 MobileNet Image Classification Lightweight convolutional Neural Network Depthwise Dilated separable convolution Hierarchical Multi-Scale Feature Fusion
下载PDF
Validation Research on the Application of Depthwise Separable Convolutional Al Facial Expression Recognition in Non-pharmacological Treatment of BPSD
4
作者 Xiangyu Liu 《Journal of Clinical and Nursing Research》 2021年第4期31-37,共7页
One of the most obvious clinical reasons of dementia or The Behavioral and Psychological Symptoms of Dementia(BPSD)are the lack of emotional expression,the increased frequency of negative emotions,and the impermanence... One of the most obvious clinical reasons of dementia or The Behavioral and Psychological Symptoms of Dementia(BPSD)are the lack of emotional expression,the increased frequency of negative emotions,and the impermanence of emotions.Observing the reduction of BPSD in dementia through emotions can be considered effective and widely used in the field of non-pharmacological therapy.At present,this article will verify whether the image recognition artificial intelligence(AI)system can correctly reflect the emotional performance of the elderly with dementia through a questionnaire survey of three professional elderly nursing staff.The ANOVA(sig.=0.50)is used to determine that the judgment given by the nursing staff has no obvious deviation,and then Kendall's test(0.722**)and spearman's test(0.863**)are used to verify the judgment severity of the emotion recognition system and the nursing staff unanimously.This implies the usability of the tool.Additionally,it can be expected to be further applied in the research related to BPSD elderly emotion detection. 展开更多
关键词 depth-wise separable convolution EMOTION BPSD DEMENTIA Nursing
下载PDF
SepFE:Separable Fusion Enhanced Network for Retinal Vessel Segmentation
5
作者 Yun Wu Ge Jiao Jiahao Liu 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第9期2465-2485,共21页
The accurate and automatic segmentation of retinal vessels fromfundus images is critical for the early diagnosis and prevention ofmany eye diseases,such as diabetic retinopathy(DR).Existing retinal vessel segmentation... The accurate and automatic segmentation of retinal vessels fromfundus images is critical for the early diagnosis and prevention ofmany eye diseases,such as diabetic retinopathy(DR).Existing retinal vessel segmentation approaches based on convolutional neural networks(CNNs)have achieved remarkable effectiveness.Here,we extend a retinal vessel segmentation model with low complexity and high performance based on U-Net,which is one of the most popular architectures.In view of the excellent work of depth-wise separable convolution,we introduce it to replace the standard convolutional layer.The complexity of the proposed model is reduced by decreasing the number of parameters and calculations required for themodel.To ensure performance while lowering redundant parameters,we integrate the pre-trained MobileNet V2 into the encoder.Then,a feature fusion residual module(FFRM)is designed to facilitate complementary strengths by enhancing the effective fusion between adjacent levels,which alleviates extraneous clutter introduced by direct fusion.Finally,we provide detailed comparisons between the proposed SepFE and U-Net in three retinal image mainstream datasets(DRIVE,STARE,and CHASEDB1).The results show that the number of SepFE parameters is only 3%of U-Net,the Flops are only 8%of U-Net,and better segmentation performance is obtained.The superiority of SepFE is further demonstrated through comparisons with other advanced methods. 展开更多
关键词 Retinal vessel segmentation U-Net depth-wise separable convolution feature fusion
下载PDF
A WEIGHTED GENERAL DISCRETE FOURIER TRANSFORM FOR THE FREQUENCY-DOMAIN BLIND SOURCE SEPARATION OF CONVOLUTIVE MIXTURES 被引量:1
6
作者 Wang Chao Fang Yong Feng Jiuchao 《Journal of Electronics(China)》 2008年第6期830-833,共4页
This letter deals with the frequency domain Blind Source Separation of Convolutive Mixtures (CMBSS). From the frequency representation of the "overlap and save", a Weighted General Discrete Fourier Transform... This letter deals with the frequency domain Blind Source Separation of Convolutive Mixtures (CMBSS). From the frequency representation of the "overlap and save", a Weighted General Discrete Fourier Transform (WGDFT) is derived to replace the traditional Discrete Fourier Transform (DFT). The mixing matrix on each frequency bin could be estimated more precisely from WGDFT coefficients than from DFT coefficients, which improves separation performance. Simulation results verify the validity of WGDFT for frequency domain blind source separation of convolutive mixtures. 展开更多
关键词 频域盲源分离 离散傅里叶变换 频率表征 重叠
下载PDF
A Framework of Lightweight Deep Cross-Connected Convolution Kernel Mapping Support Vector Machines
7
作者 Qi Wang Zhaoying Liu +3 位作者 Ting Zhang Shanshan Tu Yujian Li Muhammad Waqas 《Journal on Artificial Intelligence》 2022年第1期37-48,共12页
Deep kernel mapping support vector machines have achieved good results in numerous tasks by mapping features from a low-dimensional space to a high-dimensional space and then using support vector machines for classifi... Deep kernel mapping support vector machines have achieved good results in numerous tasks by mapping features from a low-dimensional space to a high-dimensional space and then using support vector machines for classification.However,the depth kernel mapping support vector machine does not take into account the connection of different dimensional spaces and increases the model parameters.To further improve the recognition capability of deep kernel mapping support vector machines while reducing the number of model parameters,this paper proposes a framework of Lightweight Deep Convolutional Cross-Connected Kernel Mapping Support Vector Machines(LC-CKMSVM).The framework consists of a feature extraction module and a classification module.The feature extraction module first maps the data from low-dimensional to high-dimensional space by fusing the representations of different dimensional spaces through cross-connections;then,it uses depthwise separable convolution to replace part of the original convolution to reduce the number of parameters in the module;The classification module uses a soft margin support vector machine for classification.The results on 6 different visual datasets show that LC-CKMSVM obtains better classification accuracies on most cases than the other five models. 展开更多
关键词 convolutional neural network cross-connected lightweight framework depthwise separable convolution
下载PDF
AN NMF ALGORITHM FOR BLIND SEPARATION OF CONVOLUTIVE MIXED SOURCE SIGNALS WITH LEAST CORRELATION CONSTRAINS
8
作者 Zhang Ye Fang Yong 《Journal of Electronics(China)》 2009年第4期557-563,共7页
Most of the existing algorithms for blind sources separation have a limitation that sources are statistically independent. However, in many practical applications, the source signals are non- negative and mutual stati... Most of the existing algorithms for blind sources separation have a limitation that sources are statistically independent. However, in many practical applications, the source signals are non- negative and mutual statistically dependent signals. When the observations are nonnegative linear combinations of nonnegative sources, the correlation coefficients of the observations are larger than these of source signals. In this letter, a novel Nonnegative Matrix Factorization (NMF) algorithm with least correlated component constraints to blind separation of convolutive mixed sources is proposed. The algorithm relaxes the source independence assumption and has low-complexity algebraic com- putations. Simulation results on blind source separation including real face image data indicate that the sources can be successfully recovered with the algorithm. 展开更多
关键词 矩阵分解算法 信号分离 卷积 混源 非负矩阵分解 统计独立 盲源分离 甲基甲酰胺
下载PDF
Coal/Gangue Volume Estimation with Convolutional Neural Network and Separation Based on Predicted Volume and Weight
9
作者 Zenglun Guan Murad S.Alfarzaeai +2 位作者 Eryi Hu Taqiaden Alshmeri Wang Peng 《Computers, Materials & Continua》 SCIE EI 2024年第4期279-306,共28页
In the coal mining industry,the gangue separation phase imposes a key challenge due to the high visual similaritybetween coal and gangue.Recently,separation methods have become more intelligent and efficient,using new... In the coal mining industry,the gangue separation phase imposes a key challenge due to the high visual similaritybetween coal and gangue.Recently,separation methods have become more intelligent and efficient,using newtechnologies and applying different features for recognition.One such method exploits the difference in substancedensity,leading to excellent coal/gangue recognition.Therefore,this study uses density differences to distinguishcoal from gangue by performing volume prediction on the samples.Our training samples maintain a record of3-side images as input,volume,and weight as the ground truth for the classification.The prediction process relieson a Convolutional neural network(CGVP-CNN)model that receives an input of a 3-side image and then extractsthe needed features to estimate an approximation for the volume.The classification was comparatively performedvia ten different classifiers,namely,K-Nearest Neighbors(KNN),Linear Support Vector Machines(Linear SVM),Radial Basis Function(RBF)SVM,Gaussian Process,Decision Tree,Random Forest,Multi-Layer Perceptron(MLP),Adaptive Boosting(AdaBosst),Naive Bayes,and Quadratic Discriminant Analysis(QDA).After severalexperiments on testing and training data,results yield a classification accuracy of 100%,92%,95%,96%,100%,100%,100%,96%,81%,and 92%,respectively.The test reveals the best timing with KNN,which maintained anaccuracy level of 100%.Assessing themodel generalization capability to newdata is essential to ensure the efficiencyof the model,so by applying a cross-validation experiment,the model generalization was measured.The useddataset was isolated based on the volume values to ensure the model generalization not only on new images of thesame volume but with a volume outside the trained range.Then,the predicted volume values were passed to theclassifiers group,where classification reported accuracy was found to be(100%,100%,100%,98%,88%,87%,100%,87%,97%,100%),respectively.Although obtaining a classification with high accuracy is the main motive,this workhas a remarkable reduction in the data preprocessing time compared to related works.The CGVP-CNN modelmanaged to reduce the data preprocessing time of previous works to 0.017 s while maintaining high classificationaccuracy using the estimated volume value. 展开更多
关键词 Coal coal gangue convolutional neural network CNN object classification volume estimation separation system
下载PDF
基于语义分割的车位检测算法研究
10
作者 李伟东 李冰 +1 位作者 朱旭浩 李乐 《大连理工大学学报》 CAS CSCD 北大核心 2024年第1期96-103,共8页
作为自动泊车系统中至关重要的一环,车位检测算法的精度直接决定自动泊车系统的好坏.目前,基于语义分割的车位检测算法主要有两个问题:一是分割网络参数量较大,难以满足移动端部署;二是后处理提取算法复杂,难以满足实时检测要求.针对这... 作为自动泊车系统中至关重要的一环,车位检测算法的精度直接决定自动泊车系统的好坏.目前,基于语义分割的车位检测算法主要有两个问题:一是分割网络参数量较大,难以满足移动端部署;二是后处理提取算法复杂,难以满足实时检测要求.针对这两个问题,设计一种通过检测车位线来获取停车位的车位检测算法.采用深度可分离卷积和非对称卷积相结合的方式设计车位线分割网络UFAC-Net,并提出一种更为简洁的车位线提取算法.实验结果表明:UFAC-Net模型(UFAC-Net2)分割的平均像素精度为83.07%,平均交并比为73.05%,模型参数量为3.1 MB,达到目前PSV datasets上最好的分割精度;车位检测算法可检测复杂情况下的平行、垂直、倾斜3种类型的车位,在自定义测试集中精准率为99.23%,召回率为99.12%,单张图像检测时间为32.2 ms,具有良好的检测性能. 展开更多
关键词 车位检测 语义分割 深度可分离卷积 非对称卷积
下载PDF
改进U_Net网络的钢结构表面锈蚀图像分割方法
11
作者 陈法法 董海飞 +1 位作者 何向阳 陈保家 《电子测量与仪器学报》 CSCD 北大核心 2024年第2期49-57,共9页
为实现锈蚀图像分割网络模型轻量化,同时消除非单一特征背景和锈液等类似特征背景干扰,本文将U_Net网络模型的编码部分替换为MobilenetV3_Large网络,导入基于ImageNet数据集的MobilenetV3_Large网络预训练权重,将U_Net网络模型解码部分... 为实现锈蚀图像分割网络模型轻量化,同时消除非单一特征背景和锈液等类似特征背景干扰,本文将U_Net网络模型的编码部分替换为MobilenetV3_Large网络,导入基于ImageNet数据集的MobilenetV3_Large网络预训练权重,将U_Net网络模型解码部分的普通卷积替换为深度可分离残差卷积,并在上采样的过程中添加注意力导向AG模块和Dropout机制。经实验验证表明,本文设计的改进U_Net网络模型在非单一特征背景和锈液等类似特征背景干扰下,具有明显的锈蚀图像分割优势,相比于原U_Net网络模型,模型大小减少了81.18%,浮点计算量减少了98.34%,检测效率提升了3.27倍,即从原来不足6 fps,提升至19 fps。网络模型实现轻量化的同时,网络模型的准确率达95.54%,相比于原U_Net网络模型提升了5.04%。 展开更多
关键词 锈蚀区域分割 MobilenetV3 U_Net 注意力导向 深度可分离残差卷积
下载PDF
基于改进ResNet50的钨矿石双能X射线图像分选方法
12
作者 刘志锋 曾灵锋 +2 位作者 彭芳伟 魏振华 张寰宇 《现代电子技术》 北大核心 2024年第13期87-92,共6页
文中提出一种基于深度扩张可分离卷积和注意力机制的残差网络模型(DWAtt-ResNet),通过实验对比表明,该模型在钨矿石双能X射线图像数据集上准确率、F1分数、AUC值和AP值均优于ConvNeXt、DenseNet121和EfficientNet_b4等主流的图像分类模... 文中提出一种基于深度扩张可分离卷积和注意力机制的残差网络模型(DWAtt-ResNet),通过实验对比表明,该模型在钨矿石双能X射线图像数据集上准确率、F1分数、AUC值和AP值均优于ConvNeXt、DenseNet121和EfficientNet_b4等主流的图像分类模型。通过消融实验表明,该模型准确率达到87.4%,计算量为2.7GFLOPs,参数量为16.95M,相比ResNet50准确率提高3%,计算量降低1.42 GFLOPs,参数量降低6.56M,准确率提升的同时,效率大幅提升,更适合工业生产的矿石快速分拣需求。 展开更多
关键词 钨矿石 双能X射线 图像分类 ResNet50 深度扩张可分离卷积 注意力机制
下载PDF
基于改进SE-Net和深度可分离残差的高光谱图像分类
13
作者 王燕 王振宇 《兰州理工大学学报》 CAS 北大核心 2024年第2期87-95,共9页
针对目前常见的用于高光谱图像分类的卷积神经网络参数数量多,训练时间长,对样本数量依赖性大的问题,提出一种适用于有限训练样本条件下基于改进压缩激活网络和深度可分离残差的分类网络MDSR&SE-Net.首先使用主成分分析对原始高光... 针对目前常见的用于高光谱图像分类的卷积神经网络参数数量多,训练时间长,对样本数量依赖性大的问题,提出一种适用于有限训练样本条件下基于改进压缩激活网络和深度可分离残差的分类网络MDSR&SE-Net.首先使用主成分分析对原始高光谱图像进行通道降维,然后通过三维卷积神经网络连接多特征残差结构,同时嵌入改进的SE模块提取高光谱图像的空间和光谱细节特征,最后将提取到的特征数据输入Softmax分类器激活分类.为了使网络更加轻量,通过在残差结构中使用深度可分离卷积和引入全局平均池化减少参数数量.实验结果显示,使用有限训练样本在三种常见高光谱数据集上总体分类精度均达到99%以上. 展开更多
关键词 高光谱图像 深度可分离卷积 残差网络 压缩激活网络
下载PDF
多尺度特征金字塔融合的街景图像语义分割
14
作者 曲海成 王莹 +1 位作者 董康龙 刘万军 《计算机系统应用》 2024年第3期73-84,共12页
针对街景图像语义分割任务中的目标尺寸差异大、多尺度特征难以高效提取的问题,本文提出了一种语义分割网络(LDPANet).首先,将空洞卷积与引入残差学习单元的深度可分离卷积结合,来优化编码器结构,在降低了计算复杂度的同时缓解梯度消失... 针对街景图像语义分割任务中的目标尺寸差异大、多尺度特征难以高效提取的问题,本文提出了一种语义分割网络(LDPANet).首先,将空洞卷积与引入残差学习单元的深度可分离卷积结合,来优化编码器结构,在降低了计算复杂度的同时缓解梯度消失的问题.然后利用层传递的迭代空洞空间金字塔,将自顶向下的特征信息依次融合,提高了上下文信息的有效交互能力;在多尺度特征融合之后引入属性注意力模块,使网络抑制冗余信息,强化重要特征.再者,以通道扩展上采样代替双线插值上采样作为解码器,进一步提升了特征图的分辨率.最后,LDPANet方法在Cityscapes和CamVid数据集上的精度分别达到了91.8%和87.52%,与近几年网络模型相比,本文网络模型可以精确地提取像素的位置信息以及空间维度信息,提高了语义分割的准确率. 展开更多
关键词 语义分割 MDSDC IDCP-LC 属性注意力 通道扩展上采样 特征融合
下载PDF
基于深度学习的机织物起毛起球客观评级分析
15
作者 吴骏 徐天 于坤 《现代纺织技术》 北大核心 2024年第1期1-8,共8页
为了有效克服目前人工检测速度慢、误差大、主观性强的问题,更快速、准确、客观地对机织物起毛起球样本评级,提出了一种多尺度特征融合的Wide-SqueezeNet网络。首先制作了两种成分不同的机织物起毛起球数据集。根据机织物起毛起球图像... 为了有效克服目前人工检测速度慢、误差大、主观性强的问题,更快速、准确、客观地对机织物起毛起球样本评级,提出了一种多尺度特征融合的Wide-SqueezeNet网络。首先制作了两种成分不同的机织物起毛起球数据集。根据机织物起毛起球图像中毛球形状大小不一以及分布不均的特点,在网络中改进Fire模块,其中增加了短连接来解决训练中梯度发散等问题,在短连接中使用两个3×3小卷积核来减少计算量并且获取不同尺度的特征图信息,增强网络的特征提取能力来提高准确率;其次为了减少整体的计算量,在网络预处理时将图像统一到224×224大小,并且将网络中普通卷积替换为深度可分离卷积。结果表明,通过使用多尺度特征融合和深度可分离卷积来改进网络,机织物起毛起球的客观评级准确率可以达到99.333%。相比于基础网络SqueezeNe、Resnet、MobileNet、DenseNet、ShuffleNet,该方法的提升分别为2.220%、1.777%、2.666%、1.333%和2.220%。与人工检测需要几分钟到十几分钟不等相比,该网络只需要0.072 s即可检测一幅图像,检测速度大大提高。 展开更多
关键词 机织物 起毛起球 特征融合 深度可分离卷积
下载PDF
基于轻量化YOLOv4的死淘鸡目标检测算法
16
作者 漆海霞 李承杰 黄桂珍 《中国农机化学报》 北大核心 2024年第5期195-201,共7页
针对目前死淘鸡目标检测研究较少,高精度检测算法体积大难以部署至移动式设备等问题,提出一种基于YOLOv4的轻量化死淘鸡目标检测算法。采集大规模蛋鸡养殖工厂笼中死淘鸡图片,建立目标检测数据集;在算法中引入MobileNetv3主干提取网络... 针对目前死淘鸡目标检测研究较少,高精度检测算法体积大难以部署至移动式设备等问题,提出一种基于YOLOv4的轻量化死淘鸡目标检测算法。采集大规模蛋鸡养殖工厂笼中死淘鸡图片,建立目标检测数据集;在算法中引入MobileNetv3主干提取网络与深度可分离卷积来降低模型体积;并在最大池化层前添加自注意力机制模块,增强算法对全局语义信息的捕获。在自建数据集中的试验结果表明,改进算法在死淘鸡目标检测任务中有更高的准确度,其mAP值与召回率分别达到97.74%和98.15%,模型大小缩小至原算法的1/5,在GPU加速下帧数达到77帧/s,检测速度提高1倍,能够满足嵌入式部署需求。 展开更多
关键词 死淘鸡识别 深度学习 轻量化网络 MobileNet 深度可分离卷积
下载PDF
煤矿工业物联网设备识别模型
17
作者 郝秦霞 李慧敏 《工矿自动化》 CSCD 北大核心 2024年第3期99-107,共9页
煤矿工业物联网(IIoT)设备计算与存储资源受限,易遭受非法网络入侵,造成敏感数据泄露或恶意篡改,威胁煤矿生产安全。精准识别煤矿IIoT设备可实现有效管理并维护设备正常运转,提高设备安全防护能力,然而现有设备识别算法存在特征构造复... 煤矿工业物联网(IIoT)设备计算与存储资源受限,易遭受非法网络入侵,造成敏感数据泄露或恶意篡改,威胁煤矿生产安全。精准识别煤矿IIoT设备可实现有效管理并维护设备正常运转,提高设备安全防护能力,然而现有设备识别算法存在特征构造复杂、内存与计算需求较高导致难以部署在资源受限的煤矿IIoT设备中等问题。针对上述问题,提出了一种煤矿IIoT设备识别模型。首先,对支持TCP/IP协议传输的流量数据进行流量切分、无关字段去除、去重、定长字段截取操作后转换为IDX格式存储;其次,使用卷积块注意力模块(CBAM)优化深度可分离卷积(DSC),从而搭建轻量级DSC−CBAM模型来过滤Non−IIoT设备;然后,利用带有阶段惩罚的Wasserstein生成对抗网络(WGAN−GP)扩充流量较少的煤矿IIoT设备数据,达到平衡偏移流量数据的目的;最后,在DSC−CBAM基础上引入多尺度特征融合(MFF)技术捕获浅层全局特征信息,并增加Mish激活函数提高模型训练稳定性,建立优化混合模态识别(MDCM)模型,实现煤矿IIoT设备精准识别。实验结果表明,该模型收敛速度快,准确率、召回率、精确率与F1−score指标均高达99.98%,且参数量小,能精准、高效识别煤矿IIoT设备。 展开更多
关键词 煤矿工业物联网 设备识别 深度可分离卷积 注意力机制 生成对抗网络
下载PDF
基于改进SegNet的鸡只检测算法
18
作者 吉训生 孙贝贝 夏圣奎 《计算机工程与设计》 北大核心 2024年第1期102-109,共8页
为实现智能化检测出鸡场中死亡鸡只,提出一种基于改进语义分割模型AT-SegNet的鸡只检测算法。基于对称编码解码结构SegNet,利用空洞卷积在解码前聚合不同感受野的上下文信息,设计一种三尺度注意力级联融合模块,以并联方式嵌入编、解码器... 为实现智能化检测出鸡场中死亡鸡只,提出一种基于改进语义分割模型AT-SegNet的鸡只检测算法。基于对称编码解码结构SegNet,利用空洞卷积在解码前聚合不同感受野的上下文信息,设计一种三尺度注意力级联融合模块,以并联方式嵌入编、解码器间,丰富解码器信息。利用多层深度可分离卷积替代标准卷积,提取深层次语义信息,减少计算量提高实时性。将鸡群图像分割结果交并比与阈值对比判别鸡只状态。实验结果表明,改进的AT-SegNet较原算法的检测精度提高了25.17%,能够在复杂鸡群环境中准确、高效地发现死亡鸡只。 展开更多
关键词 深度学习 鸡只检测 语义分割 编码解码结构 注意力机制 软池化 深度可分离卷积
下载PDF
面向小目标检测的轻量化改进CenterNet算法
19
作者 张伟丰 《湖北汽车工业学院学报》 2024年第2期57-63,70,共8页
为提高传统目标检测算法的实时性,并解决小目标检测效果不佳及漏检率高的问题,提出了改进CenterNet算法。首先将特征提取网络由ResNet50改为SqueezeNet,卷积计算的部分用深度可分离卷积代替;接着使用双阈值改进NMS算法替代单阈值-非极... 为提高传统目标检测算法的实时性,并解决小目标检测效果不佳及漏检率高的问题,提出了改进CenterNet算法。首先将特征提取网络由ResNet50改为SqueezeNet,卷积计算的部分用深度可分离卷积代替;接着使用双阈值改进NMS算法替代单阈值-非极大值抑制算法,通过DIoU计算损失函数。结果表明:改进算法在安全帽和口罩检测数据集的检测精度分别为91.3%和85.5%,与CenterNet算法相比,性能分别提升了2.35%和3.76%,同时具有更快的检测速度。 展开更多
关键词 目标检测 SqueezeNet 深度可分离卷积 CenterNet
下载PDF
基于锚点的快速三维手部关键点检测算法
20
作者 秦晓飞 何文 +2 位作者 班东贤 郭宏宇 于景 《电子科技》 2024年第4期77-86,共10页
在人机协作任务中,手部关键点检测为机械臂提供目标点坐标,A2J(Anchor-to-Joint)是具有代表性的一种利用锚点进行关键点检测的方法。A2J以深度图为输入,可实现较好的检测效果,但对全局特征获取能力不足。文中设计了全局-局部特征融合模... 在人机协作任务中,手部关键点检测为机械臂提供目标点坐标,A2J(Anchor-to-Joint)是具有代表性的一种利用锚点进行关键点检测的方法。A2J以深度图为输入,可实现较好的检测效果,但对全局特征获取能力不足。文中设计了全局-局部特征融合模块(Global-Local Feature Fusion,GLFF)对骨干网络浅层和深层的特征进行融合。为了提升检测速度,文中将A2J的骨干网络替换为ShuffleNetv2并对其进行改造,用5×5深度可分离卷积替换3×3深度可分离卷积,增大感受野,有效提升了骨干网络对全局特征的提取能力。文中在锚点权重估计分支引入高效通道注意力模块(Efficient Channel Attention,ECA),提升了网络对重要锚点的关注度。在主流数据集ICVL和NYU上进行的训练和测试结果表明,相比于A2J,文中所提方法的平均误差分别降低了0.09 mm和0.15 mm。在GTX1080Ti显卡上实现了151 frame·s^(-1)的检测速率,满足人机协作任务对于实时性的要求。 展开更多
关键词 人机协作 三维手部关键点检测 锚点 深度图 全局-局部特征融合 ShuffleNetv2 深度可分离卷积 高效通道注意力
下载PDF
上一页 1 2 43 下一页 到第
使用帮助 返回顶部