期刊文献+
共找到420篇文章
< 1 2 21 >
每页显示 20 50 100
An Expert System to Detect Political Arabic Articles Orientation Using CatBoost Classifier Boosted by Multi-Level Features
1
作者 Saad M.Darwish Abdul Rahman M.Sabri +1 位作者 Dhafar Hamed Abd Adel A.Elzoghabi 《Computer Systems Science & Engineering》 2024年第6期1595-1624,共30页
The number of blogs and other forms of opinionated online content has increased dramatically in recent years.Many fields,including academia and national security,place an emphasis on automated political article orient... The number of blogs and other forms of opinionated online content has increased dramatically in recent years.Many fields,including academia and national security,place an emphasis on automated political article orientation detection.Political articles(especially in the Arab world)are different from other articles due to their subjectivity,in which the author’s beliefs and political affiliation might have a significant influence on a political article.With categories representing the main political ideologies,this problem may be thought of as a subset of the text categorization(classification).In general,the performance of machine learning models for text classification is sensitive to hyperparameter settings.Furthermore,the feature vector used to represent a document must capture,to some extent,the complex semantics of natural language.To this end,this paper presents an intelligent system to detect political Arabic article orientation that adapts the categorical boosting(CatBoost)method combined with a multi-level feature concept.Extracting features at multiple levels can enhance the model’s ability to discriminate between different classes or patterns.Each level may capture different aspects of the input data,contributing to a more comprehensive representation.CatBoost,a robust and efficient gradient-boosting algorithm,is utilized to effectively learn and predict the complex relationships between these features and the political orientation labels associated with the articles.A dataset of political Arabic texts collected from diverse sources,including postings and articles,is used to assess the suggested technique.Conservative,reform,and revolutionary are the three subcategories of these opinions.The results of this study demonstrate that compared to other frequently used machine learning models for text classification,the CatBoost method using multi-level features performs better with an accuracy of 98.14%. 展开更多
关键词 Political articles orientation detection CatBoost classifier multi-level features context-based classification social networks machine learning stylometric features
下载PDF
Feature-Based Aggregation and Deep Reinforcement Learning:A Survey and Some New Implementations 被引量:15
2
作者 Dimitri P.Bertsekas 《IEEE/CAA Journal of Automatica Sinica》 EI CSCD 2019年第1期1-31,共31页
In this paper we discuss policy iteration methods for approximate solution of a finite-state discounted Markov decision problem, with a focus on feature-based aggregation methods and their connection with deep reinfor... In this paper we discuss policy iteration methods for approximate solution of a finite-state discounted Markov decision problem, with a focus on feature-based aggregation methods and their connection with deep reinforcement learning schemes. We introduce features of the states of the original problem, and we formulate a smaller "aggregate" Markov decision problem, whose states relate to the features. We discuss properties and possible implementations of this type of aggregation, including a new approach to approximate policy iteration. In this approach the policy improvement operation combines feature-based aggregation with feature construction using deep neural networks or other calculations. We argue that the cost function of a policy may be approximated much more accurately by the nonlinear function of the features provided by aggregation, than by the linear function of the features provided by neural networkbased reinforcement learning, thereby potentially leading to more effective policy improvement. 展开更多
关键词 REINFORCEMENT learning dynamic programming Markovian decision problems aggregation feature-based ARCHITECTURES policy ITERATION DEEP neural networks rollout algorithms
下载PDF
Point Cloud Classification Using Content-Based Transformer via Clustering in Feature Space 被引量:2
3
作者 Yahui Liu Bin Tian +2 位作者 Yisheng Lv Lingxi Li Fei-Yue Wang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第1期231-239,共9页
Recently, there have been some attempts of Transformer in 3D point cloud classification. In order to reduce computations, most existing methods focus on local spatial attention,but ignore their content and fail to est... Recently, there have been some attempts of Transformer in 3D point cloud classification. In order to reduce computations, most existing methods focus on local spatial attention,but ignore their content and fail to establish relationships between distant but relevant points. To overcome the limitation of local spatial attention, we propose a point content-based Transformer architecture, called PointConT for short. It exploits the locality of points in the feature space(content-based), which clusters the sampled points with similar features into the same class and computes the self-attention within each class, thus enabling an effective trade-off between capturing long-range dependencies and computational complexity. We further introduce an inception feature aggregator for point cloud classification, which uses parallel structures to aggregate high-frequency and low-frequency information in each branch separately. Extensive experiments show that our PointConT model achieves a remarkable performance on point cloud shape classification. Especially, our method exhibits 90.3% Top-1 accuracy on the hardest setting of ScanObjectN N. Source code of this paper is available at https://github.com/yahuiliu99/PointC onT. 展开更多
关键词 Content-based Transformer deep learning feature aggregator local attention point cloud classification
下载PDF
Modelling the temporal-varied nonlinear velocity profile of debris flow using a stratification aggregation algorithm in 3D-HBP-SPH framework
4
作者 HAN Zheng XIE Wendu +5 位作者 ZENG Chuicheng LI Yange CHEN Guangqi CHEN Ningsheng HU Guisheng WANG Weidong 《Journal of Mountain Science》 SCIE CSCD 2024年第12期3945-3960,共16页
Estimation of velocity profile within mud depth is a long-standing and essential problem in debris flow dynamics.Until now,various velocity profiles have been proposed based on the fitting analysis of experimental mea... Estimation of velocity profile within mud depth is a long-standing and essential problem in debris flow dynamics.Until now,various velocity profiles have been proposed based on the fitting analysis of experimental measurements,but these are often limited by the observation conditions,such as the number of configured sensors.Therefore,the resulting linear velocity profiles usually exhibit limitations in reproducing the temporal-varied and nonlinear behavior during the debris flow process.In this study,we present a novel approach to explore the debris flow velocity profile in detail upon our previous 3D-HBPSPH numerical model,i.e.,the three-dimensional Smoothed Particle Hydrodynamic model incorporating the Herschel-Bulkley-Papanastasiou rheology.Specifically,we propose a stratification aggregation algorithm for interpreting the details of SPH particles,which enables the recording of temporal velocities of debris flow at different mud depths.To analyze the velocity profile,we introduce a logarithmic-based nonlinear model with two key parameters,that a controlling the shape of velocity profile and b concerning its temporal evolution.We verify the proposed velocity profile and explore its sensitivity using 34 sets of velocity data from three individual flume experiments in previous literature.Our results demonstrate that the proposed temporalvaried nonlinear velocity profile outperforms the previous linear profiles. 展开更多
关键词 Debris flow Velocity profile Temporal varied feature NONLINEAR Stratification aggregation algorithm
下载PDF
Online identification and extraction method of regional large-scale adjustable load-aggregation characteristics
5
作者 Siwei Li Liang Yue +1 位作者 Xiangyu Kong Chengshan Wang 《Global Energy Interconnection》 EI CSCD 2024年第3期313-323,共11页
This article introduces the concept of load aggregation,which involves a comprehensive analysis of loads to acquire their external characteristics for the purpose of modeling and analyzing power systems.The online ide... This article introduces the concept of load aggregation,which involves a comprehensive analysis of loads to acquire their external characteristics for the purpose of modeling and analyzing power systems.The online identification method is a computer-involved approach for data collection,processing,and system identification,commonly used for adaptive control and prediction.This paper proposes a method for dynamically aggregating large-scale adjustable loads to support high proportions of new energy integration,aiming to study the aggregation characteristics of regional large-scale adjustable loads using online identification techniques and feature extraction methods.The experiment selected 300 central air conditioners as the research subject and analyzed their regulation characteristics,economic efficiency,and comfort.The experimental results show that as the adjustment time of the air conditioner increases from 5 minutes to 35 minutes,the stable adjustment quantity during the adjustment period decreases from 28.46 to 3.57,indicating that air conditioning loads can be controlled over a long period and have better adjustment effects in the short term.Overall,the experimental results of this paper demonstrate that analyzing the aggregation characteristics of regional large-scale adjustable loads using online identification techniques and feature extraction algorithms is effective. 展开更多
关键词 Load aggregation Regional large-scale Online recognition feature extraction method
下载PDF
Multi-Level Feature-Based Ensemble Model for Target-Related Stance Detection
6
作者 Shi Li Xinyan Cao Yiting Nan 《Computers, Materials & Continua》 SCIE EI 2020年第10期777-788,共12页
Stance detection is the task of attitude identification toward a standpoint.Previous work of stance detection has focused on feature extraction but ignored the fact that irrelevant features exist as noise during highe... Stance detection is the task of attitude identification toward a standpoint.Previous work of stance detection has focused on feature extraction but ignored the fact that irrelevant features exist as noise during higher-level abstracting.Moreover,because the target is not always mentioned in the text,most methods have ignored target information.In order to solve these problems,we propose a neural network ensemble method that combines the timing dependence bases on long short-term memory(LSTM)and the excellent extracting performance of convolutional neural networks(CNNs).The method can obtain multi-level features that consider both local and global features.We also introduce attention mechanisms to magnify target information-related features.Furthermore,we employ sparse coding to remove noise to obtain characteristic features.Performance was improved by using sparse coding on the basis of attention employment and feature extraction.We evaluate our approach on the SemEval-2016Task 6-A public dataset,achieving a performance that exceeds the benchmark and those of participating teams. 展开更多
关键词 ATTENTION sparse coding multi-level features ensemble model
下载PDF
MIA-UNet:Multi-Scale Iterative Aggregation U-Network for Retinal Vessel Segmentation 被引量:2
7
作者 Linfang Yu Zhen Qin +1 位作者 Yi Ding Zhiguang Qin 《Computer Modeling in Engineering & Sciences》 SCIE EI 2021年第11期805-828,共24页
As an important part of the new generation of information technology,the Internet of Things(IoT)has been widely concerned and regarded as an enabling technology of the next generation of health care system.The fundus ... As an important part of the new generation of information technology,the Internet of Things(IoT)has been widely concerned and regarded as an enabling technology of the next generation of health care system.The fundus photography equipment is connected to the cloud platform through the IoT,so as to realize the realtime uploading of fundus images and the rapid issuance of diagnostic suggestions by artificial intelligence.At the same time,important security and privacy issues have emerged.The data uploaded to the cloud platform involves more personal attributes,health status and medical application data of patients.Once leaked,abused or improperly disclosed,personal information security will be violated.Therefore,it is important to address the security and privacy issues of massive medical and healthcare equipment connecting to the infrastructure of IoT healthcare and health systems.To meet this challenge,we propose MIA-UNet,a multi-scale iterative aggregation U-network,which aims to achieve accurate and efficient retinal vessel segmentation for ophthalmic auxiliary diagnosis while ensuring that the network has low computational complexity to adapt to mobile terminals.In this way,users do not need to upload the data to the cloud platform,and can analyze and process the fundus images on their own mobile terminals,thus eliminating the leakage of personal information.Specifically,the interconnection between encoder and decoder,as well as the internal connection between decoder subnetworks in classic U-Net are redefined and redesigned.Furthermore,we propose a hybrid loss function to smooth the gradient and deal with the imbalance between foreground and background.Compared with the UNet,the segmentation performance of the proposed network is significantly improved on the premise that the number of parameters is only increased by 2%.When applied to three publicly available datasets:DRIVE,STARE and CHASE DB1,the proposed network achieves the accuracy/F1-score of 96.33%/84.34%,97.12%/83.17%and 97.06%/84.10%,respectively.The experimental results show that the MIA-UNet is superior to the state-of-the-art methods. 展开更多
关键词 Retinal vessel segmentation security and privacy redesigned skip connection feature maps aggregation hybrid loss function
下载PDF
Traffic Scene Captioning with Multi-Stage Feature Enhancement
8
作者 Dehai Zhang Yu Ma +3 位作者 Qing Liu Haoxing Wang Anquan Ren Jiashu Liang 《Computers, Materials & Continua》 SCIE EI 2023年第9期2901-2920,共20页
Traffic scene captioning technology automatically generates one or more sentences to describe the content of traffic scenes by analyzing the content of the input traffic scene images,ensuring road safety while providi... Traffic scene captioning technology automatically generates one or more sentences to describe the content of traffic scenes by analyzing the content of the input traffic scene images,ensuring road safety while providing an important decision-making function for sustainable transportation.In order to provide a comprehensive and reasonable description of complex traffic scenes,a traffic scene semantic captioningmodel withmulti-stage feature enhancement is proposed in this paper.In general,the model follows an encoder-decoder structure.First,multilevel granularity visual features are used for feature enhancement during the encoding process,which enables the model to learn more detailed content in the traffic scene image.Second,the scene knowledge graph is applied to the decoding process,and the semantic features provided by the scene knowledge graph are used to enhance the features learned by the decoder again,so that themodel can learn the attributes of objects in the traffic scene and the relationships between objects to generate more reasonable captions.This paper reports extensive experiments on the challenging MS-COCO dataset,evaluated by five standard automatic evaluation metrics,and the results show that the proposed model has improved significantly in all metrics compared with the state-of-the-art methods,especially achieving a score of 129.0 on the CIDEr-D evaluation metric,which also indicates that the proposed model can effectively provide a more reasonable and comprehensive description of the traffic scene. 展开更多
关键词 Traffic scene captioning sustainable transportation feature enhancement encoder-decoder structure multi-level granularity scene knowledge graph
下载PDF
Supervised Feature Learning for Offline Writer Identification Using VLAD and Double Power Normalization
9
作者 Dawei Liang Meng Wu Yan Hu 《Computers, Materials & Continua》 SCIE EI 2023年第7期279-293,共15页
As an indispensable part of identity authentication,offline writer identification plays a notable role in biology,forensics,and historical document analysis.However,identifying handwriting efficiently,stably,and quick... As an indispensable part of identity authentication,offline writer identification plays a notable role in biology,forensics,and historical document analysis.However,identifying handwriting efficiently,stably,and quickly is still challenging due to the method of extracting and processing handwriting features.In this paper,we propose an efficient system to identify writers through handwritten images,which integrates local and global features from similar handwritten images.The local features are modeled by effective aggregate processing,and global features are extracted through transfer learning.Specifically,the proposed system employs a pre-trained Residual Network to mine the relationship between large image sets and specific handwritten images,while the vector of locally aggregated descriptors with double power normalization is employed in aggregating local and global features.Moreover,handwritten image segmentation,preprocessing,enhancement,optimization of neural network architecture,and normalization for local and global features are exploited,significantly improving system performance.The proposed system is evaluated on Computer Vision Lab(CVL)datasets and the International Conference on Document Analysis and Recognition(ICDAR)2013 datasets.The results show that it represents good generalizability and achieves state-of-the-art performance.Furthermore,the system performs better when training complete handwriting patches with the normalization method.The experimental result indicates that it’s significant to segment handwriting reasonably while dealing with handwriting overlap,which reduces visual burstiness. 展开更多
关键词 Writer identification power normalization vector of locally aggregated descriptors feature extraction
下载PDF
EGSNet:An Efficient Glass Segmentation Network Based on Multi-Level Heterogeneous Architecture and Boundary Awareness
10
作者 Guojun Chen Tao Cui +1 位作者 Yongjie Hou Huihui Li 《Computers, Materials & Continua》 SCIE EI 2024年第12期3969-3987,共19页
Existing glass segmentation networks have high computational complexity and large memory occupation,leading to high hardware requirements and time overheads for model inference,which is not conducive to efficiency-see... Existing glass segmentation networks have high computational complexity and large memory occupation,leading to high hardware requirements and time overheads for model inference,which is not conducive to efficiency-seeking real-time tasks such as autonomous driving.The inefficiency of the models is mainly due to employing homogeneous modules to process features of different layers.These modules require computationally intensive convolutions and weight calculation branches with numerous parameters to accommodate the differences in information across layers.We propose an efficient glass segmentation network(EGSNet)based on multi-level heterogeneous architecture and boundary awareness to balance the model performance and efficiency.EGSNet divides the feature layers from different stages into low-level understanding,semantic-level understanding,and global understanding with boundary guidance.Based on the information differences among the different layers,we further propose the multi-angle collaborative enhancement(MCE)module,which extracts the detailed information from shallow features,and the large-scale contextual feature extraction(LCFE)module to understand semantic logic through deep features.The models are trained and evaluated on the glass segmentation datasets HSO(Home-Scene-Oriented)and Trans10k-stuff,respectively,and EGSNet achieves the best efficiency and performance compared to advanced methods.In the HSO test set results,the IoU,Fβ,MAE(Mean Absolute Error),and BER(Balance Error Rate)of EGSNet are 0.804,0.847,0.084,and 0.085,and the GFLOPs(Giga Floating Point Operations Per Second)are only 27.15.Experimental results show that EGSNet significantly improves the efficiency of the glass segmentation task with better performance. 展开更多
关键词 Image segmentation multi-level heterogeneous architecture feature differences
下载PDF
基于层次特征增强的细粒度点云分类
11
作者 白静 刘路 +1 位作者 郑虎 蒋金哲 《浙江大学学报(理学版)》 北大核心 2025年第1期70-80,共11页
针对粗粒度点云分类方法在细粒度数据集中局部特征提取不足的问题,提出了一种基于层次特征增强的三维细粒度点云分类网络(HFE-Net)。基于Veronese映射的点特征增强模块(V-PE)对点云数据进行数据增强,辅助网络学习法线和姿态高阶信息;基... 针对粗粒度点云分类方法在细粒度数据集中局部特征提取不足的问题,提出了一种基于层次特征增强的三维细粒度点云分类网络(HFE-Net)。基于Veronese映射的点特征增强模块(V-PE)对点云数据进行数据增强,辅助网络学习法线和姿态高阶信息;基于多尺度上下文感知的簇内特征增强模块(CA-IntraCE),利用不同尺度的K近邻(K-nearest neighbors,KNN)算法以及交叉注意力实现不同尺度特征的增强,以消除最大池化带来的信息丢失;基于分组稀疏采样的簇间特征增强模块(GSS-InterCE),利用最远点采样(FPS)算法获得稀疏点,并采用交叉注意力实验不同簇间的特征增强,从而提高网络的细粒度判别能力。在FG3D数据集Airplane、Car和Chair 3个类别上的实验结果显示,HFE-Net的总体准确率分别达97.40%,80.53%和83.83%,已超过现有最优方法DC-Net、FGPNet的分类框架,说明HFE-Net的分类性能具有一定的优越性。 展开更多
关键词 三维点云 细粒度分类 交叉注意力 特征增强
下载PDF
基于多尺度融合金字塔焦点网络的接触网零部件检测
12
作者 朱新宇 崔浩锐 宋洋 《工程科学学报》 EI 北大核心 2025年第2期315-327,共13页
作为高铁牵引供电系统的重要组成部分,接触网系统承担着向动车组传输电能的重要功能.实际工程运营表明,受弓网交互产生的持续冲击以及外部环境的影响,接触网支撑部件可能会出现“松、脱、断、裂”等缺陷,导致接触网结构可靠性下降,严重... 作为高铁牵引供电系统的重要组成部分,接触网系统承担着向动车组传输电能的重要功能.实际工程运营表明,受弓网交互产生的持续冲击以及外部环境的影响,接触网支撑部件可能会出现“松、脱、断、裂”等缺陷,导致接触网结构可靠性下降,严重影响接触网系统稳定运行.因此,及时精确定位接触网支撑部件(CSCs),对保障高铁安全运行和完善接触网检修维护策略具有重大意义.然而,CSCs的检测通常面临着零部件种类多、尺度差异大、部分零部件微小的问题.针对以上问题,本文提出一种基于多尺度融合金字塔焦点网络的接触网零部件检测算法,将平衡模块和特征金字塔模块相结合,提高对小目标的检测性能.首先,设计了可分离残差金字塔聚合模块(SRPAM),用于优化模型多尺度特征提取能力、扩大感受野,缓解CSCs检测的多尺度问题;其次,设计了一种基于平衡特征金字塔的路径聚合网络(PA-BFPN),用于提升跨层特征融合效率和小目标检测性能.最后,通过对比试验、可视化实验和消融实验证明了所提方法的有效性和优越性.其中,所提的MFPFCOS在CSCs数据集上的检测精度(mAP)能够在达到48.6%的同时,实现30的FLOPs(Floating point operations per second),表明所提方法能够在检测精度和检测速度之间保持良好的平衡. 展开更多
关键词 深度学习 目标检测 接触网支撑组件(CSCs) 路径聚合特征金字塔(PA-FPN) 空洞空间卷积池化金字塔(ASPP)
下载PDF
基于多边特征引导聚合网络的变化检测算法
13
作者 冯星宇 朱灵龙 +3 位作者 张永宏 阚希 曹海啸 马光义 《计算机工程与应用》 北大核心 2025年第3期264-274,共11页
现有的遥感图像变化检测方法主要依赖卷积神经网络(convolutional neural network,CNN)或Transformer进行构建,但这些方法通常未能充分平衡这两种技术的优缺点,并且往往没有专门针对变化检测的任务特性(对变化区域特征信息进行提取学习... 现有的遥感图像变化检测方法主要依赖卷积神经网络(convolutional neural network,CNN)或Transformer进行构建,但这些方法通常未能充分平衡这两种技术的优缺点,并且往往没有专门针对变化检测的任务特性(对变化区域特征信息进行提取学习)进行优化设计。针对这一问题,充分利用了Transformer的全局信息处理能力和CNN的局部信息捕获能力,提出了一种充分结合两者各自优势并由多条支路组成的多边特征引导聚合网络模型,该模型通过基于Transformer的主网络来对图像的全局信息进行提取,通过设计的基于CNN的多尺度特征提取模块来对图像的局部信息进行提取,通过特征聚合网络将图像的全局信息分别与变化和未变化区域信息进行聚合后输出得到预测结果图。为了验证模型的有效性,构建了一个包含多种地表覆盖类型,涵盖不同季节的新的遥感图像变化检测数据集。同时在实验部分也利用两个公开数据集来进一步验证了模型的泛化性和鲁棒性。实验结果表明,与现有先进方法相比,所提算法在三个数据集上的平均交并比(mean intersection over union,MIoU)指标分别提高了0.83、0.71、0.7个百分点。 展开更多
关键词 遥感图像 变化检测 卷积神经网络 TRANSFORMER 特征聚合
下载PDF
基于自适应差异化图卷积的图注意力网络表示学习算法
14
作者 吴誉兰 舒建文 《现代电子技术》 北大核心 2025年第2期51-54,共4页
为解决传统图卷积网络在处理节点间复杂关系时存在的局限性,提出一种基于自适应差异化图卷积的图注意力网络表示学习算法。采用差异化图卷积网络,依据每个节点自身特征和邻居信息进行差异化采样,捕捉节点间的复杂关系;再结合二阶段关键... 为解决传统图卷积网络在处理节点间复杂关系时存在的局限性,提出一种基于自适应差异化图卷积的图注意力网络表示学习算法。采用差异化图卷积网络,依据每个节点自身特征和邻居信息进行差异化采样,捕捉节点间的复杂关系;再结合二阶段关键相邻采样方式优先挖掘重要节点并保留随机性,完成关键邻居节点的采样;然后结合图注意力网络,通过局部关注和自适应学习权重分配将关键邻居节点特征聚合到自身节点上,增强节点的特征表示;最后经网络训练,进一步增强网络表示学习能力。实验结果表明,所提出的算法优化了节点聚合程度和边界清晰度,提高了节点分类的准确性和可视化效果,并且通过关注二阶邻居和使用双头注意力,在网络表示学习上也展现出了优越性能。 展开更多
关键词 网络表示学习 图卷积网络 自适应差异化机制 节点采样 特征聚合 网络训练 图注意力网络
下载PDF
RDG-Net:基于双阶段解码器的结直肠息肉图像分割模型
15
作者 谭山湖 郭小燕 魏伟一 《中国医学物理学杂志》 2025年第1期52-58,共7页
基于深度学习的息肉图像分割可以有效帮助医生评估癌前病变,本文针对结直肠息肉图像中息肉边界不清晰时分割效果不佳、对新样本范化能力不足的问题,提出一种基于双阶段解码器的结直肠息肉图像分割模型RDG-Net。该模型采用Res2Net-50作... 基于深度学习的息肉图像分割可以有效帮助医生评估癌前病变,本文针对结直肠息肉图像中息肉边界不清晰时分割效果不佳、对新样本范化能力不足的问题,提出一种基于双阶段解码器的结直肠息肉图像分割模型RDG-Net。该模型采用Res2Net-50作为编码器以提高图像分割精度。解码器分为两个阶段,第一阶段利用4层多尺度特征聚合模块整合不同阶段编码器提取的特征,第二阶段通过3层并行卷积融合模块增强解码器第一阶段输出的图像特征并解码至更高分辨率作为模型的最终输出结果。采用CVC-ClinicDB和Kvasir-SEG数据集的训练集数据进行模型训练,并采用CVCClinicDB与Kvasir-SEG数据集以及未参与训练的CVC300和ETIS-LaribPolypDB数据集分别对模型进行测试。测试结果显示,CVC-ClinicDB与Kvasir-SEG数据集上准确率、精确度、召回率、Dice系数、交并比和F2的平均值分别为98.41%、94.25%、92.62%、93.42%、87.69%、92.93%,CVC300和ETIS-LaribPolypDB数据集上各评价指标的平均结果分别为99.05%、87.79%、89.13%、88.39%、79.33%、88.82%。实验结果表明RDG-Net模型在结直肠息肉区域的分割任务中表现出色,在新数据集上表现出较好的泛化能力。 展开更多
关键词 图像分割 结直肠息肉 多尺度特征聚合 并行卷积
下载PDF
用于水下图像增强的多层级联融合增强网络
16
作者 王炫钧 邵菲 马彦卿 《电光与控制》 北大核心 2025年第1期68-73,共6页
为了提高水下无人潜航器决策的准确性,构建了一种用于增强水下图像的多层级联融合增强网络。首先,设计注意力引导的色彩增强模块并结合多层级联增强架构,在提取图像多尺度特征的同时,加强特征重用;其次,设计全局调整模块,将Swin Transfo... 为了提高水下无人潜航器决策的准确性,构建了一种用于增强水下图像的多层级联融合增强网络。首先,设计注意力引导的色彩增强模块并结合多层级联增强架构,在提取图像多尺度特征的同时,加强特征重用;其次,设计全局调整模块,将Swin Transformer与扩张卷积相结合,提升网络对于退化图像整体增强效果;最后,将各模块提取到的特征信息经由三重特征聚合模块进行融合增强,得到增强水下图像。为了可以更好地训练模型,构造了联合损失函数。与其他水下图像增强方法的对比实验结果表明,所提方法对于水下图像存在的色偏、模糊等问题均有着良好的增强效果,并对后续特征提取任务的完成有着较大的促进作用。 展开更多
关键词 水下无人潜航器 水下图像增强 多层级联增强架构 全局调整 三重特征聚合
下载PDF
一种针对SAR图像的舰船目标检测算法
17
作者 孟凡龙 齐向阳 范怀涛 《电光与控制》 北大核心 2025年第1期74-79,共6页
由于环境复杂、舰船目标散焦和尺度的多样性,基于SAR图像的舰船目标检测仍然存在一些问题。提出了一种针对SAR图像的舰船目标检测算法。首先,基于可变形卷积构建舰船目标特征细化模块,提高对大长宽比姿态的舰船目标的特征提取能力;其次... 由于环境复杂、舰船目标散焦和尺度的多样性,基于SAR图像的舰船目标检测仍然存在一些问题。提出了一种针对SAR图像的舰船目标检测算法。首先,基于可变形卷积构建舰船目标特征细化模块,提高对大长宽比姿态的舰船目标的特征提取能力;其次,在主干网络末尾引入了舰船空间金字塔聚合结构,增强对舰船目标的全局特征提取能力;最后,设计了尺度扩展特征金字塔网络,增强舰船浅层和深层特征信息的交互,提高对多尺度舰船目标的检测能力。实验结果表明,所提算法在HRSID数据集上的mAP达到了93.72%,F1分数达到了89.70%,优于所有比较算法,具有良好的检测效果。 展开更多
关键词 SAR图像 舰船检测 可变形卷积 舰船空间金字塔聚合结构 尺度扩展特征金字塔网络
下载PDF
基于MYOLOv8的目标检测方法
18
作者 张正勃 曹爱岷 王兴盛 《计算机测量与控制》 2025年第1期93-98,113,共7页
针对当前的目标检测方法难以表征多尺度上下文特征的问题,提出了MYOLOv8算法;为了提高YOLOv8模型对于小、中、大型目标的检测能力,提出了一种分层多尺度提取模块对空间特征进行分层特征聚合来捕获多尺度空间上下文信息;为了进一步提高... 针对当前的目标检测方法难以表征多尺度上下文特征的问题,提出了MYOLOv8算法;为了提高YOLOv8模型对于小、中、大型目标的检测能力,提出了一种分层多尺度提取模块对空间特征进行分层特征聚合来捕获多尺度空间上下文信息;为了进一步提高模型对于空间语义的提取能力,提出了一种自适应的通道注意力机制,该机制通过自适应地学习相邻通道之间的相互依赖关系来促进模型关注有用特征,抑制无用特征;为了提高模型对于边界困难样本的定位能力,提出了一种Slide Loss来处理目标检测中的样本不平衡问题,该方法采用对困难样本进行强加权的方式来促使模型着重优化难分样本;在MS COCO数据集上的实验结果表明,所提出的算法相比于YOLOv8-n和YOLOv8-s,mAP分别提升了3.4%和1.4%,同时具有相似的参数量和计算开销,以及更快的推理速度。 展开更多
关键词 目标检测 多尺度上下文 分层特征聚合 注意力机制 样本不平衡
下载PDF
ST-SIGMA:Spatio-temporal semantics and interaction graph aggregation for multi-agent perception and trajectory forecasting
19
作者 Yang Fang Bei Luo +3 位作者 Ting Zhao Dong He Bingbing Jiang Qilie Liu 《CAAI Transactions on Intelligence Technology》 SCIE EI 2022年第4期744-757,共14页
Scene perception and trajectory forecasting are two fundamental challenges that are crucial to a safe and reliable autonomous driving(AD)system.However,most proposed methods aim at addressing one of the two challenges... Scene perception and trajectory forecasting are two fundamental challenges that are crucial to a safe and reliable autonomous driving(AD)system.However,most proposed methods aim at addressing one of the two challenges mentioned above with a single model.To tackle this dilemma,this paper proposes spatio-temporal semantics and interaction graph aggregation for multi-agent perception and trajectory forecasting(STSIGMA),an efficient end-to-end method to jointly and accurately perceive the AD environment and forecast the trajectories of the surrounding traffic agents within a unified framework.ST-SIGMA adopts a trident encoder-decoder architecture to learn scene semantics and agent interaction information on bird’s-eye view(BEV)maps simultaneously.Specifically,an iterative aggregation network is first employed as the scene semantic encoder(SSE)to learn diverse scene information.To preserve dynamic interactions of traffic agents,ST-SIGMA further exploits a spatio-temporal graph network as the graph interaction encoder.Meanwhile,a simple yet efficient feature fusion method to fuse semantic and interaction features into a unified feature space as the input to a novel hierarchical aggregation decoder for downstream prediction tasks is designed.Extensive experiments on the nuScenes data set have demonstrated that the proposed ST-SIGMA achieves significant improvements compared to the state-of-theart(SOTA)methods in terms of scene perception and trajectory forecasting,respectively.Therefore,the proposed approach outperforms SOTA in terms of model generalisation and robustness and is therefore more feasible for deployment in realworld AD scenarios. 展开更多
关键词 feature fusion graph interaction hierarchical aggregation scene perception scene semantics trajectory forecasting
下载PDF
Space-time video super-resolution using long-term temporal feature aggregation
20
作者 Kuanhao Chen Zijie Yue Miaojing Shi 《Autonomous Intelligent Systems》 EI 2023年第1期75-83,共9页
Space-time video super-resolution(STVSR)serves the purpose to reconstruct high-resolution high-frame-rate videos from their low-resolution low-frame-rate counterparts.Recent approaches utilize end-to-end deep learning... Space-time video super-resolution(STVSR)serves the purpose to reconstruct high-resolution high-frame-rate videos from their low-resolution low-frame-rate counterparts.Recent approaches utilize end-to-end deep learning models to achieve STVSR.They first interpolate intermediate frame features between given frames,then perform local and global refinement among the feature sequence,and finally increase the spatial resolutions of these features.However,in the most important feature interpolation phase,they only capture spatial-temporal information from the most adjacent frame features,ignoring modelling long-term spatial-temporal correlations between multiple neighbouring frames to restore variable-speed object movements and maintain long-term motion continuity.In this paper,we propose a novel long-term temporal feature aggregation network(LTFA-Net)for STVSR.Specifically,we design a long-term mixture of experts(LTMoE)module for feature interpolation.LTMoE contains multiple experts to extract mutual and complementary spatial-temporal information from multiple consecutive adjacent frame features,which are then combined with different weights to obtain interpolation results using several gating nets.Next,we perform local and global feature refinement using the Locally-temporal Feature Comparison(LFC)module and bidirectional deformable ConvLSTM layer,respectively.Experimental results on two standard benchmarks,Adobe240 and GoPro,indicate the effectiveness and superiority of our approach over state of the art. 展开更多
关键词 Space-time video super-resolution Mixture of experts Deformable convolutional layer Long-term temporal feature aggregation
原文传递
上一页 1 2 21 下一页 到第
使用帮助 返回顶部