期刊文献+
共找到241,780篇文章
< 1 2 250 >
每页显示 20 50 100
基于MTF-Swin Transformer的风机齿轮箱故障诊断
1
作者 张彬桥 雷钧 万刚 《可再生能源》 CAS CSCD 北大核心 2024年第5期627-633,共7页
针对风机齿轮箱实际工况复杂多变及含有强噪声,传统故障诊断方法对风机齿轮箱故障诊断识别准确率较低的问题,文章提出了MTF-Swin Transformer风机齿轮箱故障诊断模型。首先,采用马尔科夫变迁场(MTF)图形编码方法将原始一维振动时序信号... 针对风机齿轮箱实际工况复杂多变及含有强噪声,传统故障诊断方法对风机齿轮箱故障诊断识别准确率较低的问题,文章提出了MTF-Swin Transformer风机齿轮箱故障诊断模型。首先,采用马尔科夫变迁场(MTF)图形编码方法将原始一维振动时序信号转化为具有关联时间信息的二维特征图谱;然后,将特征图谱作为Swin Transformer模型的输入,基于自注意力机制进行自动特征提取;最后,实现对不同故障类型的分类。仿真结果表明,该方法对齿轮箱故障诊断准确率达到了99.48%,证明了该方法的有效性和优越性。 展开更多
关键词 马尔科夫变迁场(MTF) Swin transformer 风机齿轮箱 故障诊断
下载PDF
基于TF-IDF和多头注意力Transformer模型的文本情感分析 被引量:9
2
作者 高佳希 黄海燕 《华东理工大学学报(自然科学版)》 CAS CSCD 北大核心 2024年第1期129-136,共8页
文本情感分析旨在对带有情感色彩的主观性文本进行分析、处理、归纳和推理,是自然语言处理中一项重要任务。针对现有的计算方法不能充分处理复杂度和混淆度较高的文本数据集的问题,提出了一种基于TF-IDF(Term Frequency-Inverse Documen... 文本情感分析旨在对带有情感色彩的主观性文本进行分析、处理、归纳和推理,是自然语言处理中一项重要任务。针对现有的计算方法不能充分处理复杂度和混淆度较高的文本数据集的问题,提出了一种基于TF-IDF(Term Frequency-Inverse Document Frequency)和多头注意力Transformer模型的文本情感分析模型。在文本预处理阶段,利用TF-IDF算法对影响文本情感倾向较大的词语进行初步筛选,舍去常见的停用词及其他文本所属邻域对文本情感倾向影响较小的专有名词。然后,利用多头注意力Transformer模型编码器进行特征提取,抓取文本内部重要的语义信息,提高模型对语义的分析和泛化能力。该模型在多领域、多类型评论语料库数据集上取得了98.17%的准确率。 展开更多
关键词 文本情感分析 自然语言处理 多头注意力机制 Tf-IDF算法 transformer模型
下载PDF
基于SF-Transformer的智能教育平台短期电力负荷预测研究
3
作者 冯艳丽 周宇 +2 位作者 黄福兴 万俊岭 袁培森 《华东师范大学学报(自然科学版)》 CAS CSCD 北大核心 2024年第5期173-182,共10页
建设智能教育平台是推动教育智能化的一个重要过程,但智能教育平台依赖的人工智能模型在训练过程中会消耗大量电力,因此,开展短期电力负荷预测对建设智能教育平台具有重要意义.针对在考虑多个属性开展短期电力负荷预测时,由于部分属性... 建设智能教育平台是推动教育智能化的一个重要过程,但智能教育平台依赖的人工智能模型在训练过程中会消耗大量电力,因此,开展短期电力负荷预测对建设智能教育平台具有重要意义.针对在考虑多个属性开展短期电力负荷预测时,由于部分属性与电力负荷数据的相关性不强并且Transformer无法捕捉电力负荷数据的时间相关性,而导致电力负荷预测不够准确的问题,基于SR(Székely and Rizzo)距离相关系数、融合时间定位编码和Transformer,提出了一种短期电力负荷预测模型SF-Transformer.SF-Transformer通过SR距离相关系数对影响电力负荷数据的属性进行筛选,选择与电力负荷数据之间SR距离相关系数较大的属性.SF-Transformer采用一种全局时间编码与局部位置编码相结合的融合时间定位编码,有助于模型全面获取电力负荷数据的时间定位信息.在数据集上开展了实验,实验结果表明SF-Transformer与其他模型相比,在两种时长上进行电力负荷预测具有更低的均方根误差和平均绝对误差. 展开更多
关键词 智能教育平台 短期电力负荷预测 SR距离相关系数 融合时间定位编码 transformER
下载PDF
SMSTracker:A Self-Calibration Multi-Head Self-Attention Transformer for Visual Object Tracking
4
作者 Zhongyang Wang Hu Zhu Feng Liu 《Computers, Materials & Continua》 SCIE EI 2024年第7期605-623,共19页
Visual object tracking plays a crucial role in computer vision.In recent years,researchers have proposed various methods to achieve high-performance object tracking.Among these,methods based on Transformers have becom... Visual object tracking plays a crucial role in computer vision.In recent years,researchers have proposed various methods to achieve high-performance object tracking.Among these,methods based on Transformers have become a research hotspot due to their ability to globally model and contextualize information.However,current Transformer-based object tracking methods still face challenges such as low tracking accuracy and the presence of redundant feature information.In this paper,we introduce self-calibration multi-head self-attention Transformer(SMSTracker)as a solution to these challenges.It employs a hybrid tensor decomposition self-organizing multihead self-attention transformermechanism,which not only compresses and accelerates Transformer operations but also significantly reduces redundant data,thereby enhancing the accuracy and efficiency of tracking.Additionally,we introduce a self-calibration attention fusion block to resolve common issues of attention ambiguities and inconsistencies found in traditional trackingmethods,ensuring the stability and reliability of tracking performance across various scenarios.By integrating a hybrid tensor decomposition approach with a self-organizingmulti-head self-attentive transformer mechanism,SMSTracker enhances the efficiency and accuracy of the tracking process.Experimental results show that SMSTracker achieves competitive performance in visual object tracking,promising more robust and efficient tracking systems,demonstrating its potential to providemore robust and efficient tracking solutions in real-world applications. 展开更多
关键词 Visual object tracking tensor decomposition transformER self-attention
下载PDF
Transformation Efficiency of Sulfur for a Mulberry Leaf-Silkworm Cocoon System in the Lower-Middle Reaches of the Yangtze River, China 被引量:2
5
作者 ZHAOYan-Wen HUZheng-Yi +4 位作者 CAOZhi-Hong J.D.BEATON A.M.HENDERSON M.X.FAN XUCheng-Kai 《Pedosphere》 SCIE CAS CSCD 2005年第3期281-285,共5页
Cocoon samples were collected from fifty-two mulberry gardens with high, intermediate, and low silkworm cocoon productivities in the lower-middle reaches of the Yangtze River in the six China’s provinces of Jiangsu, ... Cocoon samples were collected from fifty-two mulberry gardens with high, intermediate, and low silkworm cocoon productivities in the lower-middle reaches of the Yangtze River in the six China’s provinces of Jiangsu, Jiangxi, Anhui, Fujian, Hunan, and Hubei to determine the transformation efficiency of S from mulberry leaves to silkworm cocoons, and to evaluate the sulfur cycle (uptake and output) in the mulberry leaf-silkworm cocoon system with typical mulberry gardens in the lower-middle reaches of the Yangtze River in China. The transformation efficiency of sulfur (TES) from mulberry leaves into silkworm cocoons in the high-productivity mulberry gardens was significantly lower (P < 0.05) than that in the low-productivity gardens. For the high-productivity mulberry gardens the TES from mulberry leaves into the cocoon shells was significantly higher (P < 0.05) than that for low-yield mulberry gardens. Producing 1 kg dry cocoon in mulberry gardens required uptake of about 20 g S, however 1 kg of dry cocoon only removed about 4 g S. Therefore, recycling of these organic wastes with silkworm cultivation was important for sulfur balances. 展开更多
关键词 mulberry leaves silkworm cocoon SULFUR transformation efficiency
下载PDF
Cultivating Rice with Delaying Leaf-Senescence by P_(SAG12)-IPT Gene Transformation 被引量:7
6
作者 林拥军 曹孟良 +3 位作者 徐才国 陈浩 魏君 张启发 《Acta Botanica Sinica》 CSCD 2002年第11期1333-1338,共6页
P SAG12 _ IPT gene was introduced into an elite rice (Oryza sativa L. ssp. indica ) restorer line Minghui 63 through Agrobacterium _mediated transformation method. Out of 61 independent transgenic plants ... P SAG12 _ IPT gene was introduced into an elite rice (Oryza sativa L. ssp. indica ) restorer line Minghui 63 through Agrobacterium _mediated transformation method. Out of 61 independent transgenic plants obtained, a few acquired a recognizable phenotype in which leave senescence was delayed to a great degree. The results of field plot test on two homozygous transgenic lines indicated: (1) the stay_green ability of transgenic plants was significantly improved; (2) both the seed_setting rate and the number of panicles per plant of transgenic plants were significantly increased compared with that of the non_transgenic plants of Minghui 63; and (3) the plant height of transgenic plants was significantly reduced. 展开更多
关键词 Oryza sativa AGROBACTERIUM transformation P SAG12 _ IPT stay_green agronomic traits
下载PDF
MCIF-Transformer Mask RCNN:Multi-Branch Cross-Scale Interactive Feature Fusion Transformer Model for PET/CT Lung Tumor Instance Segmentation
7
作者 Huiling Lu Tao Zhou 《Computers, Materials & Continua》 SCIE EI 2024年第6期4371-4393,共23页
The precise detection and segmentation of tumor lesions are very important for lung cancer computer-aided diagnosis.However,in PET/CT(Positron Emission Tomography/Computed Tomography)lung images,the lesion shapes are ... The precise detection and segmentation of tumor lesions are very important for lung cancer computer-aided diagnosis.However,in PET/CT(Positron Emission Tomography/Computed Tomography)lung images,the lesion shapes are complex,the edges are blurred,and the sample numbers are unbalanced.To solve these problems,this paper proposes a Multi-branch Cross-scale Interactive Feature fusion Transformer model(MCIF-Transformer Mask RCNN)for PET/CT lung tumor instance segmentation,The main innovative works of this paper are as follows:Firstly,the ResNet-Transformer backbone network is used to extract global feature and local feature in lung images.The pixel dependence relationship is established in local and non-local fields to improve the model perception ability.Secondly,the Cross-scale Interactive Feature Enhancement auxiliary network is designed to provide the shallow features to the deep features,and the cross-scale interactive feature enhancement module(CIFEM)is used to enhance the attention ability of the fine-grained features.Thirdly,the Cross-scale Interactive Feature fusion FPN network(CIF-FPN)is constructed to realize bidirectional interactive fusion between deep features and shallow features,and the low-level features are enhanced in deep semantic features.Finally,4 ablation experiments,3 comparison experiments of detection,3 comparison experiments of segmentation and 6 comparison experiments with two-stage and single-stage instance segmentation networks are done on PET/CT lung medical image datasets.The results showed that APdet,APseg,ARdet and ARseg indexes are improved by 5.5%,5.15%,3.11%and 6.79%compared with Mask RCNN(resnet50).Based on the above research,the precise detection and segmentation of the lesion region are realized in this paper.This method has positive significance for the detection of lung tumors. 展开更多
关键词 PET/CT images instance segmentation mask RCNN interactive fusion transformER
下载PDF
Efficient Vision Transformers for Autonomous Off-Road Perception Systems
8
作者 Max H. Faykus III Adam Pickeral +2 位作者 Ethan Marquez Melissa C. Smith Jon C. Calhoun 《Journal of Computer and Communications》 2024年第9期188-207,共20页
The development of autonomous vehicles has become one of the greatest research endeavors in recent years. These vehicles rely on many complex systems working in tandem to make decisions. For practical use and safety r... The development of autonomous vehicles has become one of the greatest research endeavors in recent years. These vehicles rely on many complex systems working in tandem to make decisions. For practical use and safety reasons, these systems must not only be accurate, but also quickly detect changes in the surrounding environment. In autonomous vehicle research, the environment perception system is one of the key components of development. Environment perception systems allow the vehicle to understand its surroundings. This is done by using cameras, light detection and ranging (LiDAR), with other sensor systems and modalities. Deep learning computer vision algorithms have been shown to be the strongest tool for translating camera data into accurate and safe traversability decisions regarding the environment surrounding a vehicle. In order for a vehicle to safely traverse an area in real time, these computer vision algorithms must be accurate and have low latency. While much research has studied autonomous driving for traversing well-structured urban environments, limited research exists evaluating perception system improvements in off-road settings. This research aims to investigate the adaptability of several existing deep-learning architectures for semantic segmentation in off-road environments. Previous studies of two Convolutional Neural Network (CNN) architectures are included for comparison with new evaluation of Vision Transformer (ViT) architectures for semantic segmentation. Our results demonstrate viability of ViT architectures for off-road perception systems, having a strong segmentation accuracy, lower inference speed and memory footprint compared to previous results with CNN architectures. 展开更多
关键词 Semantic Segmentation Off-Road Vision transformERS CNNS Autonomous Driving
下载PDF
DECAY RATE OF FOURIER TRANSFORMS OF SOME SELF-SIMILAR MEASURES
9
作者 高翔 马际华 《Acta Mathematica Scientia》 SCIE CSCD 2017年第6期1607-1618,共12页
This paper is concerned with the Diophantine properties of the sequence {ξθn}, where 1 ≤ξ 〈 θ and θ is a rational or an algebraic integer. We establish a combinatorial proposition which can be used to study suc... This paper is concerned with the Diophantine properties of the sequence {ξθn}, where 1 ≤ξ 〈 θ and θ is a rational or an algebraic integer. We establish a combinatorial proposition which can be used to study such two cases in the same manner. It is shown that the decay rate of the Fourier transforms of self-similar measures μλ with λ = θ-1 as the uniform contractive ratio is logarithmic. This generalizes some results of Kershner and Bufetov-Solomyak, who consider the case of Bernoulli convolutions. As an application, we prove that μλ ahaost every x is normal to any base b ≥ 2, which implies that there exist infinitely many absolute normal numbers on the corresponding self-similar set. This can be seen as a complementary result of the well-known Cassels-Schmidt theorem. 展开更多
关键词 self-similar measures Fourier transforms decay rate normal numbers
下载PDF
Stage IV malignant transformation of mature cystic teratoma palliatively treated with concurrent chemoradiotherapy:A case report
10
作者 Saori Kondo Takashi Suzuki +4 位作者 Kanato Yoshiike Sakura Yamanaka Kenta Sonehara Hiroshi Nabeshima Osamu Oguchi 《World Journal of Clinical Cases》 SCIE 2025年第1期56-61,共6页
BACKGROUND Malignant transformation(MT)of mature cystic teratoma(MCT)has a poor prognosis,especially in advanced cases.Concurrent chemoradiotherapy(CCRT)has an inhibitory effect on MT.CASE SUMMARY Herein,we present a ... BACKGROUND Malignant transformation(MT)of mature cystic teratoma(MCT)has a poor prognosis,especially in advanced cases.Concurrent chemoradiotherapy(CCRT)has an inhibitory effect on MT.CASE SUMMARY Herein,we present a case in which CCRT had a reduction effect preoperatively.A 73-year-old woman with pyelonephritis was referred to our hospital.Computed tomography revealed right hydronephrosis and a 6-cm pelvic mass.Endoscopic ultrasound-guided fine-needle biopsy(EUS-FNB)revealed squamous cell carci-noma.The patient was diagnosed with MT of MCT.Due to her poor general con-dition and renal malfunction,we selected CCRT,expecting fewer adverse effects.After CCRT,her performance status improved,and the tumor size was reduced;surgery was performed.Five months postoperatively,the patient developed dis-semination and lymph node metastases.Palliative chemotherapy was ineffective.She died 18 months after treatment initiation.CONCLUSION EUS-FNB was useful in the diagnosis of MT of MCT;CCRT suppressed the disea-se and improved quality of life. 展开更多
关键词 Mature cystic teratoma Malignant transformation Squamous cell carcinoma Concurrent chemoradiotherapy Endoscopic ultrasound-guided fine-needle biopsy Case report
下载PDF
FOURIER TRANSFORMATION AND SINGULAR INTEGRALS ON SELF-SIMILAR MEASURE
11
作者 Wu Baoyi Su Weiyi, Nanjing University, China Department of Mathematics Nanjing University Nanjing 210093 PRC 《Analysis in Theory and Applications》 1998年第4期102-114,共13页
This paper serves two purposes. One is to modify Strichartz's results with respect to the asymptotic averages of the Fourier transform of μ on , self-similar measure defined by Hutchinson. Another purpose is to c... This paper serves two purposes. One is to modify Strichartz's results with respect to the asymptotic averages of the Fourier transform of μ on , self-similar measure defined by Hutchinson. Another purpose is to consider a singular integral operator on μ and show that this op- erator is of type (p,p)(1<p<∞). 展开更多
关键词 SHOW FOURIER transformATION AND SINGULAR INTEGRALS ON SELf-sIMILAR MEASURE MATH APPI
下载PDF
Transforming growth factor-beta 1 enhances discharge activity of cortical neurons
12
作者 Zhihui Ren Tian Li +5 位作者 Xueer Liu Zelin Zhang Xiaoxuan Chen Weiqiang Chen Kangsheng Li Jiangtao Sheng 《Neural Regeneration Research》 SCIE CAS 2025年第2期548-556,共9页
Transforming growth factor-beta 1(TGF-β1)has been extensively studied for its pleiotropic effects on central nervous system diseases.The neuroprotective or neurotoxic effects of TGF-β1 in specific brain areas may de... Transforming growth factor-beta 1(TGF-β1)has been extensively studied for its pleiotropic effects on central nervous system diseases.The neuroprotective or neurotoxic effects of TGF-β1 in specific brain areas may depend on the pathological process and cell types involved.Voltage-gated sodium channels(VGSCs)are essential ion channels for the generation of action potentials in neurons,and are involved in various neuroexcitation-related diseases.However,the effects of TGF-β1 on the functional properties of VGSCs and firing properties in cortical neurons remain unclear.In this study,we investigated the effects of TGF-β1 on VGSC function and firing properties in primary cortical neurons from mice.We found that TGF-β1 increased VGSC current density in a dose-and time-dependent manner,which was attributable to the upregulation of Nav1.3 expression.Increased VGSC current density and Nav1.3 expression were significantly abolished by preincubation with inhibitors of mitogen-activated protein kinase kinase(PD98059),p38 mitogen-activated protein kinase(SB203580),and Jun NH2-terminal kinase 1/2 inhibitor(SP600125).Interestingly,TGF-β1 significantly increased the firing threshold of action potentials but did not change their firing rate in cortical neurons.These findings suggest that TGF-β1 can increase Nav1.3 expression through activation of the ERK1/2-JNK-MAPK pathway,which leads to a decrease in the firing threshold of action potentials in cortical neurons under pathological conditions.Thus,this contributes to the occurrence and progression of neuroexcitatory-related diseases of the central nervous system. 展开更多
关键词 central nervous system cortical neurons ERK firing properties JNK Nav1.3 p38 transforming growth factor-beta 1 traumatic brain injury voltage-gated sodium currents
下载PDF
Self-Similar Transformation and Vertex Configurations of the Octagonal Ammann-Beenker Tiling
13
作者 Hong-Mei Zhang Cheng Cai Xiu-Jun Fu 《Chinese Physics Letters》 SCIE CAS CSCD 2018年第6期41-44,共4页
Based on the matching rules for squares and rhombuses,we study the self-similar transformation and the vertex configurations of the Ammann-Beenker tiling.The structural properties of the configurations and their relat... Based on the matching rules for squares and rhombuses,we study the self-similar transformation and the vertex configurations of the Ammann-Beenker tiling.The structural properties of the configurations and their relations during the self-similar transformation are obtained.Our results reveal the distribution correlations of the configurations,which provide an intuitive understanding of the octagonal quasi-periodic structure and also give implications for growing perfect quasi-periodic tiling according to the local rules. 展开更多
关键词 Self-similar transformation and Vertex Configurations of the Octagonal Ammann-Beenker Tiling
下载PDF
基于MF-SAE-SSA-KELM油浸式变压器故障诊断方法
14
作者 黄旭 许冬云 《工业控制计算机》 2024年第10期126-128,共3页
传统油浸式变压器溶解气体分析故障诊断方法存在故障诊断速度慢的问题,提出一种多尺度融合堆叠自编码器(Multiscale Fusion Stacked Auto-encoder,MF-SAE)的油浸式变压器故障诊断的方法。首先获取油浸式变压器高压套管红外检测图谱,后... 传统油浸式变压器溶解气体分析故障诊断方法存在故障诊断速度慢的问题,提出一种多尺度融合堆叠自编码器(Multiscale Fusion Stacked Auto-encoder,MF-SAE)的油浸式变压器故障诊断的方法。首先获取油浸式变压器高压套管红外检测图谱,后将该图谱裁剪并处理为灰度图,将这些灰度图展平为一维特征向量后输入SAE,通过设置不同隐含层个数获取自编码器的编码部分从数据中提取特征。这些特征累加便得到不同尺度隐含层特征。之后将这些特征输入麻雀算法优化的核极限学习机分类模型进行故障诊断。算例分析表明,所提故障诊断方法有较高的故障诊断准确率。 展开更多
关键词 油浸式变压器 Mf-sAE 麻雀算法 核极限学习机 故障诊断
下载PDF
中国货币长期中性实证研究——基于F-S方法的估计 被引量:4
15
作者 赵国庆 林梦瑶 《财经问题研究》 CSSCI 北大核心 2011年第5期60-64,共5页
本文利用Fisher-Seater的货币长期中性检验模型对中国1997年第1季度—2009年第4季度的数据进行实证分析,发现样本期间内中国货币长期导数是发散的,表明货币在长期内是非中性的。基于这一结果我们认为,近年我国的货币政策在影响实际经济... 本文利用Fisher-Seater的货币长期中性检验模型对中国1997年第1季度—2009年第4季度的数据进行实证分析,发现样本期间内中国货币长期导数是发散的,表明货币在长期内是非中性的。基于这一结果我们认为,近年我国的货币政策在影响实际经济上是有效的。 展开更多
关键词 货币长期中性 f-s方法 单位根检验
下载PDF
Wavelet transform and gradient direction based feature extraction method for off-line handwritten Tibetan letter recognition 被引量:3
16
作者 黄鹤鸣 达飞鹏 韩晓旭 《Journal of Southeast University(English Edition)》 EI CAS 2014年第1期27-31,共5页
To improve the recognition accuracy of off-line handwritten Tibetan characters the local gradient direction histograms based on the wavelet transform are proposed as the recognition features.First for a Tibetan charac... To improve the recognition accuracy of off-line handwritten Tibetan characters the local gradient direction histograms based on the wavelet transform are proposed as the recognition features.First for a Tibetan character sample image the first level approximation component of the Haar wavelet transform is calculated.Secondly the approximation component is partitioned into several equal-sized zones. Finally the gradient direction histograms of each zone are calculated and the local direction histograms of the approximation component are considered as the features of the character sample image.The proposed method is tested on the recently developed off-line Tibetan handwritten character sample database.The experimental results demonstrate the effectiveness and efficiency of the proposed feature extraction method.Furthermore compared with the detail components the approximation component contributes more to the recognition accuracy. 展开更多
关键词 pattern recognition wavelet transform gradient direction TIBETAN handwritten character
下载PDF
基于谱分解的F-S最佳鉴别平面及舰船识别研究 被引量:2
17
作者 吴小俊 杨静宇 +1 位作者 王士同 刘同明 《船舶力学》 EI 2003年第2期116-120,共5页
Fisher最佳鉴别分析方法已在许多模式识别问题中取得成功应用。Fisher最佳鉴别分析建立在对Fisher最佳鉴别准则的最优化基础上。本文利用对类内矩阵 Sw进行谱分解 ,提出一种在类内矩阵 Sw 的零空间中求解F -S最佳鉴别平面的新方法。我... Fisher最佳鉴别分析方法已在许多模式识别问题中取得成功应用。Fisher最佳鉴别分析建立在对Fisher最佳鉴别准则的最优化基础上。本文利用对类内矩阵 Sw进行谱分解 ,提出一种在类内矩阵 Sw 的零空间中求解F -S最佳鉴别平面的新方法。我们将此方法应用于红外舰船图象的特征抽取和识别的研究。实验结果表明了该方法的有效性。 展开更多
关键词 谱分解 舰船识别 特征抽取 Fisher最佳鉴别分析 红外图像识别 f-s最佳鉴别平面
下载PDF
CNN-Transformer特征融合多目标跟踪算法 被引量:4
18
作者 张英俊 白小辉 谢斌红 《计算机工程与应用》 CSCD 北大核心 2024年第2期180-190,共11页
在卷积神经网络(CNN)中,卷积运算能高效地提取目标的局部特征,却难以捕获全局表示;而在视觉Transformer中,注意力机制可以捕获长距离的特征依赖,但会忽略局部特征细节。针对以上问题,提出一种基于CNN-Transformer双分支主干网络进行特... 在卷积神经网络(CNN)中,卷积运算能高效地提取目标的局部特征,却难以捕获全局表示;而在视觉Transformer中,注意力机制可以捕获长距离的特征依赖,但会忽略局部特征细节。针对以上问题,提出一种基于CNN-Transformer双分支主干网络进行特征提取和融合的多目标跟踪算法CTMOT(CNN-transformer multi-object tracking)。使用基于CNN和Transformer双分支并行的主干网络分别提取图像的局部和全局特征。使用双向桥接模块(two-way braidge module,TBM)对两种特征进行充分融合。将融合后的特征输入两组并行的解码器进行处理。将解码器输出的检测框和跟踪框进行匹配,完成多目标跟踪任务。在多目标跟踪数据集MOT17、MOT20、KITTI以及UADETRAC上进行评估,CTMOT算法的MOTP和IDs指标在四个数据集上均达到了SOTA效果,MOTA指标分别达到了76.4%、66.3%、92.36%和88.57%,在MOT数据集上与SOTA方法效果相当,在KITTI数据集上达到SOTA效果。由于同时完成目标检测和关联,能够端到端进行目标跟踪,跟踪速度可达35 FPS,表明CTMOT算法在跟踪的实时性和准确性上达到了较好的平衡,具有较大潜力。 展开更多
关键词 多目标跟踪 transformER 特征融合
下载PDF
基于Depth-wise卷积和视觉Transformer的图像分类模型 被引量:3
19
作者 张峰 黄仕鑫 +1 位作者 花强 董春茹 《计算机科学》 CSCD 北大核心 2024年第2期196-204,共9页
图像分类作为一种常见的视觉识别任务,有着广阔的应用场景。在处理图像分类问题时,传统的方法通常使用卷积神经网络,然而,卷积网络的感受野有限,难以建模图像的全局关系表示,导致分类精度低,难以处理复杂多样的图像数据。为了对全局关... 图像分类作为一种常见的视觉识别任务,有着广阔的应用场景。在处理图像分类问题时,传统的方法通常使用卷积神经网络,然而,卷积网络的感受野有限,难以建模图像的全局关系表示,导致分类精度低,难以处理复杂多样的图像数据。为了对全局关系进行建模,一些研究者将Transformer应用于图像分类任务,但为了满足Transformer的序列化和并行化要求,需要将图像分割成大小相等、互不重叠的图像块,破坏了相邻图像数据块之间的局部信息。此外,由于Transformer具有较少的先验知识,模型往往需要在大规模数据集上进行预训练,因此计算复杂度较高。为了同时建模图像相邻块之间的局部信息并充分利用图像的全局信息,提出了一种基于Depth-wise卷积的视觉Transformer(Efficient Pyramid Vision Transformer,EPVT)模型。EPVT模型可以实现以较低的计算成本提取相邻图像块之间的局部和全局信息。EPVT模型主要包含3个关键组件:局部感知模块(Local Perceptron Module,LPM)、空间信息融合模块(Spatial Information Fusion,SIF)和“+卷积前馈神经网络(Convolution Feed-forward Network,CFFN)。LPM模块用于捕获图像的局部相关性;SIF模块用于融合相邻图像块之间的局部信息,并利用不同图像块之间的远距离依赖关系,提升模型的特征表达能力,使模型学习到输出特征在不同维度下的语义信息;CFFN模块用于编码位置信息和重塑张量。在图像分类数据集ImageNet-1K上,所提模型优于现有的同等规模的视觉Transformer分类模型,取得了82.6%的分类准确度,证明了该模型在大规模数据集上具有竞争力。 展开更多
关键词 深度学习 图像分类 Depth-wise卷积 视觉transformer 注意力机制
下载PDF
基于RoBERTa和图增强Transformer的序列推荐方法 被引量:2
20
作者 王明虎 石智奎 +1 位作者 苏佳 张新生 《计算机工程》 CAS CSCD 北大核心 2024年第4期121-131,共11页
自推荐系统出现以来,有限的数据信息就一直制约着推荐算法的进一步发展。为降低数据稀疏性的影响,增强非评分数据的利用率,基于神经网络的文本推荐模型相继被提出,但主流的卷积或循环神经网络在文本语义理解和长距离关系捕捉方面存在明... 自推荐系统出现以来,有限的数据信息就一直制约着推荐算法的进一步发展。为降低数据稀疏性的影响,增强非评分数据的利用率,基于神经网络的文本推荐模型相继被提出,但主流的卷积或循环神经网络在文本语义理解和长距离关系捕捉方面存在明显劣势。为了更好地挖掘用户与商品之间的深层潜在特征,进一步提高推荐质量,提出一种基于Ro BERTa和图增强Transformer的序列推荐(RGT)模型。引入评论文本数据,首先利用预训练的Ro BERTa模型捕获评论文本中的字词语义特征,初步建模用户的个性化兴趣,然后根据用户与商品的历史交互信息,构建具有时序特性的商品关联图注意力机制网络模型,通过图增强Transformer的方法将图模型学习到的各个商品的特征表示以序列的形式输入Transformer编码层,最后将得到的输出向量与之前捕获的语义表征以及计算得到的商品关联图的全图表征输入全连接层,以捕获用户全局的兴趣偏好,实现用户对商品的预测评分。在3组真实亚马逊公开数据集上的实验结果表明,与Deep FM、Conv MF等经典文本推荐模型相比,RGT模型在均方根误差(RMSE)和平均绝对误差(MAE)2种指标上有显著提升,相较于最优对比模型最高分别提升4.7%和5.3%。 展开更多
关键词 推荐算法 评论文本 RoBERTa模型 图注意力机制 transformer机制
下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部