期刊文献+
共找到237,516篇文章
< 1 2 250 >
每页显示 20 50 100
Dual-Path Vision Transformer用于急性缺血性脑卒中辅助诊断
1
作者 张桃红 郭学强 +4 位作者 郑瀚 罗继昌 王韬 焦力群 唐安莹 《电子科技大学学报》 EI CAS CSCD 北大核心 2024年第2期307-314,共8页
急性缺血性脑卒中是由于脑组织血液供应障碍导致的脑功能障碍,数字减影脑血管造影(DSA)是诊断脑血管疾病的金标准。基于患者的正面和侧面DSA图像,对急性缺血性脑卒中的治疗效果进行分级评估,构建基于Vision Transformer的双路径图像分... 急性缺血性脑卒中是由于脑组织血液供应障碍导致的脑功能障碍,数字减影脑血管造影(DSA)是诊断脑血管疾病的金标准。基于患者的正面和侧面DSA图像,对急性缺血性脑卒中的治疗效果进行分级评估,构建基于Vision Transformer的双路径图像分类智能模型DPVF。为了提高辅助诊断速度,基于EdgeViT的轻量化设计思想进行了模型的构建;为了使模型保持轻量化的同时具有较高的精度,提出空间-通道自注意力模块,促进Transformer模型捕获更全面的特征信息,提高模型的表达能力;此外,对于DPVF的两分支的特征融合,构建交叉注意力模块对两分支输出进行交叉融合,促使模型提取更丰富的特征,从而提高模型表现。实验结果显示DPVF在测试集上的准确率达98.5%,满足实际需求。 展开更多
关键词 急性缺血性脑卒中 视觉transformer 双分支网络 特征融合
下载PDF
SVMD-PE-BP-Transformer短期光伏功率预测
2
作者 王瑞 靳鑫鑫 逯静 《电网与清洁能源》 CSCD 北大核心 2024年第8期141-150,共10页
考虑到光伏功率受气象因素变化影响而波动性大难以预测的问题,将逐次变分模态分解SVMD-排列熵PE与BPTransformer相结合,给出了一种组合预测方法,以下简称SPBT模型。在去除非相关因子的基础上,利用SOM聚类方法,对全年光伏数据进行3种类... 考虑到光伏功率受气象因素变化影响而波动性大难以预测的问题,将逐次变分模态分解SVMD-排列熵PE与BPTransformer相结合,给出了一种组合预测方法,以下简称SPBT模型。在去除非相关因子的基础上,利用SOM聚类方法,对全年光伏数据进行3种类型的分类;针对光伏发电初始时序中所蕴含的重要信息,利用SVMD自适应K值的方法,对其进行分解。再利用PE方法计算各个子序列的熵值,即序列的起伏复杂程度,根据熵的大小,对频率接近的成分进行重构,将其分为两个区间:复杂度低的部分和复杂度高的部分。最后利用BP网络与Transformer分别对其进行预测,并对预测输出进行综合处理。该文以江苏省一光伏电站观测的气象与功率数据为例,通过比较试验验证了该模型的优势,该模型具有较低的预测误差,有助于提高预测精度。 展开更多
关键词 逐次变分模态分解 排列熵 transformER 功率预测
下载PDF
A new insight into LPSO phase transformation and mechanical properties uniformity of large-scale Mg-Gd-Y-Zn-Zr alloy prepared by multi-pass friction stir processing 被引量:1
3
作者 Xiaohu Guan Wen Wang +7 位作者 Ting Zhang Pai Peng Qiang Liu Peng Han Ke Qiao Jun Cai Liqiang Wang Kuaishe Wang 《Journal of Magnesium and Alloys》 SCIE EI CAS CSCD 2024年第5期2041-2056,共16页
A large-scale fine-grained Mg-Gd-Y-Zn-Zr alloy plate with high strength and ductility was successfully prepared by multi-pass friction stir processing(MFSP)technology in this work.The structure of grains and long peri... A large-scale fine-grained Mg-Gd-Y-Zn-Zr alloy plate with high strength and ductility was successfully prepared by multi-pass friction stir processing(MFSP)technology in this work.The structure of grains and long period stacking ordered(LPSO)phase were characterized,and the mechanical properties uniformity was investigated.Moreover,a quantitative relationship between the microstructure and tensile yield strength was established.The results showed that the grains in the processed zone(PZ)and interfacial zone(IZ)were refined from 50μm to 3μm and 4μm,respectively,and numerous original LPSO phases were broken.In IZ,some block-shaped 18R LPSO phases were transformed into needle-like 14H LPSO phases due to stacking faults and the short-range diffusion of solute atoms.The severe shear deformation in the form of kinetic energy caused profuse stacking fault to be generated and move rapidly,greatly increasing the transformation rate of LPSO phase.After MFSP,the ultimate tensile strength,yield strength and elongation to failure of the large-scale plate were 367 MPa,305 MPa and 18.0% respectively.Grain refinement and LPSO phase strengthening were the major strengthening mechanisms for the MFSP sample.In particularly,the strength of IZ was comparable to that of PZ because the strength contribution of the 14H LPSO phase offsets the lack of grain refinement strengthening in IZ.This result opposes the widely accepted notion that IZ is a weak region in MFSP-prepared large-scale fine-grained plate. 展开更多
关键词 Friction stir processing MULTI-pASS Mg-Gd-Y-Zn-Zr alloy LPSO phase transformation Mechanical properties
下载PDF
考虑多维影响因素的改进Transformer-PSO短期电价预测方法
4
作者 孙欣 王思敏 +2 位作者 谢敬东 江海林 王森 《上海交通大学学报》 EI CAS CSCD 北大核心 2024年第9期1420-1431,共12页
随着多元化电力市场的建设,电价影响因素日益增加,市场环境变化也更加剧烈.为提高市场短期电价的预测精度,提出一种考虑多种电价影响因素的改进Transformer-粒子群优化(PSO)算法短期电价预测方法.首先,在考虑历史电价、负荷的基础上进... 随着多元化电力市场的建设,电价影响因素日益增加,市场环境变化也更加剧烈.为提高市场短期电价的预测精度,提出一种考虑多种电价影响因素的改进Transformer-粒子群优化(PSO)算法短期电价预测方法.首先,在考虑历史电价、负荷的基础上进一步分析电价形成的相关因素,利用自相关函数分析电价的多周期特性并在此基础上调整输入序列,克服了仅采用历史数据以及经验调整输入序列导致预测精度受限的问题.其次,结合长短期记忆(LSTM)、自注意力机制与多层注意力机制并采用多输入结构建立改进Transformer模型,进一步提升LSTM模型捕获不同时间步信息间的长短期依赖关系的能力,克服LSTM的信息利用瓶颈,适应包括历史电价及多种电价成因的复杂多序列输入.此外,还利用PSO智能算法搜索模型不同学习阶段的最佳学习率克服手动调整学习率的局限性.最后,采用PJM市场电价进行算例分析,结果表明所提短期电价预测模型能应用于电价影响因素多、变化剧烈的市场环境,并有效提升短期电价预测精度. 展开更多
关键词 短期电价预测 多维影响因素 自相关分析 改进transformer模型 粒子群优化
下载PDF
Disordered Structure and Reversible Phase Transformation from K-Birnessite to Zn-Buserite Enable High-Performance Aqueous Zinc-lon Batteries
5
作者 Nibagani Naresh Suyoon Eom +4 位作者 Sang Jun Lee Su Hwan Jeong Ji-Won Jung Young Hwa Jung Joo-Hyung Kim 《Energy & Environmental Materials》 SCIE EI CAS CSCD 2024年第3期100-111,共12页
The layeredδ-MnO_(2)(dMO)is an excellent cathode material for rechargeable aqueous zinc-ion batteries owing to its large interlayer distance(~0.7 nm),high capacity,and low cost;however,such cathodes suffer from struc... The layeredδ-MnO_(2)(dMO)is an excellent cathode material for rechargeable aqueous zinc-ion batteries owing to its large interlayer distance(~0.7 nm),high capacity,and low cost;however,such cathodes suffer from structural degradation during the long-term cycling process,leading to capacity fading.In this study,a Co-doped dMO composite with reduced graphene oxide(GC-dMO)is developed using a simple cost-effective hydrothermal method.The degree of disorderness increases owing to the hetero-atom doping and graphene oxide composites.It is demonstrated that layered dMO and GC-dMO undergo a structural transition from K-birnessite to the Zn-buserite phase upon the first discharge,which enhances the intercalation of Zn^(2+)ions,H_(2)O molecules in the layered structure.The GC-dMO cathode exhibits an excellent capacity of 302 mAh g^(-1)at a current density of 100 mAg^(-1)after 100 cycles as compared with the dMO cathode(159 mAhg^(-1)).The excellent electrochemical performance of the GC-dMO cathode owing to Co-doping and graphene oxide sheets enhances the interlayer gap and disorderness,and maintains structural stability,which facilitates the easy reverse intercalation and de-intercalation of Zn^(2+)ions and H_(2)O molecules.Therefore,GC-dMO is a promising cathode material for large-scale aqueous ZIBs. 展开更多
关键词 aqueous zinc-ion batteries BIRNESSITE buserite disordered structure phase transformation
下载PDF
Triple-path feature transform network for ring-array photoacoustic tomography image reconstruction
6
作者 Lingyu Ma Zezheng Qin +1 位作者 Yiming Ma Mingjian Sun 《Journal of Innovative Optical Health Sciences》 SCIE EI CSCD 2024年第3期23-40,共18页
Photoacoustic imaging(PAI)is a noninvasive emerging imaging method based on the photoacoustic effect,which provides necessary assistance for medical diagnosis.It has the characteristics of large imaging depth and high... Photoacoustic imaging(PAI)is a noninvasive emerging imaging method based on the photoacoustic effect,which provides necessary assistance for medical diagnosis.It has the characteristics of large imaging depth and high contrast.However,limited by the equipment cost and reconstruction time requirements,the existing PAI systems distributed with annular array transducers are difficult to take into account both the image quality and the imaging speed.In this paper,a triple-path feature transform network(TFT-Net)for ring-array photoacoustic tomography is proposed to enhance the imaging quality from limited-view and sparse measurement data.Specifically,the network combines the raw photoacoustic pressure signals and conventional linear reconstruction images as input data,and takes the photoacoustic physical model as a prior information to guide the reconstruction process.In addition,to enhance the ability of extracting signal features,the residual block and squeeze and excitation block are introduced into the TFT-Net.For further efficient reconstruction,the final output of photoacoustic signals uses‘filter-then-upsample’operation with a pixel-shuffle multiplexer and a max out module.Experiment results on simulated and in-vivo data demonstrate that the constructed TFT-Net can restore the target boundary clearly,reduce background noise,and realize fast and high-quality photoacoustic image reconstruction of limited view with sparse sampling. 展开更多
关键词 Deep learning feature transformation image reconstruction limited-view measurement photoacoustic tomography.
下载PDF
Major-Countries Competition and Pivot to the Asia-Pacific:the Strategic Transformation and Influence of“Global NATO”
7
作者 Jin Ling 《Contemporary World》 2024年第4期30-33,共4页
As a product of the Cold War,NATO did not disintegrate with the end of it,but rather,it has sought to become“global NATO”through continuous expansion and transformation,in order to hold wider sway in international s... As a product of the Cold War,NATO did not disintegrate with the end of it,but rather,it has sought to become“global NATO”through continuous expansion and transformation,in order to hold wider sway in international security.In its 75-year history,NATO has made numerous moves including adopting confrontational security thinking,building exclusive alliances,conducting humanitarian intervention and crisis management,expanding eastward and northward and involving in the Asia-Pacific. 展开更多
关键词 NATO transformation INTEGRATE
下载PDF
Efficient single-pixel imaging encrypted transmission based on 3D Arnold transformation
8
作者 梁振宇 王朝瑾 +4 位作者 王阳阳 高皓琪 朱东涛 许颢砾 杨星 《Chinese Physics B》 SCIE EI CAS CSCD 2024年第3期378-386,共9页
Single-pixel imaging(SPI)can transform 2D or 3D image data into 1D light signals,which offers promising prospects for image compression and transmission.However,during data communication these light signals in public ... Single-pixel imaging(SPI)can transform 2D or 3D image data into 1D light signals,which offers promising prospects for image compression and transmission.However,during data communication these light signals in public channels will easily draw the attention of eavesdroppers.Here,we introduce an efficient encryption method for SPI data transmission that uses the 3D Arnold transformation to directly disrupt 1D single-pixel light signals and utilizes the elliptic curve encryption algorithm for key transmission.This encryption scheme immediately employs Hadamard patterns to illuminate the scene and then utilizes the 3D Arnold transformation to permutate the 1D light signal of single-pixel detection.Then the transformation parameters serve as the secret key,while the security of key exchange is guaranteed by an elliptic curve-based key exchange mechanism.Compared with existing encryption schemes,both computer simulations and optical experiments have been conducted to demonstrate that the proposed technique not only enhances the security of encryption but also eliminates the need for complicated pattern scrambling rules.Additionally,this approach solves the problem of secure key transmission,thus ensuring the security of information and the quality of the decrypted images. 展开更多
关键词 single-pixel imaging 3D Arnold transformation elliptic curve encryption image encryption
下载PDF
Mucosa color and size may indicate malignant transformation of chicken skin mucosa-positive colorectal neoplastic polyps
9
作者 Ying-Jie Zhang Meng-Xia Yuan +5 位作者 Wu Wen Fan Li Yi Jian Chuan-Ming Zhang Ye Yang Feng-Lin Chen 《World Journal of Gastrointestinal Oncology》 SCIE 2024年第3期750-760,共11页
BACKGROUND Lipid metabolism reprogramming is suspected to exist in pre-cancerous lesions,including colorectal adenoma.Screening colonoscopy frequently reveals chicken skin mucosa(CSM;white or yellow-white speckled muc... BACKGROUND Lipid metabolism reprogramming is suspected to exist in pre-cancerous lesions,including colorectal adenoma.Screening colonoscopy frequently reveals chicken skin mucosa(CSM;white or yellow-white speckled mucosa)surrounding colo-rectal polyps,caused by macrophages engulfing and accumulating the lipids decomposed by colon cells or adjacent tumors.CSM-positive colorectal polyps are associated with various diseases;however,their prognosis varies greatly.Cold snare polypectomy is commonly used to resect lesions up to 10 to 15 mm in diameter without signs of submucosal invasion but is controversial for CSM-positive colorectal polyps.Improved imaging is required to diagnose and treat CSM-positive colorectal polyps.METHODS This retrospective cohort study included 177 patients with CSM-positive colorectal polyps diagnosed using endoscopy.All patient-related information was extracted from the Goldisc soft-clinic DICOM system or electronic medical record system.Based on the pathological results,patients were classified as non-neoplastic polyps(five juvenile polyps),neoplastic polyps,non-invasive high-grade neoplasia(NHGN),or submucosal invasive carcinoma(SM stage cancer).We analyzed and compared the clinical features,suspected risk factors for malignant transformation of neoplastic polyps,and early infiltration of sub-mucosal carcinoma.RESULTS The diameters of NHGN and SM polyps were much smaller than those of neoplastic polyps.Most NHGN polyps had a deeper red mucosal color.On logistic regression analyses,diameter and deeper red mucosal color were independent risk factors for malignant transformation of neoplastic polyps.Type 1 CSM was more common in high-grade intraepithelial neoplasia and SM;type 2 CSM was more common in neoplastic polyps.Logistic regression analyses revealed no significant differences in the malignant transformation of neoplastic polyps or early submucosal invasion of CSM-positive colorectal cancer.Changes in the CSM mucosa surrounding neoplastic polyps and submucosal invasion of colorectal cancer disappeared within 12 months.No tumor recurrence was found during either partial or complete endoscopic resection of the CSM.CONCLUSION CSM-positive colorectal polyps>1 cm in diameter or with deeper red mucosa may be related to NHGN.Resection of CSM surrounding colorectal adenomas did not affect tumor recurrence. 展开更多
关键词 Chicken skin mucosa Colorectal cancer Colorectal polyps Endoscopic resection Malignant transformation White light endoscopy
下载PDF
Online Capacitor Voltage Transformer Measurement Error State Evaluation Method Based on In-Phase Relationship and Abnormal Point Detection
10
作者 Yongqi Liu Wei Shi +2 位作者 Jiusong Hu Yantao Zhao Pang Wang 《Smart Grid and Renewable Energy》 2024年第1期34-48,共15页
The assessment of the measurement error status of online Capacitor Voltage Transformers (CVT) within the power grid is of profound significance to the equitable trade of electric energy and the secure operation of the... The assessment of the measurement error status of online Capacitor Voltage Transformers (CVT) within the power grid is of profound significance to the equitable trade of electric energy and the secure operation of the power grid. This paper advances an online CVT error state evaluation method, anchored in the in-phase relationship and outlier detection. Initially, this method leverages the in-phase relationship to obviate the influence of primary side fluctuations in the grid on assessment accuracy. Subsequently, Principal Component Analysis (PCA) is employed to meticulously disentangle the error change information inherent in the CVT from the measured values and to compute statistics that delineate the error state. Finally, the Local Outlier Factor (LOF) is deployed to discern outliers in the statistics, with thresholds serving to appraise the CVT error state. Experimental results incontrovertibly demonstrate the efficacy of this method, showcasing its prowess in effecting online tracking of CVT error changes and conducting error state assessments. The discernible enhancements in reliability, accuracy, and sensitivity are manifest, with the assessment accuracy reaching an exemplary 0.01%. 展开更多
关键词 Capacitor Voltage transformer Measurement Error Online Monitoring Principal Component Analysis Local Outlier Factor
下载PDF
Stage IV malignant transformation of mature cystic teratoma palliatively treated with concurrent chemoradiotherapy:A case report
11
作者 Saori Kondo Takashi Suzuki +4 位作者 Kanato Yoshiike Sakura Yamanaka Kenta Sonehara Hiroshi Nabeshima Osamu Oguchi 《World Journal of Clinical Cases》 SCIE 2025年第1期56-61,共6页
BACKGROUND Malignant transformation(MT)of mature cystic teratoma(MCT)has a poor prognosis,especially in advanced cases.Concurrent chemoradiotherapy(CCRT)has an inhibitory effect on MT.CASE SUMMARY Herein,we present a ... BACKGROUND Malignant transformation(MT)of mature cystic teratoma(MCT)has a poor prognosis,especially in advanced cases.Concurrent chemoradiotherapy(CCRT)has an inhibitory effect on MT.CASE SUMMARY Herein,we present a case in which CCRT had a reduction effect preoperatively.A 73-year-old woman with pyelonephritis was referred to our hospital.Computed tomography revealed right hydronephrosis and a 6-cm pelvic mass.Endoscopic ultrasound-guided fine-needle biopsy(EUS-FNB)revealed squamous cell carci-noma.The patient was diagnosed with MT of MCT.Due to her poor general con-dition and renal malfunction,we selected CCRT,expecting fewer adverse effects.After CCRT,her performance status improved,and the tumor size was reduced;surgery was performed.Five months postoperatively,the patient developed dis-semination and lymph node metastases.Palliative chemotherapy was ineffective.She died 18 months after treatment initiation.CONCLUSION EUS-FNB was useful in the diagnosis of MT of MCT;CCRT suppressed the disea-se and improved quality of life. 展开更多
关键词 Mature cystic teratoma Malignant transformation Squamous cell carcinoma Concurrent chemoradiotherapy Endoscopic ultrasound-guided fine-needle biopsy Case report
下载PDF
Transforming growth factor-beta 1 enhances discharge activity of cortical neurons
12
作者 Zhihui Ren Tian Li +5 位作者 Xueer Liu Zelin Zhang Xiaoxuan Chen Weiqiang Chen Kangsheng Li Jiangtao Sheng 《Neural Regeneration Research》 SCIE CAS 2025年第2期548-556,共9页
Transforming growth factor-beta 1(TGF-β1)has been extensively studied for its pleiotropic effects on central nervous system diseases.The neuroprotective or neurotoxic effects of TGF-β1 in specific brain areas may de... Transforming growth factor-beta 1(TGF-β1)has been extensively studied for its pleiotropic effects on central nervous system diseases.The neuroprotective or neurotoxic effects of TGF-β1 in specific brain areas may depend on the pathological process and cell types involved.Voltage-gated sodium channels(VGSCs)are essential ion channels for the generation of action potentials in neurons,and are involved in various neuroexcitation-related diseases.However,the effects of TGF-β1 on the functional properties of VGSCs and firing properties in cortical neurons remain unclear.In this study,we investigated the effects of TGF-β1 on VGSC function and firing properties in primary cortical neurons from mice.We found that TGF-β1 increased VGSC current density in a dose-and time-dependent manner,which was attributable to the upregulation of Nav1.3 expression.Increased VGSC current density and Nav1.3 expression were significantly abolished by preincubation with inhibitors of mitogen-activated protein kinase kinase(PD98059),p38 mitogen-activated protein kinase(SB203580),and Jun NH2-terminal kinase 1/2 inhibitor(SP600125).Interestingly,TGF-β1 significantly increased the firing threshold of action potentials but did not change their firing rate in cortical neurons.These findings suggest that TGF-β1 can increase Nav1.3 expression through activation of the ERK1/2-JNK-MAPK pathway,which leads to a decrease in the firing threshold of action potentials in cortical neurons under pathological conditions.Thus,this contributes to the occurrence and progression of neuroexcitatory-related diseases of the central nervous system. 展开更多
关键词 central nervous system cortical neurons ERK firing properties JNK Nav1.3 p38 transforming growth factor-beta 1 traumatic brain injury voltage-gated sodium currents
下载PDF
Fractional S transform-Part 1:Theory 被引量:3
13
作者 胥德平 郭科 《Applied Geophysics》 SCIE CSCD 2012年第1期73-79,116,共8页
The S transform, which is a time-frequency representation known for its local spectral phase properties in signal processing, uniquely combines elements of wavelet transforms and the short-time Fourier transform (STF... The S transform, which is a time-frequency representation known for its local spectral phase properties in signal processing, uniquely combines elements of wavelet transforms and the short-time Fourier transform (STFT). The fractional Fourier transform is a tool for non-stationary signal analysis. In this paper, we define the concept of the fractional S transform (FRST) of a signal, based on the idea of the fractional Fourier transform (FRFT) and S transform (ST), extend the S transform to the time-fractional frequency domain from the time- frequency domain to obtain the inverse transform, and study the FRST mathematical properties. The FRST, which has the advantages of FRFT and ST, can enhance the ST flexibility to process signals. Compared to the S transform, the FRST can effectively improve the signal time- frequency resolution capacity. Simulation results show that the proposed method is effective. 展开更多
关键词 fractional S transform fractional Fourier transform time-frequency analysis
下载PDF
CNN-Transformer特征融合多目标跟踪算法 被引量:4
14
作者 张英俊 白小辉 谢斌红 《计算机工程与应用》 CSCD 北大核心 2024年第2期180-190,共11页
在卷积神经网络(CNN)中,卷积运算能高效地提取目标的局部特征,却难以捕获全局表示;而在视觉Transformer中,注意力机制可以捕获长距离的特征依赖,但会忽略局部特征细节。针对以上问题,提出一种基于CNN-Transformer双分支主干网络进行特... 在卷积神经网络(CNN)中,卷积运算能高效地提取目标的局部特征,却难以捕获全局表示;而在视觉Transformer中,注意力机制可以捕获长距离的特征依赖,但会忽略局部特征细节。针对以上问题,提出一种基于CNN-Transformer双分支主干网络进行特征提取和融合的多目标跟踪算法CTMOT(CNN-transformer multi-object tracking)。使用基于CNN和Transformer双分支并行的主干网络分别提取图像的局部和全局特征。使用双向桥接模块(two-way braidge module,TBM)对两种特征进行充分融合。将融合后的特征输入两组并行的解码器进行处理。将解码器输出的检测框和跟踪框进行匹配,完成多目标跟踪任务。在多目标跟踪数据集MOT17、MOT20、KITTI以及UADETRAC上进行评估,CTMOT算法的MOTP和IDs指标在四个数据集上均达到了SOTA效果,MOTA指标分别达到了76.4%、66.3%、92.36%和88.57%,在MOT数据集上与SOTA方法效果相当,在KITTI数据集上达到SOTA效果。由于同时完成目标检测和关联,能够端到端进行目标跟踪,跟踪速度可达35 FPS,表明CTMOT算法在跟踪的实时性和准确性上达到了较好的平衡,具有较大潜力。 展开更多
关键词 多目标跟踪 transformER 特征融合
下载PDF
基于Depth-wise卷积和视觉Transformer的图像分类模型 被引量:3
15
作者 张峰 黄仕鑫 +1 位作者 花强 董春茹 《计算机科学》 CSCD 北大核心 2024年第2期196-204,共9页
图像分类作为一种常见的视觉识别任务,有着广阔的应用场景。在处理图像分类问题时,传统的方法通常使用卷积神经网络,然而,卷积网络的感受野有限,难以建模图像的全局关系表示,导致分类精度低,难以处理复杂多样的图像数据。为了对全局关... 图像分类作为一种常见的视觉识别任务,有着广阔的应用场景。在处理图像分类问题时,传统的方法通常使用卷积神经网络,然而,卷积网络的感受野有限,难以建模图像的全局关系表示,导致分类精度低,难以处理复杂多样的图像数据。为了对全局关系进行建模,一些研究者将Transformer应用于图像分类任务,但为了满足Transformer的序列化和并行化要求,需要将图像分割成大小相等、互不重叠的图像块,破坏了相邻图像数据块之间的局部信息。此外,由于Transformer具有较少的先验知识,模型往往需要在大规模数据集上进行预训练,因此计算复杂度较高。为了同时建模图像相邻块之间的局部信息并充分利用图像的全局信息,提出了一种基于Depth-wise卷积的视觉Transformer(Efficient Pyramid Vision Transformer,EPVT)模型。EPVT模型可以实现以较低的计算成本提取相邻图像块之间的局部和全局信息。EPVT模型主要包含3个关键组件:局部感知模块(Local Perceptron Module,LPM)、空间信息融合模块(Spatial Information Fusion,SIF)和“+卷积前馈神经网络(Convolution Feed-forward Network,CFFN)。LPM模块用于捕获图像的局部相关性;SIF模块用于融合相邻图像块之间的局部信息,并利用不同图像块之间的远距离依赖关系,提升模型的特征表达能力,使模型学习到输出特征在不同维度下的语义信息;CFFN模块用于编码位置信息和重塑张量。在图像分类数据集ImageNet-1K上,所提模型优于现有的同等规模的视觉Transformer分类模型,取得了82.6%的分类准确度,证明了该模型在大规模数据集上具有竞争力。 展开更多
关键词 深度学习 图像分类 Depth-wise卷积 视觉transformer 注意力机制
下载PDF
基于RoBERTa和图增强Transformer的序列推荐方法 被引量:2
16
作者 王明虎 石智奎 +1 位作者 苏佳 张新生 《计算机工程》 CAS CSCD 北大核心 2024年第4期121-131,共11页
自推荐系统出现以来,有限的数据信息就一直制约着推荐算法的进一步发展。为降低数据稀疏性的影响,增强非评分数据的利用率,基于神经网络的文本推荐模型相继被提出,但主流的卷积或循环神经网络在文本语义理解和长距离关系捕捉方面存在明... 自推荐系统出现以来,有限的数据信息就一直制约着推荐算法的进一步发展。为降低数据稀疏性的影响,增强非评分数据的利用率,基于神经网络的文本推荐模型相继被提出,但主流的卷积或循环神经网络在文本语义理解和长距离关系捕捉方面存在明显劣势。为了更好地挖掘用户与商品之间的深层潜在特征,进一步提高推荐质量,提出一种基于Ro BERTa和图增强Transformer的序列推荐(RGT)模型。引入评论文本数据,首先利用预训练的Ro BERTa模型捕获评论文本中的字词语义特征,初步建模用户的个性化兴趣,然后根据用户与商品的历史交互信息,构建具有时序特性的商品关联图注意力机制网络模型,通过图增强Transformer的方法将图模型学习到的各个商品的特征表示以序列的形式输入Transformer编码层,最后将得到的输出向量与之前捕获的语义表征以及计算得到的商品关联图的全图表征输入全连接层,以捕获用户全局的兴趣偏好,实现用户对商品的预测评分。在3组真实亚马逊公开数据集上的实验结果表明,与Deep FM、Conv MF等经典文本推荐模型相比,RGT模型在均方根误差(RMSE)和平均绝对误差(MAE)2种指标上有显著提升,相较于最优对比模型最高分别提升4.7%和5.3%。 展开更多
关键词 推荐算法 评论文本 RoBERTa模型 图注意力机制 transformer机制
下载PDF
基于Transformer和自适应特征融合的矿井低照度图像亮度提升和细节增强方法 被引量:1
17
作者 田子建 吴佳奇 +4 位作者 张文琪 陈伟 周涛 杨伟 王帅 《煤炭科学技术》 EI CAS CSCD 北大核心 2024年第1期297-310,共14页
高质量矿井影像为矿山安全生产提供保障,也有利于提高后续图像分析技术的性能。矿井影像受低照度环境的影响,易出现亮度低,照度不均,颜色失真,细节信息丢失严重等问题。针对上述问题,提出一种基于Transformer和自适应特征融合的矿井低... 高质量矿井影像为矿山安全生产提供保障,也有利于提高后续图像分析技术的性能。矿井影像受低照度环境的影响,易出现亮度低,照度不均,颜色失真,细节信息丢失严重等问题。针对上述问题,提出一种基于Transformer和自适应特征融合的矿井低照度图像亮度提升和细节增强方法。基于生成对抗思想搭建生成对抗式主体模型框架,使用目标图像域而非单一参考图像驱动判别器监督生成器的训练,实现对低照度图像的充分增强;基于特征表示学习理论搭建特征编码器,将图像解耦为亮度分量和反射分量,避免图像增强过程中亮度与颜色特征相互影响从而导致颜色失真问题;设计CEM-Transformer Encoder通过捕获全局上下文关系和提取局部区域特征,能够充分提升整体图像亮度并消除局部区域照度不均;在反射分量增强过程中,使用结合CEM-Cross-Transformer Encoder的跳跃连接将低级特征与深层网络处特征进行自适应融合,能够有效避免细节特征丢失,并在编码网络中添加ECA-Net,提高浅层网络的特征提取效率。制作矿井低照度图像数据集为矿井低照度图像增强任务提供数据资源。试验显示,在矿井低照度图像数据集和公共数据集中,与5种先进的低照度图像增强算法相比,该算法增强图像的质量指标PSNR、SSIM、VIF平均提高了16.564%,10.998%,16.226%和14.438%,10.888%,14.948%,证明该算法能够有效提升整体图像亮度,消除照度不均,避免颜色失真和细节丢失,实现矿井低照度图像增强。 展开更多
关键词 图像增强 图像识别 生成对抗网络 特征解耦 transformER
下载PDF
融合卷积注意力和Transformer架构的行人重识别方法 被引量:2
18
作者 王静 李沛橦 +2 位作者 赵容锋 张云 马振玲 《北京航空航天大学学报》 EI CAS CSCD 北大核心 2024年第2期466-476,共11页
行人重识别技术是智能安防系统中的重要方法之一,为构建一个适用各种复杂场景的行人重识别模型,基于现有的卷积神经网络和Transformer模型,提出一种融合卷积注意力和Transformer(FCAT)架构的方法,以增强Transformer对局部细节信息的关... 行人重识别技术是智能安防系统中的重要方法之一,为构建一个适用各种复杂场景的行人重识别模型,基于现有的卷积神经网络和Transformer模型,提出一种融合卷积注意力和Transformer(FCAT)架构的方法,以增强Transformer对局部细节信息的关注。所提方法主要将卷积空间注意力和通道注意力嵌入Transformer架构中,分别加强对图像中重要区域的关注和对重要通道特征的关注,以进一步提高Transformer架构对局部细节特征的提取能力。在3个公开行人重识别数据集上的对比消融实验证明,所提方法在非遮挡数据集上取得了与现有方法相当的结果,在遮挡数据集上的性能得到显著提升。所提方法更加轻量化,在不增加额外计算量和模型参数的情况下,推理速度得到了提升。 展开更多
关键词 行人重识别 深度学习 卷积神经网络 transformER 注意力机制
下载PDF
High-performance multi-transform architecture for H.264/AVC
19
作者 王刚 王庆 +1 位作者 李冰 陈锐 《Journal of Southeast University(English Edition)》 EI CAS 2013年第3期276-281,共6页
In order to increase the hardware utilization and minimize the chip area a multi-transform coding architecture which includes 4 ×4 forward integer transform 4 ×4 inverse integer transform 4 ×4 Hadamard ... In order to increase the hardware utilization and minimize the chip area a multi-transform coding architecture which includes 4 ×4 forward integer transform 4 ×4 inverse integer transform 4 ×4 Hadamard transform and 2 ×2 Hadamard transform is proposed. By simplifying these transforms and exploring their similarities the proposed design merges the architectures processing individual transforms into a high-performance multi-transform coding architecture.Using a semiconductor manufacturing international corporation SMIC 0.18 μm complementary metal oxide semiconductor CMOS technology the proposed architecture achieves the maximum operating clock frequency of 200 MHz and the throughput rate of 800 ×106 pixel/s with the hardware cost of 3 704 gates.The results demonstrate that the data throughput rate per unit area DTUA of this design is at least 40.28%higher than that of the reference design.This design can meet the requirements of real-time decoding digital cinema video 4 096 ×2 048@30 Hz at 62.9 MHz which helps to reduce the power consumption. 展开更多
关键词 H. 264/AVC multi-transform architecture Hadamard transform integer transform
下载PDF
基于残差U-Net和自注意力Transformer编码器的磁场预测方法 被引量:1
20
作者 金亮 尹振豪 +2 位作者 刘璐 宋居恒 刘元凯 《电工技术学报》 EI CSCD 北大核心 2024年第10期2937-2952,共16页
利用有限元方法对几何结构复杂的电机和变压器进行磁场分析,存在仿真时间长且无法复用的问题。因此,该文提出一种基于残差U-Net和自注意力Transformer编码器的磁场预测方法。首先建立永磁同步电机(PMSM)和非晶合金变压器(AMT)有限元模型... 利用有限元方法对几何结构复杂的电机和变压器进行磁场分析,存在仿真时间长且无法复用的问题。因此,该文提出一种基于残差U-Net和自注意力Transformer编码器的磁场预测方法。首先建立永磁同步电机(PMSM)和非晶合金变压器(AMT)有限元模型,得到深度学习训练所需的数据集;然后将Transformer模块与U-Net模型结合,并引入短残差机制建立ResUnet-Transformer模型,通过预测图像的像素实现磁场预测;最后通过Targeted Dropout算法和动态学习率调整策略对模型进行优化,解决拟合问题并提高预测精度。计算实例证明,ResUnet-Transformer模型在PMSM和AMT数据集上测试集的平均绝对百分比误差(MAPE)均小于1%,且仅需500组样本。该文提出的磁场预测方法能减少实际工况和多工况下精细模拟和拓扑优化的时间和资源消耗,亦是虚拟传感器乃至数字孪生的关键实现方法之一。 展开更多
关键词 有限元方法 电磁场 深度学习 U-Net transformER
下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部