期刊文献+
共找到487篇文章
< 1 2 25 >
每页显示 20 50 100
Multi-Modal Medical Image Fusion Based on Improved Parameter Adaptive PCNN and Latent Low-Rank Representation
1
作者 Zirui Tang Xianchun Zhou 《Instrumentation》 2024年第2期53-63,共11页
Multimodal medical image fusion can help physicians provide more accurate treatment plans for patients, as unimodal images provide limited valid information. To address the insufficient ability of traditional medical ... Multimodal medical image fusion can help physicians provide more accurate treatment plans for patients, as unimodal images provide limited valid information. To address the insufficient ability of traditional medical image fusion solutions to protect image details and significant information, a new multimodality medical image fusion method(NSST-PAPCNNLatLRR) is proposed in this paper. Firstly, the high and low-frequency sub-band coefficients are obtained by decomposing the source image using NSST. Then, the latent low-rank representation algorithm is used to process the low-frequency sub-band coefficients;An improved PAPCNN algorithm is also proposed for the fusion of high-frequency sub-band coefficients. The improved PAPCNN model was based on the automatic setting of the parameters, and the optimal method was configured for the time decay factor αe. The experimental results show that, in comparison with the five mainstream fusion algorithms, the new algorithm has significantly improved the visual effect over the comparison algorithm,enhanced the ability to characterize important information in images, and further improved the ability to protect the detailed information;the new algorithm has achieved at least four firsts in six objective indexes. 展开更多
关键词 image fusion improved parameter adaptive pcnn non-subsampled shear-wave transform latent low-rank representation
下载PDF
Multimodal Medical Image Fusion Based on Parameter Adaptive PCNN and Latent Low-rank Representation 被引量:1
2
作者 WANG Wenyan ZHOU Xianchun YANG Liangjian 《Instrumentation》 2023年第1期45-58,共14页
Medical image fusion has been developed as an efficient assistive technology in various clinical applications such as medical diagnosis and treatment planning.Aiming at the problem of insufficient protection of image ... Medical image fusion has been developed as an efficient assistive technology in various clinical applications such as medical diagnosis and treatment planning.Aiming at the problem of insufficient protection of image contour and detail information by traditional image fusion methods,a new multimodal medical image fusion method is proposed.This method first uses non-subsampled shearlet transform to decompose the source image to obtain high and low frequency subband coefficients,then uses the latent low rank representation algorithm to fuse the low frequency subband coefficients,and applies the improved PAPCNN algorithm to fuse the high frequency subband coefficients.Finally,based on the automatic setting of parameters,the optimization method configuration of the time decay factorαe is carried out.The experimental results show that the proposed method solves the problems of difficult parameter setting and insufficient detail protection ability in traditional PCNN algorithm fusion images,and at the same time,it has achieved great improvement in visual quality and objective evaluation indicators. 展开更多
关键词 Image fusion Non-subsampled Shearlet Transform Parameter Adaptive PCNN Latent Low-rank representation
下载PDF
Non Sub-Sampled Contourlet with Joint Sparse Representation Based Medical Image Fusion
3
作者 Kandasamy Kittusamy Latha Shanmuga Vadivu Sampath Kumar 《Computer Systems Science & Engineering》 SCIE EI 2023年第3期1989-2005,共17页
Medical Image Fusion is the synthesizing technology for fusing multi-modal medical information using mathematical procedures to generate better visual on the image content and high-quality image output.Medical image f... Medical Image Fusion is the synthesizing technology for fusing multi-modal medical information using mathematical procedures to generate better visual on the image content and high-quality image output.Medical image fusion represents an indispensible role infixing major solutions for the complicated medical predicaments,while the recent research results have an enhanced affinity towards the preservation of medical image details,leaving color distortion and halo artifacts to remain unaddressed.This paper proposes a novel method of fusing Computer Tomography(CT)and Magnetic Resonance Imaging(MRI)using a hybrid model of Non Sub-sampled Contourlet Transform(NSCT)and Joint Sparse Representation(JSR).This model gratifies the need for precise integration of medical images of different modalities,which is an essential requirement in the diagnosing process towards clinical activities and treating the patients accordingly.In the proposed model,the medical image is decomposed using NSCT which is an efficient shift variant decomposition transformation method.JSR is exercised to extricate the common features of the medical image for the fusion process.The performance analysis of the proposed system proves that the proposed image fusion technique for medical image fusion is more efficient,provides better results,and a high level of distinctness by integrating the advantages of complementary images.The comparative analysis proves that the proposed technique exhibits better-quality than the existing medical image fusion practices. 展开更多
关键词 Medical image fusion computer tomography magnetic resonance imaging non sub-sampled contourlet transform(NSCT) joint sparse representation(JSR)
下载PDF
Intelligent Fusion of Infrared and Visible Image Data Based on Convolutional Sparse Representation and Improved Pulse-Coupled Neural Network 被引量:3
4
作者 Jingming Xia Yi Lu +1 位作者 Ling Tan Ping Jiang 《Computers, Materials & Continua》 SCIE EI 2021年第4期613-624,共12页
Multi-source information can be obtained through the fusion of infrared images and visible light images,which have the characteristics of complementary information.However,the existing acquisition methods of fusion im... Multi-source information can be obtained through the fusion of infrared images and visible light images,which have the characteristics of complementary information.However,the existing acquisition methods of fusion images have disadvantages such as blurred edges,low contrast,and loss of details.Based on convolution sparse representation and improved pulse-coupled neural network this paper proposes an image fusion algorithm that decompose the source images into high-frequency and low-frequency subbands by non-subsampled Shearlet Transform(NSST).Furthermore,the low-frequency subbands were fused by convolutional sparse representation(CSR),and the high-frequency subbands were fused by an improved pulse coupled neural network(IPCNN)algorithm,which can effectively solve the problem of difficulty in setting parameters of the traditional PCNN algorithm,improving the performance of sparse representation with details injection.The result reveals that the proposed method in this paper has more advantages than the existing mainstream fusion algorithms in terms of visual effects and objective indicators. 展开更多
关键词 Image fusion infrared image visible light image non-downsampling shear wave transform improved PCNN convolutional sparse representation
下载PDF
State Accurate Representation and Performance Prediction Algorithm Optimization for Industrial Equipment Based on Digital Twin
5
作者 Ying Bai Xiaoti Ren Hong Li 《Intelligent Automation & Soft Computing》 SCIE 2023年第9期2999-3018,共20页
The combination of the Industrial Internet of Things(IIoT)and digital twin(DT)technology makes it possible for the DT model to realize the dynamic perception of equipment status and performance.However,conventional di... The combination of the Industrial Internet of Things(IIoT)and digital twin(DT)technology makes it possible for the DT model to realize the dynamic perception of equipment status and performance.However,conventional digital modeling is weak in the fusion and adjustment ability between virtual and real information.The performance prediction based on experience greatly reduces the inclusiveness and accuracy of the model.In this paper,a DT-IIoT optimization model is proposed to improve the real-time representation and prediction ability of the key equipment state.Firstly,a global real-time feedback and the dynamic adjustment mechanism is established by combining DT-IIoT with algorithm optimization.Secondly,a strong screening dual-model optimization(SSDO)prediction method based on Stacking integration and fusion is proposed in the dynamic regulation mechanism.Lightweight screening and multi-round optimization are used to improve the prediction accuracy of the evolution model.Finally,tak-ing the boiler performance of a power plant in Shanxi as an example,the accurate representation and evolution prediction of boiler steam quantity is realized.The results show that the real-time state representation and life cycle performance prediction of large key equipment is optimized through these methods.The self-lifting ability of the Stacking integration and fusion-based SSDO prediction method is 15.85%on average,and the optimal self-lifting ability is 18.16%.The optimization model reduces the MSE loss from the initial 0.318 to the optimal 0.1074,and increases R2 from the initial 0.731 to the optimal 0.9092.The adaptability and reliability of the model are comprehensively improved,and better prediction and analysis results are achieved.This ensures the stable operation of core equipment,and is of great significance to comprehensively understanding the equipment status and performance. 展开更多
关键词 Digital twin(DT) digital representation transfer learning dual model optimization information fusion
下载PDF
基于SEFusion-MPOR的多模态特征融合舆情表征算法
6
作者 郭小宇 马静 《情报理论与实践》 CSSCI 北大核心 2024年第7期181-189,共9页
[目的/意义]多模态舆情表征是多模态舆情计算与分析的基础。文章探索了一种赋予不同模态特征动态权重的舆情表征算法,可以更精准地捕捉到模态之间的依赖关系,极大降低多模态舆情表征复杂度,减少算力资源消耗。[方法/过程]SEFusion-MPOR... [目的/意义]多模态舆情表征是多模态舆情计算与分析的基础。文章探索了一种赋予不同模态特征动态权重的舆情表征算法,可以更精准地捕捉到模态之间的依赖关系,极大降低多模态舆情表征复杂度,减少算力资源消耗。[方法/过程]SEFusion-MPOR算法在预训练模型特征的基础上,通过全连接层、门控机制与激活函数构建了压缩与激活算子,获取各模态的动态权重,使用矩阵相乘将动态权重作用于相应模态,进而构建了多模态特征融合的网络舆情表征算法。[结果/结论]在Memotion 3与MVSA-multiple两个公开的多模态舆情数据集上进行实验,与基线模型的对比表明,文章提出的表征方法在多个子任务中取得了最优结果。该方法仅通过简单操作,就达到了复杂表征算法的效果,且具有可解释性与外推性。其高效和准确的表征方法不仅适用于舆情情报处理,也适合情报分析工作中的通用多模态信息基础表征。[局限]研究验证仅限于双模态数据集,未涉及更广泛模态的数据集。 展开更多
关键词 多模态舆情 多模态特征融合 舆情表征 预训练模型 SEfusion-MPOR
下载PDF
A Comprehensive Survey on Deep Learning Multi-Modal Fusion:Methods,Technologies and Applications
7
作者 Tianzhe Jiao Chaopeng Guo +2 位作者 Xiaoyue Feng Yuming Chen Jie Song 《Computers, Materials & Continua》 SCIE EI 2024年第7期1-35,共35页
Multi-modal fusion technology gradually become a fundamental task in many fields,such as autonomous driving,smart healthcare,sentiment analysis,and human-computer interaction.It is rapidly becoming the dominant resear... Multi-modal fusion technology gradually become a fundamental task in many fields,such as autonomous driving,smart healthcare,sentiment analysis,and human-computer interaction.It is rapidly becoming the dominant research due to its powerful perception and judgment capabilities.Under complex scenes,multi-modal fusion technology utilizes the complementary characteristics of multiple data streams to fuse different data types and achieve more accurate predictions.However,achieving outstanding performance is challenging because of equipment performance limitations,missing information,and data noise.This paper comprehensively reviews existing methods based onmulti-modal fusion techniques and completes a detailed and in-depth analysis.According to the data fusion stage,multi-modal fusion has four primary methods:early fusion,deep fusion,late fusion,and hybrid fusion.The paper surveys the three majormulti-modal fusion technologies that can significantly enhance the effect of data fusion and further explore the applications of multi-modal fusion technology in various fields.Finally,it discusses the challenges and explores potential research opportunities.Multi-modal tasks still need intensive study because of data heterogeneity and quality.Preserving complementary information and eliminating redundant information between modalities is critical in multi-modal technology.Invalid data fusion methods may introduce extra noise and lead to worse results.This paper provides a comprehensive and detailed summary in response to these challenges. 展开更多
关键词 Multi-modal fusion representation TRANSLATION ALIGNMENT deep learning comparative analysis
下载PDF
Fake News Detection Based on Cross-Modal Message Aggregation and Gated Fusion Network
8
作者 Fangfang Shan Mengyao Liu +1 位作者 Menghan Zhang Zhenyu Wang 《Computers, Materials & Continua》 SCIE EI 2024年第7期1521-1542,共22页
Social media has become increasingly significant in modern society,but it has also turned into a breeding ground for the propagation of misleading information,potentially causing a detrimental impact on public opinion... Social media has become increasingly significant in modern society,but it has also turned into a breeding ground for the propagation of misleading information,potentially causing a detrimental impact on public opinion and daily life.Compared to pure text content,multmodal content significantly increases the visibility and share ability of posts.This has made the search for efficient modality representations and cross-modal information interaction methods a key focus in the field of multimodal fake news detection.To effectively address the critical challenge of accurately detecting fake news on social media,this paper proposes a fake news detection model based on crossmodal message aggregation and a gated fusion network(MAGF).MAGF first uses BERT to extract cumulative textual feature representations and word-level features,applies Faster Region-based ConvolutionalNeuralNetwork(Faster R-CNN)to obtain image objects,and leverages ResNet-50 and Visual Geometry Group-19(VGG-19)to obtain image region features and global features.The image region features and word-level text features are then projected into a low-dimensional space to calculate a text-image affinity matrix for cross-modal message aggregation.The gated fusion network combines text and image region features to obtain adaptively aggregated features.The interaction matrix is derived through an attention mechanism and further integrated with global image features using a co-attention mechanism to producemultimodal representations.Finally,these fused features are fed into a classifier for news categorization.Experiments were conducted on two public datasets,Twitter and Weibo.Results show that the proposed model achieves accuracy rates of 91.8%and 88.7%on the two datasets,respectively,significantly outperforming traditional unimodal and existing multimodal models. 展开更多
关键词 Fake news detection cross-modalmessage aggregation gate fusion network co-attention mechanism multi-modal representation
下载PDF
HOG-VGG:VGG Network with HOG Feature Fusion for High-Precision PolSAR Terrain Classification
9
作者 Jiewen Li Zhicheng Zhao +2 位作者 Yanlan Wu Jiaqiu Ai Jun Shi 《Journal of Harbin Institute of Technology(New Series)》 CAS 2024年第5期1-15,共15页
This article proposes a VGG network with histogram of oriented gradient(HOG) feature fusion(HOG-VGG) for polarization synthetic aperture radar(PolSAR) image terrain classification.VGG-Net has a strong ability of deep ... This article proposes a VGG network with histogram of oriented gradient(HOG) feature fusion(HOG-VGG) for polarization synthetic aperture radar(PolSAR) image terrain classification.VGG-Net has a strong ability of deep feature extraction,which can fully extract the global deep features of different terrains in PolSAR images,so it is widely used in PolSAR terrain classification.However,VGG-Net ignores the local edge & shape features,resulting in incomplete feature representation of the PolSAR terrains,as a consequence,the terrain classification accuracy is not promising.In fact,edge and shape features play an important role in PolSAR terrain classification.To solve this problem,a new VGG network with HOG feature fusion was specifically proposed for high-precision PolSAR terrain classification.HOG-VGG extracts both the global deep semantic features and the local edge & shape features of the PolSAR terrains,so the terrain feature representation completeness is greatly elevated.Moreover,HOG-VGG optimally fuses the global deep features and the local edge & shape features to achieve the best classification results.The superiority of HOG-VGG is verified on the Flevoland,San Francisco and Oberpfaffenhofen datasets.Experiments show that the proposed HOG-VGG achieves much better PolSAR terrain classification performance,with overall accuracies of 97.54%,94.63%,and 96.07%,respectively. 展开更多
关键词 PolSAR terrain classification high⁃precision HOG⁃VGG feature representation completeness elevation multi⁃level feature fusion
下载PDF
A multi-source image fusion algorithm based on gradient regularized convolution sparse representation
10
作者 WANG Jian QIN Chunxia +2 位作者 ZHANG Xiufei YANG Ke REN Ping 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2020年第3期447-459,共13页
Image fusion based on the sparse representation(SR)has become the primary research direction of the transform domain method.However,the SR-based image fusion algorithm has the characteristics of high computational com... Image fusion based on the sparse representation(SR)has become the primary research direction of the transform domain method.However,the SR-based image fusion algorithm has the characteristics of high computational complexity and neglecting the local features of an image,resulting in limited image detail retention and a high registration misalignment sensitivity.In order to overcome these shortcomings and the noise existing in the image of the fusion process,this paper proposes a new signal decomposition model,namely the multi-source image fusion algorithm of the gradient regularization convolution SR(CSR).The main innovation of this work is using the sparse optimization function to perform two-scale decomposition of the source image to obtain high-frequency components and low-frequency components.The sparse coefficient is obtained by the gradient regularization CSR model,and the sparse coefficient is taken as the maximum value to get the optimal high frequency component of the fused image.The best low frequency component is obtained by using the fusion strategy of the extreme or the average value.The final fused image is obtained by adding two optimal components.Experimental results demonstrate that this method greatly improves the ability to maintain image details and reduces image registration sensitivity. 展开更多
关键词 gradient regularization convolution sparse representation(CSR) image fusion
下载PDF
Multi-task Joint Sparse Representation Classification Based on Fisher Discrimination Dictionary Learning 被引量:6
11
作者 Rui Wang Miaomiao Shen +1 位作者 Yanping Li Samuel Gomes 《Computers, Materials & Continua》 SCIE EI 2018年第10期25-48,共24页
Recently,sparse representation classification(SRC)and fisher discrimination dictionary learning(FDDL)methods have emerged as important methods for vehicle classification.In this paper,inspired by recent breakthroughs ... Recently,sparse representation classification(SRC)and fisher discrimination dictionary learning(FDDL)methods have emerged as important methods for vehicle classification.In this paper,inspired by recent breakthroughs of discrimination dictionary learning approach and multi-task joint covariate selection,we focus on the problem of vehicle classification in real-world applications by formulating it as a multi-task joint sparse representation model based on fisher discrimination dictionary learning to merge the strength of multiple features among multiple sensors.To improve the classification accuracy in complex scenes,we develop a new method,called multi-task joint sparse representation classification based on fisher discrimination dictionary learning,for vehicle classification.In our proposed method,the acoustic and seismic sensor data sets are captured to measure the same physical event simultaneously by multiple heterogeneous sensors and the multi-dimensional frequency spectrum features of sensors data are extracted using Mel frequency cepstral coefficients(MFCC).Moreover,we extend our model to handle sparse environmental noise.We experimentally demonstrate the benefits of joint information fusion based on fisher discrimination dictionary learning from different sensors in vehicle classification tasks. 展开更多
关键词 Multi-sensor fusion fisher discrimination dictionary learning(FDDL) vehicle classification sensor networks sparse representation classification(SRC)
下载PDF
非结构化数据表征增强的术后风险预测模型
12
作者 王亚强 杨潇 +3 位作者 朱涛 郝学超 舒红平 陈果 《中文信息学报》 CSCD 北大核心 2024年第1期156-165,共10页
准确的术后风险预测对临床资源的规划、应急方案的准备以及患者术后风险和死亡率的降低具有积极的作用。目前,术后风险预测主要基于患者的基本信息、术前的实验室检查及术中的生命体征等结构化数据,蕴含着丰富语义信息的非结构化术前诊... 准确的术后风险预测对临床资源的规划、应急方案的准备以及患者术后风险和死亡率的降低具有积极的作用。目前,术后风险预测主要基于患者的基本信息、术前的实验室检查及术中的生命体征等结构化数据,蕴含着丰富语义信息的非结构化术前诊断的价值尚待验证。针对上述问题,该文提出一种非结构化数据表征增强的术后风险预测模型,利用自注意力机制,将结构化数据与术前诊断进行信息加权融合。基于临床数据,该文将所提出的模型与术后风险预测常用的统计机器学习模型以及最新的深度神经网络进行对比,在肺部并发症风险预测、ICU入室风险预测和心血管不良风险预测任务上的F1值平均提升了9.533%,同时预测模型还具有良好的可解释性。 展开更多
关键词 术后风险预测 自注意力机制 数据表征 信息融合
下载PDF
面向视频数据的多模态情感分析
13
作者 武星 殷浩宇 +2 位作者 姚骏峰 李卫民 钱权 《计算机工程》 CAS CSCD 北大核心 2024年第6期218-227,共10页
多模态情感分析旨在从文本、图像和音频数据中提取和整合语义信息,从而识别在线视频中说话者的情感状态。尽管多模态融合方案在此研究领域已取得一定成果,但是已有方法在处理模态间分布差异和关系知识的融合方面仍有欠缺,为此,提出一种... 多模态情感分析旨在从文本、图像和音频数据中提取和整合语义信息,从而识别在线视频中说话者的情感状态。尽管多模态融合方案在此研究领域已取得一定成果,但是已有方法在处理模态间分布差异和关系知识的融合方面仍有欠缺,为此,提出一种多模态情感分析方法。设计一种多模态提示门(MPG)模块,其能够将非语言信息转换为融合文本上下文的提示,利用文本信息对非语言信号的噪声进行过滤,得到包含丰富语义信息的提示,以增强模态间的信息整合。此外,提出一种实例到标签的对比学习框架,在语义层面上区分隐空间中的不同标签以进一步优化模型输出。在3个大规模情感分析数据集上的实验结果表明,该方法的二分类精度相对次优模型提高了约0.7%,三分类精度提高了超过2.5%,达到0.671。该方法能够为将多模态情感分析引入用户画像、视频理解、AI面试等领域提供参考。 展开更多
关键词 多模态情感分析 语义信息 多模态融合 上下文表征 对比学习
下载PDF
面向双模态夜视图像的混合尺度融合算法
14
作者 刘文强 姜迈 +1 位作者 乔顺利 李宏达 《兵器装备工程学报》 CAS CSCD 北大核心 2024年第5期291-298,共8页
针对传统红外与可见光图像融合算法存在的细节模糊、对比度降低、背景信息缺失等不足,提出了一种基于混合尺度的红外与可见光融合方法。通过潜在低秩表示变换将源图像分解低秩子带和显著子带;利用非下采样轮廓波变换将低秩子带继续分解... 针对传统红外与可见光图像融合算法存在的细节模糊、对比度降低、背景信息缺失等不足,提出了一种基于混合尺度的红外与可见光融合方法。通过潜在低秩表示变换将源图像分解低秩子带和显著子带;利用非下采样轮廓波变换将低秩子带继续分解为低频分量与高频分量;针对显著子带采用基于卷积稀疏表示的方法进行融合;并结合全局均值、区域均值与能量的优势融合低频分量;利用权重决策图融合高频分量。基于自建库及公开库的实验结果表明,与其他5种图像融合算法相比,所提算法在充分继承源图像有效信息的同时,融合图像整体对比度更均衡,有效提升了融合图像的清晰度,包含更丰富的图像细节信息,在主客观评价上均取得了更好的效果。 展开更多
关键词 图像融合 混合尺度 卷积稀疏表示 红外图像 可见光图像
下载PDF
基于跨尺度相似先验的遥感图像时空融合算法
15
作者 方帅 万旗 曹洋 《电子学报》 EI CAS CSCD 北大核心 2024年第6期2037-2052,共16页
遥感卫星图像在空间分辨率和时间分辨率之间权衡导致图像序列的时空矛盾.时空图像融合提供了一个生成高空间分辨率和高时间分辨率图像的解决方案,以满足各种地球观测应用.基于稀疏表示的时空融合算法通过联合训练字典和稀疏编码表示建... 遥感卫星图像在空间分辨率和时间分辨率之间权衡导致图像序列的时空矛盾.时空图像融合提供了一个生成高空间分辨率和高时间分辨率图像的解决方案,以满足各种地球观测应用.基于稀疏表示的时空融合算法通过联合训练字典和稀疏编码表示建立高低空间分辨率图像之间的关系,为物候变化、类型变化等各种情况提供了统一的融合框架.然而,多源遥感图像来自于不同的传感器,高低空间分辨率图像之间关系模型暗含有传感器映射关系,导致模型设备依赖.针对该问题,本文提出将多源遥感图像时空融合过程分解为传感器偏差校正和时空融合两个子问题,即设备依赖部分和设备无关部分.传感器偏差校正部分可以作为时空融合预处理模块,提高融合精度,并且使得后续的融合模型更加具有普适性.当高低空间分辨率图像空间分辨率差异较大时,“高低空间分辨率图像稀疏系数一致”的假设带来的融合误差非常突出.针对该问题,本文提出基于跨尺度相似先验的遥感图像时空融合算法,利用跨尺度相似块构建稀疏结构先验的正则项,优化稀疏表示的目标函数,并构建中间尺度图像,降低跨尺度相似块的二义性.本文分别使用3组典型场景的实验数据集与其他算法进行对比,实验结果表明,在BOREAS数据集上,与次优的指标相比,本文算法的结构相似度(Structural SIMilarity,SSIM)提高了4.2%,光谱角(Spectral Angle Mapper,SAM)提高了4.6%;在CIA数据集上,与次优的指标相比,本文算法的SSIM提高了2.7%,SAM提高了12.8%;在LGC数据集上,与次优的指标相比,本文算法的SSIM提高了7.1%,SAM提高了16.3%;证明本文算法在空间和光谱特性上表现出优秀的特性. 展开更多
关键词 遥感 时空融合 稀疏表示 跨尺度相似
下载PDF
MCM-ICE:联合独立编码和协同编码的多模态分类模型
16
作者 郭锐锋 魏靖烜 +1 位作者 于碧辉 孙林壮 《小型微型计算机系统》 CSCD 北大核心 2024年第9期2080-2086,共7页
多模态数据处理是一个重要的研究领域,它可以通过结合文本、图像等多种信息来提高模型性能.然而,由于不同模态之间的异构性以及信息融合的挑战,设计有效的多模态分类模型仍然是一个具有挑战性的问题.本文提出了一种新的多模态分类模型—... 多模态数据处理是一个重要的研究领域,它可以通过结合文本、图像等多种信息来提高模型性能.然而,由于不同模态之间的异构性以及信息融合的挑战,设计有效的多模态分类模型仍然是一个具有挑战性的问题.本文提出了一种新的多模态分类模型——MCM-ICE,它通过联合独立编码和协同编码策略来解决特征表示和特征融合的挑战.MCM-ICE在Fashion-Gen和Hateful Memes Challenge两个数据集上进行了实验,结果表明该模型在这两项任务中均优于现有的最先进方法.本文还探究了协同编码模块Transformer输出层的不同向量选取对结果的影响,结果表明选取[CLS]向量和去除[CLS]的向量的平均池化向量可以获得最佳结果.消融研究和探索性分析支持了MCM-ICE模型在处理多模态分类任务方面的有效性. 展开更多
关键词 多模态数据处理 特征表示 特征融合 协同编码
下载PDF
基于多级特征融合和强化学习的多模态实体对齐
17
作者 李华昱 王翠翠 +1 位作者 张智康 李海洋 《中文信息学报》 CSCD 北大核心 2024年第9期36-47,共12页
针对传统实体对齐方法未充分利用多模态信息,且在特征融合时未考虑模态间潜在的交互影响等问题,该文提出了一种多模态实体对齐方法,旨在充分利用实体的不同模态特征,在不同多模态知识图谱中找到等价实体。首先通过不同的特征编码器获得... 针对传统实体对齐方法未充分利用多模态信息,且在特征融合时未考虑模态间潜在的交互影响等问题,该文提出了一种多模态实体对齐方法,旨在充分利用实体的不同模态特征,在不同多模态知识图谱中找到等价实体。首先通过不同的特征编码器获得属性、关系、图像和图结构的嵌入表示,同时引入数值模态以增强实体语义信息;其次在特征融合阶段,在对比学习的基础上同时进行跨模态互补性和相关性建模,并引入强化学习优化模型输出,减小获得的联合嵌入和真实模态嵌入之间的异构差异;最后计算两个实体之间的余弦相似度,筛选出候选对齐实体对,并将其迭代加入对齐种子,指导新的实体对齐。实验结果表明,该文所提方法在多模态实体对齐任务中是有效的。 展开更多
关键词 多模态知识图谱 表示学习 实体对齐 特征融合
下载PDF
面向测井领域的多模态知识图谱构建
18
作者 曹茂俊 林世友 +2 位作者 肖阳 王瑞芳 邱斌鑫 《计算机技术与发展》 2024年第9期195-201,共7页
针对测井解释过程中数据多源异构、数据间难以互补融合,不能很好应用于风险评估、解释评价和决策知识提供等问题,提出了一种面向测井领域的多模态知识图谱构建方法。该方法从测井角度出发,采用自顶向下的方式将知识整理分类为通用知识... 针对测井解释过程中数据多源异构、数据间难以互补融合,不能很好应用于风险评估、解释评价和决策知识提供等问题,提出了一种面向测井领域的多模态知识图谱构建方法。该方法从测井角度出发,采用自顶向下的方式将知识整理分类为通用知识、区域知识和辅助知识等,结合测井解释过程中文本、图片、音视频等多模态资料深入挖掘实体属性关系,搭建了测井领域本体层,并基于CasRel实体关系联合抽取,余弦相似度多模态知识融合和TransR多模态表示学习技术完成了测井领域多模态知识图谱的构建。通过大庆测试服务分公司现场实际验证表明,基于该文构造的测井领域多模态知识图谱有效增强了测井知识的整合、互联和共享。 展开更多
关键词 测井 知识图谱 多模态 知识融合 知识表示
下载PDF
融合项目特征级信息的稀疏兴趣网络序列推荐
19
作者 胡胜利 武静雯 林凯 《计算机工程与设计》 北大核心 2024年第6期1743-1749,共7页
在以往提取多兴趣嵌入的序列推荐模型中仅能通过聚类的方法发现少量兴趣概念,忽视项目交互序列中特征级信息对最终推荐结果的影响。针对此问题,对传统的多兴趣序列推荐模型进行改进,提出一种融合项目特征级信息的稀疏兴趣网络序列推荐... 在以往提取多兴趣嵌入的序列推荐模型中仅能通过聚类的方法发现少量兴趣概念,忽视项目交互序列中特征级信息对最终推荐结果的影响。针对此问题,对传统的多兴趣序列推荐模型进行改进,提出一种融合项目特征级信息的稀疏兴趣网络序列推荐模型。实验结果表明,相比其它模型,该模型可以更好捕捉用户的多样化偏好并缓解冷启动问题。在给定数据集上,该模型比传统的序列推荐模型在命中率上平均提高了6.4%,归一化折损累计增益平均提高了8.7%。 展开更多
关键词 深度学习 序列推荐 多兴趣 稀疏兴趣网络 嵌入表征 特征级信息 特征融合
下载PDF
位置标签增强的中文医学命名实体级联识别
20
作者 王旭阳 赵丽婕 张继远 《计算机工程与应用》 CSCD 北大核心 2024年第2期121-128,共8页
针对一般领域的命名实体识别方法不能直接用于中文医学专业实体的识别,现有的相关研究只专注于英文文本和扁平结构的医学实体识别等问题,通过对专业领域实体识别方法的研究,结合中文医学实体的特点提出了一种面向中文医学实体的级联识... 针对一般领域的命名实体识别方法不能直接用于中文医学专业实体的识别,现有的相关研究只专注于英文文本和扁平结构的医学实体识别等问题,通过对专业领域实体识别方法的研究,结合中文医学实体的特点提出了一种面向中文医学实体的级联识别方法。将每个字符元素相对于实体的位置标签嵌入模型,并结合中文医学实体跨度内不同元素的重要程度进行实体的融合表示。通过序列标注方法检测字符的位置标签,利用字符的位置信息指导候选实体生成,并进行实体语义分类。模型在CMeEE和CCKS2018数据集以及中文糖尿病科研文献数据集上分别进行扁平实体、嵌套实体和不连续性长实体的识别实验。实验结果表明,该方法能够有效地识别中文医学文本中不同结构的实体。 展开更多
关键词 中文医学命名实体 位置标签嵌入 结合元素重要程度的实体融合表示 级联识别 线性结构
下载PDF
上一页 1 2 25 下一页 到第
使用帮助 返回顶部