期刊文献+
共找到12篇文章
< 1 >
每页显示 20 50 100
基于多模态关联图的图像语义标注方法 被引量:2
1
作者 郭玉堂 罗斌 《计算机应用》 CSCD 北大核心 2010年第A12期3295-3297,3303,共4页
为了改善图像标注的性能,提出了一种基于多模态关联图的图像语义标注方法。该方法用一个无向图表达了图像区域特征、标注词以及图像三者之间的关系,结合图像区域特征相似性和语义间的相关性提取图像语义信息,提高了图像标注的精度。利... 为了改善图像标注的性能,提出了一种基于多模态关联图的图像语义标注方法。该方法用一个无向图表达了图像区域特征、标注词以及图像三者之间的关系,结合图像区域特征相似性和语义间的相关性提取图像语义信息,提高了图像标注的精度。利用逆向文档频率(IDF)修正图像节点与其标注词节点之间边的权值,克服了传统方法中因高频词引起的偏差,有效地提高了图像标注的性能。在Corel图像数据集上进行了实验,实验结果验证了该方法的有效性。 展开更多
关键词 像语义 多模态图 逆向文档频率 高频词
下载PDF
多模态医学图象的SVD-ICP配准方法 被引量:7
2
作者 余立锋 俎栋林 +3 位作者 王卫东 邓元木 尤江生 包尚联 《CT理论与应用研究(中英文)》 2000年第1期1-7,16,共8页
多模态医学图象的配准在医学诊断和治疗计划中起着重要的作用。本文提出一种基于轮廓特征的迭代最近点(SVD-ICP)的配准方法。这种方法结合了SVD最优化解析方法和迭代搜索的优点来解决图象轮廓点的匹配问题,适用于不同模态... 多模态医学图象的配准在医学诊断和治疗计划中起着重要的作用。本文提出一种基于轮廓特征的迭代最近点(SVD-ICP)的配准方法。这种方法结合了SVD最优化解析方法和迭代搜索的优点来解决图象轮廓点的匹配问题,适用于不同模态医学图象之间的配准。我们关于CT-MRI和PET-MRI二维图象的配准实验证明了该方法的有效性。 展开更多
关键词 多模态医学 配准 融合 医学诊断 CT SVD-ICP
下载PDF
多模态耦合特征子空间正则的SVDD 被引量:1
3
作者 王闯 胡文军 +1 位作者 刘闯 王余波 《湖州师范学院学报》 2023年第8期51-61,共11页
针对传统支持向量数据描述(SVDD)方法无法用于多模态数据异常检测的问题,提出一种新颖的用于处理多模态数据的SVDD方法.该方法将多模态数据通过投影矩阵映射到公共低维子空间,再利用多模态图正则SVDD来保持模态内与模态间的结构关系,同... 针对传统支持向量数据描述(SVDD)方法无法用于多模态数据异常检测的问题,提出一种新颖的用于处理多模态数据的SVDD方法.该方法将多模态数据通过投影矩阵映射到公共低维子空间,再利用多模态图正则SVDD来保持模态内与模态间的结构关系,同时利用稀疏投影矩阵正则SVDD来降低原始空间中的特征耦合影响.该方法称为耦合特征子空间正则的支持向量数据描述(CFSR-SVDD).实验结果表明,所提出的方法在精度和稳定性上具有更好的优势. 展开更多
关键词 一类分类 多模态数据 支持向量数据描述 子空间学习 多模态图
下载PDF
基于Legendre矩的CT及MR医学图象融合方法 被引量:3
4
作者 汪家旺 舒华忠 +2 位作者 罗立民 葛云 翁学军 《中国图象图形学报(A辑)》 CSCD 北大核心 2001年第4期369-373,共5页
为了提高 CT、MR多模态医学图象配准、融合的精度和速度 ,提出了基于 L egendre矩的 CT和 MR多模态医学图象配准、融合方法 ,并运用二维数据图象的 L egendre矩正交性和无冗余性的特点 ,通过找出 CT及 MR两种模态医学图象的质心 ,计算... 为了提高 CT、MR多模态医学图象配准、融合的精度和速度 ,提出了基于 L egendre矩的 CT和 MR多模态医学图象配准、融合方法 ,并运用二维数据图象的 L egendre矩正交性和无冗余性的特点 ,通过找出 CT及 MR两种模态医学图象的质心 ,计算出两图象的比例因子 ,从而完成了两图象的平移和旋转 ,并精确地实现了 CT和 MR两模态图象的配准、融合 ,还优化了 L egendre矩的快速算法和提高了应用 L egendre矩配准 CT和 MR图象的速度 .实验表明 ,利用 L egendre矩对 CT和 MR等多模态图象配准、融合 ,不失为一种比较直接、简洁的方法 ;同时 ,L egendre矩在医学影象诊断。 展开更多
关键词 正交矩 多模态图 数据融合 医学 CT MR 象配准 象处理
下载PDF
三维医学图象可视化技术综述 被引量:33
5
作者 李燕 谭鸥 段会龙 《中国图象图形学报(A辑)》 CSCD 北大核心 2001年第2期103-110,共8页
概要地分析和评述了近年来三维医学图象可视化技术的发展 ,并主要从三维医学图象的分割标注、多模态医学图象的数据整合、体数据的绘制等 3个角度对三维医学图象的可视化技术进行了分类综述 ,同时介绍了各种算法的原理和最新进展 .由于... 概要地分析和评述了近年来三维医学图象可视化技术的发展 ,并主要从三维医学图象的分割标注、多模态医学图象的数据整合、体数据的绘制等 3个角度对三维医学图象的可视化技术进行了分类综述 ,同时介绍了各种算法的原理和最新进展 .由于医学图象可视化的目的是辅助医生了解生物内部组织的信息 ,因此除图象绘制技术外 ,组织及组织特性的精确自动分割标注技术 ,以及将不同图象模态提供的互补信息综合起来的匹配 /融合技术 ,都是医学图象可视化需要解决的重要问题 ,其中 。 展开更多
关键词 三维医学 多模态医学 可视化 象分割 数据整合 象匹配 数据融合 影像诊断
下载PDF
Analysis of multimodality in PPT teaching discourse
6
作者 李冬艳 胥国红 《Sino-US English Teaching》 2010年第5期21-25,共5页
Computer technology-based PPT is usually conceived as a tool for information transmission and presentation rather than as a type of discourse. Much focus of the previous study on PPT is concerned with its development,... Computer technology-based PPT is usually conceived as a tool for information transmission and presentation rather than as a type of discourse. Much focus of the previous study on PPT is concerned with its development, design and application. However, PPT itself may actually be regarded as a multimodal discourse comprising multisemiotics, such as linguistic signs, image, graph, sound, color and their interrelated layouts, etc.. So the article attempts to make a multimodal analysis of College English PPT discourse via the principle of reading images by Kress and van Leeuwen in 1996, aiming to present a different angle of interpreting the meaning of composition anchored in PPT. 展开更多
关键词 MULTIMODALITY PPT discourse the meaning of composition
下载PDF
A method based on mutual information and gradient information for medical image registration 被引量:3
7
作者 陈晓燕 辜嘉 +2 位作者 李松毅 舒华忠 罗立民 《Journal of Southeast University(English Edition)》 EI CAS 2003年第1期35-39,共5页
Mutual information is widely used in medical image registration, because it does not require preprocessing the image. However, the local maximum problem in the registration is insurmountable. We combine mutual informa... Mutual information is widely used in medical image registration, because it does not require preprocessing the image. However, the local maximum problem in the registration is insurmountable. We combine mutual information and gradient information to solve this problem and apply it to the non-rigid deformation image registration. To improve the accuracy, we provide some implemental issues, for example, the Powell searching algorithm, gray interpolation and consideration of outlier points. The experimental results show the accuracy of the method and the feasibility in non-rigid medical image registration. 展开更多
关键词 medical image registration gradient information mutual information multi-modal images non-rigid deformation
下载PDF
数据新闻探析
8
作者 刘鹏 《牡丹江大学学报》 2016年第7期32-35,共4页
2010年开始兴起了一种新的新闻报道形式——数据新闻,它是以数据为核心,借助互联网技术、大数据处理技术和视觉技术挖掘新闻事实的深层相关关系,采用文字、图像、色彩等多种模态符号来构建新闻语篇并传递新闻信息。从语言学视角阐述数... 2010年开始兴起了一种新的新闻报道形式——数据新闻,它是以数据为核心,借助互联网技术、大数据处理技术和视觉技术挖掘新闻事实的深层相关关系,采用文字、图像、色彩等多种模态符号来构建新闻语篇并传递新闻信息。从语言学视角阐述数据新闻的定义及其分类,并运用相关理论分析数据新闻的语体特征揭示其特征形成的原因,以加深对数据新闻的认知,为新闻语体研究提供参考。 展开更多
关键词 数据新闻 多模态信息 语体
下载PDF
Multimodality image registration and fusion using neural network
9
作者 Mostafa G Mostafa Aly A Farag Edward Essock 《Journal of Harbin Institute of Technology(New Series)》 EI CAS 2003年第3期235-240,共6页
Multimodality image registration and fusion are essential steps in building 3-D models from remotesensing data. We present in this paper a neural network technique for the registration and fusion of multimodali-ty rem... Multimodality image registration and fusion are essential steps in building 3-D models from remotesensing data. We present in this paper a neural network technique for the registration and fusion of multimodali-ty remote sensing data for the reconstruction of 3-D models of terrain regions. A FeedForward neural network isused to fuse the intensity data sets with the spatial data set after learning its geometry. Results on real data arepresented. Human performance evaluation is assessed on several perceptual tests in order to evaluate the fusionresults. 展开更多
关键词 data fusion image registration image interpolation neural network 3-D model building
下载PDF
Test method of laser paint removal based on multi-modal feature fusion
10
作者 HUANG Hai-peng HAO Ben-tian +2 位作者 YE De-jun GAO Hao LI Liang 《Journal of Central South University》 SCIE EI CAS CSCD 2022年第10期3385-3398,共14页
Laser cleaning is a highly nonlinear physical process for solving poor single-modal(e.g., acoustic or vision)detection performance and low inter-information utilization. In this study, a multi-modal feature fusion net... Laser cleaning is a highly nonlinear physical process for solving poor single-modal(e.g., acoustic or vision)detection performance and low inter-information utilization. In this study, a multi-modal feature fusion network model was constructed based on a laser paint removal experiment. The alignment of heterogeneous data under different modals was solved by combining the piecewise aggregate approximation and gramian angular field. Moreover, the attention mechanism was introduced to optimize the dual-path network and dense connection network, enabling the sampling characteristics to be extracted and integrated. Consequently, the multi-modal discriminant detection of laser paint removal was realized. According to the experimental results, the verification accuracy of the constructed model on the experimental dataset was 99.17%, which is 5.77% higher than the optimal single-modal detection results of the laser paint removal. The feature extraction network was optimized by the attention mechanism, and the model accuracy was increased by 3.3%. Results verify the improved classification performance of the constructed multi-modal feature fusion model in detecting laser paint removal, the effective integration of acoustic data and visual image data, and the accurate detection of laser paint removal. 展开更多
关键词 laser cleaning multi-modal fusion image processing deep learning
下载PDF
Multi-modal face parts fusion based on Gabor feature for face recognition 被引量:1
11
作者 相燕 《High Technology Letters》 EI CAS 2009年第1期70-74,共5页
A novel face recognition method, which is a fusion of muhi-modal face parts based on Gabor feature (MMP-GF), is proposed in this paper. Firstly, the bare face image detached from the normalized image was convolved w... A novel face recognition method, which is a fusion of muhi-modal face parts based on Gabor feature (MMP-GF), is proposed in this paper. Firstly, the bare face image detached from the normalized image was convolved with a family of Gabor kernels, and then according to the face structure and the key-points locations, the calculated Gabor images were divided into five parts: Gabor face, Gabor eyebrow, Gabor eye, Gabor nose and Gabor mouth. After that multi-modal Gabor features were spatially partitioned into non-overlapping regions and the averages of regions were concatenated to be a low dimension feature vector, whose dimension was further reduced by principal component analysis (PCA). In the decision level fusion, match results respectively calculated based on the five parts were combined according to linear discriminant analysis (LDA) and a normalized matching algorithm was used to improve the performance. Experiments on FERET database show that the proposed MMP-GF method achieves good robustness to the expression and age variations. 展开更多
关键词 Gabor filter multi-modal Gabor features principal component analysis (PCA) linear discriminant analysis (IDA) normalized matching algorithm
下载PDF
Smart Human Computer Interface with EMG and Vision Based on Multi-modal Information Fusion 被引量:1
12
作者 Hee-su KANG Hyun-chool SHIN 《Journal of Measurement Science and Instrumentation》 CAS 2011年第2期152-156,共5页
A smart Human Interface (HCI) replacing conventional mouse interface is proposed. The interface is able to control and command action with only hand. Four finger motions (left click, right dick, hold, drag) are u... A smart Human Interface (HCI) replacing conventional mouse interface is proposed. The interface is able to control and command action with only hand. Four finger motions (left click, right dick, hold, drag) are used to command the interface. Also the authors materialiae cursor movement control using image processing The measure what they use for inference is entropy of Electromyogram (EMG) signal, Gaussian modeling and likelihood estimation. In image processing for cursor control, they use color recognition to get the center point of finger tip from marker, and map the point onto cursor. Accuracy of finger movement inference is over 95% and cursor control works naturally without delay. They materlalize whole system to check its performance and utility. 展开更多
关键词 vision:HCI:interface:mouse
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部