针对基于EMD(Earth Mover's Distance)的文档语义相似性算法不满足度量公理因而难以在信息检索与数据挖掘中推广应用的问题,该文提出了一种新的基于EMD的文档语义相似性度量——..Mdss_EMD(Metric for document semantic similarity...针对基于EMD(Earth Mover's Distance)的文档语义相似性算法不满足度量公理因而难以在信息检索与数据挖掘中推广应用的问题,该文提出了一种新的基于EMD的文档语义相似性度量——..Mdss_EMD(Metric for document semantic similarity based EMD)。首先在分析EMD及现有改进方法缺陷的基础上,给出了文档宽度、虚拟项的概念;随后通过增加虚拟项来对齐文档矢量的总权值,使所有度量公理得到满足;最后,为提高该度量的适应能力及处理速度,还实现了虚拟项相似距离的弹性设计并对EMD算法进行了简化。该方法把EMD扩展到度量空间中来,很大程度上提高了EMD的索引能力与精度,初步实验表明,Mdss_EMD的整体性能优于原EMD及现有其它类似方法。展开更多
The technique of image retrieval is widely used in science experiment, military affairs, public security, advertisement, family entertainment, library and so on. The existing algorithms are mostly based on the charact...The technique of image retrieval is widely used in science experiment, military affairs, public security, advertisement, family entertainment, library and so on. The existing algorithms are mostly based on the characteristics of color, texture, shape and space relationship. This paper introduced an image retrieval algorithm, which is based on the matching of weighted EMD(Earth Mover’s Distance) distance and texture distance. EMD distance is the distance between the histograms of two images in HSV(Hue, Saturation, Value) color space, and texture distance is the L1 distance between the texture spectra of two images. The experimental results show that the retrieval rate can be increased obviously by using the proposed algorithm.展开更多
文摘针对基于EMD(Earth Mover's Distance)的文档语义相似性算法不满足度量公理因而难以在信息检索与数据挖掘中推广应用的问题,该文提出了一种新的基于EMD的文档语义相似性度量——..Mdss_EMD(Metric for document semantic similarity based EMD)。首先在分析EMD及现有改进方法缺陷的基础上,给出了文档宽度、虚拟项的概念;随后通过增加虚拟项来对齐文档矢量的总权值,使所有度量公理得到满足;最后,为提高该度量的适应能力及处理速度,还实现了虚拟项相似距离的弹性设计并对EMD算法进行了简化。该方法把EMD扩展到度量空间中来,很大程度上提高了EMD的索引能力与精度,初步实验表明,Mdss_EMD的整体性能优于原EMD及现有其它类似方法。
文摘The technique of image retrieval is widely used in science experiment, military affairs, public security, advertisement, family entertainment, library and so on. The existing algorithms are mostly based on the characteristics of color, texture, shape and space relationship. This paper introduced an image retrieval algorithm, which is based on the matching of weighted EMD(Earth Mover’s Distance) distance and texture distance. EMD distance is the distance between the histograms of two images in HSV(Hue, Saturation, Value) color space, and texture distance is the L1 distance between the texture spectra of two images. The experimental results show that the retrieval rate can be increased obviously by using the proposed algorithm.