期刊文献+

基于余弦相似度的文本空间索引方法研究 被引量:49

An Approach for Spatial Index of Text Information Based on Cosine Similarity
下载PDF
导出
摘要 基于相似度的数据空间索引在数据挖掘及数据可视化等方面有着重要的应用。本文以新闻的标题为研究对象,提出了以 CrossAVL为基础的文本对象层次式聚类方法以及文本信息空间索引算法 FastMap-MDS,有效地保持了文本对象间的相似信息。实验表明,该方法具有较高的效率和精度。 Spatial index for data based on similarity can be employed by applications on data mining and data visualization widely. To build spatial index of news title, this paper implements hierarchical cluster algorithm for news titles with CrossAVL as data structure for the similarity matrix storing and presents an available and efficiency method named as FastMap-MDS. Experiment results show that this method can work efficiently while the similarity information are kept well.
出处 《计算机科学》 CSCD 北大核心 2005年第9期160-163,共4页 Computer Science
基金 中国博士后科学基金(2004036463)
关键词 余弦相似度 数据空间 索引方法 数据挖掘 数据可视化 数据库 Similarity, Spatial index, Hierarchical cluster
  • 相关文献

参考文献6

  • 1陈恩红,塔建庆,张振亚,王煦法.基于神经网络的增量式数据索引机制研究[J].小型微型计算机系统,2003,24(10):1783-1786. 被引量:1
  • 2Faloutsos C. FastMap: A Fast Algorithm for indexing, Data-Min ing and Visualization of Traditional and Multimedia Datasets. In:Proc. of ACM SIGMOD, 1995. 163~174
  • 3Jagadish H V. A retrieval technique for similar shapes. In:Proc. ACM SIGMOD Conf, May 1990. 208~217
  • 4Torgerson S. Multidimensional scaling: I. theory and method. Psychometrika, 1952,17: 401~419
  • 5Kruskal J B, Wish M. Multidimensional scaling. SAGE publications, Beverly Hills, 1978
  • 6Ding C. Cluster merging and splitting in hierarchical clustering al gorithms. In:IEEE Intl. Conf. on Data Mining (ICDM'02), Dec. 2002. 139~146

二级参考文献5

  • 1Jagadish H V. A retrieval technique {or similar shapes[C]. Proc.ACM SIGMOD Conf, May 1990, 208-217.
  • 2Hristescu G and Farach-Colton M; CoFE:A scalable method for feature extraction from complex objects[C]. Proceedings of Data Warehousing and Knowledge Discovery 2000,Sep. 2000, 358-371.
  • 3Joseph B. Kruskal and Myron Wish, Multidimensional scaling[M]. SAGE Publications, Beverly hills, 1978.
  • 4Christos Faloutsos, FastMap: A fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets[C]. Proc. of ACM SIGMOD 1995, 163-174.
  • 5Shu B,Kak S. A neural network-based intelligent meta search engine[J]. Information Sciences, 1999,10, 1 - 11.

同被引文献450

引证文献49

二级引证文献145

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部