期刊文献+

维吾尔文后缀树构造算法的设计与实现

Design and implementation of Uighur generalized suffix tree construction algorithm
下载PDF
导出
摘要 为用后缀树聚类算法对维吾尔文网页进行聚类,通过分析可扩展后缀树和维吾尔文的特点设计了维吾尔文后缀树构造算法。实验结果证明该方法能够在线性的时间范围内构造维吾尔文后缀树,并用它来对维吾尔文网页进行聚类。 Suffix Tree Clustering(STC) have been applied to web page clustering problems. In order to use the STC algorithm to cluster Uighur page, this paper analyzes the characteristics of the generalized suffix tree and Uighur features to design the Ui- ghur generalized suffix tree construction algorithm. The experimental result shows that the method can construct Uighur suffix tree in linear time range, and it can be used to cluster Uighur web page.
出处 《计算机工程与应用》 CSCD 2013年第8期9-11,16,共4页 Computer Engineering and Applications
基金 国家自然科学基金(No.61262063 No.61142004) 新疆多种语种重点实验室开放课题(No.049807)
关键词 后缀 后缀树 可扩展后缀树 节点 公共前缀 suffix suffix tree generalized suffix tree node prefix
  • 相关文献

参考文献9

  • 1Chim H, Deng Xiaotie.EfFicient phrase-based document simi-larity for clusteringf J] .IEEE Transactions on Knowledgeand Data Engineering, 2008,20(9).
  • 2Nguyen H.Mobile search engine using clustering and queryexpansion[D].San Jose: Department of Computer Science,San Jose State University ,2010.
  • 3Grossi R, Italiano G F.SufFix trees and their applications instring algorithms[R].Venice:University of Venice, 1995.
  • 4Apostolico A.The myriad virtues of subword trees[M]//Combi-natorial algorithms on words.Berlin,Germany : Springer-Verlag,1985.
  • 5Zamir O,Etzioni O,Karp R M.Fast and intuitive clusteringof web documents[C]//Proc 3rd Int,l Conf Knowledge Dis-covery and Data Mining.[S.I.].AAAI Press, 1997.
  • 6Zamir O, Etzioni O.Web document clustering: a feasibilitydemonstration[C]//SIGIR,98.New York : ACM,1998 : 46-54.
  • 7Carpineto C, Osinski S, Romano G,et al.A survey of webclustering engines[J].ACM Computing Surveys,2009,41(3).
  • 8Cao Guihong,Song Dawei,Bruza P.Suffix tree clustering onpost-retrieval documents[Z].Queensland:Information Ecology,Distributed Systems Technology Centre, The University ofQueensland, 2003.
  • 9Zamir O E.Clustering web documents : a phrase-based methodfor groping search engine research results[D].Washington:University of Washington, 1999.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部