期刊文献+

基于搜索引擎的行业标签分类体系构建与扩展研究

Study on the Construction and Expansion of Industry Label Taxonomy Based on Search Engine
下载PDF
导出
摘要 针对各种行业不同的需求,需要构建一个可以不断完善和不断学习的行业标签分类体系。分类体系主要通过分类标签来对属性相同的实体进行归类。行业标签分类体系的分类标签下包含两种实体,即URL和关键词。在行业标签分类体系的构建与扩展过程中,搜索引擎起到了重要的作用。目前,已经构建了一个初步的行业标签分类体系,并实现了一个自动扩增URL和关键词的系统。 According to the different needs of various industries, we demand a industry label taxonomy which is able to improve and refine continuously. Taxonomy mainly merge the entities with the same properties through the labels. Industry label taxonomy contains two kinds of entities, that is, URLs and keywords. In the process of the construction and expansion of industry label taxonomy, search engine plays an important role. At present, we have constructed a preliminary label industry taxonomy, and have implemented a system which helps the taxonomy to expand its URLs and keywords automatically.
出处 《信息技术与信息化》 2015年第6期161-165,共5页 Information Technology and Informatization
关键词 搜索引擎 分类体系 分类标签 Search engine Taxonomy Labels
  • 相关文献

参考文献5

  • 1Min-Yen Kan,Hoang Oanh Nguyen Thi Fast Webpage Classification Using URL features[J].CIKM,2005(9):325-326.
  • 2Eda Bavkan,Monika Henzinger,Ludmila Marian,et al.Purely URL-based Topic Classification[J].WWW,2009(11):1109-1110.
  • 3Neetu Anand.Customized Category Based Clustering of URLs[J].ISCBI,2013:310-314.
  • 4Yeye He Dong Xin,Venkatesh Ganti,et al.Crawling Deep Web Entity Pages[J].WSDM,2013(4):355-364.
  • 5Marc Najork,Janet L.Wiener.Breadth-First Search Crawling Yields High-Quality Pages[J].WWW,2010(12):114-118.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部