期刊文献+

基于维吾尔文的聚焦策略爬虫技术研究

Study on Focused Crawler for Uyghur langange
下载PDF
导出
摘要 随着网络资源的不断丰富,人们获取信息的途径已被网络代替。维吾尔文,在语言信息处理,WEB应用等领域有了迅速的发展。文章针对网络爬虫的工作原理以及聚焦爬虫策略进行阐述,在此基础上结合维吾尔语信息提取的相关研究,研究了维吾尔文的网络爬虫技术的结构和策略,从而为维吾尔文搜索引擎的网页数据库建设和维吾尔文网络舆情分析研究提供海量的语料。 The way people getting various information have gradually been replaced by the vast growing Inter-net,along with rich online resources. as for this, Uyghur language have developed very fast in many research fields, in which natural language processing and Web application. This paper, mainly presented basic theory of web crawl-er and strategy of focused carawler, on the basis of study on Uyghur information extraction. Then discussed Uyghur web crawler in both structural and strategic way. Thus, massively provided large rage corpus for Uyghur search en-gine and Uyghur public network analysis.
出处 《新疆师范大学学报(自然科学版)》 2014年第4期75-78,共4页 Journal of Xinjiang Normal University(Natural Sciences Edition)
关键词 网络爬虫 维吾尔文聚焦策略 维吾尔文搜索引擎 Web crawler Uyghur Web crawler Uyghur search engine
  • 相关文献

参考文献2

二级参考文献32

  • 1刘世涛.简析搜索引擎中网络爬虫的搜索策略[J].阜阳师范学院学报(自然科学版),2006,23(3):59-62. 被引量:15
  • 2[8]Cho,Molina. Synchronizing a database to improve freshness. In:Junghoo Cho, Hector Garcia-Molina, eds. Proc. of 2000 ACM Intl. Conf. on Management of Data(SIGMOD),May 2000
  • 3[9]Cho, Molina, Page. Efficient Crawling Through URL Ordering.In: Junghoo Cho,Hector Garcia-Molina and Lawrence Page, eds.Proc. of the Seventh Intl. World Wide Web Conf. Toronto,Canada,May 1999
  • 4[10]Edwards,et al. An Adaptive Model for Optimizing Performance of an Incremental Web Crawler. In: J. Edwards, K. McCurley, J.Tomlin,eds. Proc. of the 10th Intl. World Wide Web Conf. Hong Kong ,May 2001
  • 5[11]Heydon ,Najork .Mercator:A Scalable,Extensible Web Crawler.A. Heydon and M. Najork. In World Wide Web Journal, Dec.1999. 219~229
  • 6[12]Kamba T,Bharat K,Albers M. The Krakatoa Chronicle - An Interactive, Personalized, Newspaper on the Web. In: Proc. of WWW 4,Boston, USA,Dec. 1995
  • 7[13]Kahle B. Preserving the Internet,Scientific American,March 1997
  • 8[14]Koster M. The Web Robots Pages. 1999
  • 9[15]Lawrence S,Giles C L. Accessibility of information on the Web.Nature, 1999,400(6740) :107~109
  • 10[16]Letizia. An Agent That Assists Web Browsing. In:H. Lieberman,ed. Proc. of the Intl. Joint Conf. on AI,Montreal ,Canada,Aug.1995

共引文献34

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部