基于遗传算法的主题信息搜索系统研究被引量：1

Study on Subject Information Search System Based on Genetic Algorithm

下载PDF

导出

摘要针对网络信息资源"迷向"与"过载"的现象,本文通过对遗传算法的分析应用,构建了由基于遗传算法的主题爬虫、信息处理和查询服务三部分组成的主题信息搜索系统。实验结果表明,应用该系统可以获取与主题相关度高的网页信息。 The subject information acquisition system is established by applying the genetic algorithm, according to web information overload and resource puzzle. The testing results showed that the web pages which are strong correlation in subject can be catched, and the accuracy of capturing the subject web pages was improved by using the system.

作者罗长寿康丽刘国靖

机构地区北京市农林科学院农业科技信息研究所中国农业大学信息与电气工程学院

出处《现代情报》 2009年第3期176-178,181,共4页 Journal of Modern Information

基金北京市自然科学基金资助项目(4062013):遗传算法在网页信息搜索技术中的应用研究

关键词主题遗传算法爬虫搜索系统 subject genetic algorithm crawler search system

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献4

1朱炜,王超,李俊,潘金贵.Web超链分析算法研究[J].计算机科学,2003,30(9):89-93. 被引量：20
2DeBra P, Houben G, Komatzky Y, et al. Information Retrieval in Distributed Hypertexts. Proc 4th RIAO Conference. New York: Computer- assisted Information Retrieval, 1994:481 - 491.
3Herseoviei M, Jacov M, Yoelle S Marek. The Shark - Search Algorithm- An Application: Tailored Web Site Mapping. Ccmputer Networks and ISDN Systems, 1998, 30: 317- 326.
4宋聚平,王永成,尹中航,滕伟.面向主题的网页搜索系统[J].上海交通大学学报,2003,37(3):401-403. 被引量：12

二级参考文献27

1Page L, Brin S, Motwani R, Winograd T. The PageRank Citation Ranking : Bringing Order to the WEB. Jan 1998 and July 2001 at http://www. db. stanford. edu/-backub/PageRanksub. ps.
2Brin S,Page L. The anatomy of a large-scale hypertextual WEB search engine, In: Proc of the Seventh Intl World Wide WEB Conf. 1998.
3Richardson M,Domingos P. The Intelligent Surfer: Probabilistic Combination of Link and Content Information in PageRank, volume 14. MIT Press, Cambridge, MA, 2002.
4Haveliwala T H. Topic-Sensitive PageRank. In:Proc of the Eleventh Intl World Wide WEB Conf. 2002.
5Kleinberg J. Authoritative sources in a hyperlinked environmerit. In.. Proc 9th ACM-SIAM Symposium on Discrete Algorithms, 1998. Extended version in Journal of the ACM 46(1999). Also appears as IBM Research Report RJ 10076, May 1997.
6Chakrabarti S,et al. Hypersearching the WEB. Scientific American. June 1999.
7Henzinger M R,Bharat K. Improved algorithms for topic distillation in a hyperlinked environment. In:Proc of the 21'st Intl ACMSIGIR Conf on Research and Development in IR, Aug. 1998.
8Lempel R,Moran S. The Stochastic Approach for Link-Structure Analysis (SALSA) and the TKC Effect. In:Porc 9 th Intl WorldWide WEB Conf. 2000.
9Chakrabarti S, et al. Mining the WEB's link structure. IEEE Computer, Aug. 1999.
10Chakrabarti S,et al. Automatic resource compilation by analyzing hyperlink structure and associated text. In:Proc 7th Intl WWW Conf. 1998.

共引文献29

1周晓滨.基于神经网络的Web信息检索研究与实现[J].情报杂志,2004,23(11):52-53.
2赵仲孟,何世丽,袁薇,沈钧毅.主题搜索引擎中专业网页索引集构造算法的研究[J].微电子学与计算机,2005,22(1):6-9. 被引量：3
3杨沅钊,吴薇,喻晓莉,杨国才.搜索引擎排名改进算法分析[J].农业网络信息,2005(2):41-43. 被引量：2
4单爱民.一种统一开放的互联网信息搜索排序公式的研究[J].现代计算机,2005,11(3):15-18.
5耿桦,李媛,朱炜,潘金贵.Web搜索中的数据挖掘技术研究[J].计算机科学,2005,32(4):37-41. 被引量：4
6赵新慧.搜索引擎中基于Bayes分类的网页更新研究[J].交通与计算机,2005,23(5):63-65.
7吴安清,张颖江,涂军.主题搜索ROBOT综合爬行策略的研究[J].武汉理工大学学报,2006,28(2):74-76. 被引量：6
8李彦刚,魏海平,侯兴华.基于HTMLParser的Web信息抽取系统的设计与实现[J].辽宁石油化工大学学报,2006,26(2):83-86. 被引量：8
9苑丽红.在C语言教学中启发和训练学生的编程思维[J].福建电脑,2006,22(12):207-208. 被引量：5
10李伟,黄颖.基于HtmlParser的网页信息提取[J].兵工自动化,2007,26(7):41-41. 被引量：4

同被引文献11

1凌波,周水庚,周傲英.P2P信息检索系统的查询结果排序与合并策略[J].计算机学报,2007,30(3):405-414. 被引量：13
2Broder A. A taxonomy of Web search[C]//SIGIR Forum. New York, N Y, USA: ACM Press, 2002 : 3-10.
3Rose D E, Levinson D. Understanding user goals in web search [C] //WWW ' 04 : Proceedings of the 13the international confe- rence on World Wide Web. New York, N Y, USA: ACM Press, 2004: 13-19.
4Jansen B J,Booth D L,Spink A. Determining the user intent of Web search engine queries[C] // Williamson CL, Zurko ME, Patel-Schneider PF,et al. , eds. Proc. of the 16th Int'l Conf. on World Wide Web. New York: ACM Press, 2007:1149-1150.
5Ricardo A, Liliana C B, Cristina N. The intention behind Webqueries[C]//Crestani F, Ferragina P, Sanderson M, eds. Proc. of the 13th Int'l Conf. on String Processing and Information Re- trieval (SPIRE 2006 ). Berlin, Heidelberg: Springer-Verlag, 2006 :98-109.
6Qi G, Eugene A. Exploring mouse movements for inferring que- ry intent[-C]//Myaeng SH, Oard DW, Sebastianj F, et al. , eds. Proc. of the 31st Annual Int' 1 ACM SIGIR Conf. on Research and Development in Information Retrieval. 2008:707-708.
7Liu YQ, Fu Y P, Zhang M, et al. Automatic search engine per- formance evaluation with click-through data analysis[C] ffWil- liamson CL, Zurko ME, Patel-Schneider PF, et al. , eds. Proc. of the 16th Int'l Conf. on World Wide Web. New York: ACM Press, 2007 : 1133-1134.
8吴晓晖,宋萍萍,张荣欣.有无查询意图的分类与实现架构模型研究[J].情报科学,2009,27(12):1829-1833. 被引量：6
9王大玲,于戈,鲍玉斌,张沫,沈洲.基于用户搜索意图的Web网页动态泛化[J].软件学报,2010,21(5):1083-1097. 被引量：14
10袁鼎荣,钟宁,张师超.文本信息处理研究述评[J].计算机科学,2011,38(2):9-13. 被引量：11

引证文献1

1杨艺,周元.基于用户查询意图识别的Web搜索优化模型[J].计算机科学,2012,39(1):264-267. 被引量：16

二级引证文献16

1陆伟,周红霞,张晓娟.查询意图研究综述[J].中国图书馆学报,2013,39(1):100-111. 被引量：27
2尤川川,张桂刚.一种基于大数据的有效搜索方法[J].计算机科学,2013,40(6):183-186. 被引量：12
3李敏,罗惠琼,唐春玲,王强.Web交互模型的形式化验证研究[J].计算机科学,2014,41(2):219-221. 被引量：1
4陈臣,陈双飞.一种基于大数据的数字图书馆高效搜索引擎[J].现代情报,2014,34(1):49-51. 被引量：14
5郑炜,梁战平,梁建.面向用户意图的智能搜索引擎框架研究[J].现代图书情报技术,2014(3):65-72. 被引量：8
6张萍,王建忠.一种基于大数据的有效搜索方法的改进[J].计算机应用研究,2014,31(8):2331-2333. 被引量：4
7李爱明.基于本体和用户查询意图的查询扩展方法研究[J].情报科学,2015,33(5):68-71. 被引量：4
8位通,贾仰理,张振领,Julien.一种新的语义相似度计算方法[J].聊城大学学报（自然科学版）,2015,28(2):88-92.
9曲朝阳,孙立擎,潘峰,曲楠,颜佳,张率.基于流形排序的电网截面数据检索[J].科学技术与工程,2016,16(15):239-244. 被引量：4
10胡伶霞.图书馆OPAC检索中基于词典的查询意图自动识别[J].图书馆学研究,2016(23):72-76. 被引量：8

1张小琴,王晓辉.主题信息搜索系统中的搜索策略研究[J].软件导刊,2014,13(1):89-92. 被引量：2
2邵雄凯,梁云静,刘建舟.基于遗传算法的主题信息搜索研究[J].网络安全技术与应用,2009(11):57-60. 被引量：1
3张小琴.联合贝叶斯推理与遗传算法的主题信息搜索策略[J].中南民族大学学报（自然科学版）,2014,33(2):89-92. 被引量：1
4肖立英,李建华,谭立球.基于Agent的用户个性化兴趣模型的研究[J].计算机科学,2002,29(z1):123-124.
5薛联凤.CORBA在电子商务智能代理中的作用[J].南京林业大学学报（自然科学版）,2002,26(3):75-77. 被引量：1
6杨山豹,张晓凌.基于知识库的智能搜索引擎研究[J].电脑与信息技术,2010,18(2):41-44. 被引量：3
7谢能付,王文生,段延娥.基于概念空间的领域信息爬虫设计研究[J].江西师范大学学报（自然科学版）,2008,32(2):192-196.
8杨桂芝.一种基于信息推送的搜索引擎模型[J].现代电子技术,2007,30(8):81-83. 被引量：2
9张华伟.个性化网络教学资源管理系统的设计与实现[J].科技广场,2009(5):174-176. 被引量：4
10王娟,方逵.一种优化的基于协同过滤的农业信息推荐系统研究[J].农机化研究,2011,33(7):194-197. 被引量：1

现代情报

2009年第3期

浏览历史

内容加载中请稍等...

基于遗传算法的主题信息搜索系统研究被引量：1

参考文献4

二级参考文献27

共引文献29

同被引文献11

引证文献1

二级引证文献16

相关作者

相关机构

相关主题

浏览历史

基于遗传算法的主题信息搜索系统研究 被引量：1

参考文献4

二级参考文献27

共引文献29

同被引文献11

引证文献1

二级引证文献16

相关作者

相关机构

相关主题

浏览历史

基于遗传算法的主题信息搜索系统研究被引量：1