期刊文献+

基于XML和ANN的Web文本智能检索研究 被引量:1

Research of web text information retrieval based on XML and ANN
下载PDF
导出
摘要 传统的网络信息检索技术如搜索引擎存在一些不足,一方面它只是将信息搜寻出来,不能发现隐藏在数据背后的知识;另一方面其采集软件在采集数据时缺乏人工干预,智能性不强,导致信息利用率不高。针对传统的Web搜索引擎存在的上述问题,结合Web文本挖掘、XML、BP神经网络在数据处理方面的长处,提出了一个具有一定智能的Web文本信息检索模型,以使其具有较高的信息利用率。 There are some shortages to the traditional technical of web information retrieval such as search engine. On one hand, when it works, all that it does is only responsible for searching the information in the Web, and then shows all the results to users in some order without some necessary work of filter, so it can't find the hidden knowledge behind the data; on the other hand, due to a lack of necessary manual intervention, its module of gathering data is with poor intelligence. Hence on this condition the utilization ratio of information is not high. To the above questions that the traditional web search engine exists, the advantages of web text mining, XML, BP neural networks in processing data are combined to propose a model of web text information retrieval and through it, a higher utilization ratio of information is achieved.
作者 张标 何国辉
出处 《计算机工程与设计》 CSCD 北大核心 2006年第16期2973-2975,共3页 Computer Engineering and Design
基金 广东省自然科学基金项目(032356)
关键词 WEB文本挖掘 WEB信息检索 可扩展标记语言 人工神经网络 向后传播误差算法 web text mining web information retrieval XML artificial neural network back-propagation algorithm
  • 相关文献

参考文献6

二级参考文献18

  • 1毛国君.数据挖掘的概念、系统结构和方法[J].计算机工程与设计,2002,23(8):13-17. 被引量:28
  • 2HANJia-wei KamberMicheline 范明.数据挖掘:概念与技术[M].北京:机械工业出版社,2001..
  • 3ZHOU Hao-feng, LOU Yu-bo, YUAN Qing-qing,et al. Refining Web authoritative resource by frequent structures [C]. Proceedings of the Seventh International Database Engineering and Applications Symposium(IDEAS'03), 2003.250 -255.
  • 4Vaisman A A, Dandretta G, Sapia M. Enhancing Web access using data mining techniques[C]. Proceedings of the 14th International Database and Expert Systems Applications Workshop,2003.327 -331.
  • 5Fayyad U, Piatesky-Shapiro G, Smyth P. The KDD process for extracting useful knowledge from volumes ofdata[J].Communications of the ACM, 1996, 39(11).
  • 6Hand D J. Data mining: Statistics and more[J]. The American Statistician, 1998,52 (2).
  • 7张善节,唐汉,高瑞章.实用计算方法[M].南京:南京大学出版社,1998.
  • 8加罗什金(俄).神经网络理论[M].北京:清华大学出版社,2002.
  • 9Han J. Towards on-line analytical mining in lare database systems[C]. ACM-SIGMOD, 1993.97-107.
  • 10Washington D C.International conference on management of data[C]. ACM Press,1993.207-216.

共引文献78

同被引文献8

  • 1欧启忠.ASP安全编译及其应用程序开发[J].广西师范学院学报(自然科学版),2005,22(2):90-93. 被引量:1
  • 2倪现君.文本挖掘在Web中的技术分析[J].中国科技信息,2006(03A):23-23. 被引量:1
  • 3胡骏,李星.校园网信息资源搜索引擎的研究与实现[J].计算机工程与设计,2006,27(24):4629-4631. 被引量:14
  • 4Sriam Raghavan, Hector Garcia-Molina. Crawling the hidden web[C]. Roma,Italy: Proceedings of the 27th VLDB Conference,2001.
  • 5孙斌.文本信息提取技术[Z].北京大学计算机系计算语言所,2000.
  • 6Steven Bowman.Active server pages[EB/OL].http://www.tessella.com,2001.
  • 7Dayne Freitag,Nicholas Kushmeric.Boosted wrapper induction [C].Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence.Austin,TX:AAAI Press/The MIT Press,2000:577-583.
  • 8William W Cohen, Matthew Hurst, Lee S Jenson. A flexible learning system for wrapping tables and lists in html documents [C].Proceedings of the 11th International World Wide Web Conference on WWW-2002. Honolulu, Hawaii, USA: ACM Press, 2002:232-241.

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部