

The design and implementation of scientific literature statistical analysis system based on three retrieval systems
摘要 文中给出了系统的设计思想、体系结构,详细阐述了主要算法IESS的设计与实现。本系统可以帮助用户迅速了解研究领域的文章分布、抓住研究重点。 The design idea and system structure of scientific literature statistical analysis system are given in this paper. The design and implementation of main model IESS is explained in detail. Experi- ments show that the user can quickly understand the distribution of papers and seize the key point in their research field through the help of the system.
出处 《河北省科学院学报》 CAS 2009年第2期14-18,共5页 Journal of The Hebei Academy of Sciences
基金 河北省科学技术研究与发展计划项目(07213597)
关键词 三大检索 文献统计 网页分析 Three Retrieval Systems Document statistics Web-page analysis
  • 相关文献


  • 1Khare R,Cutting D, Sitaker K, Rifkin A. Nutch : A Flexible and Scalable Open-Source Web Search Engine . CommerceNet Labs : [CN- TR- 04- 04]. November 2004 : 1 - 12.
  • 2王辉.Web页面爬行实践——.NET下正则表达式的应用[J].程序员,2004(9):112-114. 被引量:1
  • 3许建潮,侯锟.Web信息的自主抽取方法[J].计算机工程与应用,2005,41(14):185-189. 被引量:15
  • 4王宁,王延章.一种半结构化数据采集系统的设计与实现[J].计算机应用与软件,2007,24(5):7-8. 被引量:1
  • 5杨曦 罗燕京 钟锋.面向垂直搜索引擎的一种动态网页的抓取方法.科技信息(学术研究),2008,(4).
  • 6Alberto H F Laender, BerthierA R ibeiro-Neto, Ahigran S daSilva, Juliana S Teixeira. A brief survey of Web data extraction tools[J]. ACM SIGMOD Record, 2002,31 (2) : 84 -93.
  • 7A. Pan et al. , Semi-automatic wrapper generation for commercial web sources, Proceedings of IFIP WGg. 1 Conference on Engineering Inform, Systems in the Internet Context (EISIC), 2002 : 265- 283.
  • 8A. Arasu, H. Garcia-Molina, Extracting structured data from web pages, in: Proceedings of the ACM SIGMOD International Conference on Management of Data, 2003:491-509.


  • 1Muslea I,Minton S,Knoblock C A.Hierarchical Wrapper Induction for Semistructured Information Sources[J].To Appear in the Journal of Autonomous Agents and Multi-Agent Systems, 1999.
  • 2Kurt D Bollacker,Steve Lawrence,C Lee Giles et al. CiteSeer:An Autonomous Web Agent for Automatic Retrieval and identification of Interesting Publications[C].In:Proceedings of 2nd International Conference on Autonomous Agent, 1998-04:116~123.
  • 3Jose Luis Ambite,Naveen Ashish,Craig Knoblock et al.A System for Constructing Mediators for Internet Source,System Demonstration[C].In:Proceedings of the ACM SIGMOD International,Conference on Management of Data, Seattle, Washington, 19983..
  • 4Stefano Ceri,Piero Fraternali,Aldo bongio[J].Web Modeling Language (WebML) :A modeling language for designing Web Sites[J].Computer Networks, 2000:137~157.
  • 5Embley D W,Campbell D M,Jiang Y S et al. Conceptual-ModelBased Data Extraction from Multiple-Record Web Documents[J].Data and Knowledge Engineering,1999.
  • 6D W Embley,D M Campbell,R Smithm et al.A conceptual-modeling approach to extracting data from Web[C].In:Proceedings of the 17th International Conference on Conceptual Modeling,1998.
  • 7Abiteboul S.Querying semi-structured data[C].In Proc.Of ICDT.Delphi,Greece,January,1997:1-18.
  • 8Abitbouls.Querying semi structured data[C].In Proceedings of ICDT.Delphi,Greece,1997:18.
  • 9Buneman P,Hille Brandg,et al.A query language and optimization techniques of run structured data[C].In Proceedings of the ACMSIGMOD International Conference.Montreal,Canada,1996:505-516.
  • 10Peter Buneman.Semi-structured data[C].In Proc.of PODS.Tucson,Arizona.1997:117-121.









使用帮助 返回顶部