期刊文献+

HTMLParser提取网页超链接研究 被引量:1

HTMLParser Extract Web Page Hyperlink Research
下载PDF
导出
摘要 每个网页中都存在许多超链接,很多网页的有用信息都存在于超链接中,如何有效地获取这些超链接成为Web挖掘的一个重要步骤。提出了利用HTMLParser开源工具实现Web页面解析,提取网页的超链接,从而获取有用信息,为下一步开发搜索引擎做准备。 There are many hyperlinks in each Web page, many pages of useful information exist the hyperlink, how to effectively access to these hyperlinks as an important step in Web mining. We propose the use of open source tools to achieve Web page HTMLParser parse, extract web page hyperlink in order to gain useful information for further development of search engine preparation.
作者 郎凤举
出处 《电脑编程技巧与维护》 2010年第2期74-75,共2页 Computer Programming Skills & Maintenance
关键词 HTMLPARSER 页面解析 信息提取 HTMLParser, page analysis information extraction
  • 相关文献

参考文献3

二级参考文献18

  • 1许建潮,侯锟.Web信息的自主抽取方法[J].计算机工程与应用,2005,41(14):185-189. 被引量:15
  • 2洪辉,刘子敬,李石君,欧伟杰.智能WEB信息提取系统的研究和设计[J].微计算机信息,2005,21(11X):71-74. 被引量:8
  • 3张茂元,邹春燕,卢正鼎.一种基于语义匹配的Web信息提取方法研究[J].计算机工程与应用,2006,42(23):141-143. 被引量:3
  • 4Crescenzi V,Mecca G.Merialdo,P Roadrunner.Towards Automatic Data Extraction from Large Web Sites[A].In International Conference on Very Large Data Bases (VLDB 2001)[C].Roma,Italy:September 1,2001.1-14.
  • 5Valter Crescenzi,Giansalvatore Mecca,Paolo Merialdo,etc.An Automatic Data Grabber for Large Web Sites[A].In International Conference on Very Large Data Bases (VLDB 2004).
  • 6LIUWei,MENG Xiao-feng,MENG Wei-yi.Vision-based Web datarecords extraction[].Proc of the th SIGMOD International Work-shop on Web and Databases.2006
  • 7NIE Zai-qing,WEN Ji-rong,MA Wei-ying.Object-level verticalsearch[].Proc of the rd Biennial Conference on Innovative DataSystems Research.2007
  • 8XIAO Xiang-ya,LUO Qiong,HONG Dan,et al.Slicing-tree basedWeb page transformation for small displays[].Proc of the thACMInternational Conference on Information and Knowledge Manage-ment.2005
  • 9LEE E,KANG J,CHOI J,et al.Topic-specific Web content adapta-tion to mobile devices[].Proc of IEEE/WIC/ACM InternationalConference on Web Intelligence.2006
  • 10CHUNG C Y,GERTZ M,SUNDARESAN N.Reverse engineeringfor Web data:from visual to semantic structures[].Proc of theth International Conference on Data Engineering.2002

共引文献11

同被引文献7

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部