摘要
针对当前互联网发展的高度动态性和复杂性,阐明并分析了W eb数据挖掘中存在的问题,并在此基础上主要探讨了XML在W eb数据挖掘中所起的作用.结合XML的可扩展性以及可被结构化等特点,从几个方面对基于HTML网页挖掘所遇到的困难,诸如链接信息分析、数据信息集成等,都提出了相应基于XML的解决方案,并给出了简要的示例说明.
The problems in WEB data mining in regard to the high dynamism and complicacy are analyzed in this paper. On the basis of the analysis, the function of XML in web data mining is mainly discussed. As for the difficulties encountered in HTML mining, such as linked information analysis and data information integration, solutions to XML are given respectively in this paper, as well as demonstration for simple examples combining with expandable and structuralized characteristics.
出处
《大连大学学报》
2006年第2期56-58,67,共4页
Journal of Dalian University