摘要
讨论了Web数据挖掘中的数据异构问题,通过XML技术建立了一个半结构化数据模型和一个自动抽取模型,以解决Internet上绝大多数因异构、非结构化、动态数据集成问题所导致的Web数据挖掘的困难。
The data heterogeneity problem in Web data mining is discussed. By using XML technology a semi-structured data model and an automatic extraction model are established for solving most of the difficulties in Web data mining caused by heterogeneous, unstructured and dynamic data integration problems on Internet.
出处
《计算机时代》
2010年第9期4-6,共3页
Computer Era