摘要
Web作为世界上最大的信息源,为数据挖掘技术提供了大量的原始数据,然而Web数据半结构化的特征使得在数据挖掘过程中必须选择合适的算法;研究Web信息提取的过程,并利用粗集方法实现对于来自Web的大批量农产品价格数据的挖掘过程。
As the most information resource in the world, Web offers a lot of primitive data for data mining, but that data is semi-structural, so it is an important job to choose a proper algorithm. It studies the process of Web information retrieval, and by rough set theory, the data mining was completed about the agriculture product price from Web.
出处
《安徽工业大学学报(自然科学版)》
CAS
2005年第4期379-382,共4页
Journal of Anhui University of Technology(Natural Science)
关键词
粗集
WEB信息提取
差别矩阵
约简
rough set
Web information retrieval
discernibility matrix
reduction