摘要
以网络上数据量最大、更新最快的网络新闻作为主要的数据源,对其检索与挖掘技术展开研究,构建一个大规模网络数据在线挖掘分析服务系统,以该系统作为平台,为用户提供廉价的、高可靠的、具备交互能力、实时的在线搜索和数据挖掘服务。对于该系统,主要研究其整体设计思路和各模块的算法思想。
Uses the online news which is of large amount and updated quickly as the main data source, studies the retrieval and data mining technology, and creates a platform based on large-scale network data analysis. This platform afford users cheap, reliable, interactive, and real-time data mining services. Mainly studis the overall design idea of this system and the main algorithm of the modules.
出处
《现代计算机》
2012年第15期8-12,共5页
Modern Computer
关键词
大规模新闻数据
分类
聚类
检索
Large-Scale Online News Data
Sorting
Clustering
Retrieving