摘要
数据挖掘是20世纪末逐渐形成的一个多学科交叉领域,目前已经广泛成功地应用在金融、零售、医药、通讯、电子工程、航空、旅馆等有大量数据和深度分析需求的领域。文中对数据挖掘的建模、算法、应用和软件工具进行了综述,给出了数据挖掘的定义、范畴和特点,以及数据挖掘的数据集的各种实际情况;总结了数据挖掘在实际应用时的基本步骤和过程;对数据挖掘在各种应用问题上的任务和建模进行了讨论;列举了目前数据挖掘领域中主要流行的算法,并对算法设计需要考虑的问题进行了简要的分析;综述了目前数据挖掘算法在一些领域的应用;较全面地叙述了目前数据挖掘软件工具性能及其开发商情况;最后,对数据挖掘的发展前景和方向进行了展望。
Data mining is an interdisciplinary area formulated in the end of the 20th century. Up to now, it has been widely and successfully used in the applications of banks, stock markets, retails, medicine, telecommunication, electronic engineering, aviation industry and travel industry, where huge amounts of data are available and await indepth analysis. Provides a survey for data mining from the points of modeling, algorithms, applications and software systems. First, it outlines the concept and characteristics Of data mining, and discusses the data sets. Second, the stePs and procedure are summarized. Third, the tasks and models are explored in the scenarios of applications. Fourth, the popularly used data mining algorithms are briefly analyzed with the practical considerations. Fifth, the applications of data mining are illustrated. Sixth, the data mining software tools, their features and vendors are listed and commented. Finally, the prospect and the issues to be solved by data mining are addressed.
出处
《计算机技术与发展》
2006年第1期1-4,65,共5页
Computer Technology and Development
基金
留学回国启动基金(4131522)
关键词
数据挖掘
算法
应用
软件系统
data mining
algofithms
applications
software systems