摘要
数据挖掘技术在中国的应用尚不普及,一个重要的原因就是由于业务数据的不规范。通过使用数据预处理技术,可以使业务数据更加规范,保证各种数据挖掘算法取得良好的效果。以保险理赔预测为应用背景,介绍了如何结合专业知识进行数据清洗的方法,同时还提出了一种压缩大数据集的数据归约算法。
One of reasons which Data Mining is not used widely in China is of the irregular data. The Data Pretreatment technology can make the data more regular, and make all Data Mining Algorithms more effective. The background of the research is the claim fraud detection. A method of data cleaning with professional knowledge will be talked about. A data reduction method for large database will be put forward too. It is proved that the method works well in reality.
出处
《计算机工程与设计》
CSCD
北大核心
2005年第9期2537-2539,2564,共4页
Computer Engineering and Design