摘要
提出一种灰度散布分类指导挖掘的高类似度文本分类方法,采用灰度散布分类的思想对海量文本下的数据挖掘进行指导,采用灰度散布分类思想对特征更细分类能力,将相似度高的文本更细的分开,最后采用一组特征接近的随机词汇进行特征分类实验。结果显示,采用基于灰度散布分类指导挖掘方法,使特征类似度很高的文本可以被很好的区分开来,具有广泛的分类应用价值。
A classification of high likeness text with guidance under distribution of gray level was proposed, the distributionof gray level was used to do data mining in mass text, the detailed characteristic was extracted, and the high likeness textwas separated. Finally, a team of high likeness text was used to test the effect, and the experiment result shows that the highlikeness text can be separated well with guidance under distribution of gray level, so it has good value for separation application.
出处
《科技通报》
北大核心
2014年第4期179-181,共3页
Bulletin of Science and Technology
关键词
灰度散布
分类指导
高类似度文本分类
distribution of gray level
guidance of classification
high likeness text