摘要
提出了一种新的文档特征提取方法,将关键字通过文档的题名、摘要进行映射扩展,并对关键字的出现位置进行加权,不仅解决了维度偏高的问题,而且突出了重点特征词,提高了聚类的速度和精度。
This paper presents a new method of document feature extraction, which maps key word , tide , summary of the text file to the key words set, and adds the word weight based on the position of the word. It decreases the number of vector, gives prominence to important words and improves the speed and precision of text clustering.
出处
《盐城工学院学报(自然科学版)》
CAS
2006年第4期68-70,共3页
Journal of Yancheng Institute of Technology:Natural Science Edition
关键词
特征提取
模糊聚类
学术期刊
feature extraction
fuzzy clustering
academic journal