摘要
挖掘的理论和应用研究是数据挖掘领域一个新的重要分支 ,介绍了一种文本数据挖掘方法 .首先 ,论述了文本挖掘的意义和重要性 ,探讨了文本挖掘的定义和文本分类的一些形式 ,然后讨论了一个以数据预处理、特征提取、特征表示和特征匹配等文本分类的一些关键理论问题 ,并给出了一个基于该方法的文本分类系统的实验结果 。
Study and application of text data mining is one of the most important problems in the data mining. In this paper, we firstly study a method of text data mining. We first discuss the signification and importance of text data mining, and present the definition of text mining and some types of text classification. Then we give the key theory on text classification in detail, such as data processing, character mining, character denoting and character matching. Finally, we get some results of experiment by using a simple system based on the text classification method. These results of experiment mean that the method is feasible.
出处
《湘潭大学自然科学学报》
CAS
CSCD
2001年第4期34-37,共4页
Natural Science Journal of Xiangtan University
基金
湖南省教育厅资助项目 (0 0C85 )