摘要
在网络这个庞大的虚拟图书馆中,占信息比重最大的文本数据却缺乏结构化、组织化的规整性,大大降低了网络文本信息的利用效率,而文本的自动分类技术则能降低网络的查询时间,提高网络搜索质量。文章提出了一种基于粗糙集理论的文本分类方法。
In a vast virtual library network, which accounts for the largest proportion of the text message is a lack of baseline data structure and organization of structured, and greatly reduce the efficiency in the use of text information network, and automatic text classification technology is able to reduce the network's time and improve the quality of Internet search. This paper presents a rough set theory based on the text classification.
出处
《自动化与信息工程》
2006年第3期1-3,共3页
Automation & Information Engineering
关键词
文本分类
粗糙集
决策表
属性约简
规则提取
Text Categorization
Rough Set
Decision Table
Reduction of Attributes
Rule Extraction