摘要
为了浏览因特网上日益增多的在线中文文本 ,本文给出了基于概念的中文文本可视化表示机制 ,以直观的方式组织和表示文本及文本集 .其基本思想是 :首先在概念扩充的基础上 ,进行文本分类 .然后 ,利用本文提出的文本特征抽取方法和摘要方法 ,获取文本类别、文本、文本正文的标记信息 ,通过类别、文本、正文的超文本连接 ,帮助用户有目的、有选择地浏览文本 .
The Chinese text visualization based on concept is put forward in this paper, and it can help users browse the more and more internet online text resources. Its main idea is showed as follows: Based on the concept expansion, text collection is divided into several categories. The approach of text feature extraction and the approach of text summary are presented in the paper, and they are applied to the text categories and texts in order to obtain the logic representation of text categories and texts. Users can make use of the hypertext links between text categories and texts to select the texts which they are interested in and read the texts .
出处
《小型微型计算机系统》
EI
CSCD
北大核心
2000年第10期1042-1045,共4页
Journal of Chinese Computer Systems
基金
国家自然科学基金资助项目 !(编号 :6 96 75 0 19)
国家教委博士点基金资助项目
关键词
中文文本可视化
概念
信息处理
文本分类
Text visualization
Text feature extraction
Text summary
Text category
Text browsing
Concept expansion