摘要
本文分析是文本处理领域中的重要内容,它可以有效地改进文本检索、文本过滤以及文本摘要的精度。本文简要描述了文本的物理结构和逻辑结构以及文本分析的背景,将潜在语义索引引入文本分析中,提出了基于潜在语义索引的层次分析方法。该方法保证了层次划分的有序性和聚合性,可操作性强,便于解释,并给出了在文本检索、文本过滤和文本摘要中的应用。
Text structure analysis plays an important role in text processing, it can improve the precision and efficiency of the text retrieval and text summary. The physical structure, logical structure of text and background of the text structure analysis are briefly described in the paper. Latent Semantic Indexing is introduced as a basis of text structure analysis, and text hierarchical analysis approach is put forward. As a result, the approach ensures the hierarchies to be in natural sequence and high cohesion. In addition, its applications in text retrieval and text summary are given as examples.
出处
《模式识别与人工智能》
EI
CSCD
北大核心
2000年第1期47-51,共5页
Pattern Recognition and Artificial Intelligence
基金
国家自然科学基金
国家教委博士点基金
关键词
层次分析
潜在语义索引
文本分析方法
文本处理
Vector Space Model, Text Structure Analysis, Text Hierarchical Analysis, Latent Semantic Indexing