摘要
在简要介绍篇章的向量空间模型表示的基础上,讨论了基于段间相似度和关系图进行篇章分析的方法,包括:结构分析,主题分析和聚类,浏览与跳段阅读,最后讨论所存在的主要问题及进一步改进的意见.
The vector space model (VSM) of text in arbitrary subject area is outlined, and text relation maps among paragraphs are obtained by similarity analysis. Based on the relation maps, several approaches for text analysis are described, which include structure analysis, theme analysis and clustering, theme browsing and text traversal. At last, some of the main problems are discussed and further improvements are suggested.
出处
《中南林学院学报》
CSCD
2004年第5期93-97,共5页
Journal of Central South Forestry University
关键词
分布式计算
软件设计
向量空间模型
信息检索
篇章理解
自然语言处理
distributed computation
software design
vector space model
information retrieval
text understanding
natural language processing