摘要
潜在语义分析是一种关于自然语言信息提取和再现的理论方法,它通过代数的方法提取语义空间中潜在结构。论文叙述了潜在语义分析的基本理论方法,概述了这种方法所建立的潜在语义空间的数学意义;然后通过一个简单示例说明LSA在中文信息处理中的分析方法,并通过分析结果中文本间、词汇间关联度的变化来说明LSA在中文信息处理中的重要意义。
Latent Semantic Analysis is a theory and method about extracting and representing information of nature language.LSA retrievals the latent semantic structure from semantic space by mathematical method.Firstly,this paper presents the underlying idea of LSA and introduces the mathematical means of the Latent Semantic Space which is built by LSA.In the following subsection,the paper introduces the application of LSA in the field of Chinese information processing though a sample example.The variation of the similarities between documents,or between terms ,in the example analysis result,shows the important meaning of LSA.
出处
《计算机工程与应用》
CSCD
北大核心
2005年第3期91-93,共3页
Computer Engineering and Applications
关键词
潜在语义分析
潜在语义空间
中文信息处理
奇异值分解
Latent Semantic Analysis,Latent Semantic Space,Chinese information processing,Singular Value Decomposition