摘要
为适应Internet时代和大规模文献处理的需要 ,以中文文本为处理对象 ,研究了从主题词、主题概念和主题句三个不同层面自动抽取文本主题的方法 ,着重讨论了加权体系和一些经验值的获取方法。对新闻类文献做了实验 。
To meet the requirement of Internet and large scale text processing,this paper introduces how to automatically extract subject from Chinese texts. We extract the subject from three different levels: subject word,subject concept and subject sentence. We put the emphasis on how to form the weighting system and acquire the experience coefficient values. Based on the experimental results of news articles,we briefly analyze the performance.
出处
《中文信息学报》
CSCD
北大核心
2001年第4期20-27,共8页
Journal of Chinese Information Processing
基金
8 6 3计划资助项目!(86 3 - 30 6 -ZD0 3- 0 4- 1)