期刊文献+

基于聚类的自动摘要 被引量:1

SENTENCES CLUSTERING BASED AUTOMATIC SUMMARIZATION
原文传递
导出
摘要 提出了一种基于题聚类的自动摘要算法.该算法在采用统计方法的同时.又适当结合知识理解,既摆脱了领域限制,也使摘要的结果更为准确.此外,为了能够全面反映信息样本的主要内容,而又不产生信息(?)余,本文提出的摘要算法还力图适应于不同的样本、动态确定摘要长度.为此.本文首先构造出新的互依赖模型,为摘要算法选择较为准确的属性.接着,挖掘出评估语句重要性的新规则.为摘要算法提供选择为重要语句的尺度.最后,提出了一种较为客观的、基于任务的摘要性能评估算法. In this paper, an algorithm which automatically summarizes a document by extracting subtopics from the sentences is based on statistics and partially understanding knowledge, in order to get better summarization and get rid of the restriction of information domain. Resides, since it is diffcult to determine the length of summaries manually, the algorithm also strives to obtain a better summary with proper length. To this end, a new module of mutual dependence is put forward too and used to select features, which can selects accuracy features for the summarizing algorithm. And then new rules to evaluate sentences are brought forward. Furthermore, a new task-based algorithm to evaluate summarization impersonally is offered.
出处 《模式识别与人工智能》 EI CSCD 北大核心 2004年第3期291-298,共8页 Pattern Recognition and Artificial Intelligence
基金 国家自然科学基金(No.60173027)
关键词 自动摘要 互依赖 聚类 信息检索 Automatic Summarization Mutual Dependence Clustering Information Retrieval
  • 相关文献

参考文献9

  • 1Peng Fuchun, Schuurmans D. Self-Supervised Chinese Word Segmentation. In: Proc of the 4th International Symposium on Intelligent Data Analysis. Lisbon, Portugal, 2001, 238 - 247
  • 2Mareu D. The Rhetorical Parsing of Natural Language Texts. In: Proc of the 35th Annual Meeting of the Association for Computational Linguistics. Madrid, Spain, 1997, 96 - 103
  • 3Barzilay R, Elhadad M. Using Lexical Chains for Text Summarization. In: Proc of the Intelligent Scalable Text Summarization Workshop. Madrid, Spain, 1997, 2- 9
  • 4Salton G, Singhal A, Mitra M, Buckley C. Automatic Text Structuring and Summarization. Information Processing and Management, 1997, 33(2): 193-208
  • 5Hand T F. A Proposal for Task-Based Evaluation of Text Summarization Systems. In: Proc of the Association for Computational Linguistics and the European Association for Computational Linguistics on Summarization Workshop. Madrid, Spain, 1997, 31 -36
  • 6Hand T F, Sundheim B. TIPSTER - SUMMAC Summarization Evaluation. In: Proc of the Workshop on TIPSTER Text Phase III. Washington DC, USA, 1998, 353- 340
  • 7Yang K C, Ho T H, Chien L F, Lee L S. Statisties-Based Segment Pattern Lexicon - A New Direction for Chinese Language Modeling. In: Proe of IEEE International Conference on Acoustics,Speech, and Signal Processing. Seattle, WA, 1998, 169- 172
  • 8Marcu D. Improving Summarization through Rhetorical Parsing Tuning. In: Proc of the 6th International Conference on Computational Linguistics and the Association for Computational Linguistics Workshop on Very Large Corpora. Montreal, Canada, 1998, 206-215. http://citeseer.nj.nee.eom/artiele/mareu98improving.html
  • 9边肇祺 张学工.模式识别[M].北京:清华大学出版社,2001..

共引文献28

同被引文献7

  • 1孙茂松,邹嘉彦.汉语自动分词研究评述[J].当代语言学,2001,3(1):22-32. 被引量:101
  • 2金博,史彦军,滕弘飞,艾景波.自动文摘技术及应用[J].计算机应用研究,2004,21(12):13-15. 被引量:4
  • 3于海滨,秦兵,刘挺,郎君.命名实体识别和指代消解在文摘系统中的应用[J].计算机应用研究,2006,23(4):180-182. 被引量:7
  • 4[1]M Blaze,J Feigenbaum,J Lacy.Decentralized trust management.In:Proc of the 1996 Symp on Security and Privacy.Los Alamitos:IEEE Computer Society Press,1996.164-173
  • 5[2]A Abdul-Rahman,S Hailes.A distributed trust model.In:Proc of the 1997 Workshop on New Security Paradigms.New York:ACM Press,1997.48-60
  • 6[3]W Wang,G S Zeng,L L Yuan.A semantic reputation mechanism in P2P semantic Web.In:Proc of the 1st Asian Semantic Web Conference (ASWC).LNCS 4185.Berlin:Springer,2006.682-688
  • 7[4]Y Gil,D Artz.Towards content trust of Web resources.The 15th Int'l World Wide Web Conference (WWW-06),Edinburgh,Scotland,2006

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部