期刊文献+

基于最大树法的多文档文摘子主题划分 被引量:1

Sub-topic segmentation based on maximum tree algorithon in multi-document summarization
下载PDF
导出
摘要 提出一种基于最大树法的生成多文档文摘子主题划分方法。对多文档集合中的句子进行基于语义词典的相似度计算,形成相似度矩阵。提出了将相同或相似的句子通过模糊聚类的方法归并成一类,每一类代表一个子主题,通过抱团结构分析划分出子主题。实验结果表明,生成的多文档文摘覆盖性强,冗余信息少,具有一定实用价值。 A novel approach for sub-topic segmentation based on maximum tree algorithm was proposed.A method of sentence similarity computation based on semantic dependency was studied deeply.The similar sentences in multi-document set were combined into one class,each class was on sub-topic.Based on sentences similarity matrix calculated maximum tree,strategy is employed to divide sub-topic.The experiment results shows that the multi-document summarization made is more coverage,less redundant,which has certain practical value.
出处 《辽宁科技大学学报》 CAS 2009年第6期575-580,共6页 Journal of University of Science and Technology Liaoning
关键词 多文档文摘 子主题划分 最大树算法 multi-document summarization sub-topic segmentation maximum tree algorithm
  • 相关文献

参考文献12

  • 1RADEV D R, JING H Y. MBUDZIKKOWSKA Malgorzata. Centroid-based summarization of muhiple documents: sentence extraction, utility-based evaluation, and user studies [ A ]. SIDNER C. ANLP/NAACL 2000 Workshop [ C ]. Washington: ANLP/NAACL2000, 2000:21 - 29.
  • 2LIN C Y, HOVY E. From single to multi-document summarization: a prototype system and its Evaluation[ A ]. ISABELLE Pe. Proceeding of the dOth anniversary meeting of the association for computational linguistics(ACL-02) [ C ]. Philadephia, USA : The Computer and Information Science Department and the Institute for Research in Cognitive Science University of Pennsylvania,2002:25 - 34.
  • 3RADEV D R, McKEOVWN K R. Generating natural languages summaries from multiple on-line sources [ J ]. Computational Linguistics, 1998,24(3 ) :21 - 29.
  • 4HARABAGIU S M, MAIORANO S J. Multi-document summarization with GISTexter [ A]. ZAMPOLLI A. Proceedings of the Third LREC Conference 2002 ( LREC 2002) [ C ]. Canary Islands, Spain: LREC2002 ,2002 :65 - 68.
  • 5FILATOVA E, HATZIVASSILOGLOU V. Event-based extractive summarization [ A ]. Proceedings of ACL Workshop on summarization [ C ]. Barcelona, Span : ACL2004,2004 : 319 - 398.
  • 6BOROS E, KANTOR P B, NEU D .l. A clustering based approach to creating multi-document summaries [ A ]. KRAFTD H. Proceedings of the 24th Annum International ACM SIGIR Conference: On Research and Development in Information Retrieval [ C ]. New Orleans, LA : ACM2001,2001 : 240 - 245.
  • 7FUMG P, NGAI G, CHEUNG C S. Combining optimal clustering and hidden Markov model for extractive summarization[ A ]. JOHNSON M. Proceedings of the ACL 2003 Workshop on Muhilingual Summarization and Question Answering [ C ]. Sapporo, Japan: ACL2003 ,2003 :21 - 28.
  • 8秦兵,刘挺,高晔.多文档集合中逻辑主题的确定[A].中国中文信息学会.第一届全国信息检索与内容安全学术会议论文集[C].北京:中国中文信息学会,2004:230-235.
  • 9PRIM R C. Shortest connection networks and some generalizations[ J ]. Bell System Technology Journal, 1957, (36) :1389 - 1400.
  • 10KRUSKAL J B. On the shortest spanning subtree of a graph and the traveling salesman problem [ J ]. Proc Amer Math Soc, 1956, (7) :48 -50.

二级参考文献9

  • 1王源,吴晓滨,涂从文,刘滨,章元峰,王金娥.后控规范的计算机处理[J].现代图书情报技术,1993(2):4-7. 被引量:30
  • 2[3]宋明年.报纸文献机助自由标引研究及对汉语后控制词表动态维护的思考[D].中国人民解放军空军政治学院硕士论文,1994.
  • 3[7]何贵新.模糊知识处理的理论与技术(第2版).北京:国防工业出版社,1998:414-421
  • 4[3]Ganter B,Wille R.Formal concept analysis:mathematical foudations.Berlin:Springer Verlag,1999
  • 5[4]Wille R.Restructuring lattice theory:an approach based on hierarchies.(Ed:) Rival I.,Symposium on Ordered Sets,Boston:Reidel,Dordrecht,1982
  • 6[5]Formica A.Ontology-based concept similarity in formal concept analysis.Information Sciences,2006; 176 (18):2624-2641
  • 7[6]Missikoffm,Wang X F.A group decision system for collaborative ontology building.La Rochelle,France:Proceedings of International Conference on Group Decision and Negociation,2001:153-160
  • 8阮晓明.《中文科技期刊数据库》同义词词表的研制及其作用[J].进展,1999,10(8):37-39. 被引量:2
  • 9张琪玉.网络信息检索工具增强关键词检索功能的措施[J].图书馆杂志,2001,20(1):7-10. 被引量:45

共引文献15

同被引文献5

  • 1王思翠.基于S2AFCM与篇章内容结构分析的自动文摘系统研究[D].云南:昆明理工大学(硕士)论文,2011.
  • 2DUNNJ C. A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters[J]. Cybernetics and Systems, 1973(3):32-57.
  • 3XIE X L, BENI G. A validity measure for fuzzy chstering[J]. IEEE Trans Part Anal Machine Intell, 1991,13(8):841-847.
  • 4王威娜,陈巨龙,温宇鹏.自适应的模糊C均值聚类算法[J].吉林化工学院学报,2008,25(2):80-82. 被引量:5
  • 5任丽娜,秦永彬,许道云.基于自适应权重的模糊C-均值聚类算法[J].计算机应用研究,2012,29(8):2849-2851. 被引量:8

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部