基于聚类的自动摘要被引量：1

SENTENCES CLUSTERING BASED AUTOMATIC SUMMARIZATION

导出

摘要提出了一种基于题聚类的自动摘要算法.该算法在采用统计方法的同时.又适当结合知识理解,既摆脱了领域限制,也使摘要的结果更为准确.此外,为了能够全面反映信息样本的主要内容,而又不产生信息(?)余,本文提出的摘要算法还力图适应于不同的样本、动态确定摘要长度.为此.本文首先构造出新的互依赖模型,为摘要算法选择较为准确的属性.接着,挖掘出评估语句重要性的新规则.为摘要算法提供选择为重要语句的尺度.最后,提出了一种较为客观的、基于任务的摘要性能评估算法. In this paper, an algorithm which automatically summarizes a document by extracting subtopics from the sentences is based on statistics and partially understanding knowledge, in order to get better summarization and get rid of the restriction of information domain. Resides, since it is diffcult to determine the length of summaries manually, the algorithm also strives to obtain a better summary with proper length. To this end, a new module of mutual dependence is put forward too and used to select features, which can selects accuracy features for the summarizing algorithm. And then new rules to evaluate sentences are brought forward. Furthermore, a new task-based algorithm to evaluate summarization impersonally is offered.

作者王建会周水庚胡运发

机构地区复旦大学计算机与信息技术系

出处《模式识别与人工智能》 EI CSCD 北大核心 2004年第3期291-298,共8页 Pattern Recognition and Artificial Intelligence

基金国家自然科学基金(No.60173027)

关键词自动摘要互依赖聚类信息检索 Automatic Summarization Mutual Dependence Clustering Information Retrieval

分类号 TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献9

1Peng Fuchun, Schuurmans D. Self-Supervised Chinese Word Segmentation. In: Proc of the 4th International Symposium on Intelligent Data Analysis. Lisbon, Portugal, 2001, 238 - 247
2Mareu D. The Rhetorical Parsing of Natural Language Texts. In: Proc of the 35th Annual Meeting of the Association for Computational Linguistics. Madrid, Spain, 1997, 96 - 103
3Barzilay R, Elhadad M. Using Lexical Chains for Text Summarization. In: Proc of the Intelligent Scalable Text Summarization Workshop. Madrid, Spain, 1997, 2- 9
4Salton G, Singhal A, Mitra M, Buckley C. Automatic Text Structuring and Summarization. Information Processing and Management, 1997, 33(2): 193-208
5Hand T F. A Proposal for Task-Based Evaluation of Text Summarization Systems. In: Proc of the Association for Computational Linguistics and the European Association for Computational Linguistics on Summarization Workshop. Madrid, Spain, 1997, 31 -36
6Hand T F, Sundheim B. TIPSTER - SUMMAC Summarization Evaluation. In: Proc of the Workshop on TIPSTER Text Phase III. Washington DC, USA, 1998, 353- 340
7Yang K C, Ho T H, Chien L F, Lee L S. Statisties-Based Segment Pattern Lexicon - A New Direction for Chinese Language Modeling. In: Proe of IEEE International Conference on Acoustics,Speech, and Signal Processing. Seattle, WA, 1998, 169- 172
8Marcu D. Improving Summarization through Rhetorical Parsing Tuning. In: Proc of the 6th International Conference on Computational Linguistics and the Association for Computational Linguistics Workshop on Very Large Corpora. Montreal, Canada, 1998, 206-215. http://citeseer.nj.nee.eom/artiele/mareu98improving.html
9边肇祺张学工.模式识别[M].北京:清华大学出版社,2001..

共引文献28

1王永红,戴理昱,纪伟.基于神经网络的战术C^3I系统效能分析[J].火力与指挥控制,2003,28(z1):20-22. 被引量：5
2于胜学.基于模糊识别的柴油机工况判断[J].中国修船,2007(z1):39-40.
3张明,龙鹏飞.基于聚类、粗糙集和支持向量机的故障诊断[J].微机发展,2004,14(8):38-40. 被引量：1
4周建频,杜文.在动态供应链重构中应用数据挖掘识别企业模式[J].物流技术,2004,23(11):83-85. 被引量：2
5张燕平,张铃,段震.构造性核覆盖算法在图像识别中的应用[J].中国图象图形学报（A辑）,2004,9(11):1304-1308. 被引量：17
6王建会,王洪伟,申展,胡运发.一种实用高效的文本分类算法[J].计算机研究与发展,2005,42(1):85-93. 被引量：20
7西宝,李一军.复杂工程风险管理的信息密度演化计算方法[J].哈尔滨工业大学学报,2005,37(1):56-59. 被引量：4
8张燕平,张铃,吴涛.机器学习中的多侧面递进算法MIDA[J].电子学报,2005,33(2):327-331. 被引量：26
9魏传锋,庞彧,李运泽,王浚,于涛.改进的最近邻法在基于事例推理中的应用[J].系统仿真学报,2005,17(5):1045-1047. 被引量：13
10刘欣,章显,陶卿.SVM在物理实验中的应用[J].大学物理,2005,24(6):40-43. 被引量：1

同被引文献7

1孙茂松,邹嘉彦.汉语自动分词研究评述[J].当代语言学,2001,3(1):22-32. 被引量：101
2金博,史彦军,滕弘飞,艾景波.自动文摘技术及应用[J].计算机应用研究,2004,21(12):13-15. 被引量：4
3于海滨,秦兵,刘挺,郎君.命名实体识别和指代消解在文摘系统中的应用[J].计算机应用研究,2006,23(4):180-182. 被引量：7
4[1]M Blaze,J Feigenbaum,J Lacy.Decentralized trust management.In:Proc of the 1996 Symp on Security and Privacy.Los Alamitos:IEEE Computer Society Press,1996.164-173
5[2]A Abdul-Rahman,S Hailes.A distributed trust model.In:Proc of the 1997 Workshop on New Security Paradigms.New York:ACM Press,1997.48-60
6[3]W Wang,G S Zeng,L L Yuan.A semantic reputation mechanism in P2P semantic Web.In:Proc of the 1st Asian Semantic Web Conference (ASWC).LNCS 4185.Berlin:Springer,2006.682-688
7[4]Y Gil,D Artz.Towards content trust of Web resources.The 15th Int'l World Wide Web Conference (WWW-06),Edinburgh,Scotland,2006

引证文献1

1张泉,曾国荪,王伟,孙明军,谷华楠.基于改进的模糊C-均值聚类的信任文摘[J].计算机研究与发展,2008,45(z1):268-273. 被引量：2

二级引证文献2

1张少中,方朝曦,陈军敢,施炯.基于社会网络的电子商务信任社区聚类模型[J].浙江大学学报（工学版）,2013,47(4):656-661. 被引量：10
2刘德喜,万常选.社会化短文本自动摘要研究综述[J].小型微型计算机系统,2013,34(12):2764-2771. 被引量：12

1王建会,胡运发,李荣陆.自适应确定摘要长度[J].计算机研究与发展,2004,41(3):399-406. 被引量：3
2罗艳芬,万国金.基于BP神经网络模型的信息处理系统的应用分析[J].计算机与现代化,2004(11):7-8. 被引量：1
3潘军,刘丽.工作流模型时间与费用性能评估算法[J].北京航空航天大学学报,2013,39(5):650-654. 被引量：1
4李栋娜,曹阳,张奇,郑刚.SOC软硬件协同设计中多任务性能评估算法[J].计算机应用研究,2005,22(6):52-55. 被引量：1
5张双斌.MD5优化算法及安全性分析[J].电脑编程技巧与维护,2009(22):112-114.
6黄武锋.一种基于神经网络的数据挖掘算法[J].电脑编程技巧与维护,2017(3):57-57.
7刘学彦,王昕,王振雷.带遗忘因子的线性回归性能评估算法及应用[J].控制工程,2014,21(6):867-872. 被引量：12
8梅蓉.基于彩色图像的目标识别方法研究[J].计算机与数字工程,2007,35(9):119-122. 被引量：1
9梁冰,刘群.基于自动机模型数据关联性能评估算法[J].电子科技大学学报,2008,37(4):606-609. 被引量：1
10邓擘,郑彦宁,傅继彬.汉语实体关系模式的自动获取研究[J].计算机科学,2010,37(2):183-185. 被引量：3

模式识别与人工智能

2004年第3期

浏览历史

内容加载中请稍等...

基于聚类的自动摘要被引量：1

参考文献9

共引文献28

同被引文献7

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

基于聚类的自动摘要 被引量：1

参考文献9

共引文献28

同被引文献7

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

基于聚类的自动摘要被引量：1