期刊文献+

基于加权复杂网络的文本关键词提取 被引量:14

KEYWORDS EXTRACTION BASED ON WEIGHTED COMPLEX NETWORK
原文传递
导出
摘要 通过分析基于复杂网络的关键词提取算法的特点和不足,提出了一种基于加权复杂网络提取的文本关键词新算法.首先根据文本特征词之间的关系构建文本的加权复杂网络模型,其次通过节点的加权聚类系数和节点的介数计算节点的综合特征值,最后根据综合特征值提取出文本关键词.实验结果表明,该算法提取的关键词能够较好地体现文本主题,提取关键词的准确率比已有算法有明显提高. By analyzing the characteristics and disadvantages of the existing keywords extraction algorithms based on complex network,a new keywords extraction algorithm is proposed by using of weighted complex network.First of all,a weighted complex network model is constructed according to the relationship between the feature words of text.Secondly,the weighted clustering coefficient and betweeness are introduced to calculate the node's multi-feature value.Finally,the keywords are extracted by the multi-feature value.The experiment results show that the keywords extracted by this algorithm have great contribution to the text subject,and the accuracy of keywords extraction is better than the existing algorithms.
出处 《系统科学与数学》 CSCD 北大核心 2010年第11期1592-1596,共5页 Journal of Systems Science and Mathematical Sciences
基金 国家自然科学基金(10771092)资助课题
关键词 关键词提取 加权复杂网络 综合特征值 extraction weighted complex network multi-feature value
  • 相关文献

参考文献9

二级参考文献33

  • 1李素建,王厚峰,俞士汶,辛乘胜.关键词自动标引的最大熵模型应用研究[J].计算机学报,2004,27(9):1192-1197. 被引量:92
  • 2韦洛霞,李勇,李伟,邵明珠,罗诗裕.汉字网络的3度分隔与小世界效应[J].科学通报,2004,49(24):2615-2616. 被引量:16
  • 3耿焕同,蔡庆生,于琨,赵鹏.一种基于词共现图的文档主题词自动抽取方法[J].南京大学学报(自然科学版),2006,42(2):156-162. 被引量:30
  • 4张敏,耿焕同,王煦法.一种利用BC方法的关键词自动提取算法研究[J].小型微型计算机系统,2007,28(1):189-192. 被引量:19
  • 5Coben J D.Highlights:language and domain-independent automatic indexing terms for abstracting[J].Journal of American Society for Information Science, 1995,46(3 ) : 162-174.
  • 6Tzeras K,Hartman S.Automatic indexing based on Bayesian inference networks[C]//Proc 16th Ann Int ACM SIGIR Conference on Research and Development in Information Retrieval,Inference Networks, 1993 : 22-34.
  • 7Matsuo Y,Ohsawa Y,Ishizuka M.KeyWorld:extracting keywords from a document as a small world[C]//Discovery Science,the 4th International Conference,2001:271-281.
  • 8Cancho R F I,Sole R V.The small world of human language[D]. Santa Fe Institute Working Paper,2001.
  • 9李晓明,闫宏飞,王继民.搜索引擎——原理、技术与系统[M].北京:科学出版社,2006.173.
  • 10Hyvarinen A. Fast and Robust Fixed-point Algorithms for Independent Component Analysis[J]. IEEE Transactions on Neural Networks, 1999, 10(3): 626-634.

共引文献100

同被引文献128

引证文献14

二级引证文献85

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部