期刊文献+

结合拓扑势与TextRank算法的关键词提取方法 被引量:9

KEYWORDS EXTRACTION METHOD COMBINING TOPOLOGICAL POTENTIAL AND TEXTRANK ALGORITHM
下载PDF
导出
摘要 传统的TextRank算法进行关键词提取时词语之间的连接边采用权值均分的形式进行加权,未考虑词语的语义信息。针对这种情况,提出结合拓扑势与TextRank算法的关键词提取方法。方法使用词频和词语在文中的分布情况对词语加权作为词语的全局影响;使用拓扑势的思想结合词语全局影响计算词语间的转移概率作为词语的局部影响;将转移概率矩阵应用于传统TextRank算法中。实验表明,考虑词语全局及局部重要性等语义信息可有效提升TextRank算法的准确率和召回率。 When the traditional TextRank algorithm extracts keywords,the connection edges between words are weighted in the form of equipartition without considering the semantic information of words.In this case,this paper proposes a keyword extraction method combining topological potential and TextRank algorithm.It used the word frequency and the distribution of words in the text to weight words as the global influence;it combined the topological potential with the global influence to calculate the transfer probability between words as the local influence of words;the transition probability matrix was applied to the traditional TextRank algorithm.The experiments show that considering the semantic information such as global and local importance of words can effectively improve the accuracy and recall rate of the TextRank algorithm.
作者 罗婉丽 张磊 Luo Wanli;Zhang Lei(College of Information and Engineering,Sichuan Tourism University,Chengdu 610199,Sichuan,China;Sichuan Rural Credit Union,Chengdu 610041,Sichuan,China)
出处 《计算机应用与软件》 北大核心 2022年第1期334-338,共5页 Computer Applications and Software
基金 四川省教育厅项目(14ZB0315)。
关键词 TextRank算法 关键词提取 语义信息 拓扑势 TextRank algorithm Keywords extraction Semantic information Topological potential
  • 相关文献

参考文献8

二级参考文献47

共引文献206

同被引文献100

引证文献9

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部