摘要
通过分析基于复杂网络的关键词提取算法的特点和不足,提出了一种基于加权复杂网络提取的文本关键词新算法.首先根据文本特征词之间的关系构建文本的加权复杂网络模型,其次通过节点的加权聚类系数和节点的介数计算节点的综合特征值,最后根据综合特征值提取出文本关键词.实验结果表明,该算法提取的关键词能够较好地体现文本主题,提取关键词的准确率比已有算法有明显提高.
By analyzing the characteristics and disadvantages of the existing keywords extraction algorithms based on complex network,a new keywords extraction algorithm is proposed by using of weighted complex network.First of all,a weighted complex network model is constructed according to the relationship between the feature words of text.Secondly,the weighted clustering coefficient and betweeness are introduced to calculate the node's multi-feature value.Finally,the keywords are extracted by the multi-feature value.The experiment results show that the keywords extracted by this algorithm have great contribution to the text subject,and the accuracy of keywords extraction is better than the existing algorithms.
出处
《系统科学与数学》
CSCD
北大核心
2010年第11期1592-1596,共5页
Journal of Systems Science and Mathematical Sciences
基金
国家自然科学基金(10771092)资助课题