期刊文献+

考虑全文本内容的算法学术影响力分析研究 被引量:20

Using Full-text to Analyse Academic Impact of Algorithms
原文传递
导出
摘要 [目的/意义]从全文本内容分析的角度对算法的学术影响力进行分析。[方法 /过程]以自然语言处理领域十大数据挖掘算法使用为例,分析不同算法在特定领域的影响力。通过对1965年-2006年间发表的自然语言处理领域10 922篇学术论文的调研,从其全文内容中抽取6 001条包含十大数据挖掘算法的句子(简称算法句);针对算法句从提及论文数、总提及次数、提及位置等3个方面,对不同算法的影响力进行比较分析。[结果/结论]以不同特征作为影响力衡量标准,十大数据挖掘算法在自然语言处理领域学术论文中的影响力有明显区别,在基于论文数、提及数和提及位置的评估标准中,SVM算法表现出较高的影响力,Apriori算法的影响力则明显低于其他算法。本研究为量化评估算法的影响力提供了新思路。 [ Purpose/significancel This paper analyses the influence of different algorithms in specific fields based on full-text analysis. [ Method/process] This paper analyzes the usage of the top 10 data mining algorithms in the domain of natural language processing. Firstly, we use 10922 academic papers published in the field of natural language processing fiom 1965 to 2006, and 6001 sentences containing Top - 10 data mining algorithms are extracted from its full text. We evaluate the impact of the Top - 10 algorithms according to three aspects: number of papers, mention number of algorithms, location of algorithms, and compare the results of different evaluation criterion. [ Result/conclusion] With different standard of assessment, the influence of ten data mining algorithms in conference papers of NLP is obviously different. SVM algorithm has higher influence on the evaluation criteria based on the number of papers and number of mention, and impact of Apriori algorithm is significantly lower than other algorithms. Our result of this paper provides a new way to quantify the influence of algorithm.
出处 《图书情报工作》 CSSCI 北大核心 2017年第23期6-14,共9页 Library and Information Service
关键词 算法影响力 影响力评估 全文本内容 文本内容分析 impact of algorithms impact assessment full-text text content analysis
  • 相关文献

参考文献8

二级参考文献58

共引文献116

同被引文献298

引证文献20

二级引证文献142

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部