摘要
人工智能领域的研究成果在学术研究与工业实践之间转移流动。为揭示自然语言处理领域论文与专利中的技术方法的分布及其演化趋势,基于知识演化视角对比分析论文与专利技术方法的分布,将2013年1月到2022年12月arXiv数据集中自然语言处理领域的39025篇论文,以及按照“关键词+IPC分类号”的检索方法在incoPat专利数据库得到的14654件专利数作为样本,利用维基百科分类树构建方法本体,将论文和专利映射到领域本体中,并采用配对样本T检验方法验证两者知识演化方面的滞后性。结果表明:样本论文与专利存在共同关注的技术方法,包括卷积神经网络、循环神经网络、长短期记忆网络、注意力机制和预训练语言模型,其中前三者属于比较成熟的神经网络架构,而注意力机制随着上述神经网络架构的发展而迅速发展;专利技术方法相对论文滞后约1~2年。
The research outcomes in the field of artificial intelligence transition and circulate between academic research and industrial practice.To uncover the distribution and evolutionary trends of technical methods in natural language processing(NLP)within papers and patents,a comparative analysis is conducted based on the perspective of knowledge evolution.This paper analyzes 39025 NLP papers from the arXiv dataset spanning January 2013 to December 2022,and 14654 patents'information retrieved from the incoPat patent database by using the search method of"keywords+IPC classification number".Using the Wikipedia category tree,an ontology of methods is constructed to map the papers and patents to the domain ontology.A paired sample t-test is employed to verify the lag in knowledge evolution between the two sources.The findings indicate that the sampled papers and patents share common technical methods,including convolutional neural networks(CNN),recurrent neural networks(RNN),long short-term memory(LSTM),attention mechanisms,and pretrained language models.Among these,the first three represent relatively mature neural network architectures,while the attention mechanism has rapidly developed alongside these neural network architectures.Patent technical methods tend to lag behind papers by approximately 1 to 2 years.
作者
唐露源
谢士尧
徐源
Tang Luyuan;Xie Shiyao;Xu Yuan(School of Economics,Beijing Institute of Technology,Beijing 102488,China;National Institute of Health Data Science,Peking University,Beijing 100191,China;College of Education,Beijing Institute of Technology,Beijing 102488,China)
出处
《科技管理研究》
2024年第10期153-160,共8页
Science and Technology Management Research
关键词
人工智能
自然语言处理
知识演化
神经网络
技术方法
文献计量
专利分析
artificial intelligence
natural language processing(NLP)
knowledge evolution
neural networks
technical methods
bibliometrics
patents analysis