摘要
[目的/意义]基于大量专利文献数据的核心技术主题识别有助于识别某技术领域的关键技术、分析关键技术的发展方向,是进行技术创新的基础情报工作,对于研究人员、企业乃至国家层面都具有一定的意义。[方法/过程]提出基于Chunk-LDAvis的核心技术主题识别方法,首先基于经典LDA模型进行主题识别,然后利用名词组块对初始LDA主题识别结果进行标注,构建Chunk-LDA主题识别结果,提高其可解读性;然后基于社会网络分析方法构建主题网络,识别核心技术主题;基于R语言的LDAvis工具包绘制可交互的Chunk-LDAvis核心技术主题关联分析图谱,发现核心技术主题的隐含联系,辅助进行核心技术主题识别。[结果/结论]通过对纳米农业领域进行实证研究,验证了本文提出方法的准确性和可行性。
[Purpose/significance]Core technology topic identification based on a large number of patent documents is helpful to detect key tech no logies in a technical field and to analyze the direction of the development of key technologies.It is the basic information work for technological innovation and has certain significance for researchers,enterprises and even the national level.[Method/process]This paper proposes a core technology topic identification method based on Chunk-LDAvis.Firstly,it is based on the classic LDA model to identify the topics.Then,the noun chunk is used to mark the results of the initial LDA topic identification,and the result of the Chunk-LDA topic recognition is constructed to improve its interpretability.Then based on the social network analysis method,the topic network is constracted to identify the core technical topics;based on the LDAvis toolkit,the interactive Chunk-LDAvis core technology topic association analysis map is plotted,and the hidden links of the core technical topics are found,and the core technology topic detection is assisted.[Result/conclusion]Through the empirical study on the field of nanoscale agriculture,the accuracy and feasibility of the proposed method are verified.
作者
刘自强
许海云
岳丽欣
方曙
Liu Ziqiang;Xu Haiyun;Yue Lixin;Fang Shu(Chengdu Library of Chinese Academy of Sciences,Chengdu 610041;Department of Library,Information and Archives Management,School of Economics and Management,University of Chinese Academy of Sciences,Beijing 100190;Institute of Scientific and Technical Information of China (ISTIC),Beijing 100038;School of Information Resource Management,Renmin University of China,Beijing 100872)
出处
《图书情报工作》
CSSCI
北大核心
2019年第9期73-84,共12页
Library and Information Service
基金
国家自然科学基金项目“基于科学—技术主题关联分析的创新演化路径识别方法研究”(项目编号:71704170)
中国科学院成都文献情报中心青年人才创新项目(项目编号:Y7Z0581002)研究成果之一