期刊文献+

基于HDP的汽车专利主题演化研究 被引量:10

HDP-based Vehicle Patent Topic Evolution
下载PDF
导出
摘要 近年来专利数据呈爆炸式增长,从专利文本信息中准确地获取主题信息并将其可视化逐渐成为一个重要的研究方向。专利主题演化研究能够挖掘出专利中潜在的发展模式,对相关研究具有重要参考价值。本文将分层的狄利克雷过程(HDP)应用到专利主题聚类中,通过当前主题与加入历史数据之后的主题变化来挖掘主题的分流与合流,最后对主题信息利用叠式图进行可视化展示。实验结合实际的汽车专利数据进行分析研究,发现汽车专利主要分为三个大主题,而且各个主题之间有分流、合流,有逐年递增也有逐年递减,有新生主题也有消亡主题等各种形式,并发现从2006年开始汽车安全领域和汽车新能源领域分别独立成为一个主题并呈逐年增长的趋势。 In recent years, the patent data is in cxplosive growth. Accurately extracting topic information from patent data and visualizing it is becoming an important research direction. The research of the topic evolution of vehicle patent can dig out the potential development model, which has great importance to the related study. Here, we used the Hierarchical Dirichlet process (HDP) to cluster the patent data and mine splitting and merging of the topics by comparing the topics of each year and the topics with history data clustered by HDP. At last, we visualized the relationship of the topic information using stacked graph. We used the actual vehicle patent data in the experiment and discovered that there are three major topics of the vehicle patent data. There are splitting and merging among different topics, shrinking of the topic, expanding of the topic, newborn of the topic and perishing of the topic. We also found that after 2006 the field of vehicle safety and new energy sources for vehicle became to individual topics and showed increasing trend year by year.
出处 《情报学报》 CSSCI 北大核心 2014年第9期944-951,共8页 Journal of the China Society for Scientific and Technical Information
基金 国家自然科学基金资助项目(编号:61277370) 辽宁省自然科学基金(编号:201202031)
关键词 HDP 主题聚类 主题演化 汽车专利 hierarchical dirichlet processes, topic clustering, topic evolution, vehicle patent
  • 相关文献

参考文献13

  • 1Blei D M,Ng A Y,Jordan M I. Latent dirichlet allocation [ J]. Journal of Machine Learning Research, 2003, 3 : 993 -1022.
  • 2Wang X, McCallum A. Topics over time: a non-Markov continuous-time model of topical trends[ C ]//Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2006 : 424- 433.
  • 3Teh Y W, Jordan M I, Beal M J, et al. Hierarchical Dirichlet processes [ J ]. Journal of the American Statistical Association, 2006, 101 (476).
  • 4Lancichinetti A, Fortunato S. Consensus clustering in complex networks[ J]. Scientific Reports, 2012, 2.
  • 5Havre S,Hetzler E, Whitney P, et al. Themeriver: Visualizing thematic changes in large document collections [ J ].Visualization and Computer Graphics, IEEE Transactions on, 2002, 8 ( 1 ) : 9-20.
  • 6Wei F, Liu S, Song Y, et al. Tiara: a visual exploratory text analytic system [ C ]//Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2010: 153-162.
  • 7Cui W, Liu S, Tan L, et al. Textflow: Towards better understanding of evolving topics in text[ J]. Visualization and Computer Graphics, IEEE Transactions on, 2011, 17 (12) : 2412-2421.
  • 8范宇,符红光,文奕.基于LDA模型的专利信息聚类技术[J].计算机应用,2013,33(A01):87-89. 被引量:21
  • 9郝智勇,贺明科,谭文堂,张健东.基于多维标度法的专利文本可视化聚类研究[J].计算机应用研究,2010,27(12):4608-4611. 被引量:13
  • 10Salton G, Wong A, Yang C S. A vector space model for automatic indexing [ J]. Communications of the ACM, 1975, 18(11) : 613-620.

二级参考文献24

  • 1方曙,张娴,肖国华.专利情报分析方法及应用研究[J].图书情报知识,2007,24(4):64-69. 被引量:110
  • 2ORAZIA D D.Corporate strategic technological partnership in the European information and communications technology industry[J].Research Policy,2000,29(9):1015-1031.
  • 3REITZIG M.Strategic management of intellectual property[J].MIT Sloan Management Review,2004,45(3):35-40.
  • 4WILLIAM K.Innovation needs patents reform[J].Research Policy,2001,30(3):403-423.
  • 5HEINRICH G.Parameter estimation for text analysis[R].[S.l.] :University of Leipzig,2008.
  • 6ZHOU Y G.XIA L X.Non-quantitative data analysis and applications theory[M].Beijing:Scientific Press,1993.
  • 7TAO Li,SHENG Ma,OGIHARA M.Document clustering via adaptive subspace iteration[C] //Proc of the 12th ACM International Conference on Multimedia.New York:ACM Publisher,2004:364-367.
  • 8HanJ CamberM 数据挖掘 范明 孟小峰 译.概念与技术[M].北京:机械工业出版社,2001..
  • 9许洪波.文本挖掘与机器学习.信息技术快报,2005,(2):1-14.
  • 10中国国家知识产权局.国内外三种专利授权状况总累计表[EB/OL].[2006-12-14].http://www.sipo.gov.cn/sipo/ghfzs/zltj/gnwszzlsqzkztjb/200612/t20061214_124636.htm.

共引文献41

同被引文献151

引证文献10

二级引证文献37

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部