期刊文献+

融合时序分类的科技领域实体增长预测研究

Research on Entity Growth Prediction in Science and Technology Field Fusing Time Series Classification
下载PDF
导出
摘要 [目的/意义]科技领域实体增长机制是进行预测型科技情报分析的核心,要有效地进行科技发展态势感知和预测,必须深入了解科技领域细粒度知识增长机制。文章提出并验证了融合时序分类的科技领域实体增长预测方案。[方法/过程]首先,根据实体词频时间序列的增长模式,将实体分为可拟合、有趋势和无规律3种类别;其次,利用曲线拟合、局部加权回归方法抽取特征,构建特征向量,再用MLP模型实现了高精度的实体分类;最后,构建融合时序分类的集成模型与基线模型进行对比验证效果。[结果/结论]根据实验结果,验证显示集成预测模型相对于基线模型,误差减少了13%以上,进一步证实了预测结果在科学性和准确性方面的优势。综合考虑,所提出的融合时间序列分类的实体增长预测方案在可行性和应用价值方面具备潜力。 [Purpose/significance]The growth mechanism of entities in the field of science and technology is integral to the predictive analysis of scientific and technological information.To effectively anticipate advancements in science and technology,it is vital to deeply explore the nuanced knowledge growth mechanisms within the field.This paper proposes and validates a scheme for predicting the growth of entities within the science and technology domain using time-series classification.[Method/process]Firstly,based on the growth pattern of entity word frequency time series,entities are categorized into three classes:fitable,trend,and irregular.Then,curve fitting and local weighted regression methods are employed to extract features and construct feature vectors,while a Multilayer Perceptron(MLP)model is used to achieve high-precision entity classification.Subsequently,an integrated model fusing time series classification is constructed for comparison with the baseline model to verify its effectiveness.[Result/conclusion]According to our experimental results,the integrated prediction model reduces error by over 13%compared to the baseline model,further attesting to its advantages in terms of scientific robustness and accuracy.In conclusion,the proposed scheme of predicting entity growth in the science and technology field,which incorporates time series classification,holds significant potential for feasibility and application value.
作者 陈果 陈智力 陈霜澜 Chen Guo
出处 《情报理论与实践》 北大核心 2024年第2期116-123,共8页 Information Studies:Theory & Application
基金 教育部人文社会科学研究青年基金项目“基于领域实体的学科研究前沿识别体系构建研究”(项目编号:21YJC870003) 江苏省社会科学基金青年项目“面向前沿技术监测的领域知识分析模式研究”(项目编号:21TQC002)的成果之一。
关键词 时间序列分类 领域实体 实体增长预测 集成模型 time series classification domain entities entity growth forecast integrating model
  • 相关文献

参考文献6

二级参考文献96

  • 1Keogh E,Kasetty S.On the need for time series data mining benchmarks:A survey and empirical demonstration//Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.Edmonton,Alberta,Canada,2002:102-111
  • 2Yang Yi-Ming,Yang Qiang,Lu Wei,Pan Jia-Lin,Pan Rong,Lu Chen-Hui,Li Lei,Qin Zhen-Xing.Preprocessing time series data for classification with application to CRM//Proceedings of the 18th Australian Joint Conference on Artificial Intelligence (AI 2005).Sydney,Australia,2005:133-142
  • 3Ling C X,Huang J,Zhang H.AUC:A statistically consistent and more discriminating measure than accuracy//Proceedings of the 18th International Joint Conference on Artificial Intelligence.Acapulco,Mexico,2003:329-341
  • 4Faloutsos Christos,Ranganathan M,Manolopoulos Yannis.Fast subsequence matching in time-series databases//Proceedings of the 1994 ACM SIGMOD International Conference on Management of Data.Minneapolis,Minnesota,United States,1994:419-429
  • 5Chan Franky Kin-Pong,Fu Ada Wai-Chee,Yu Clement.Haar wavelets for efficient similarity search of time-series:With and without time warping.IEEE Transactions on Knowledge and Data Engineering,2003,15(3):686-705
  • 6Yi Byoung-Kee,Jagadish H V,Faloutsos Christos.Efficient retrieval of similar time sequences under time warping//Proceedings of the 14th International Conference on Data Engineering.Orlando,Florida,1998:201-208
  • 7Kim Sang-Wook,Park Sanghyun,Chu Wesley W.An indexbased approach for similarity search supporting time warping in large sequence databases//Proceedings of the 17th International Conference on Data Engineering.Heidelberg,Germany,2001:607-614
  • 8Quinlan J R.C4.5:Programs for Machine Learning.San Francisco:Morgan Kaufmann Publishers,1993
  • 9Kay Steven M.Fundamentals of Statistical Signal Processing:Estimation Theory.Upper Saddle River,NJ,USA,Prentice Hall,1993
  • 10MacQueen J B.Some methods for classification and analysis of multivariate observations//Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability.Berkeley,1967,1:281-297

共引文献123

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部