期刊文献+

基于主成分分析和Eros的邻近传播算法在金融数据集中的应用

Application of PCA and Eros Affinity Propagation Clustering in Financial Data Sets
下载PDF
导出
摘要 金融数据集的多维特性和高噪声特性使得对金融时间序列数据的分析难上加难,本文提出一种基于主成分分析和Eros的近邻传播的聚类算法。首先利用主成分分析方法对多变量的金融时间序列数据进行降维处理,提取出主要特征值;然后使用基于Eros的近邻传播算法聚类对提取出的特征值进行分析。该聚类方法可以把数据集中的个体当作是原始数据的一个属性,通过迭代竞争达到最优,不需要事先确定聚类数目。研究结果表明,这种集成算法大大降低了时间序列数据的维度,有很高的分类正确率,表明该聚类方法用于金融时间序列数据处理是有效可行的。 The muhi-dimensional characteristics and high noise characteristics of the financial data set make it hard to analyze the time series. This paper puts forward an algorithm based on the principal component analysis and the Eros affinity propagation clustering. First it uses the principal component analysis method to extract the main eigenvalues of the muhivariate financial time series data; then uses the Eros affinity propagation clustering to analyze the extracted eigenvalues. This kind of clustering algorithm can make the individual data as an attribute of the original data, through iterate competition to achieve optimal, do not need to find the number of clusters. The research results show that, this integrated method greatly reduces the dimension of the time seties, and has a highly correct classification rate. It proves that this algorithm is very effective.
作者 廖洪一 王欣
出处 《计算机与现代化》 2015年第8期24-28,共5页 Computer and Modernization
基金 国家自然科学基金民航联合基金资助项目(U1233105) 中国民用航空飞行学院大学生创新项目(201410624043)
关键词 近邻传播聚类 弗罗贝尼乌斯范数 多元时间序列 主成分分析 聚类模型 affinity propagation clustering Frohenius norm multivariate time series principal component analysis clustering model
  • 相关文献

参考文献16

  • 1Buyya R, Abramson D, Venugopal S. The grid economy [J]. Proceedings of the IEEE, 2005,93(3) :698-714.
  • 2谢娟英,蒋帅,王春霞,张琰,谢维信.一种改进的全局K-均值聚类算法[J].陕西师范大学学报(自然科学版),2010,38(2):18-22. 被引量:46
  • 3郭崇慧,贾宏峰,张娜.基于ICA的时间序列聚类方法及其在股票数据分析中的应用[J].运筹与管理,2008,17(5):120-124. 被引量:13
  • 4Maharaj E A, D' Urso P. A coherence-based approach for the pattern recognition of time series[ J ]. Physica A: Sta- tistical Mechanics and its Applications, 2010,389 ( 17 ) : 3516-3537.
  • 5Caiado J, Crato N, Pefiad D. A periodogram-based metric for time series classification [ J]. Computational Statist & Data Analysis, 2006,50(10) :2668-2684.
  • 6王骏,王士同,邓赵红.聚类分析研究中的若干问题[J].控制与决策,2012,27(3):321-328. 被引量:194
  • 7Frey B J, Dueck D. Clustering by passing messages between data points[J]. Science, 2007,315(5814):972-976.
  • 8Sambasivam S, Theodosopoulos N. Advanced data cluste- ring methods of mining Web documents [ J ]. Issues in In- forming Science and Information Technology, 2006 (3): 563-579.
  • 9Bohm C, Berchtold S, Keim D A. Searching in high-di- mensional spaces-index structures for improving the per- formance of multimedia databases [J].ACM Computing Surveys, 2001,33 ( 3 ) :322-373.
  • 10Fang S H, Lin T N. Principal component localization in in- door WLAN environments [ J]. IEEE Transactions on Mo- bile Computing, 2012,11 ( 1 ) : 100-110.

二级参考文献127

共引文献303

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部