期刊文献+

区间型符号数据回归分析及其应用 被引量:13

Methodology and application of regression analysis of interval-type symbolic data
下载PDF
导出
摘要 介绍了符号数据分析方法的基本理论.针对一种最常用的符号数据——区间型符号数据,基于误差传递的理论,提出了区间回归分析的方法.方法包括了线性回归分析和可线性化的非线性回归分析两种情形.讨论了基于Hausdorff距离的区间数距离,基于此定义了回归模型的评价指标.进行了方法的应用研究,选取沪深300指数与中信规模风格指数,从时间维上对其日内数据进行"数据打包",形成区间型符号数据;建立了区间线性回归分析模型,从全局上揭示了两类指数间的相关性.结论表明,与针对点数据的传统回归分析相比,区间型符号数据的回归分析方法不仅实现了样本空间的降维,而且有利于从整体上把握变量之间的内在关系. The theory of symbolic data analysis (SDA) is introduced first. One type of the most important symbolic data is interval data. The methods of regression analysis of interval-type symbolic data are proposed via error transferring theory. Linear regression analysis and non-linear regression which can be linearized are studied separately. Furthermore, the distance between interval numbers is defined based on Hausdorff distance. And, based upon this, evaluation indices of the regression analysis models are proposed. Finally, appli.eation of the correlation between CSI 300 and style indices of Chinese international trust & investment company (CITIC) is carried out. In this application study, the interval-type symbolic data are obtained by data package of daily indices data. The regression analysis models of the interval-type symbolic data are set up, revealing the overall correlation between the CSI 300 and style indices of CITIC. It is concluded that, comparing with the traditional regression analysis of point data, the regression analysis of interval-type symbolic data can not only reduce the dimension of the sample space, but also grasp the inner relations of the variables.
出处 《管理科学学报》 CSSCI 北大核心 2010年第4期38-43,共6页 Journal of Management Sciences in China
基金 国家自然科学基金资助项目(70701026)
关键词 符号数据 回归分析 区间数 误差传递 指数 symbolic data regression analysis interval numbers error transferring index
  • 相关文献

参考文献12

  • 1胡艳,王惠文.一种海量数据的分析技术——符号数据分析及应用[J].北京航空航天大学学报(社会科学版),2004,17(2):40-44. 被引量:19
  • 2Bock H H,Diday E.Analysis of Symbolic Data[M].New York:Springer-Verlag,2000.
  • 3Billard L,Diday E.From the statistics of data to the statistics of knowledge:Symbolic data analysis[J].Journal of the American Statistical Association,2003,98(462),470-487.
  • 4Billard L,Diday E.Symbolic Data Analysis:Conceptual Statistics and Data Mining[M].Chichester:John Viley & Sons,2006.
  • 5Guo J P,Li W H.Integrated Evaluation of Listed Companies by Factor Analysis for Symbolic Data[C].AMIGE 2008 Proceedings,2008:196-199.
  • 6Carvalho F D A T,Lechevallier Y.Partitional clustering algorithms for symbolic interval data based on single adaptive distances[J].Pattern Recognition,2009,42(7):1223-1236.
  • 7郭均鹏,李汶华.基于经验相关矩阵的区间主成分分析[J].管理科学学报,2008,11(3):49-52. 被引量:9
  • 8Billard L,Diday E.Regression analysis for interval-valued data[C]// In Kiers H A L,Rassoon J P,Groenen P J F,etc.Data Analysis,Classification,and Related Methods[C].Berlin:Springer-Verlag,2000:369-374.
  • 9Moore R E.Interval Analysis[M].Englewood Cliffs,New Jersey:Prentice-Hall,1966.
  • 10郭均鹏,李汶华.基于误差理论的区间主成分分析及其应用[J].数理统计与管理,2007,26(4):636-640. 被引量:6

二级参考文献41

共引文献51

同被引文献123

引证文献13

二级引证文献51

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部