期刊文献+

一种区间PCA的效度分析方法 被引量:6

Analysis of validity of the PCA for interval data
下载PDF
导出
摘要 关于区间数据的主成分分析(PCA)方法已取得了较丰富的研究成果,但少见对这些方法效度的评价研究.针对该问题,基于Hausdorff距离用于定义两个紧集之间距离的考虑,将区间数视为一个紧集,定义了区间数之间的距离,并研究了区间矩阵的距离.在此基础上,根据PCA方法的原理,建立了一个区间PCA方法的效度评价指标.该指标取值在0与1之间,其取值越大,说明区间PCA方法效度越高,反之则效度越小.最后,采用模拟的方法,分别选取均匀分布和正态分布两种类型的区间数据样本,对目前最常用的两种区间PCA方法——顶点法和中点法进行了效度分析,验证了文中所提的效度指标的正确性. Many achievements about the methods of principal component analysis (PCA) for interval data have been got. While the evaluation of the methods' validity is seldom done. Aimed at this problem, seeing an interval datum as a compact set, the distance between two interval numbers is defined based on Hausdorff distance. Furthermore, the distance between two interval matrices was studied. Based on this, according to the theory of PCA,a validity index of PCA for interval data was constructed. The index is obverse whose value is between 0 and 1. The bigger the index is, the higher the validity of the interval PCA method is. Vertices-PCA and eenters-PCA are the two main methods of PCA for interval data. Validity study of the two methods was made by means of simulation, where uniform and normal data were generated and used. The correctness of the validity index defined in the paper is demonstrated by the simulation.
出处 《系统工程学报》 CSCD 北大核心 2009年第2期226-230,共5页 Journal of Systems Engineering
基金 国家自然科学基金资助项目(70701026)
关键词 区间数 主成分分析 效度 HAUSDORFF距离 interval number principal component analysis validity Hausdorff distance
  • 相关文献

参考文献12

  • 1Moore R E. Interval Analysis[ M]. New Jersey: Prentice-Hall, 1966.
  • 2Bock H H, Diday E. Analysis of Symbolic Data[M]. New York: Springer Verlag, 2000.
  • 3胡艳,王惠文.一种海量数据的分析技术——符号数据分析及应用[J].北京航空航天大学学报(社会科学版),2004,17(2):40-44. 被引量:19
  • 4Cazes P, Chouakria A, Diday E, et al. Extension de l' analyse en composantes principales a des donnees de type intervalle [J]. Revue de Statistique Appliquee, 1997, 45 (3) : 5-24.
  • 5Lauro N C , Palumbo F. Principal components analysis of interval data : A symbolic data analysis approach [J]. Computational Statistics, 2000, 15 ( 1 ) : 73-87.
  • 6Pierpaolo D, Paolo G. A least squares approach to principal component analysis for interval valued data[ J]. Chemometrics and Intelligent Laboratory Systems, 2004, 70(2) : 179-192.
  • 7Palumbo F, Lauro N C. A PCA for interval valued data based on midpoints and radii[ A]. In: Yanai H, Okada A , Shigemasu K, et al.. New Developments in Psychometrics[M]. Tokyo: Springer Verlag, 2003.
  • 8Denoeux T, Masson M H. Principal component analysis of fuzzy data using autoassociative neural networks [ J ]. IEEE Transactions on Fuzzy Systems, 2004,12(3) : 336-349.
  • 9Giordani P, Kiers H A L. Three-way principal component analysis of interval-valued data [J]. Journal of Chemometrics, 2004, 18(5) : 253-264.
  • 10D' Urso P, Giordani P. A possibilistic approach to latent component analysis for fuzzy data [ J ]. Fuzzy Sets and Systems, 2005, 150(2) : 285-305.

共引文献18

同被引文献71

引证文献6

二级引证文献44

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部