基于MLE与流形学习的数据可视化方法

Data Visualization Method Based on MLE and Manifold Learning

下载PDF

导出

摘要在一个给定的样本空间划分下,每个数据集是一个潜在的多项分布的抽样假设。通过对模型参数的最大似然估计,数据集的潜在分布近似于一个离散化的经验分布。根据推广的多项分布族的Fisher度量,潜在分布的信息差异可近似为经验分布间的差异,为基于MLE嵌入得到的信息流形上非监督学习创造了条件。当约简空间的维数为2或3时,原数据集之间的自然可分性可通过降维数据展现出来。实验结果表明,该方法能应用到大样本数据集或彩色图像等高维结构化数据的可视化。 The method is stemmed from the assumption that each data set is a probabilistic realization of an underlying multinomial distribution under a partition on sample space. With the MLE of model parameters, the underlying distribution of a data set can be approximated by a discretized probability distribution. With the generalized Fisher metric on multinomial manifold with boundary, the information divergence between underlying models can be approximated by the corresponding divergence between estimated distributions, it provides the necessary element for unsupervised learning on information manifold. The natural separation of original data sets can be visualized when the dimension of reduced space is two or three. Experimental result shows that the method can be applied to visualization of big sample data sets or color image data sets.

作者邹健刘传才

机构地区南京理工大学计算机学院安徽工程大学应用数理学院

出处《计算机工程》 CAS CSCD 北大核心 2011年第1期4-6,共3页 Computer Engineering

基金国家自然科学基金资助项目(9082004) 国家"863"计划基金资助项目(2006AA04Z238) 安徽自然科学基金资助项目(KJ2007B056)

关键词多项分布最大似然估计流形学习数据可视化 multinomial distribution maximum likelihood estimation manifold learning data visualization

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献7

1van der M L J P, Postma E O, van den H H J. Dimensionality Reduction: A Comparative Review[R]. Tilburg, Netherlands: Tilburg Centre for Creative Computing, Tilburg University, Technical Report: 2009-005, 2009.
2Ferreira de Oliveira M C, Levkowitz H. From Visual Data Exploration to Visual Data Mining: A Survey[J]. IEEE Transactions on Visualization and Computer Graphics, 2003, 9(3): 378-394.
3蒋润,周激流,雷刚,李晓华.基于有监督流形学习的正交投影降维[J].计算机工程,2009,35(23):207-208. 被引量：4
4Kondor R. The Information Geometry of the Multinomial Distribution[EB/OL]. (2003-08-10). http://www.its.caltech.edu/-risi/notes/ multinomial.pdf.
5Lee S M, Abbott A L, Araman P A. Dimensionality Reduction and Clustering on Statistical Manifolds[C]//Proc. of IEEE Conf. on Computer Vision and Pattern .Recognition. Minneapolis, USA: [s. n.], 2007: 1-7.
6Lebanon G. Information Geometry, the Embedding Principle, and Document Classification[C]//Proc. of the 2nd International Symposium on Information Geometry and Its Applications. Tokyo Japan: [s. n.], 2005.
7COIL-100 Database[EB/OL]. (2009-10-30). http://www.cs.columbia. edulCAVE/software/softlib/coil- 100.php.

二级参考文献5

1Jolliffe I T. Principal Component Analysis[M]. New York, USA: Springer-Verlag, 1989.
2Roweis S T, Saul L K. Nonlinear Dimensionality Reduction by Locally Linear Embedding[J]. Science, 2000, (290): 2323-2326.
3Kokiopoulou E, Saad Y. Face Recognition Using OPRA- faces[C]//Proc, of IEEE Int'l. Conf. on Machine Learning and Application. NY, USA: [s. n.], 2005:15-17.
4Qing Xiangyun, Wang Xingyu. Face Recognition Using Laplacian+ OPRA-faces[C]//Proc. of IEEE Conf. on Intelligent Control and Automation. Dalian, China: [s. n.], 2006:10013-10016.
5Zhao Qijun, Zhang David, Lu Hongtao. Supervised LLE in ICA Space for Facial Expression Recognition[C]//Proc. of International Conference on Neural Networks and Brain. Shanghai, China: Is. n.], 2005: 1970-1975.

共引文献3

1钟明,薛惠锋,梅觅.基于局部线性嵌入的最大散度矩阵算法[J].计算机工程,2011,37(12):176-178. 被引量：1
2靳丽丽,陈秀宏.有监督的二维分块局部相似与差异算法[J].计算机工程,2011,37(21):117-119.
3王伟,毕笃彦,孙恒义.基于改进ISOMAP的飞机识别算法[J].计算机工程,2011,37(21):144-145. 被引量：1

1吴新玲.一种基于混合概率模型的文本分类方法[J].微电子学与计算机,2011,28(11):133-136.
2张玥,刘传才,邹健,卢桂馥.颜色共生矩阵的Fisher信息度量及识别应用[J].计算机工程与应用,2015,51(5):19-22.
3杨赛,赵春霞.图像分类中的概率乘积核函数[J].中国图象图形学报,2013,18(8):961-967. 被引量：2
4蒋丹妮,赵向文,方青.基于Pearson分布族的SAR影像分布模型确定与对比度增强[J].测绘与空间地理信息,2013,36(12):182-184.
5曾利军,刘卉.一种改进RSSI的无线传感网络定位算法研究[J].现代计算机（中旬刊）,2016(6):16-19. 被引量：1
6史庆伟,李艳妮,郭朋亮.科技文献中作者研究兴趣动态发现[J].计算机应用,2013,33(11):3080-3083. 被引量：13
7张贝贝.荣联云盘系统打造时尚办公新平台[J].软件和信息服务,2015(2).
8赵慧,王丽芳,介婧.柯西分布概率模型的copula分布估计算法[J].太原科技大学学报,2013,34(4):266-271. 被引量：2
9张翠霞,武新乾,轩凤霞.基于MATLAB GUI的概率分布程序设计[J].洛阳师范学院学报,2013,32(11):73-77. 被引量：3
10陈千,郭鑫,王素格,张虎.文本流多粒度主题结构建模研究[J].中文信息学报,2015,29(1):118-125. 被引量：2

计算机工程

2011年第1期

浏览历史

内容加载中请稍等...

基于MLE与流形学习的数据可视化方法

参考文献7

二级参考文献5

共引文献3

相关作者

相关机构

相关主题

浏览历史