We present a new algorithm for manifold learning and nonlinear dimensionality reduction. Based on a set of unorganized data points sampled with noise from a parameterized manifold, the local geometry of the manifold i...We present a new algorithm for manifold learning and nonlinear dimensionality reduction. Based on a set of unorganized data points sampled with noise from a parameterized manifold, the local geometry of the manifold is learned by constructing an approximation for the tangent space at each point, and those tangent spaces are then aligned to give the global coordinates of the data points with respect to the underlying manifold. We also present an error analysis of our algorithm showing that reconstruction errors can be quite small in some cases. We illustrate our algorithm using curves and surfaces both in 2D/3D Euclidean spaces and higher dimensional Euclidean spaces. We also address several theoretical and algorithmic issues for further research and improvements.展开更多
Image feature optimization is an important means to deal with high-dimensional image data in image semantic understanding and its applications. We formulate image feature optimization as the establishment of a mapping...Image feature optimization is an important means to deal with high-dimensional image data in image semantic understanding and its applications. We formulate image feature optimization as the establishment of a mapping between highand low-dimensional space via a five-tuple model. Nonlinear dimensionality reduction based on manifold learning provides a feasible way for solving such a problem. We propose a novel globular neighborhood based locally linear embedding (GNLLE) algorithm using neighborhood update and an incremental neighbor search scheme, which not only can handle sparse datasets but also has strong anti-noise capability and good topological stability. Given that the distance measure adopted in nonlinear dimensionality reduction is usually based on pairwise similarity calculation, we also present a globular neighborhood and path clustering based locally linear embedding (GNPCLLE) algorithm based on path-based clustering. Due to its full consideration of correlations between image data, GNPCLLE can eliminate the distortion of the overall topological structure within the dataset on the manifold. Experimental results on two image sets show the effectiveness and efficiency of the proposed algorithms.展开更多
We present our recent work on both linear and nonlinear data reduction methods and algorithms: for the linear case we discuss results on structure analysis of SVD of columnpartitioned matrices and sparse low-rank appr...We present our recent work on both linear and nonlinear data reduction methods and algorithms: for the linear case we discuss results on structure analysis of SVD of columnpartitioned matrices and sparse low-rank approximation; for the nonlinear case we investigate methods for nonlinear dimensionality reduction and manifold learning. The problems we address have attracted great deal of interest in data mining and machine learning.展开更多
The concise and informative representation of hyperspectral imagery is achieved via the introduced diffusion geometric coordinates derived from nonlinear dimension reduction maps - diffusion maps. The huge-volume high...The concise and informative representation of hyperspectral imagery is achieved via the introduced diffusion geometric coordinates derived from nonlinear dimension reduction maps - diffusion maps. The huge-volume high- dimensional spectral measurements are organized by the affinity graph where each node in this graph only connects to its local neighbors and each edge in this graph represents local similarity information. By normalizing the affinity graph appropriately, the diffusion operator of the underlying hyperspectral imagery is well-defined, which means that the Markov random walk can be simulated on the hyperspectral imagery. Therefore, the diffusion geometric coordinates, derived from the eigenfunctions and the associated eigenvalues of the diffusion operator, can capture the intrinsic geometric information of the hyperspectral imagery well, which gives more enhanced representation results than traditional linear methods, such as principal component analysis based methods. For large-scale full scene hyperspectral imagery, by exploiting the backbone approach, the computation complexity and the memory requirements are acceptable. Experiments also show that selecting suitable symmetrization normalization techniques while forming the diffusion operator is important to hyperspectral imagery representation.展开更多
As modern weapons and equipment undergo increasing levels of informatization,intelligence,and networking,the topology and traffic characteristics of battlefield data networks built with tactical data links are becomin...As modern weapons and equipment undergo increasing levels of informatization,intelligence,and networking,the topology and traffic characteristics of battlefield data networks built with tactical data links are becoming progressively complex.In this paper,we employ a traffic matrix to model the tactical data link network.We propose a method that utilizes the Maximum Variance Unfolding(MVU)algorithm to conduct nonlinear dimensionality reduction analysis on high-dimensional open network traffic matrix datasets.This approach introduces novel ideas and methods for future applications,including traffic prediction and anomaly analysis in real battlefield network environments.展开更多
In this paper, a new nonlinear fault detection technique based on locally linear embedding (LLE) is developed. LLE can efficiently compute the low-dimensional embedding of the data with the local neighborhood struct...In this paper, a new nonlinear fault detection technique based on locally linear embedding (LLE) is developed. LLE can efficiently compute the low-dimensional embedding of the data with the local neighborhood structure information preserved. In this method, a data-dependent kernel matrix which can reflect the nonlinear data structure is defined. Based on the kernel matrix, the Nystrrm formula makes the mapping extended to the testing data possible. With the kernel view of the LLE, two monitoring statistics are constructed. Together with the out of sample extensions, LLE is used for nonlinear fault detection. Simulation cases were studied to demonstrate the performance of the proposed method.展开更多
A new manifold learning method, called incremental alignment method (IAM), is proposed for nonlinear dimensionality reduction of high dimensional data with intrinsic low dimensionality. The main idea is to increment...A new manifold learning method, called incremental alignment method (IAM), is proposed for nonlinear dimensionality reduction of high dimensional data with intrinsic low dimensionality. The main idea is to incrementally align low-dimensional coordinates of input data patch-by-patch to iteratively generate the representation of the entire data.set. The method consists of two major steps, the incremental step and the alignment step. The incremental step incrementally searches neighborhood patch to be aligned in the next step, and the alignment step iteratively aligns the low-dimensional coordinates of the neighborhood patch searched to generate the embeddings of the entire dataset. Compared with the existing manifold learning methods, the proposed method dominates in several aspects: high efficiency, easy out-of-sample extension, well metric-preserving, and averting of the local minima issue. All these properties are supported by a series of experiments performed on the synthetic and real-life datasets. In addition, the computational complexity of the proposed method is analyzed, and its efficiency is theoretically argued and experimentally demonstrated.展开更多
Over the past few years,nonlinear manifold learning has been widely exploited in data analysis and machine learning.This paper presents a novel manifold learning algorithm,named atlas compatibility transformation(ACT)...Over the past few years,nonlinear manifold learning has been widely exploited in data analysis and machine learning.This paper presents a novel manifold learning algorithm,named atlas compatibility transformation(ACT),It solves two problems which correspond to two key points in the manifold definition:how to chart a given manifold and how to align the patches to a global coordinate space based on compatibility.For the first problem,we divide the manifold into maximal linear patch(MLP) based on normal vector field of the manifold.For the second problem,we align patches into an optimal global system by solving a generalized eigenvalue problem.Compared with the traditional method,the ACT could deal with noise datasets and fragment datasets.Moreover,the mappings between high dimensional space and low dimensional space are given.Experiments on both synthetic data and real-world data indicate the effection of the proposed algorithm.展开更多
文摘We present a new algorithm for manifold learning and nonlinear dimensionality reduction. Based on a set of unorganized data points sampled with noise from a parameterized manifold, the local geometry of the manifold is learned by constructing an approximation for the tangent space at each point, and those tangent spaces are then aligned to give the global coordinates of the data points with respect to the underlying manifold. We also present an error analysis of our algorithm showing that reconstruction errors can be quite small in some cases. We illustrate our algorithm using curves and surfaces both in 2D/3D Euclidean spaces and higher dimensional Euclidean spaces. We also address several theoretical and algorithmic issues for further research and improvements.
基金Project (No 2008AA01Z132) supported by the National High-Tech Research and Development Program of China
文摘Image feature optimization is an important means to deal with high-dimensional image data in image semantic understanding and its applications. We formulate image feature optimization as the establishment of a mapping between highand low-dimensional space via a five-tuple model. Nonlinear dimensionality reduction based on manifold learning provides a feasible way for solving such a problem. We propose a novel globular neighborhood based locally linear embedding (GNLLE) algorithm using neighborhood update and an incremental neighbor search scheme, which not only can handle sparse datasets but also has strong anti-noise capability and good topological stability. Given that the distance measure adopted in nonlinear dimensionality reduction is usually based on pairwise similarity calculation, we also present a globular neighborhood and path clustering based locally linear embedding (GNPCLLE) algorithm based on path-based clustering. Due to its full consideration of correlations between image data, GNPCLLE can eliminate the distortion of the overall topological structure within the dataset on the manifold. Experimental results on two image sets show the effectiveness and efficiency of the proposed algorithms.
基金This work was supported in part by the Special Funds for Major State Basic Research Projectsthe National Natural Science Foundation of China(Grants No.60372033 and 9901936)NSF CCR9901986,DMS 0311800.
文摘We present our recent work on both linear and nonlinear data reduction methods and algorithms: for the linear case we discuss results on structure analysis of SVD of columnpartitioned matrices and sparse low-rank approximation; for the nonlinear case we investigate methods for nonlinear dimensionality reduction and manifold learning. The problems we address have attracted great deal of interest in data mining and machine learning.
基金The National Key Technologies R & D Program during the 11th Five-Year Plan Period (No.2006BAB15B01)
文摘The concise and informative representation of hyperspectral imagery is achieved via the introduced diffusion geometric coordinates derived from nonlinear dimension reduction maps - diffusion maps. The huge-volume high- dimensional spectral measurements are organized by the affinity graph where each node in this graph only connects to its local neighbors and each edge in this graph represents local similarity information. By normalizing the affinity graph appropriately, the diffusion operator of the underlying hyperspectral imagery is well-defined, which means that the Markov random walk can be simulated on the hyperspectral imagery. Therefore, the diffusion geometric coordinates, derived from the eigenfunctions and the associated eigenvalues of the diffusion operator, can capture the intrinsic geometric information of the hyperspectral imagery well, which gives more enhanced representation results than traditional linear methods, such as principal component analysis based methods. For large-scale full scene hyperspectral imagery, by exploiting the backbone approach, the computation complexity and the memory requirements are acceptable. Experiments also show that selecting suitable symmetrization normalization techniques while forming the diffusion operator is important to hyperspectral imagery representation.
文摘As modern weapons and equipment undergo increasing levels of informatization,intelligence,and networking,the topology and traffic characteristics of battlefield data networks built with tactical data links are becoming progressively complex.In this paper,we employ a traffic matrix to model the tactical data link network.We propose a method that utilizes the Maximum Variance Unfolding(MVU)algorithm to conduct nonlinear dimensionality reduction analysis on high-dimensional open network traffic matrix datasets.This approach introduces novel ideas and methods for future applications,including traffic prediction and anomaly analysis in real battlefield network environments.
基金supported in part by the National Basic Research Program of China(973 Program)(No.2012CB720505)the National Natural Science Foundation of China(No.61273167)
文摘In this paper, a new nonlinear fault detection technique based on locally linear embedding (LLE) is developed. LLE can efficiently compute the low-dimensional embedding of the data with the local neighborhood structure information preserved. In this method, a data-dependent kernel matrix which can reflect the nonlinear data structure is defined. Based on the kernel matrix, the Nystrrm formula makes the mapping extended to the testing data possible. With the kernel view of the LLE, two monitoring statistics are constructed. Together with the out of sample extensions, LLE is used for nonlinear fault detection. Simulation cases were studied to demonstrate the performance of the proposed method.
基金supported by the National Basic Research 973 Program of China under Grant No.2007CB311002the National Natural Science Foundation of China under Grant No.60905003
文摘A new manifold learning method, called incremental alignment method (IAM), is proposed for nonlinear dimensionality reduction of high dimensional data with intrinsic low dimensionality. The main idea is to incrementally align low-dimensional coordinates of input data patch-by-patch to iteratively generate the representation of the entire data.set. The method consists of two major steps, the incremental step and the alignment step. The incremental step incrementally searches neighborhood patch to be aligned in the next step, and the alignment step iteratively aligns the low-dimensional coordinates of the neighborhood patch searched to generate the embeddings of the entire dataset. Compared with the existing manifold learning methods, the proposed method dominates in several aspects: high efficiency, easy out-of-sample extension, well metric-preserving, and averting of the local minima issue. All these properties are supported by a series of experiments performed on the synthetic and real-life datasets. In addition, the computational complexity of the proposed method is analyzed, and its efficiency is theoretically argued and experimentally demonstrated.
基金supported by National Natural Science Foundation of China(No.61171145)Shanghai Educational Development Fundation(No.12ZZ083)
文摘Over the past few years,nonlinear manifold learning has been widely exploited in data analysis and machine learning.This paper presents a novel manifold learning algorithm,named atlas compatibility transformation(ACT),It solves two problems which correspond to two key points in the manifold definition:how to chart a given manifold and how to align the patches to a global coordinate space based on compatibility.For the first problem,we divide the manifold into maximal linear patch(MLP) based on normal vector field of the manifold.For the second problem,we align patches into an optimal global system by solving a generalized eigenvalue problem.Compared with the traditional method,the ACT could deal with noise datasets and fragment datasets.Moreover,the mappings between high dimensional space and low dimensional space are given.Experiments on both synthetic data and real-world data indicate the effection of the proposed algorithm.