Aiming at the defects of the nodes in the low energy adaptive clustering hierarchy (LEACH) protocol, such as high energy consumption and uneven energy consumption, a two-level linear clustering protocol is built. Th...Aiming at the defects of the nodes in the low energy adaptive clustering hierarchy (LEACH) protocol, such as high energy consumption and uneven energy consumption, a two-level linear clustering protocol is built. The protocol improves the way of the nodes distribution at random. The terminal nodes which have not been a two-level cluster head in the cluster can compete with the principle of equivalent possibility, and on the basis of the rest energy of nodes the two-level cluster head is selected at last. The single hop within the cluster and single hop or multiple hops between clusters are used. Simulation experiment results show that the performance of the two-level linear clustering protocol applied to the Hexi corridor agricultural field is superior to that of the LEACH protocol in the survival time of network nodes, the ratio of success, and the remaining energy of network nodes.展开更多
In this paper, a data-driven linear clustering(DLC) method is proposed to solve the long-term system load forecasting problem caused by load fluctuation in some developed cities. A large substation load dataset with a...In this paper, a data-driven linear clustering(DLC) method is proposed to solve the long-term system load forecasting problem caused by load fluctuation in some developed cities. A large substation load dataset with annual interval is utilized and firstly preprocessed by the proposed linear clustering method to prepare for modelling.Then optimal autoregressive integrated moving average(ARIMA) models are constructed for the sum series of each obtained cluster to forecast their respective future load. Finally, the system load forecasting result is obtained by summing up all the ARIMA forecasts. From error analysis and application results, it is both theoretically and practically proved that the proposed DLC method can reduce random forecasting errors while guaranteeing modelling accuracy, so that a more stable and precise system load forecasting result can be obtained.展开更多
Two mixed linear models are proposed for grouping populations by a dissimilarity coefficent which has two parameters for squared difference of marginal mean and variance component of interaction.Cluster trees can be c...Two mixed linear models are proposed for grouping populations by a dissimilarity coefficent which has two parameters for squared difference of marginal mean and variance component of interaction.Cluster trees can be constructed by the mixed linear model approaches for experimental data with sampling errors within populations or with some missing values.Unweighted pair-group method ( UPGM ) is suggested as fusion method. Sampling variances of estimated dissimilarity coefficient can be obtained by the jackknife procedure.A one-tail t-test is applicable for detecting significance of dissimilarity of populaions within specific group.Unbiasedness and efficiency for estimation of dissimilarity coefficients are proved by Monte Carolo simulations.Worked example from cotton yield data is given for demonstration of the use of these cluster methods.展开更多
We propose a practical scheme to generate cluster states by simultaneously accomplishing two-qubit conditional gating on an array of equidistant ions by using transverse modes. Our operation is robust to heating and i...We propose a practical scheme to generate cluster states by simultaneously accomplishing two-qubit conditional gating on an array of equidistant ions by using transverse modes. Our operation is robust to heating and insensitive to Lamb-Dicke parameter. Meanwhile, as it is carried out in a geometric quantum computing fashion, our scheme enables the fast and high-fidelity generation of cluster states. The experimental feasibility is discussed with sophisticated ion trap techniques.展开更多
The evolution of configurations of Aln (n=3,4,6,13,19) clusters were investigated using linear synchronous transit (LST) method. The stable structures of Al3, Al4, Al6, Al13, Al19 clusters were confirmed to be triangl...The evolution of configurations of Aln (n=3,4,6,13,19) clusters were investigated using linear synchronous transit (LST) method. The stable structures of Al3, Al4, Al6, Al13, Al19 clusters were confirmed to be triangle, rhombus, octahedron, icosahedron and double icosahedron, respectively. For Al6 and Al19 clusters there are metastable structures of parallelogram and octahedron, respectively, whereas in the Al3, Al4 and Al13 clusters, no metastable configuration are validated. A large energy gap and a low energy barrier between the parallelogram and the octahedron of the Al6 cluster indicate the transformation from its metastable configuration to stable octahedron to be rather easy. By contrast, a small energy gap and a high energy barrier between the stable and metastable structures of Al19 cluster mean its configuration evolution from the octahedron to the double icosahedron occurs hardly, therefore the metastable octahedron configuration of Al19 cluster can be extensively detected in experiments and simulations.展开更多
This paper deals with the cluster exponential synchronization of a class ot complex networks wlm nyorm coupm^g and time-varying delay. Through constructing an appropriate Lyapunov-Krasovskii functional and applying th...This paper deals with the cluster exponential synchronization of a class ot complex networks wlm nyorm coupm^g and time-varying delay. Through constructing an appropriate Lyapunov-Krasovskii functional and applying the theory of the Kronecker product of matrices and the linear matrix inequality (LMI) technique, several novel sufficient conditions for cluster exponential synchronization are obtained. These cluster exponential synchronization conditions adopt the bounds of both time delay and its derivative, which are less conservative. Finally, the numerical simulations are performed to show the effectiveness of the theoretical results.展开更多
Aiming to provide an appropriate number K of clusters, in this paper, we propose a new criterion function - H criterion function, whose three properties have also been proved. We validate the performance of the H crit...Aiming to provide an appropriate number K of clusters, in this paper, we propose a new criterion function - H criterion function, whose three properties have also been proved. We validate the performance of the H criterion function on one artificial dataset and three real-world datasets, and the results are almostly consistent with a previous method. The nonparametric criterion we proposed is intuitive, simple and the computational cost is acceptable.展开更多
The present paper discusses the application of localized linear models for the prediction of hourly PM10 concentration values. The advantages of the proposed approach lies in the clustering of the data based on a comm...The present paper discusses the application of localized linear models for the prediction of hourly PM10 concentration values. The advantages of the proposed approach lies in the clustering of the data based on a common property and the utilization of the target variable during this process, which enables the development of more coherent models. Two alternative localized linear modelling approaches are developed and compared against benchmark models, one in which data are clustered based on their spatial proximity on the embedding space and one novel approach in which grouped data are described by the same linear model. Since the target variable is unknown during the prediction stage, a complimentary pattern recognition approach is developed to account for this lack of information. The application of the developed approach on several PM10 data sets from the Greater Athens Area, Helsinki and London monitoring networks returned a significant reduction of the prediction error under all examined metrics against conventional forecasting schemes such as the linear regression and the neural networks.展开更多
Optical responses in dilute composites are controlled through the local dielectric resonance of metallic clusters. We consider two located metallic clusters close to each other with admittances ε1 and ε2. Through va...Optical responses in dilute composites are controlled through the local dielectric resonance of metallic clusters. We consider two located metallic clusters close to each other with admittances ε1 and ε2. Through varying the difference admittance ratio η[= (ε2 - ε0)/(ε1 - ε0)], we find that their optical responses are determined by the local resonance. There is a blueshift of absorption peaks with the increase of η- Simultaneously, it is known that the absorption peaks will be redshifted by enlarging the cluster size. By adjusting the nano-metallic cluster geometry, size and admittances, we can control the positions and intensities of absorption peaks effectively. We have also deduced the effective linear optical responses of three-component composites εe=ε0 (1+∑^n n=1[(γn1+ηγn2)/(ε0(s-sn))]) and the sum rule of cross sections:∑^n n=1(γn1+ηγn2)=Nh1+Nh2,, where Nh1and Nh2 are the numbers of εl and ε2 bonds along the electric field, respectively. These results may be beneficial to the study of surface plasmon resonances on a nanometre scale.展开更多
函数型聚类分析在统计学领域被广泛关注,其分析过程通常在降维目标实现后进行。为了有效解决函数型主成分聚类问题,文章结合局部线性嵌入算法(Locally Linear Embedding,LLE)在非线性空间下的适用性,提出了一种局部线性下的函数型主成...函数型聚类分析在统计学领域被广泛关注,其分析过程通常在降维目标实现后进行。为了有效解决函数型主成分聚类问题,文章结合局部线性嵌入算法(Locally Linear Embedding,LLE)在非线性空间下的适用性,提出了一种局部线性下的函数型主成分分析模型(LLE Function Principle Component Analysis,LFPCA)。首先,采用函数型主成分分析法作为降维目标方法,改进了FPCA的算法模型,通过将LLE算法的权重系数矩阵与函数型主成分定义相结合,构建出一个适用于非线性空间下的聚类算法;其次,在求解算法的过程中定义了函数型主成分得分,并结合EM算法构建出GMM模型来近似函数型算法的概率密度函数,使模型更高效且适用性更强;最后,通过随机模拟实验及应用分析验证了LFPCA算法模型在真实数据集上具有良好的聚类效能。展开更多
Today, Linear Mixed Models (LMMs) are fitted, mostly, by assuming that random effects and errors have Gaussian distributions, therefore using Maximum Likelihood (ML) or REML estimation. However, for many data sets, th...Today, Linear Mixed Models (LMMs) are fitted, mostly, by assuming that random effects and errors have Gaussian distributions, therefore using Maximum Likelihood (ML) or REML estimation. However, for many data sets, that double assumption is unlikely to hold, particularly for the random effects, a crucial component </span></span><span style="font-family:Verdana;"><span style="font-family:Verdana;"><span style="font-family:Verdana;">in </span></span></span><span style="font-family:Verdana;"><span style="font-family:Verdana;"><span style="font-family:Verdana;">which assessment of magnitude is key in such modeling. Alternative fitting methods not relying on that assumption (as ANOVA ones and Rao</span></span></span><span style="font-family:Verdana;"><span style="font-family:Verdana;"><span style="font-family:Verdana;">’</span></span></span><span style="font-family:Verdana;"><span style="font-family:Verdana;"><span style="font-family:Verdana;">s MINQUE) apply, quite often, only to the very constrained class of variance components models. In this paper, a new computationally feasible estimation methodology is designed, first for the widely used class of 2-level (or longitudinal) LMMs with only assumption (beyond the usual basic ones) that residual errors are uncorrelated and homoscedastic, with no distributional assumption imposed on the random effects. A major asset of this new approach is that it yields nonnegative variance estimates and covariance matrices estimates which are symmetric and, at least, positive semi-definite. Furthermore, it is shown that when the LMM is, indeed, Gaussian, this new methodology differs from ML just through a slight variation in the denominator of the residual variance estimate. The new methodology actually generalizes to LMMs a well known nonparametric fitting procedure for standard Linear Models. Finally, the methodology is also extended to ANOVA LMMs, generalizing an old method by Henderson for ML estimation in such models under normality.展开更多
The syntheses and structures of a novel series of polynuclear coinage metal cluster compounds are discussed. The most fascinating structural characteristic of the Au-Ag alloy clusters with Au_18Ag_19 and Au_18Ag_19 co...The syntheses and structures of a novel series of polynuclear coinage metal cluster compounds are discussed. The most fascinating structural characteristic of the Au-Ag alloy clusters with Au_18Ag_19 and Au_18Ag_19 cores and phosphine ligands, which were prepared by the reduction of mononuclear coinage metal complexes R3PAuX and R3PAgX with NaBH4 in organic solution, is their construction from 13-atom gold-centred icosahedral Au_7Ag_6 building blocks. The structures of the polynuclear coinage metal clusters with R2dtc ligands are variable with either unlimited linear chain or triangular M3 units.展开更多
This paper simultaneously investigates variable selection and imputation estimation of semiparametric partially linear varying-coefficient model in that case where there exist missing responses for cluster data. As is...This paper simultaneously investigates variable selection and imputation estimation of semiparametric partially linear varying-coefficient model in that case where there exist missing responses for cluster data. As is well known, commonly used approach to deal with missing data is complete-case data. Combined the idea of complete-case data with a discussion of shrinkage estimation is made on different cluster. In order to avoid the biased results as well as improve the estimation efficiency, this article introduces Group Least Absolute Shrinkage and Selection Operator (Group Lasso) to semiparametric model. That is to say, the method combines the approach of local polynomial smoothing and the Least Absolute Shrinkage and Selection Operator. In that case, it can conduct nonparametric estimation and variable selection in a computationally efficient manner. According to the same criterion, the parametric estimators are also obtained. Additionally, for each cluster, the nonparametric and parametric estimators are derived, and then compute the weighted average per cluster as finally estimators. Moreover, the large sample properties of estimators are also derived respectively.展开更多
Recent developments in database technology have seen a wide variety of data being stored in huge collections. The wide variety makes the analysis tasks of a generic database a strenuous task in knowledge discovery. On...Recent developments in database technology have seen a wide variety of data being stored in huge collections. The wide variety makes the analysis tasks of a generic database a strenuous task in knowledge discovery. One approach is to summarize large datasets in such a way that the resulting summary dataset is of manageable size. Histogram has received significant attention as summarization/representative object for large database. But, it suffers from computational and space complexity. In this paper, we propose an idea to transform the histogram object into a Piecewise Linear Regression (PLR) line object and suggest that PLR objects can be less computational and storage intensive while compared to those of histograms. On the other hand to carry out a cluster analysis, we propose a distance measure for computing the distance between the PLR lines. Case study is presented based on the real data of online education system LMS. This demonstrates that PLR is a powerful knowledge representative for very large database.展开更多
彩色点画是一种从视觉上由大量小像素点构建图像的艺术技术,像素个数的多少直接影响着构图的成本。其优化选点构图方法为实现低成本打印提供了一个重要的方式。目前,点画生成存在着多通道采样点难以均匀分布,颜色层次难以兼顾等难点,并...彩色点画是一种从视觉上由大量小像素点构建图像的艺术技术,像素个数的多少直接影响着构图的成本。其优化选点构图方法为实现低成本打印提供了一个重要的方式。目前,点画生成存在着多通道采样点难以均匀分布,颜色层次难以兼顾等难点,并耗费大量的计算成本。对此,提出了一种基于超像素自适应聚类和线性规划最优选点的彩色点画生成方法,该方法在初步超像素划分图像的基础上,使用基于颜色密度峰值的自适应聚类方法得到最佳聚类个数,并进一步划分子聚类,然后根据每个子聚类的颜色均值作为子聚类内部选点的最佳间隔距离,在选点的同时依据SSIM指标,建立目标优化模型,通过数学优化器Gurobi实现模型选点,使点保留最少个数的目标基础上,同时保持聚类内部分布均匀和颜色渐变层次,以提高所生成的点画图像的可视化效果。实验结果表明,本文算法极大地降低了像素个数并在生成的点画的平均结构相似性(mean structural similarity index measure,SSIM)、峰值信噪比(peak signal to noise ratio,PSNR)等评价指标方面均优于当前方法。展开更多
A novel adaptive non-linear mapping (ANLM), integrating an adaptive mapping error (AME) with a chaosgenetic algorithm (CGA) including chaotic variable, was proposed to overcome the deficiencies of non-linear map...A novel adaptive non-linear mapping (ANLM), integrating an adaptive mapping error (AME) with a chaosgenetic algorithm (CGA) including chaotic variable, was proposed to overcome the deficiencies of non-linear mapping (NLM). The value of AME weight factor is determined according to the relative deviation square of distance between the two mapping points and the corresponding original objects distance. The larger the relative deviation square between two distances is, the larger the value of the corresponding weight factor is. Due to chaotic mapping operator, the evolutional process of CGA makes the individuals of subgenerations distributed ergodieally in the defined space and circumvents the premature of the individuals of subgenerations. The comparison results demonstrated that the whole performance of CGA is better than that of traditional genetic algorithm. Furthermore, a typical example of mapping eight-dimenslonal olive oil samples onto two-dimensional plane was employed to verify the effectiveness of ANLM. The results showed that the topology-preserving map obtained by ANLM can well represent the classification of original objects and is much better than that obtained by NLM.展开更多
基金supported by the Foundation Projects in Gansu Province Department of Education under Grant No.2015A-163
文摘Aiming at the defects of the nodes in the low energy adaptive clustering hierarchy (LEACH) protocol, such as high energy consumption and uneven energy consumption, a two-level linear clustering protocol is built. The protocol improves the way of the nodes distribution at random. The terminal nodes which have not been a two-level cluster head in the cluster can compete with the principle of equivalent possibility, and on the basis of the rest energy of nodes the two-level cluster head is selected at last. The single hop within the cluster and single hop or multiple hops between clusters are used. Simulation experiment results show that the performance of the two-level linear clustering protocol applied to the Hexi corridor agricultural field is superior to that of the LEACH protocol in the survival time of network nodes, the ratio of success, and the remaining energy of network nodes.
基金supported by the National Energy(Shanghai)Smart Grid Research Centerthe National Natural Science Foundation of China(No.51377103)
文摘In this paper, a data-driven linear clustering(DLC) method is proposed to solve the long-term system load forecasting problem caused by load fluctuation in some developed cities. A large substation load dataset with annual interval is utilized and firstly preprocessed by the proposed linear clustering method to prepare for modelling.Then optimal autoregressive integrated moving average(ARIMA) models are constructed for the sum series of each obtained cluster to forecast their respective future load. Finally, the system load forecasting result is obtained by summing up all the ARIMA forecasts. From error analysis and application results, it is both theoretically and practically proved that the proposed DLC method can reduce random forecasting errors while guaranteeing modelling accuracy, so that a more stable and precise system load forecasting result can be obtained.
文摘Two mixed linear models are proposed for grouping populations by a dissimilarity coefficent which has two parameters for squared difference of marginal mean and variance component of interaction.Cluster trees can be constructed by the mixed linear model approaches for experimental data with sampling errors within populations or with some missing values.Unweighted pair-group method ( UPGM ) is suggested as fusion method. Sampling variances of estimated dissimilarity coefficient can be obtained by the jackknife procedure.A one-tail t-test is applicable for detecting significance of dissimilarity of populaions within specific group.Unbiasedness and efficiency for estimation of dissimilarity coefficients are proved by Monte Carolo simulations.Worked example from cotton yield data is given for demonstration of the use of these cluster methods.
基金supported by the National Natural Science Foundation of China (Grant Nos.10774163 and 10804132)
文摘We propose a practical scheme to generate cluster states by simultaneously accomplishing two-qubit conditional gating on an array of equidistant ions by using transverse modes. Our operation is robust to heating and insensitive to Lamb-Dicke parameter. Meanwhile, as it is carried out in a geometric quantum computing fashion, our scheme enables the fast and high-fidelity generation of cluster states. The experimental feasibility is discussed with sophisticated ion trap techniques.
基金Prqject(104139) supported by the Ministry of Education of China Project(03-Y3069) supported by the Hunan Province Natural Science Fund
文摘The evolution of configurations of Aln (n=3,4,6,13,19) clusters were investigated using linear synchronous transit (LST) method. The stable structures of Al3, Al4, Al6, Al13, Al19 clusters were confirmed to be triangle, rhombus, octahedron, icosahedron and double icosahedron, respectively. For Al6 and Al19 clusters there are metastable structures of parallelogram and octahedron, respectively, whereas in the Al3, Al4 and Al13 clusters, no metastable configuration are validated. A large energy gap and a low energy barrier between the parallelogram and the octahedron of the Al6 cluster indicate the transformation from its metastable configuration to stable octahedron to be rather easy. By contrast, a small energy gap and a high energy barrier between the stable and metastable structures of Al19 cluster mean its configuration evolution from the octahedron to the double icosahedron occurs hardly, therefore the metastable octahedron configuration of Al19 cluster can be extensively detected in experiments and simulations.
基金supported by the National Natural Science Foundation of China (Grant Nos. 61074073 and 61034005)the Fundamental Research Funds for the Central Universities of China (Grant No. N110504001)the Open Project of the State Key Laboratory of Management and Control for Complex Systems, China (Grant No. 20110107)
文摘This paper deals with the cluster exponential synchronization of a class ot complex networks wlm nyorm coupm^g and time-varying delay. Through constructing an appropriate Lyapunov-Krasovskii functional and applying the theory of the Kronecker product of matrices and the linear matrix inequality (LMI) technique, several novel sufficient conditions for cluster exponential synchronization are obtained. These cluster exponential synchronization conditions adopt the bounds of both time delay and its derivative, which are less conservative. Finally, the numerical simulations are performed to show the effectiveness of the theoretical results.
文摘Aiming to provide an appropriate number K of clusters, in this paper, we propose a new criterion function - H criterion function, whose three properties have also been proved. We validate the performance of the H criterion function on one artificial dataset and three real-world datasets, and the results are almostly consistent with a previous method. The nonparametric criterion we proposed is intuitive, simple and the computational cost is acceptable.
文摘The present paper discusses the application of localized linear models for the prediction of hourly PM10 concentration values. The advantages of the proposed approach lies in the clustering of the data based on a common property and the utilization of the target variable during this process, which enables the development of more coherent models. Two alternative localized linear modelling approaches are developed and compared against benchmark models, one in which data are clustered based on their spatial proximity on the embedding space and one novel approach in which grouped data are described by the same linear model. Since the target variable is unknown during the prediction stage, a complimentary pattern recognition approach is developed to account for this lack of information. The application of the developed approach on several PM10 data sets from the Greater Athens Area, Helsinki and London monitoring networks returned a significant reduction of the prediction error under all examined metrics against conventional forecasting schemes such as the linear regression and the neural networks.
基金Project supported by the National Natural Science Foundation of China(Grant Nos 10304001, 10334010, 10521002, 10434020, 10328407 and 90501007).
文摘Optical responses in dilute composites are controlled through the local dielectric resonance of metallic clusters. We consider two located metallic clusters close to each other with admittances ε1 and ε2. Through varying the difference admittance ratio η[= (ε2 - ε0)/(ε1 - ε0)], we find that their optical responses are determined by the local resonance. There is a blueshift of absorption peaks with the increase of η- Simultaneously, it is known that the absorption peaks will be redshifted by enlarging the cluster size. By adjusting the nano-metallic cluster geometry, size and admittances, we can control the positions and intensities of absorption peaks effectively. We have also deduced the effective linear optical responses of three-component composites εe=ε0 (1+∑^n n=1[(γn1+ηγn2)/(ε0(s-sn))]) and the sum rule of cross sections:∑^n n=1(γn1+ηγn2)=Nh1+Nh2,, where Nh1and Nh2 are the numbers of εl and ε2 bonds along the electric field, respectively. These results may be beneficial to the study of surface plasmon resonances on a nanometre scale.
文摘函数型聚类分析在统计学领域被广泛关注,其分析过程通常在降维目标实现后进行。为了有效解决函数型主成分聚类问题,文章结合局部线性嵌入算法(Locally Linear Embedding,LLE)在非线性空间下的适用性,提出了一种局部线性下的函数型主成分分析模型(LLE Function Principle Component Analysis,LFPCA)。首先,采用函数型主成分分析法作为降维目标方法,改进了FPCA的算法模型,通过将LLE算法的权重系数矩阵与函数型主成分定义相结合,构建出一个适用于非线性空间下的聚类算法;其次,在求解算法的过程中定义了函数型主成分得分,并结合EM算法构建出GMM模型来近似函数型算法的概率密度函数,使模型更高效且适用性更强;最后,通过随机模拟实验及应用分析验证了LFPCA算法模型在真实数据集上具有良好的聚类效能。
文摘Today, Linear Mixed Models (LMMs) are fitted, mostly, by assuming that random effects and errors have Gaussian distributions, therefore using Maximum Likelihood (ML) or REML estimation. However, for many data sets, that double assumption is unlikely to hold, particularly for the random effects, a crucial component </span></span><span style="font-family:Verdana;"><span style="font-family:Verdana;"><span style="font-family:Verdana;">in </span></span></span><span style="font-family:Verdana;"><span style="font-family:Verdana;"><span style="font-family:Verdana;">which assessment of magnitude is key in such modeling. Alternative fitting methods not relying on that assumption (as ANOVA ones and Rao</span></span></span><span style="font-family:Verdana;"><span style="font-family:Verdana;"><span style="font-family:Verdana;">’</span></span></span><span style="font-family:Verdana;"><span style="font-family:Verdana;"><span style="font-family:Verdana;">s MINQUE) apply, quite often, only to the very constrained class of variance components models. In this paper, a new computationally feasible estimation methodology is designed, first for the widely used class of 2-level (or longitudinal) LMMs with only assumption (beyond the usual basic ones) that residual errors are uncorrelated and homoscedastic, with no distributional assumption imposed on the random effects. A major asset of this new approach is that it yields nonnegative variance estimates and covariance matrices estimates which are symmetric and, at least, positive semi-definite. Furthermore, it is shown that when the LMM is, indeed, Gaussian, this new methodology differs from ML just through a slight variation in the denominator of the residual variance estimate. The new methodology actually generalizes to LMMs a well known nonparametric fitting procedure for standard Linear Models. Finally, the methodology is also extended to ANOVA LMMs, generalizing an old method by Henderson for ML estimation in such models under normality.
文摘The syntheses and structures of a novel series of polynuclear coinage metal cluster compounds are discussed. The most fascinating structural characteristic of the Au-Ag alloy clusters with Au_18Ag_19 and Au_18Ag_19 cores and phosphine ligands, which were prepared by the reduction of mononuclear coinage metal complexes R3PAuX and R3PAgX with NaBH4 in organic solution, is their construction from 13-atom gold-centred icosahedral Au_7Ag_6 building blocks. The structures of the polynuclear coinage metal clusters with R2dtc ligands are variable with either unlimited linear chain or triangular M3 units.
文摘This paper simultaneously investigates variable selection and imputation estimation of semiparametric partially linear varying-coefficient model in that case where there exist missing responses for cluster data. As is well known, commonly used approach to deal with missing data is complete-case data. Combined the idea of complete-case data with a discussion of shrinkage estimation is made on different cluster. In order to avoid the biased results as well as improve the estimation efficiency, this article introduces Group Least Absolute Shrinkage and Selection Operator (Group Lasso) to semiparametric model. That is to say, the method combines the approach of local polynomial smoothing and the Least Absolute Shrinkage and Selection Operator. In that case, it can conduct nonparametric estimation and variable selection in a computationally efficient manner. According to the same criterion, the parametric estimators are also obtained. Additionally, for each cluster, the nonparametric and parametric estimators are derived, and then compute the weighted average per cluster as finally estimators. Moreover, the large sample properties of estimators are also derived respectively.
文摘Recent developments in database technology have seen a wide variety of data being stored in huge collections. The wide variety makes the analysis tasks of a generic database a strenuous task in knowledge discovery. One approach is to summarize large datasets in such a way that the resulting summary dataset is of manageable size. Histogram has received significant attention as summarization/representative object for large database. But, it suffers from computational and space complexity. In this paper, we propose an idea to transform the histogram object into a Piecewise Linear Regression (PLR) line object and suggest that PLR objects can be less computational and storage intensive while compared to those of histograms. On the other hand to carry out a cluster analysis, we propose a distance measure for computing the distance between the PLR lines. Case study is presented based on the real data of online education system LMS. This demonstrates that PLR is a powerful knowledge representative for very large database.
文摘彩色点画是一种从视觉上由大量小像素点构建图像的艺术技术,像素个数的多少直接影响着构图的成本。其优化选点构图方法为实现低成本打印提供了一个重要的方式。目前,点画生成存在着多通道采样点难以均匀分布,颜色层次难以兼顾等难点,并耗费大量的计算成本。对此,提出了一种基于超像素自适应聚类和线性规划最优选点的彩色点画生成方法,该方法在初步超像素划分图像的基础上,使用基于颜色密度峰值的自适应聚类方法得到最佳聚类个数,并进一步划分子聚类,然后根据每个子聚类的颜色均值作为子聚类内部选点的最佳间隔距离,在选点的同时依据SSIM指标,建立目标优化模型,通过数学优化器Gurobi实现模型选点,使点保留最少个数的目标基础上,同时保持聚类内部分布均匀和颜色渐变层次,以提高所生成的点画图像的可视化效果。实验结果表明,本文算法极大地降低了像素个数并在生成的点画的平均结构相似性(mean structural similarity index measure,SSIM)、峰值信噪比(peak signal to noise ratio,PSNR)等评价指标方面均优于当前方法。
基金Supported by the National Natural Science Foun-dation of China (20506003) the National Basic Research ProgramofChina (973 Program2002CB312200) the ShangHai Science andTechnology of Phosphor of China (04QMX1433)
文摘A novel adaptive non-linear mapping (ANLM), integrating an adaptive mapping error (AME) with a chaosgenetic algorithm (CGA) including chaotic variable, was proposed to overcome the deficiencies of non-linear mapping (NLM). The value of AME weight factor is determined according to the relative deviation square of distance between the two mapping points and the corresponding original objects distance. The larger the relative deviation square between two distances is, the larger the value of the corresponding weight factor is. Due to chaotic mapping operator, the evolutional process of CGA makes the individuals of subgenerations distributed ergodieally in the defined space and circumvents the premature of the individuals of subgenerations. The comparison results demonstrated that the whole performance of CGA is better than that of traditional genetic algorithm. Furthermore, a typical example of mapping eight-dimenslonal olive oil samples onto two-dimensional plane was employed to verify the effectiveness of ANLM. The results showed that the topology-preserving map obtained by ANLM can well represent the classification of original objects and is much better than that obtained by NLM.