The metallic softness parameterαr 0 determines the structure of the cluster and governs the rule of magic numbers. Using molecular dynamic method, the stable structures and magic numbers are determined for the cluste...The metallic softness parameterαr 0 determines the structure of the cluster and governs the rule of magic numbers. Using molecular dynamic method, the stable structures and magic numbers are determined for the clusters consisting of 13 up to 147 atoms in medium range Morse potentials, which is suitable for most of metals. As the number of atoms constituting the cluster increases, the stable structures undergo transition from face-centered (FC) to edge-centered (EC) structures. The magic number take ones of FC series before transition and take ones of EC series after that. The transition point from FC to EC structures depends on the value of softness parameter.展开更多
One of the most important problems of clustering is to define the number of classes. In fact, it is not easy to find an appropriate method to measure whether the cluster configuration is acceptable or not. In this pap...One of the most important problems of clustering is to define the number of classes. In fact, it is not easy to find an appropriate method to measure whether the cluster configuration is acceptable or not. In this paper we propose a possible and non-automatic solution considering different criteria of clustering and comparing their results. In this way robust structures of an analyzed dataset can be often caught (or established) and an optimal cluster configuration, which presents a meaningful association, may be defined. In particular, we also focus on the variables which may be used in cluster analysis. In fact, variables which contain little clustering information can cause misleading and not-robustness results. Therefore, three algorithms are employed in this study: K-means partitioning methods, Partitioning Around Medoids (PAM) and the Heuristic Identification of Noisy Variables (HINoV). The results are compared with robust methods ones.展开更多
The stable structures and energies of Ni clusters were investigated using particle swarm optimization(PSO)combined with simulated annealing(SA).Sutton-Chen many-body potential was used in describing the interatomic in...The stable structures and energies of Ni clusters were investigated using particle swarm optimization(PSO)combined with simulated annealing(SA).Sutton-Chen many-body potential was used in describing the interatomic interactions.The simulation results indicate that the structures of Ni clusters are icosahedral-like and binding energy per atom tends to approach that of bulk materials when the atoms number increases.The stability of Ni clusters depends not only on size but also on symmetrical characterization.The structure stability of Nin clusters increases with the increase of total atom number n.It is also found that there exists direct correlation between stability and geometrical structures of the clusters,and relatively higher symmetry clusters are more stable.From the results of the second difference in the binding energy,the clusters at n=3 is more stable than others,and the magic numbers effect is also found.展开更多
Clustering is a group of unsupervised statistical techniques commonly used in many disciplines. Considering their applications to fish abundance data, many technical details need to be considered to ensure reasonable ...Clustering is a group of unsupervised statistical techniques commonly used in many disciplines. Considering their applications to fish abundance data, many technical details need to be considered to ensure reasonable interpretation. However, the reliability and stability of the clustering methods have rarely been studied in the contexts of fisheries. This study presents an intensive evaluation of three common clustering methods, including hierarchical clustering(HC), K-means(KM), and expectation-maximization(EM) methods, based on fish community surveys in the coastal waters of Shandong, China. We evaluated the performances of these three methods considering different numbers of clusters, data size, and data transformation approaches, focusing on the consistency validation using the index of average proportion of non-overlap(APN). The results indicate that the three methods tend to be inconsistent in the optimal number of clusters. EM showed relatively better performances to avoid unbalanced classification, whereas HC and KM provided more stable clustering results. Data transformation including scaling, square-root, and log-transformation had substantial influences on the clustering results, especially for KM. Moreover, transformation also influenced clustering stability, wherein scaling tended to provide a stable solution at the same number of clusters. The APN values indicated improved stability with increasing data size, and the effect leveled off over 70 samples in general and most quickly in EM. We conclude that the best clustering method can be chosen depending on the aim of the study and the number of clusters. In general, KM is relatively robust in our tests. We also provide recommendations for future application of clustering analyses. This study is helpful to ensure the credibility of the application and interpretation of clustering methods.展开更多
One of the limitations of using OFDM technique is its higher PAPR in the time domain signal. The higher PAPR OFDM signal would cause the fatal degradation of BER performance and undesirable spectrum regrowth in the no...One of the limitations of using OFDM technique is its higher PAPR in the time domain signal. The higher PAPR OFDM signal would cause the fatal degradation of BER performance and undesirable spectrum regrowth in the nonlinear channel. One of the promising PAPR reduction methods for OFDM signal is the Partial Transmit Sequence (PTS) method which can achieve better PAPR performance with reasonable computation complexity. However the PTS method is required to inform the phase coefficients of PTS as the side information to the receiver for the correct demodulation of data information through the data or separate channels. To simplify the transceiver of OFDM system with the PTS method, the phase coefficients of PTS are usually embedded in the data information. However since the phase coefficients of PTS are obtained after the PTS processing only for the data information at each OFDM symbol, it is hard to embed the phase coefficients of PTS in the data information separately without degradation of PAPR performance. To solve this problem, this paper proposes a new PAPR reduction method based on the packet-switched transmission systems in which all the clusters within the certain number of OFDM symbols have the sequential cluster ID numbers embedded in the header of each cluster. The salient features of the proposed method are to reduce the PAPR performance by re-ordering of clusters (ROC) in the frequency domain at the transmitter and to reconstruct the original ordering of clusters by using the cluster ID number demodulated from each cluster at the receiver. This paper also proposes a reduction technique of computation complexity for the proposed ROC method by using the feature of IFFT processing. This paper presents various computer simulation results to verify the effectiveness of the proposed ROC method with the reduction technique of computation complexity.展开更多
We propose a novel scheme based on clustering analysis in color space to solve text segmentation in complex color images. Text segmentation includes automatic clustering of color space and foreground image generation....We propose a novel scheme based on clustering analysis in color space to solve text segmentation in complex color images. Text segmentation includes automatic clustering of color space and foreground image generation. Two methods are also proposed for automatic clustering: The first one is to determine the optimal number of clusters and the second one is the fuzzy competitively clustering method based on competitively learning techniques. Essential foreground images obtained from any of the color clusters are combined into foreground images. Further performance analysis reveals the advantages of the proposed methods.展开更多
Particle size distribution of 12-500 nm was measured at Mt. Waliguan, China Global Atmosphere Watch Baseline Observatory, from Aug. in 2005 to May in 2007.72-hr back-trajectories at 100-m arrival height above ground l...Particle size distribution of 12-500 nm was measured at Mt. Waliguan, China Global Atmosphere Watch Baseline Observatory, from Aug. in 2005 to May in 2007.72-hr back-trajectories at 100-m arrival height above ground level for the same period were calculated at 6:00, 12:00, and 21:00 (Beijing Time) for each day using the Hybrid Single-Particle Lagrangian Integrated Trajectory (HYSPLIT-4) model developed by NOAA/ARL. It was found that air mass sources significantly impact particle number concentration and size distribution at Mt. Waliguan. Cluster analysis of back-trajectories show that higher Aitken mode particle number concentration was observed when air masses came from or passed by the northeastern section of Mt. Waliguan, with short trajectory length. High number concentration of nucleation mode was associated with air masses from clean regions, with long trajectory length.展开更多
An appropriate optimal number of market segments(ONS)estimation is essential for an enterprise to achieve successful market segmentation,but at present,there is a serious lack of attention to this issue in market segm...An appropriate optimal number of market segments(ONS)estimation is essential for an enterprise to achieve successful market segmentation,but at present,there is a serious lack of attention to this issue in market segmentation.In our study,an independent adaptive ONS estimation method BWCON-NSDK-means++is proposed by integrating a newinternal validity index(IVI)Between-Within-Connectivity(BWCON)and a newstable clustering algorithmNatural-SDK-means++(NSDK-means++)in a novel way.First,to complete the evaluation dimensions of the existing IVIs,we designed a connectivity formula based on the neighbor relationship and proposed the BWCON by integrating the connectivity with other two commonly considered measures of compactness and separation.Then,considering the stability,number of parameters and clustering performance,we proposed the NSDK-means++to participate in the integrationwhere the natural neighbor was used to optimize the initial cluster centers(ICCs)determination strategy in the SDK-means++.At last,to ensure the objectivity of the estimatedONS,we designed a BWCON-based ONS estimation framework that does not require the user to set any parameters in advance and integrated the NSDK-means++into this framework forming a practical ONS estimation tool BWCON-NSDK-means++.The final experimental results showthat the proposed BWCONand NSDK-means++are significantlymore suitable than their respective existing models to participate in the integration for determining theONS,and the proposed BWCON-NSDK-means++is demonstrably superior to the BWCON-KMA,BWCONMBK,BWCON-KM++,BWCON-RKM++,BWCON-SDKM++,BWCON-Single linkage,BWCON-Complete linkage,BWCON-Average linkage and BWCON-Ward linkage in terms of the ONS estimation.Moreover,as an independentmarket segmentation tool,the BWCON-NSDK-means++also outperforms the existing models with respect to the inter-market differentiation and sub-market size.展开更多
This paper presents a new Section Set Adaptive FCM algorithm.The algorithm solved the shortcomings of local optimality,unsure classification and clustering numbers ascertained previously.And it improved on the archite...This paper presents a new Section Set Adaptive FCM algorithm.The algorithm solved the shortcomings of local optimality,unsure classification and clustering numbers ascertained previously.And it improved on the architecture of FCM al- gorithm,enhanced the analysis for effective clustering.During the clustering processing,it may adjust clustering numbers dy- namically.Finally,it used the method of section set decreasing the time of classification.By experiments,the algorithm can im- prove dependability of clustering and correctness of classification.展开更多
Traditional Fuzzy C-Means(FCM)and Possibilistic C-Means(PCM)clustering algorithms are data-driven,and their objective function minimization process is based on the available numeric data.Recently,knowledge hints have ...Traditional Fuzzy C-Means(FCM)and Possibilistic C-Means(PCM)clustering algorithms are data-driven,and their objective function minimization process is based on the available numeric data.Recently,knowledge hints have been introduced to formknowledge-driven clustering algorithms,which reveal a data structure that considers not only the relationships between data but also the compatibility with knowledge hints.However,these algorithms cannot produce the optimal number of clusters by the clustering algorithm itself;they require the assistance of evaluation indices.Moreover,knowledge hints are usually used as part of the data structure(directly replacing some clustering centers),which severely limits the flexibility of the algorithm and can lead to knowledgemisguidance.To solve this problem,this study designs a newknowledge-driven clustering algorithmcalled the PCM clusteringwith High-density Points(HP-PCM),in which domain knowledge is represented in the form of so-called high-density points.First,a newdatadensitycalculation function is proposed.The Density Knowledge Points Extraction(DKPE)method is established to filter out high-density points from the dataset to form knowledge hints.Then,these hints are incorporated into the PCM objective function so that the clustering algorithm is guided by high-density points to discover the natural data structure.Finally,the initial number of clusters is set to be greater than the true one based on the number of knowledge hints.Then,the HP-PCM algorithm automatically determines the final number of clusters during the clustering process by considering the cluster elimination mechanism.Through experimental studies,including some comparative analyses,the results highlight the effectiveness of the proposed algorithm,such as the increased success rate in clustering,the ability to determine the optimal cluster number,and the faster convergence speed.展开更多
In systems in atomic and nano scales such as clusters or agglomerates constituted of particles from a few to less than one hundred of atoms, quantum confinement effects are very important. Their optical and electronic...In systems in atomic and nano scales such as clusters or agglomerates constituted of particles from a few to less than one hundred of atoms, quantum confinement effects are very important. Their optical and electronic properties are often dependent on the size of the systems and the way in which the atoms in these clusters are bonded. Generally, these nano-structures display optical and electronic properties significantly different of those found in corresponding bulk materials. Silicon agglomerates found in Silicon Rich Oxide (SRO) films have optical properties, which have reported as depended directly on nano-crystal size. Furthermore, the room temperature photoluminescence (PL) of Silicon Rich Oxides (SRO) has repeatedly generated a huge interest due to their possible applications in optoelectronic devices. However, a plausible emission mechanism has not yet widespread acceptance of the scientific community. In this research, we employed the Density Functional Theory with a functional B3LYP and a basis set 6 - 31G* to calculate the optical and electronic properties of small (six to ten silicon atoms) and medium size clusters of silicon (constituted of eleven to fourteen silicon atoms). With the theoretical calculation of the structural and optical properties of silicon clusters, it is possible to evaluate the contribution of silicon agglomerates in the luminescent emission mechanism experimentally found in thin SRO films.展开更多
In the past 30 years, Chinese enterprises have been a hot topic of discussion and concern among the general public in terms of economic and social status, ownership structure, business mechanism, and management level....In the past 30 years, Chinese enterprises have been a hot topic of discussion and concern among the general public in terms of economic and social status, ownership structure, business mechanism, and management level. Solving the problem of employment for the people is an important prerequisite for their peaceful living and work, as well as a prerequisite and foundation for building a harmonious society. The employment situation of private enterprises has always been of great concern to the outside world, and these two major jobs have always occupied an important position in the employment field of China that cannot be ignored. With the establishment of the market economy system, individual and private enterprises have become important components of the socialist economy, making significant contributions to economic development and social progress. The rapid development of China’s economy, on the one hand, is the embodiment of the superiority of China’s socialist market economic system, and on the other hand, it is the role of the tertiary industry and private enterprises in promoting the national economy. Since the 1990s, China’s private enterprises have become a new economic growth point for local and even national countries, and are one of the important ways to arrange employment and achieve social stability. This paper studies the employment of private enterprises and individuals from the perspective of statistics, extracts relevant data from China statistical Yearbook, uses the relevant knowledge of statistics to process the data, obtains the conclusion and puts forward relevant constructive suggestions.展开更多
Using the fuzzy cluster analysis and the temperature-salinity(T-S) similarity number analysis of cruise conductivity-temperature-depth(CTD) data in the upper layer(0–300 m) of the northern South China Sea(NSCS), we c...Using the fuzzy cluster analysis and the temperature-salinity(T-S) similarity number analysis of cruise conductivity-temperature-depth(CTD) data in the upper layer(0–300 m) of the northern South China Sea(NSCS), we classify the upper layer water of the NSCS into six water masses: diluted water(D), surface water(SS),the SCS subsurface water mass(U_S), the Pacific Ocean subsurface water mass(U_P), surface-subsurface mixed water(SU) and subsurface-intermediate mixed water(UI). A new stacked stereogram is used to illustrate the water mass distribution, and to examine the source and the distribution of U_P, combining with the sea surface height data and geostrophic current field. The results show that water mass U_P exists in all four seasons with the maximum range in spring and the minimum range in summer. In spring and winter, the U_P intrudes into the Luzon Strait and the southwest of Taiwan Island via the northern Luzon Strait in the form of nonlinear Rossby eddies, and forms a high temperature and high salinity zone east of the Dongsha Islands. In summer, the U_P is sporadically distributed in the study area. In autumn, the U_P is located in the upper 200 m layer east of Hainan Island.展开更多
The upper bound of the optimal number of clusters in clustering algorithm is studied in this paper. A new method is proposed to solve this issue. This method shows that the rule cmax≤N^(1/N), which is popular in curr...The upper bound of the optimal number of clusters in clustering algorithm is studied in this paper. A new method is proposed to solve this issue. This method shows that the rule cmax≤N^(1/N), which is popular in current papers, is reasonable in some sense. The above conclusion is tested and analyzed by some typical examples in the literature, which demonstrates the validity of the new method.展开更多
To the problem that it is hard to determine the clustering number and the abnormal points by using the clustering validity function, an effective clustering partition model based on the genetic algorithm is built in t...To the problem that it is hard to determine the clustering number and the abnormal points by using the clustering validity function, an effective clustering partition model based on the genetic algorithm is built in this paper. The solution to the problem is formed by the combination of the clustering partition and the encoding samples, and the fitness function is defined by the distances among and within clusters. The clustering number and the samples in each cluster are determined and the abnormal points are distinguished by implementing the triple random crossover operator and the mutation. Based on the known sample data, the results of the novel method and the clustering validity function are compared. Numerical experiments are given and the results show that the novel method is more effective.展开更多
Investigation of genetic diversity of geographically distant wheat genotypes is </span><span style="font-family:Verdana;">a </span><span style="font-family:Verdana;">useful ...Investigation of genetic diversity of geographically distant wheat genotypes is </span><span style="font-family:Verdana;">a </span><span style="font-family:Verdana;">useful approach in wheat breeding providing efficient crop varieties. This article presents multivariate cluster and principal component analyses (PCA) of some yield traits of wheat, such as thousand-kernel weight (TKW), grain number, grain yield and plant height. Based on the results, an evaluation of economically valuable attributes by eigenvalues made it possible to determine the components that significantly contribute to the yield of common wheat genotypes. Twenty-five genotypes were grouped into four clusters on the basis of average linkage. The PCA showed four principal components (PC) with eigenvalues ></span><span style="font-family:""> </span><span style="font-family:Verdana;">1, explaining approximately 90.8% of the total variability. According to PC analysis, the variance in the eigenvalues was </span><span style="font-family:Verdana;">the </span><span style="font-family:Verdana;">greatest (4.33) for PC-1, PC-2 (1.86) and PC-3 (1.01). The cluster analysis revealed the classification of 25 accessions into four diverse groups. Averages, standard deviations and variances for clusters based on morpho-physiological traits showed that the maximum average values for grain yield (742.2), biomass (1756.7), grains square meter (18</span><span style="font-family:Verdana;">,</span><span style="font-family:Verdana;">373.7), and grains per spike (45.3) were higher in cluster C compared to other clusters. Cluster D exhibited the maximum thousand-kernel weight (TKW) (46.6).展开更多
Lucerne( Medicago sativa L.) is a plant of strict allogamy,and its pollination relies on bees mainly. Genetic variations of lucerne floral properties,including receptacle diameter,coronary length,number of flowers per...Lucerne( Medicago sativa L.) is a plant of strict allogamy,and its pollination relies on bees mainly. Genetic variations of lucerne floral properties,including receptacle diameter,coronary length,number of flowers per raceme,number of racemes per twig,number of flowers per square metre,percentage of tripped flowers,nectar production,sugar concentration in nectar and contents of sucrose,fructose and glucose in nectar,have been studied with morphological markers,and floral properties of ten lucerne cultivars were also investigated to determine their role in number of visiting bees and to provide a basis for the evaluation of mutant flowers for visitation by bees. The results showed CV( coefficient of variation) of floral properties was from 0. 80% to 92. 30%,of which the content of glucose was the most significant one with variation from 0. 01 to 0. 53 μmol /L( P 【 0. 05),and the sugar concentration was the most insignificant one( P 】 0. 05). The significant order of floral properties affecting the number of visiting bees was that the nectar production per square metre( r = 0. 93,P 【 0. 01) was in the first place,followed by the number of flowers per square metre( r = 0. 92,P 【 0. 01),sucrose concentration of nectar sugar( r = 0. 82,P 【 0. 05),coronary length( r = 0. 77,P 【 0. 05) and nectar production per flower( r = 0. 71,P 【 0. 05).展开更多
Fuzzy C-means(FCM)is a clustering method that falls under unsupervised machine learning.The main issues plaguing this clustering algorithm are the number of the unknown clusters within a particular dataset and initial...Fuzzy C-means(FCM)is a clustering method that falls under unsupervised machine learning.The main issues plaguing this clustering algorithm are the number of the unknown clusters within a particular dataset and initialization sensitivity of cluster centres.Artificial Bee Colony(ABC)is a type of swarm algorithm that strives to improve the members’solution quality as an iterative process with the utilization of particular kinds of randomness.However,ABC has some weaknesses,such as balancing exploration and exploitation.To improve the exploration process within the ABC algorithm,the mean artificial bee colony(MeanABC)by its modified search equation that depends on solutions of mean previous and global best is used.Furthermore,to solve the main issues of FCM,Automatic clustering algorithm was proposed based on the mean artificial bee colony called(AC-MeanABC).It uses the MeanABC capability of balancing between exploration and exploitation and its capacity to explore the positive and negative directions in search space to find the best value of clusters number and centroids value.A few benchmark datasets and a set of natural images were used to evaluate the effectiveness of AC-MeanABC.The experimental findings are encouraging and indicate considerable improvements compared to other state-of-the-art approaches in the same domain.展开更多
Clustering data with varying densities and complicated structures is important,while many existing clustering algorithms face difficulties for this problem. The reason is that varying densities and complicated structu...Clustering data with varying densities and complicated structures is important,while many existing clustering algorithms face difficulties for this problem. The reason is that varying densities and complicated structure make single algorithms perform badly for different parts of data. More intensive parts are assumed to have more information probably,an algorithm clustering from high density part is proposed,which begins from a tiny distance to find the highest density-connected partition and form corresponding super cores,then distance is iteratively increased by a global heuristic method to cluster parts with different densities. Mean of silhouette coefficient indicates the cluster performance. Denoising function is implemented to eliminate influence of noise and outliers. Many challenging experiments indicate that the algorithm has good performance on data with widely varying densities and extremely complex structures. It decides the optimal number of clusters automatically.Background knowledge is not needed and parameters tuning is easy. It is robust against noise and outliers.展开更多
基金Supported by the National Natural Science Foundation of China(196 740 42 198340 70 ) Science and Technology Program of Natio
文摘The metallic softness parameterαr 0 determines the structure of the cluster and governs the rule of magic numbers. Using molecular dynamic method, the stable structures and magic numbers are determined for the clusters consisting of 13 up to 147 atoms in medium range Morse potentials, which is suitable for most of metals. As the number of atoms constituting the cluster increases, the stable structures undergo transition from face-centered (FC) to edge-centered (EC) structures. The magic number take ones of FC series before transition and take ones of EC series after that. The transition point from FC to EC structures depends on the value of softness parameter.
文摘One of the most important problems of clustering is to define the number of classes. In fact, it is not easy to find an appropriate method to measure whether the cluster configuration is acceptable or not. In this paper we propose a possible and non-automatic solution considering different criteria of clustering and comparing their results. In this way robust structures of an analyzed dataset can be often caught (or established) and an optimal cluster configuration, which presents a meaningful association, may be defined. In particular, we also focus on the variables which may be used in cluster analysis. In fact, variables which contain little clustering information can cause misleading and not-robustness results. Therefore, three algorithms are employed in this study: K-means partitioning methods, Partitioning Around Medoids (PAM) and the Heuristic Identification of Noisy Variables (HINoV). The results are compared with robust methods ones.
基金Project(60371046)supported by the National Natural Science Foundation of China
文摘The stable structures and energies of Ni clusters were investigated using particle swarm optimization(PSO)combined with simulated annealing(SA).Sutton-Chen many-body potential was used in describing the interatomic interactions.The simulation results indicate that the structures of Ni clusters are icosahedral-like and binding energy per atom tends to approach that of bulk materials when the atoms number increases.The stability of Ni clusters depends not only on size but also on symmetrical characterization.The structure stability of Nin clusters increases with the increase of total atom number n.It is also found that there exists direct correlation between stability and geometrical structures of the clusters,and relatively higher symmetry clusters are more stable.From the results of the second difference in the binding energy,the clusters at n=3 is more stable than others,and the magic numbers effect is also found.
基金provided by the Marine S&T Fund of Shandong Province for Pilot National Laboratory for Marine Science and Technology (Qingdao) (No.2018SDKJ0501-2)。
文摘Clustering is a group of unsupervised statistical techniques commonly used in many disciplines. Considering their applications to fish abundance data, many technical details need to be considered to ensure reasonable interpretation. However, the reliability and stability of the clustering methods have rarely been studied in the contexts of fisheries. This study presents an intensive evaluation of three common clustering methods, including hierarchical clustering(HC), K-means(KM), and expectation-maximization(EM) methods, based on fish community surveys in the coastal waters of Shandong, China. We evaluated the performances of these three methods considering different numbers of clusters, data size, and data transformation approaches, focusing on the consistency validation using the index of average proportion of non-overlap(APN). The results indicate that the three methods tend to be inconsistent in the optimal number of clusters. EM showed relatively better performances to avoid unbalanced classification, whereas HC and KM provided more stable clustering results. Data transformation including scaling, square-root, and log-transformation had substantial influences on the clustering results, especially for KM. Moreover, transformation also influenced clustering stability, wherein scaling tended to provide a stable solution at the same number of clusters. The APN values indicated improved stability with increasing data size, and the effect leveled off over 70 samples in general and most quickly in EM. We conclude that the best clustering method can be chosen depending on the aim of the study and the number of clusters. In general, KM is relatively robust in our tests. We also provide recommendations for future application of clustering analyses. This study is helpful to ensure the credibility of the application and interpretation of clustering methods.
文摘One of the limitations of using OFDM technique is its higher PAPR in the time domain signal. The higher PAPR OFDM signal would cause the fatal degradation of BER performance and undesirable spectrum regrowth in the nonlinear channel. One of the promising PAPR reduction methods for OFDM signal is the Partial Transmit Sequence (PTS) method which can achieve better PAPR performance with reasonable computation complexity. However the PTS method is required to inform the phase coefficients of PTS as the side information to the receiver for the correct demodulation of data information through the data or separate channels. To simplify the transceiver of OFDM system with the PTS method, the phase coefficients of PTS are usually embedded in the data information. However since the phase coefficients of PTS are obtained after the PTS processing only for the data information at each OFDM symbol, it is hard to embed the phase coefficients of PTS in the data information separately without degradation of PAPR performance. To solve this problem, this paper proposes a new PAPR reduction method based on the packet-switched transmission systems in which all the clusters within the certain number of OFDM symbols have the sequential cluster ID numbers embedded in the header of each cluster. The salient features of the proposed method are to reduce the PAPR performance by re-ordering of clusters (ROC) in the frequency domain at the transmitter and to reconstruct the original ordering of clusters by using the cluster ID number demodulated from each cluster at the receiver. This paper also proposes a reduction technique of computation complexity for the proposed ROC method by using the feature of IFFT processing. This paper presents various computer simulation results to verify the effectiveness of the proposed ROC method with the reduction technique of computation complexity.
文摘We propose a novel scheme based on clustering analysis in color space to solve text segmentation in complex color images. Text segmentation includes automatic clustering of color space and foreground image generation. Two methods are also proposed for automatic clustering: The first one is to determine the optimal number of clusters and the second one is the fuzzy competitively clustering method based on competitively learning techniques. Essential foreground images obtained from any of the color clusters are combined into foreground images. Further performance analysis reveals the advantages of the proposed methods.
基金sponsored by National Key Development Program for Fundamental Research (973 Program) Project(Nos.2006CB403703 and 2006CB403701)
文摘Particle size distribution of 12-500 nm was measured at Mt. Waliguan, China Global Atmosphere Watch Baseline Observatory, from Aug. in 2005 to May in 2007.72-hr back-trajectories at 100-m arrival height above ground level for the same period were calculated at 6:00, 12:00, and 21:00 (Beijing Time) for each day using the Hybrid Single-Particle Lagrangian Integrated Trajectory (HYSPLIT-4) model developed by NOAA/ARL. It was found that air mass sources significantly impact particle number concentration and size distribution at Mt. Waliguan. Cluster analysis of back-trajectories show that higher Aitken mode particle number concentration was observed when air masses came from or passed by the northeastern section of Mt. Waliguan, with short trajectory length. High number concentration of nucleation mode was associated with air masses from clean regions, with long trajectory length.
基金supported by the earmarked fund for CARS-29 and the open funds of the Key Laboratory of Viticulture and Enology,Ministry of Agriculture,China.
文摘An appropriate optimal number of market segments(ONS)estimation is essential for an enterprise to achieve successful market segmentation,but at present,there is a serious lack of attention to this issue in market segmentation.In our study,an independent adaptive ONS estimation method BWCON-NSDK-means++is proposed by integrating a newinternal validity index(IVI)Between-Within-Connectivity(BWCON)and a newstable clustering algorithmNatural-SDK-means++(NSDK-means++)in a novel way.First,to complete the evaluation dimensions of the existing IVIs,we designed a connectivity formula based on the neighbor relationship and proposed the BWCON by integrating the connectivity with other two commonly considered measures of compactness and separation.Then,considering the stability,number of parameters and clustering performance,we proposed the NSDK-means++to participate in the integrationwhere the natural neighbor was used to optimize the initial cluster centers(ICCs)determination strategy in the SDK-means++.At last,to ensure the objectivity of the estimatedONS,we designed a BWCON-based ONS estimation framework that does not require the user to set any parameters in advance and integrated the NSDK-means++into this framework forming a practical ONS estimation tool BWCON-NSDK-means++.The final experimental results showthat the proposed BWCONand NSDK-means++are significantlymore suitable than their respective existing models to participate in the integration for determining theONS,and the proposed BWCON-NSDK-means++is demonstrably superior to the BWCON-KMA,BWCONMBK,BWCON-KM++,BWCON-RKM++,BWCON-SDKM++,BWCON-Single linkage,BWCON-Complete linkage,BWCON-Average linkage and BWCON-Ward linkage in terms of the ONS estimation.Moreover,as an independentmarket segmentation tool,the BWCON-NSDK-means++also outperforms the existing models with respect to the inter-market differentiation and sub-market size.
基金Science and Researching Foundation of Jiamusi University(L2006-12)
文摘This paper presents a new Section Set Adaptive FCM algorithm.The algorithm solved the shortcomings of local optimality,unsure classification and clustering numbers ascertained previously.And it improved on the architecture of FCM al- gorithm,enhanced the analysis for effective clustering.During the clustering processing,it may adjust clustering numbers dy- namically.Finally,it used the method of section set decreasing the time of classification.By experiments,the algorithm can im- prove dependability of clustering and correctness of classification.
基金supported by the National Key Research and Development Program of China(No.2022YFB3304400)the National Natural Science Foundation of China(Nos.6230311,62303111,62076060,61932007,and 62176083)the Key Research and Development Program of Jiangsu Province of China(No.BE2022157).
文摘Traditional Fuzzy C-Means(FCM)and Possibilistic C-Means(PCM)clustering algorithms are data-driven,and their objective function minimization process is based on the available numeric data.Recently,knowledge hints have been introduced to formknowledge-driven clustering algorithms,which reveal a data structure that considers not only the relationships between data but also the compatibility with knowledge hints.However,these algorithms cannot produce the optimal number of clusters by the clustering algorithm itself;they require the assistance of evaluation indices.Moreover,knowledge hints are usually used as part of the data structure(directly replacing some clustering centers),which severely limits the flexibility of the algorithm and can lead to knowledgemisguidance.To solve this problem,this study designs a newknowledge-driven clustering algorithmcalled the PCM clusteringwith High-density Points(HP-PCM),in which domain knowledge is represented in the form of so-called high-density points.First,a newdatadensitycalculation function is proposed.The Density Knowledge Points Extraction(DKPE)method is established to filter out high-density points from the dataset to form knowledge hints.Then,these hints are incorporated into the PCM objective function so that the clustering algorithm is guided by high-density points to discover the natural data structure.Finally,the initial number of clusters is set to be greater than the true one based on the number of knowledge hints.Then,the HP-PCM algorithm automatically determines the final number of clusters during the clustering process by considering the cluster elimination mechanism.Through experimental studies,including some comparative analyses,the results highlight the effectiveness of the proposed algorithm,such as the increased success rate in clustering,the ability to determine the optimal cluster number,and the faster convergence speed.
文摘In systems in atomic and nano scales such as clusters or agglomerates constituted of particles from a few to less than one hundred of atoms, quantum confinement effects are very important. Their optical and electronic properties are often dependent on the size of the systems and the way in which the atoms in these clusters are bonded. Generally, these nano-structures display optical and electronic properties significantly different of those found in corresponding bulk materials. Silicon agglomerates found in Silicon Rich Oxide (SRO) films have optical properties, which have reported as depended directly on nano-crystal size. Furthermore, the room temperature photoluminescence (PL) of Silicon Rich Oxides (SRO) has repeatedly generated a huge interest due to their possible applications in optoelectronic devices. However, a plausible emission mechanism has not yet widespread acceptance of the scientific community. In this research, we employed the Density Functional Theory with a functional B3LYP and a basis set 6 - 31G* to calculate the optical and electronic properties of small (six to ten silicon atoms) and medium size clusters of silicon (constituted of eleven to fourteen silicon atoms). With the theoretical calculation of the structural and optical properties of silicon clusters, it is possible to evaluate the contribution of silicon agglomerates in the luminescent emission mechanism experimentally found in thin SRO films.
文摘In the past 30 years, Chinese enterprises have been a hot topic of discussion and concern among the general public in terms of economic and social status, ownership structure, business mechanism, and management level. Solving the problem of employment for the people is an important prerequisite for their peaceful living and work, as well as a prerequisite and foundation for building a harmonious society. The employment situation of private enterprises has always been of great concern to the outside world, and these two major jobs have always occupied an important position in the employment field of China that cannot be ignored. With the establishment of the market economy system, individual and private enterprises have become important components of the socialist economy, making significant contributions to economic development and social progress. The rapid development of China’s economy, on the one hand, is the embodiment of the superiority of China’s socialist market economic system, and on the other hand, it is the role of the tertiary industry and private enterprises in promoting the national economy. Since the 1990s, China’s private enterprises have become a new economic growth point for local and even national countries, and are one of the important ways to arrange employment and achieve social stability. This paper studies the employment of private enterprises and individuals from the perspective of statistics, extracts relevant data from China statistical Yearbook, uses the relevant knowledge of statistics to process the data, obtains the conclusion and puts forward relevant constructive suggestions.
基金The National Natural Science Foundation of China under contract No.41776027the National Basic Research Program of China under contract Nos 2015CB954004 and 2009CB421208the Open Fund of the Key Laboratory of Ocean Circulation and Waves,Chinese Academy of Sciences under contract No.KLOCW1808
文摘Using the fuzzy cluster analysis and the temperature-salinity(T-S) similarity number analysis of cruise conductivity-temperature-depth(CTD) data in the upper layer(0–300 m) of the northern South China Sea(NSCS), we classify the upper layer water of the NSCS into six water masses: diluted water(D), surface water(SS),the SCS subsurface water mass(U_S), the Pacific Ocean subsurface water mass(U_P), surface-subsurface mixed water(SU) and subsurface-intermediate mixed water(UI). A new stacked stereogram is used to illustrate the water mass distribution, and to examine the source and the distribution of U_P, combining with the sea surface height data and geostrophic current field. The results show that water mass U_P exists in all four seasons with the maximum range in spring and the minimum range in summer. In spring and winter, the U_P intrudes into the Luzon Strait and the southwest of Taiwan Island via the northern Luzon Strait in the form of nonlinear Rossby eddies, and forms a high temperature and high salinity zone east of the Dongsha Islands. In summer, the U_P is sporadically distributed in the study area. In autumn, the U_P is located in the upper 200 m layer east of Hainan Island.
基金This work was supported by the National Natural Science Foundation of China (Grant Nos. 69872003 and 40035010)
文摘The upper bound of the optimal number of clusters in clustering algorithm is studied in this paper. A new method is proposed to solve this issue. This method shows that the rule cmax≤N^(1/N), which is popular in current papers, is reasonable in some sense. The above conclusion is tested and analyzed by some typical examples in the literature, which demonstrates the validity of the new method.
文摘To the problem that it is hard to determine the clustering number and the abnormal points by using the clustering validity function, an effective clustering partition model based on the genetic algorithm is built in this paper. The solution to the problem is formed by the combination of the clustering partition and the encoding samples, and the fitness function is defined by the distances among and within clusters. The clustering number and the samples in each cluster are determined and the abnormal points are distinguished by implementing the triple random crossover operator and the mutation. Based on the known sample data, the results of the novel method and the clustering validity function are compared. Numerical experiments are given and the results show that the novel method is more effective.
文摘Investigation of genetic diversity of geographically distant wheat genotypes is </span><span style="font-family:Verdana;">a </span><span style="font-family:Verdana;">useful approach in wheat breeding providing efficient crop varieties. This article presents multivariate cluster and principal component analyses (PCA) of some yield traits of wheat, such as thousand-kernel weight (TKW), grain number, grain yield and plant height. Based on the results, an evaluation of economically valuable attributes by eigenvalues made it possible to determine the components that significantly contribute to the yield of common wheat genotypes. Twenty-five genotypes were grouped into four clusters on the basis of average linkage. The PCA showed four principal components (PC) with eigenvalues ></span><span style="font-family:""> </span><span style="font-family:Verdana;">1, explaining approximately 90.8% of the total variability. According to PC analysis, the variance in the eigenvalues was </span><span style="font-family:Verdana;">the </span><span style="font-family:Verdana;">greatest (4.33) for PC-1, PC-2 (1.86) and PC-3 (1.01). The cluster analysis revealed the classification of 25 accessions into four diverse groups. Averages, standard deviations and variances for clusters based on morpho-physiological traits showed that the maximum average values for grain yield (742.2), biomass (1756.7), grains square meter (18</span><span style="font-family:Verdana;">,</span><span style="font-family:Verdana;">373.7), and grains per spike (45.3) were higher in cluster C compared to other clusters. Cluster D exhibited the maximum thousand-kernel weight (TKW) (46.6).
基金supported by National Natural Science Foundation of China(Grant No.31060324)Natural Science Foundation of Yunnan Province(Grant No.2012FB148)Natural Science Key Foundation of Yunnan Provincial Education Department(Grant No.2012Z021)
文摘Lucerne( Medicago sativa L.) is a plant of strict allogamy,and its pollination relies on bees mainly. Genetic variations of lucerne floral properties,including receptacle diameter,coronary length,number of flowers per raceme,number of racemes per twig,number of flowers per square metre,percentage of tripped flowers,nectar production,sugar concentration in nectar and contents of sucrose,fructose and glucose in nectar,have been studied with morphological markers,and floral properties of ten lucerne cultivars were also investigated to determine their role in number of visiting bees and to provide a basis for the evaluation of mutant flowers for visitation by bees. The results showed CV( coefficient of variation) of floral properties was from 0. 80% to 92. 30%,of which the content of glucose was the most significant one with variation from 0. 01 to 0. 53 μmol /L( P 【 0. 05),and the sugar concentration was the most insignificant one( P 】 0. 05). The significant order of floral properties affecting the number of visiting bees was that the nectar production per square metre( r = 0. 93,P 【 0. 01) was in the first place,followed by the number of flowers per square metre( r = 0. 92,P 【 0. 01),sucrose concentration of nectar sugar( r = 0. 82,P 【 0. 05),coronary length( r = 0. 77,P 【 0. 05) and nectar production per flower( r = 0. 71,P 【 0. 05).
基金supported by the Research Management Center,Xiamen University Malaysia under XMUM Research Program Cycle 4(Grant No:XMUMRF/2019-C4/IECE/0012).
文摘Fuzzy C-means(FCM)is a clustering method that falls under unsupervised machine learning.The main issues plaguing this clustering algorithm are the number of the unknown clusters within a particular dataset and initialization sensitivity of cluster centres.Artificial Bee Colony(ABC)is a type of swarm algorithm that strives to improve the members’solution quality as an iterative process with the utilization of particular kinds of randomness.However,ABC has some weaknesses,such as balancing exploration and exploitation.To improve the exploration process within the ABC algorithm,the mean artificial bee colony(MeanABC)by its modified search equation that depends on solutions of mean previous and global best is used.Furthermore,to solve the main issues of FCM,Automatic clustering algorithm was proposed based on the mean artificial bee colony called(AC-MeanABC).It uses the MeanABC capability of balancing between exploration and exploitation and its capacity to explore the positive and negative directions in search space to find the best value of clusters number and centroids value.A few benchmark datasets and a set of natural images were used to evaluate the effectiveness of AC-MeanABC.The experimental findings are encouraging and indicate considerable improvements compared to other state-of-the-art approaches in the same domain.
基金Supported by the National Key Research and Development Program of China(No.2016YFB0201305)National Science and Technology Major Project(No.2013ZX0102-8001-001-001)National Natural Science Foundation of China(No.91430218,31327901,61472395,61272134,61432018)
文摘Clustering data with varying densities and complicated structures is important,while many existing clustering algorithms face difficulties for this problem. The reason is that varying densities and complicated structure make single algorithms perform badly for different parts of data. More intensive parts are assumed to have more information probably,an algorithm clustering from high density part is proposed,which begins from a tiny distance to find the highest density-connected partition and form corresponding super cores,then distance is iteratively increased by a global heuristic method to cluster parts with different densities. Mean of silhouette coefficient indicates the cluster performance. Denoising function is implemented to eliminate influence of noise and outliers. Many challenging experiments indicate that the algorithm has good performance on data with widely varying densities and extremely complex structures. It decides the optimal number of clusters automatically.Background knowledge is not needed and parameters tuning is easy. It is robust against noise and outliers.