Recommendation system can greatly alleviate the "information overload" in the big data era. Existing recommendation methods, however, typically focus on predicting missing rating values via analyzing user-it...Recommendation system can greatly alleviate the "information overload" in the big data era. Existing recommendation methods, however, typically focus on predicting missing rating values via analyzing user-item dualistic relationship, which neglect an important fact that the latent interests of users can influence their rating behaviors. Moreover, traditional recommendation methods easily suffer from the high dimensional problem and cold-start problem. To address these challenges, in this paper, we propose a PBUED(PLSA-Based Uniform Euclidean Distance) scheme, which utilizes topic model and uniform Euclidean distance to recommend the suitable items for users. The solution first employs probabilistic latent semantic analysis(PLSA) to extract users' interests, users with different interests are divided into different subgroups. Then, the uniform Euclidean distance is adopted to compute the users' similarity in the same interest subset; finally, the missing rating values of data are predicted via aggregating similar neighbors' ratings. We evaluate PBUED on two datasets and experimental results show PBUED can lead to better predicting performance and ranking performance than other approaches.展开更多
Classification is one of the data mining processes used to predict predetermined target classes with data learning accurately.This study discusses data classification using a fuzzy soft set method to predict target cl...Classification is one of the data mining processes used to predict predetermined target classes with data learning accurately.This study discusses data classification using a fuzzy soft set method to predict target classes accurately.This study aims to form a data classification algorithm using the fuzzy soft set method.In this study,the fuzzy soft set was calculated based on the normalized Hamming distance.Each parameter in this method is mapped to a power set from a subset of the fuzzy set using a fuzzy approximation function.In the classification step,a generalized normalized Euclidean distance is used to determine the similarity between two sets of fuzzy soft sets.The experiments used the University of California(UCI)Machine Learning dataset to assess the accuracy of the proposed data classification method.The dataset samples were divided into training(75%of samples)and test(25%of samples)sets.Experiments were performed in MATLAB R2010a software.The experiments showed that:(1)The fastest sequence is matching function,distance measure,similarity,normalized Euclidean distance,(2)the proposed approach can improve accuracy and recall by up to 10.3436%and 6.9723%,respectively,compared with baseline techniques.Hence,the fuzzy soft set method is appropriate for classifying data.展开更多
In this article,multiple attribute decision-making problems are solved using the vague normal set(VNS).It is possible to generalize the vague set(VS)and q-rung fuzzy set(FS)into the q-rung vague set(VS).A log q-rung n...In this article,multiple attribute decision-making problems are solved using the vague normal set(VNS).It is possible to generalize the vague set(VS)and q-rung fuzzy set(FS)into the q-rung vague set(VS).A log q-rung normal vague weighted averaging(log q-rung NVWA),a log q-rung normal vague weighted geometric(log q-rung NVWG),a log generalized q-rung normal vague weighted averaging(log Gq-rung NVWA),and a log generalized q-rungnormal vagueweightedgeometric(logGq-rungNVWG)operator are discussed in this article.Adescription is provided of the scoring function,accuracy function and operational laws of the log q-rung VS.The algorithms underlying these functions are also described.A numerical example is provided to extend the Euclidean distance and the Humming distance.Additionally,idempotency,boundedness,commutativity,and monotonicity of the log q-rung VS are examined as they facilitate recognizing the optimal alternative more quickly and help clarify conceptualization.We chose five anemia patients with four types of symptoms including seizures,emotional shock or hysteria,brain cause,and high fever,who had either retrograde amnesia,anterograde amnesia,transient global amnesia,post-traumatic amnesia,or infantile amnesia.Natural numbers q are used to express the results of the models.To demonstrate the effectiveness and accuracy of the models we are investigating,we compare several existing models with those that have been developed.展开更多
Through improving the redundant data filtering of unreliable data filter for radio frequency identification(RFID) with sliding-window,a data filter which integrates self-adaptive sliding-window and Euclidean distanc...Through improving the redundant data filtering of unreliable data filter for radio frequency identification(RFID) with sliding-window,a data filter which integrates self-adaptive sliding-window and Euclidean distance is proposed.The input data required being filtered have been shunt by considering a large number of redundant data existing in the unreliable data for RFID and the redundant data in RFID are the main filtering object with utilizing the filter based on Euclidean distance.The comparison between the results from the method proposed in this paper and previous research shows that it can improve the accuracy of the RFID for unreliable data filtering and largely reduce the redundant reading rate.展开更多
When the coordinates of a set of points are known, the pairwise Euclidean distances among the points can be easily computed. Conversely, if the Euclidean distance matrix is given, a set of coordinates for those points...When the coordinates of a set of points are known, the pairwise Euclidean distances among the points can be easily computed. Conversely, if the Euclidean distance matrix is given, a set of coordinates for those points can be computed through the well known classical Multi-Dimensional Scaling (MDS). In this paper, we consider the case where some of the distances are far from being accurate (containing large noises or even missing). In such a situation, the order of the known distances (i.e., some distances are larger than others) is valuable information that often yields far more accurate construction of the points than just using the magnitude of the known distances. The methods making use of the order information is collectively known as nonmetric MDS. A challenging computational issue among all existing nonmetric MDS methods is that there are often a large number of ordinal constraints. In this paper, we cast this problem as a matrix optimization problem with ordinal constraints. We then adapt an existing smoothing Newton method to our matrix problem. Extensive numerical results demonstrate the efficiency of the algorithm, which can potentially handle a very large number of ordinal constraints.展开更多
Hierarchical clustering algorithms, such as Pearson's correlation, Euclidean distance, Euclidean distance harmonic, Spearman rank correlation, Kendall's tau, and City-block distance, were used to find the best way t...Hierarchical clustering algorithms, such as Pearson's correlation, Euclidean distance, Euclidean distance harmonic, Spearman rank correlation, Kendall's tau, and City-block distance, were used to find the best way to establish theoretical MAPK/Erk signaling pathway on the basis of breast cancer line MCF-7 gene expressions. The algorithm constructs a hierarchy from top to bottom on the basis of a self-organizing tree. It dynamically finds the number of clusters at each level. It was found that only Euclidean distance harmonic is fit for the analysis of the cascade composed from a RAF1 (c-Raf), a MKNK1, a MAPKK (MEK1/2) to MAPK (Erk) in breast cancer line MCF-7. The result is consistent with the biological experimental MAP/Erk signaling pathway, and the theoretical MAPK/Erk signaling pathway on breast cancer line MCF-7 is set up.展开更多
In this paper, we propose a simpleyet-effective method for isotropic meshing relying on Euclidean distance transformation based centroidal Voronoi tessellation(CVT). Our approach improves the performance and robustnes...In this paper, we propose a simpleyet-effective method for isotropic meshing relying on Euclidean distance transformation based centroidal Voronoi tessellation(CVT). Our approach improves the performance and robustness of computing CVT on curved domains while simultaneously providing highquality output meshes. While conventional extrinsic methods compute CVTs in the entire volume bounded by the input model, we restrict the computation to a 3D shell of user-controlled thickness. Taking voxels which contain surface samples as sites, we compute the exact Euclidean distance transform on the GPU. Our algorithm is parallel and memory-efficient,and can construct the shell space for resolutions up to 20483 at interactive speed. The 3D centroidal Voronoi tessellation and restricted Voronoi diagrams are also computed efficiently on the GPU. Since the shell space can bridge holes and gaps smaller than a certain tolerance, and tolerate non-manifold edges and degenerate triangles, our algorithm can handle models with such defects, which typically cause conventional remeshing methods to fail. Our method can process implicit surfaces, polyhedral surfaces, and point clouds in a unified framework. Computational results show that our GPU-based isotropic meshing algorithm produces results comparable to state-ofthe-art techniques, but is significantly faster than conventional CPU-based implementations.展开更多
One hundred and sixty-eight genotypes of cotton from the same growing region were used as a germplasm group to study the validity of different genetic distances in constructing cotton core subset. Mixed linear model a...One hundred and sixty-eight genotypes of cotton from the same growing region were used as a germplasm group to study the validity of different genetic distances in constructing cotton core subset. Mixed linear model approach was employed to unbiasedly predict genotypic values of 20 traits for eliminating the environmental effect. Six commonly used genetic distances(Euclidean,standardized Euclidean,Mahalanobis,city block,cosine and correlation distances) combining four commonly used hierarchical cluster methods(single distance,complete distance,unweighted pair-group average and Ward's methods) were used in the least distance stepwise sampling(LDSS) method for constructing different core subsets. The analyses of variance(ANOVA) of different evaluating parameters showed that the validities of cosine and correlation distances were inferior to those of Euclidean,standardized Euclidean,Mahalanobis and city block distances. Standardized Euclidean distance was slightly more effective than Euclidean,Mahalanobis and city block distances. The principal analysis validated standardized Euclidean distance in the course of constructing practical core subsets. The covariance matrix of accessions might be ill-conditioned when Mahalanobis distance was used to calculate genetic distance at low sampling percentages,which led to bias in small-sized core subset construction. The standardized Euclidean distance is recommended in core subset construction with LDSS method.展开更多
An approach of distane map based imageenhancement (DMIE) is proposed. It is applied toconventional interpolations to get sharp images. Edgedetection is performed after images are interpolatedby linear interpolations. ...An approach of distane map based imageenhancement (DMIE) is proposed. It is applied toconventional interpolations to get sharp images. Edgedetection is performed after images are interpolatedby linear interpolations. To meet the two conditionsset for DMIE, i. e., no abrupt changes and no over-boosting, different boosting rate should be used inadjusting pixel intensities. When the boosting rate isdetermined by using the distance from enhancedpixels to nearest edges, edge-oriented imageenhancement is obtained. By using Erosion technique,the range for pixel intensity adiustment is set.Over-enhancement is avoided by limiting the pixel iutensities in enhancement within the range. A unifled linear-time algoritiml for disance transform is adopted to deal with the calculation of Euelidean distance of the images.Its computation complexity is 0(N).After the preparation,i.e.,distance transforming and erosion,the images get more and more sharpened while no over.boosting.Occurs by repeating the enhancement procedure ,The simplicity of the enhancement operation makes DMIE suitable for enhancement rate adjusting展开更多
Exploring the indica-japonica differentiation in parents of hybridization can provide theoretical bases for utilizing inter-subspecific heterosis. In this study, 5 sterile lines and 18 self-bred restorer lines were us...Exploring the indica-japonica differentiation in parents of hybridization can provide theoretical bases for utilizing inter-subspecific heterosis. In this study, 5 sterile lines and 18 self-bred restorer lines were used as female parents and male parents respectively. Then 90 combinations were constructed by incomplete diallel cross followed by relationship analysis between parental Cheng’s index difference value and Euclidean distance and heterosis. The results showed a significant correlation between several phenotype values, super male parent heterosis and control heterosis and Euclidean distance or Cheng’s index difference value. However, it was no significant correlation for yield. Further analysis found a common interval, 3.41 - 3.46 for Euclidean distance and 3 - 4 for cheng’s index difference value of parents, which was significant or high significant positive correlated with phenotype value, super male parent and control heterosis of main yield traits. This illustrates that the larger the genetic difference of parents was, the stronger the heterosis combinations were, when the genetic differences of parents were in an appropriate range.展开更多
Ventilation system is significant in underground metal mine of alpine region.Reasonable evaluation of ventilation effectiveness will lead to a practical improvement for the maintenance and management of ventilation sy...Ventilation system is significant in underground metal mine of alpine region.Reasonable evaluation of ventilation effectiveness will lead to a practical improvement for the maintenance and management of ventilation system.However,it is difficult to make an effective evaluation of ventilation system due to the lack of classification criteria with respect to underground metal mine in alpine region.This paper proposes a novel evaluation method called the cloud model-clustering analysis(CMCA).Cloud model(CM)is utilized to process collected data of ventilation system,and they are converted into cloud descriptors by CM.Cloud similarity(CS)based Euclidean distance(ED)is proposed to make clustering analysis of assessed samples.Then the classification of assessed samples will be identified by clustering analysis results.A case study is developed based on CMCA.Evaluation results show that ventilation effectiveness can be well classified.Moreover,CM is used alone to make comparison of evaluation results obtained by CMCA.Then the availability and validity of CMCA is verified.Meanwhile,difference of CS based ED and classical ED is analyzed.Two new clustering analysis methods are introduced to make comparison with CMCA.Then the ability of proposed CMCA to meet evaluation requirements of ventilation system is verified.展开更多
Air pollution has seriously endangered human health and the natural ecosystem during the last decades.Air quality monitoring stations(AQMS)have played a critical role in providing valuable data sets for recording regi...Air pollution has seriously endangered human health and the natural ecosystem during the last decades.Air quality monitoring stations(AQMS)have played a critical role in providing valuable data sets for recording regional air pollutants.The spatial representativeness of AQMS is a critical parameter when choosing the location of stations and assessing effects on the population to long-term exposure to air pollution.In this paper,we proposed a methodological framework for assessing the spatial representativeness of the regional air quality monitoring network and applied it to ground-based PM_(2.5)observation in the mainland of China.Weighted multidimensional Euclidean distance between each pixel and the stations was used to determine the representativeness of the existing monitoring network.In addition,the K-means clustering method was adopted to improve the spatial representativeness of the existing AQMS.The results showed that there were obvious differences among the representative area of 1820 stations in the mainland of China.The monitoring stations could well represent the PM_(2.5)spatial distribution of the entire region,and the effectively represented area(i.e.the area where the Euclidean distance between the pixels and the stations was lower than the average value)accounted for 67.32%of the total area and covered 93.12%of the population.Forty additional stations were identified in the Northwest,North China,and Northeast regions,which could improve the spatial representativeness by 14.31%.展开更多
Anti-detection is becoming as an emerging challenge for anti-phishing.This paper solves the threats of anti-detection from the threshold setting condition.Enough webpages are considered to complicate threshold setting...Anti-detection is becoming as an emerging challenge for anti-phishing.This paper solves the threats of anti-detection from the threshold setting condition.Enough webpages are considered to complicate threshold setting condition when the threshold is settled.According to the common visual behavior which is easily attracted by the salient region of webpages,image retrieval methods based on texton correlation descriptor(TCD)are improved to obtain enough webpages which have similarity in the salient region for the images of webpages.There are two steps for improving TCD which has advantage of recognizing the salient region of images:(1)This paper proposed Weighted Euclidean Distance based on neighborhood location(NLW-Euclidean distance)and double cross windows,and combine them to solve the problems in TCD;(2)Space structure is introduced to map the image set to Euclid space so that similarity relation among images can be used to complicate threshold setting conditions.Experimental results show that the proposed method can improve the effectiveness of anti-phishing and make the system more stable,and significantly reduce the possibilities of being hacked to be used as mining systems for blockchain.展开更多
A reduced state Soft Input Soft Output (SISO) a posteriori probability algorithm for Seri-ally Concatenated Continuous Phase Modulation (SCCPM) is proposed in this paper. Based on the Reduced State Sequence Detection ...A reduced state Soft Input Soft Output (SISO) a posteriori probability algorithm for Seri-ally Concatenated Continuous Phase Modulation (SCCPM) is proposed in this paper. Based on the Reduced State Sequence Detection (RSSD),it has more general form compared with other reduced state SISO algorithms. The proposed algorithm can greatly reduce the state number,thus leads to the computation complexity reduction. It also minimizes the degradation in Euclidean distance with decision feedback in the reduced state trellis. Analysis and simulation results show that the perform-ance degradation is little with proper reduction scheme.展开更多
Usually image assessment methods could be classified into two categories: subjective as-sessments and objective ones. The latter are judged by the correlation coefficient with subjective quality measurement MOS (Mean ...Usually image assessment methods could be classified into two categories: subjective as-sessments and objective ones. The latter are judged by the correlation coefficient with subjective quality measurement MOS (Mean Opinion Score). This paper presents an objective quality assessment algorithm special for binary images. In the algorithm, noise energy is measured by Euclidean distance between noises and signals and the structural effects caused by noise are described by Euler number change. The assessment on image quality is calculated quantitatively in terms of PSNR (Peak Signal to Noise Ratio). Our experiments show that the results of the algorithm are highly correlative with subjective MOS and the algorithm is more simple and computational saving than traditional objective assessment methods.展开更多
Transition zone or ecotone is a unique community sandwiched between two communities/ecosystems/biomes.These ecotones in Himalaya remained unexplored for many ecological aspects like biodiversity,phyto-sociology,bounda...Transition zone or ecotone is a unique community sandwiched between two communities/ecosystems/biomes.These ecotones in Himalaya remained unexplored for many ecological aspects like biodiversity,phyto-sociology,boundary detection and even impact of change in land use pattern(anthropogenic activity).The most accepted and widespread technique called as Moving Split Window(MSW) technique is used for detection of vegetation and environmental boundaries at four different sites in the lesser stratum of north-west Himalaya.All the four sites were at different distances from the nearest human inhabited area.Anthropogenic activities like grazing,herb collection,wood collection etc.were common at proximal sites.Such activities have led to the change in land use pattern.In this study,we have tried to work out the impact of the change in land use pattern(human interference) on the vegetation and basic environmental parameters like soil pH,electrical conductivity and moisture on forestgrassland ecotone in north-west Himalaya.Data on mountain steepness was also collected and analyzed.The dissimilarity profile using the statistical tool Squared Euclidian Distance(SED) indicated that species turnover locations increase with the increase in distance of ecotones from human settlements.The ecotones at distant locations from human villages are characterized with blunt as well as sharp peaks for vegetation data,however,conditions are reverse in case of the proximal sites.The study also reveals that as the distance between the ecotone and human settlements increases,the complex conditions like multiple vegetation boundaries prevails on the transitions.In this regard,land use induced blurring of forest-grassland transition in north-west Himalaya is summed up in the study.展开更多
Mapping design criteria of bit-interleaved coded modulation with iterative decoding (BICM-ID) with square 16QAM are analyzed. Three of the existing criteria are analyzed and compared with each other. Through the compa...Mapping design criteria of bit-interleaved coded modulation with iterative decoding (BICM-ID) with square 16QAM are analyzed. Three of the existing criteria are analyzed and compared with each other. Through the comparison, two main characters of the mapping design criteria are found. They are the harmonic mean of the minimum squared Euclidean distance and the average of Hamming distances with the nearest Euclidean distance. Based on these two characters, a novel mapping design criterion is proposed and a label mapping named mixed mapping is searched according to it. Simulation results show that mixed mapping performs better than the other mappings in BICM-ID system.展开更多
The paper analyze and improve the SIFT optimized algorithm, and proposes an image matching method for SIFT algorithm based on quasi Euclidean distance and KD-tree. Experiments show that this algorithm has matching mor...The paper analyze and improve the SIFT optimized algorithm, and proposes an image matching method for SIFT algorithm based on quasi Euclidean distance and KD-tree. Experiments show that this algorithm has matching more points, high matching accuracy, no repealed points and higher advantage of matching efficiency based on keeping the basic characteristics of SIFT algorithm unchanged, and provides precise matching point to generate precise image stitching and other related fields of the follow-up product. At the same time, this method was applied to the layout optimization and achieved good results.展开更多
As a new three-dimensional(3-D)modulation,Polarization Quadrature Amplitude Modulation(PQAM) can be regarded as the combination of Pulse amplitude modulation(PAM) and Quadrature Amplitude Modulation(QAM) Modulation.It...As a new three-dimensional(3-D)modulation,Polarization Quadrature Amplitude Modulation(PQAM) can be regarded as the combination of Pulse amplitude modulation(PAM) and Quadrature Amplitude Modulation(QAM) Modulation.It can better improve the digital communication efficiency and reduce the Symbol error rate(SER) of the system than one-dimensional or two-dimensional modulation scheme.How to design a feasible constellation is the most concerned problem of PQAM currently.This paper first studies the relationship between the SER theoretical value of PQAM and the distribution of M and N,proposes a new M,N allocation scheme.Secondly,a new and straightforward design method of constructing higher-level 3-D signal constellations,which can be matched with the PQAM,and the constellation can divided into three different structures according to the ary for PQAM.Finally,the simulation results show that:in PQAM system,the modulation scheme and the constellation mapping scheme are proposed in this paper which can effectively reduce the system SER and improve the anti-noise performance of the system.展开更多
This paper presents a classifier named kernel-based nonlinear representor (KNR) for optimal representation of pattern features. Adopting the Gaussian kernel, with the kernel width adaptively estimated by a simple tech...This paper presents a classifier named kernel-based nonlinear representor (KNR) for optimal representation of pattern features. Adopting the Gaussian kernel, with the kernel width adaptively estimated by a simple technique, it is applied to eigenface classification. Experimental results on the ORL face database show that it improves performance by around 6 points, in classification rate, over the Euclidean distance classifier.展开更多
基金supported in part by the National High‐tech R&D Program of China (863 Program) under Grant No. 2013AA102301technological project of Henan province (162102210214)
文摘Recommendation system can greatly alleviate the "information overload" in the big data era. Existing recommendation methods, however, typically focus on predicting missing rating values via analyzing user-item dualistic relationship, which neglect an important fact that the latent interests of users can influence their rating behaviors. Moreover, traditional recommendation methods easily suffer from the high dimensional problem and cold-start problem. To address these challenges, in this paper, we propose a PBUED(PLSA-Based Uniform Euclidean Distance) scheme, which utilizes topic model and uniform Euclidean distance to recommend the suitable items for users. The solution first employs probabilistic latent semantic analysis(PLSA) to extract users' interests, users with different interests are divided into different subgroups. Then, the uniform Euclidean distance is adopted to compute the users' similarity in the same interest subset; finally, the missing rating values of data are predicted via aggregating similar neighbors' ratings. We evaluate PBUED on two datasets and experimental results show PBUED can lead to better predicting performance and ranking performance than other approaches.
文摘Classification is one of the data mining processes used to predict predetermined target classes with data learning accurately.This study discusses data classification using a fuzzy soft set method to predict target classes accurately.This study aims to form a data classification algorithm using the fuzzy soft set method.In this study,the fuzzy soft set was calculated based on the normalized Hamming distance.Each parameter in this method is mapped to a power set from a subset of the fuzzy set using a fuzzy approximation function.In the classification step,a generalized normalized Euclidean distance is used to determine the similarity between two sets of fuzzy soft sets.The experiments used the University of California(UCI)Machine Learning dataset to assess the accuracy of the proposed data classification method.The dataset samples were divided into training(75%of samples)and test(25%of samples)sets.Experiments were performed in MATLAB R2010a software.The experiments showed that:(1)The fastest sequence is matching function,distance measure,similarity,normalized Euclidean distance,(2)the proposed approach can improve accuracy and recall by up to 10.3436%and 6.9723%,respectively,compared with baseline techniques.Hence,the fuzzy soft set method is appropriate for classifying data.
基金supported by the National Research Foundation of Korea(NRF)Grant funded by the Korea government(MSIT)(No.RS-2023-00218176)Korea Institute for Advancement of Technology(KIAT)Grant funded by the Korea government(MOTIE)(P0012724)The Competency Development Program for Industry Specialist)and the Soonchunhyang University Research Fund.
文摘In this article,multiple attribute decision-making problems are solved using the vague normal set(VNS).It is possible to generalize the vague set(VS)and q-rung fuzzy set(FS)into the q-rung vague set(VS).A log q-rung normal vague weighted averaging(log q-rung NVWA),a log q-rung normal vague weighted geometric(log q-rung NVWG),a log generalized q-rung normal vague weighted averaging(log Gq-rung NVWA),and a log generalized q-rungnormal vagueweightedgeometric(logGq-rungNVWG)operator are discussed in this article.Adescription is provided of the scoring function,accuracy function and operational laws of the log q-rung VS.The algorithms underlying these functions are also described.A numerical example is provided to extend the Euclidean distance and the Humming distance.Additionally,idempotency,boundedness,commutativity,and monotonicity of the log q-rung VS are examined as they facilitate recognizing the optimal alternative more quickly and help clarify conceptualization.We chose five anemia patients with four types of symptoms including seizures,emotional shock or hysteria,brain cause,and high fever,who had either retrograde amnesia,anterograde amnesia,transient global amnesia,post-traumatic amnesia,or infantile amnesia.Natural numbers q are used to express the results of the models.To demonstrate the effectiveness and accuracy of the models we are investigating,we compare several existing models with those that have been developed.
基金supported by the foundation of Science and Technology Commission of Shanghai Municipality (Grant No.13521103902)
文摘Through improving the redundant data filtering of unreliable data filter for radio frequency identification(RFID) with sliding-window,a data filter which integrates self-adaptive sliding-window and Euclidean distance is proposed.The input data required being filtered have been shunt by considering a large number of redundant data existing in the unreliable data for RFID and the redundant data in RFID are the main filtering object with utilizing the filter based on Euclidean distance.The comparison between the results from the method proposed in this paper and previous research shows that it can improve the accuracy of the RFID for unreliable data filtering and largely reduce the redundant reading rate.
文摘When the coordinates of a set of points are known, the pairwise Euclidean distances among the points can be easily computed. Conversely, if the Euclidean distance matrix is given, a set of coordinates for those points can be computed through the well known classical Multi-Dimensional Scaling (MDS). In this paper, we consider the case where some of the distances are far from being accurate (containing large noises or even missing). In such a situation, the order of the known distances (i.e., some distances are larger than others) is valuable information that often yields far more accurate construction of the points than just using the magnitude of the known distances. The methods making use of the order information is collectively known as nonmetric MDS. A challenging computational issue among all existing nonmetric MDS methods is that there are often a large number of ordinal constraints. In this paper, we cast this problem as a matrix optimization problem with ordinal constraints. We then adapt an existing smoothing Newton method to our matrix problem. Extensive numerical results demonstrate the efficiency of the algorithm, which can potentially handle a very large number of ordinal constraints.
文摘Hierarchical clustering algorithms, such as Pearson's correlation, Euclidean distance, Euclidean distance harmonic, Spearman rank correlation, Kendall's tau, and City-block distance, were used to find the best way to establish theoretical MAPK/Erk signaling pathway on the basis of breast cancer line MCF-7 gene expressions. The algorithm constructs a hierarchy from top to bottom on the basis of a self-organizing tree. It dynamically finds the number of clusters at each level. It was found that only Euclidean distance harmonic is fit for the analysis of the cascade composed from a RAF1 (c-Raf), a MKNK1, a MAPKK (MEK1/2) to MAPK (Erk) in breast cancer line MCF-7. The result is consistent with the biological experimental MAP/Erk signaling pathway, and the theoretical MAPK/Erk signaling pathway on breast cancer line MCF-7 is set up.
基金partially supported by Ac RF RG40/12MOE2013-T2-2-011+2 种基金partially supported by National Natural Science Foundation of China (Nos. 61432003 and 61322206)the TNList Cross-discipline Foundationpartially supported by HKSAR Research Grants Council (RGC) General Research Fund (GRF), CUHK/14207414
文摘In this paper, we propose a simpleyet-effective method for isotropic meshing relying on Euclidean distance transformation based centroidal Voronoi tessellation(CVT). Our approach improves the performance and robustness of computing CVT on curved domains while simultaneously providing highquality output meshes. While conventional extrinsic methods compute CVTs in the entire volume bounded by the input model, we restrict the computation to a 3D shell of user-controlled thickness. Taking voxels which contain surface samples as sites, we compute the exact Euclidean distance transform on the GPU. Our algorithm is parallel and memory-efficient,and can construct the shell space for resolutions up to 20483 at interactive speed. The 3D centroidal Voronoi tessellation and restricted Voronoi diagrams are also computed efficiently on the GPU. Since the shell space can bridge holes and gaps smaller than a certain tolerance, and tolerate non-manifold edges and degenerate triangles, our algorithm can handle models with such defects, which typically cause conventional remeshing methods to fail. Our method can process implicit surfaces, polyhedral surfaces, and point clouds in a unified framework. Computational results show that our GPU-based isotropic meshing algorithm produces results comparable to state-ofthe-art techniques, but is significantly faster than conventional CPU-based implementations.
基金Project supported by the National Natural Science Foundation of China (No. 30270759)the Cooperation Project in Science and Technology between China and Poland Governments (No. 32-38)the Scientific Research Foundation for Doctors in Shandong Academy of Agricultural Sciences (No. [2007]20), China
文摘One hundred and sixty-eight genotypes of cotton from the same growing region were used as a germplasm group to study the validity of different genetic distances in constructing cotton core subset. Mixed linear model approach was employed to unbiasedly predict genotypic values of 20 traits for eliminating the environmental effect. Six commonly used genetic distances(Euclidean,standardized Euclidean,Mahalanobis,city block,cosine and correlation distances) combining four commonly used hierarchical cluster methods(single distance,complete distance,unweighted pair-group average and Ward's methods) were used in the least distance stepwise sampling(LDSS) method for constructing different core subsets. The analyses of variance(ANOVA) of different evaluating parameters showed that the validities of cosine and correlation distances were inferior to those of Euclidean,standardized Euclidean,Mahalanobis and city block distances. Standardized Euclidean distance was slightly more effective than Euclidean,Mahalanobis and city block distances. The principal analysis validated standardized Euclidean distance in the course of constructing practical core subsets. The covariance matrix of accessions might be ill-conditioned when Mahalanobis distance was used to calculate genetic distance at low sampling percentages,which led to bias in small-sized core subset construction. The standardized Euclidean distance is recommended in core subset construction with LDSS method.
文摘An approach of distane map based imageenhancement (DMIE) is proposed. It is applied toconventional interpolations to get sharp images. Edgedetection is performed after images are interpolatedby linear interpolations. To meet the two conditionsset for DMIE, i. e., no abrupt changes and no over-boosting, different boosting rate should be used inadjusting pixel intensities. When the boosting rate isdetermined by using the distance from enhancedpixels to nearest edges, edge-oriented imageenhancement is obtained. By using Erosion technique,the range for pixel intensity adiustment is set.Over-enhancement is avoided by limiting the pixel iutensities in enhancement within the range. A unifled linear-time algoritiml for disance transform is adopted to deal with the calculation of Euelidean distance of the images.Its computation complexity is 0(N).After the preparation,i.e.,distance transforming and erosion,the images get more and more sharpened while no over.boosting.Occurs by repeating the enhancement procedure ,The simplicity of the enhancement operation makes DMIE suitable for enhancement rate adjusting
文摘Exploring the indica-japonica differentiation in parents of hybridization can provide theoretical bases for utilizing inter-subspecific heterosis. In this study, 5 sterile lines and 18 self-bred restorer lines were used as female parents and male parents respectively. Then 90 combinations were constructed by incomplete diallel cross followed by relationship analysis between parental Cheng’s index difference value and Euclidean distance and heterosis. The results showed a significant correlation between several phenotype values, super male parent heterosis and control heterosis and Euclidean distance or Cheng’s index difference value. However, it was no significant correlation for yield. Further analysis found a common interval, 3.41 - 3.46 for Euclidean distance and 3 - 4 for cheng’s index difference value of parents, which was significant or high significant positive correlated with phenotype value, super male parent and control heterosis of main yield traits. This illustrates that the larger the genetic difference of parents was, the stronger the heterosis combinations were, when the genetic differences of parents were in an appropriate range.
基金Project(2018YFC0808404)supported by National Key Research and Development Program of China。
文摘Ventilation system is significant in underground metal mine of alpine region.Reasonable evaluation of ventilation effectiveness will lead to a practical improvement for the maintenance and management of ventilation system.However,it is difficult to make an effective evaluation of ventilation system due to the lack of classification criteria with respect to underground metal mine in alpine region.This paper proposes a novel evaluation method called the cloud model-clustering analysis(CMCA).Cloud model(CM)is utilized to process collected data of ventilation system,and they are converted into cloud descriptors by CM.Cloud similarity(CS)based Euclidean distance(ED)is proposed to make clustering analysis of assessed samples.Then the classification of assessed samples will be identified by clustering analysis results.A case study is developed based on CMCA.Evaluation results show that ventilation effectiveness can be well classified.Moreover,CM is used alone to make comparison of evaluation results obtained by CMCA.Then the availability and validity of CMCA is verified.Meanwhile,difference of CS based ED and classical ED is analyzed.Two new clustering analysis methods are introduced to make comparison with CMCA.Then the ability of proposed CMCA to meet evaluation requirements of ventilation system is verified.
基金funded by the National Natural Science Foundation of China (41977399)the National Key Research and Development Program (2017YFC0505800)
文摘Air pollution has seriously endangered human health and the natural ecosystem during the last decades.Air quality monitoring stations(AQMS)have played a critical role in providing valuable data sets for recording regional air pollutants.The spatial representativeness of AQMS is a critical parameter when choosing the location of stations and assessing effects on the population to long-term exposure to air pollution.In this paper,we proposed a methodological framework for assessing the spatial representativeness of the regional air quality monitoring network and applied it to ground-based PM_(2.5)observation in the mainland of China.Weighted multidimensional Euclidean distance between each pixel and the stations was used to determine the representativeness of the existing monitoring network.In addition,the K-means clustering method was adopted to improve the spatial representativeness of the existing AQMS.The results showed that there were obvious differences among the representative area of 1820 stations in the mainland of China.The monitoring stations could well represent the PM_(2.5)spatial distribution of the entire region,and the effectively represented area(i.e.the area where the Euclidean distance between the pixels and the stations was lower than the average value)accounted for 67.32%of the total area and covered 93.12%of the population.Forty additional stations were identified in the Northwest,North China,and Northeast regions,which could improve the spatial representativeness by 14.31%.
基金The work reported in this paper was supported by the Joint research project of Jiangsu Province under Grant No.BY2016026-04the Opening Project of State Key Laboratory for Novel Software Technology of Nanjing University under Grant No.KFKT2018B27+1 种基金the National Natural Science Foundation for Young Scientists of China under Grant No.61303263the Jiangsu Provincial Research Foundation for Basic Research(Natural Science Foundation)under Grant No.BK20150201.
文摘Anti-detection is becoming as an emerging challenge for anti-phishing.This paper solves the threats of anti-detection from the threshold setting condition.Enough webpages are considered to complicate threshold setting condition when the threshold is settled.According to the common visual behavior which is easily attracted by the salient region of webpages,image retrieval methods based on texton correlation descriptor(TCD)are improved to obtain enough webpages which have similarity in the salient region for the images of webpages.There are two steps for improving TCD which has advantage of recognizing the salient region of images:(1)This paper proposed Weighted Euclidean Distance based on neighborhood location(NLW-Euclidean distance)and double cross windows,and combine them to solve the problems in TCD;(2)Space structure is introduced to map the image set to Euclid space so that similarity relation among images can be used to complicate threshold setting conditions.Experimental results show that the proposed method can improve the effectiveness of anti-phishing and make the system more stable,and significantly reduce the possibilities of being hacked to be used as mining systems for blockchain.
基金Supported by NSFC & Microsoft Asia (60372048)China TRAPOYT, NSFC key project (60496316)+2 种基金863 Project (2005AA123910)RFDP (20050701007)MOE Key Project (104171).
文摘A reduced state Soft Input Soft Output (SISO) a posteriori probability algorithm for Seri-ally Concatenated Continuous Phase Modulation (SCCPM) is proposed in this paper. Based on the Reduced State Sequence Detection (RSSD),it has more general form compared with other reduced state SISO algorithms. The proposed algorithm can greatly reduce the state number,thus leads to the computation complexity reduction. It also minimizes the degradation in Euclidean distance with decision feedback in the reduced state trellis. Analysis and simulation results show that the perform-ance degradation is little with proper reduction scheme.
基金Supported by Innovation Fund for Small Technology Based Firms, China (No.04C26213301189)Science and Technology Foundation by Beijng Jiaotong University (No.2005SM009)the Key Laboratory of Advanced Information Science and Network Technology of Beijing (No.TDXX0509).
文摘Usually image assessment methods could be classified into two categories: subjective as-sessments and objective ones. The latter are judged by the correlation coefficient with subjective quality measurement MOS (Mean Opinion Score). This paper presents an objective quality assessment algorithm special for binary images. In the algorithm, noise energy is measured by Euclidean distance between noises and signals and the structural effects caused by noise are described by Euler number change. The assessment on image quality is calculated quantitatively in terms of PSNR (Peak Signal to Noise Ratio). Our experiments show that the results of the algorithm are highly correlative with subjective MOS and the algorithm is more simple and computational saving than traditional objective assessment methods.
文摘Transition zone or ecotone is a unique community sandwiched between two communities/ecosystems/biomes.These ecotones in Himalaya remained unexplored for many ecological aspects like biodiversity,phyto-sociology,boundary detection and even impact of change in land use pattern(anthropogenic activity).The most accepted and widespread technique called as Moving Split Window(MSW) technique is used for detection of vegetation and environmental boundaries at four different sites in the lesser stratum of north-west Himalaya.All the four sites were at different distances from the nearest human inhabited area.Anthropogenic activities like grazing,herb collection,wood collection etc.were common at proximal sites.Such activities have led to the change in land use pattern.In this study,we have tried to work out the impact of the change in land use pattern(human interference) on the vegetation and basic environmental parameters like soil pH,electrical conductivity and moisture on forestgrassland ecotone in north-west Himalaya.Data on mountain steepness was also collected and analyzed.The dissimilarity profile using the statistical tool Squared Euclidian Distance(SED) indicated that species turnover locations increase with the increase in distance of ecotones from human settlements.The ecotones at distant locations from human villages are characterized with blunt as well as sharp peaks for vegetation data,however,conditions are reverse in case of the proximal sites.The study also reveals that as the distance between the ecotone and human settlements increases,the complex conditions like multiple vegetation boundaries prevails on the transitions.In this regard,land use induced blurring of forest-grassland transition in north-west Himalaya is summed up in the study.
文摘Mapping design criteria of bit-interleaved coded modulation with iterative decoding (BICM-ID) with square 16QAM are analyzed. Three of the existing criteria are analyzed and compared with each other. Through the comparison, two main characters of the mapping design criteria are found. They are the harmonic mean of the minimum squared Euclidean distance and the average of Hamming distances with the nearest Euclidean distance. Based on these two characters, a novel mapping design criterion is proposed and a label mapping named mixed mapping is searched according to it. Simulation results show that mixed mapping performs better than the other mappings in BICM-ID system.
文摘The paper analyze and improve the SIFT optimized algorithm, and proposes an image matching method for SIFT algorithm based on quasi Euclidean distance and KD-tree. Experiments show that this algorithm has matching more points, high matching accuracy, no repealed points and higher advantage of matching efficiency based on keeping the basic characteristics of SIFT algorithm unchanged, and provides precise matching point to generate precise image stitching and other related fields of the follow-up product. At the same time, this method was applied to the layout optimization and achieved good results.
基金supported in part by the National Natural Science Foundation of China (61561039, 61271177, and 61461044)
文摘As a new three-dimensional(3-D)modulation,Polarization Quadrature Amplitude Modulation(PQAM) can be regarded as the combination of Pulse amplitude modulation(PAM) and Quadrature Amplitude Modulation(QAM) Modulation.It can better improve the digital communication efficiency and reduce the Symbol error rate(SER) of the system than one-dimensional or two-dimensional modulation scheme.How to design a feasible constellation is the most concerned problem of PQAM currently.This paper first studies the relationship between the SER theoretical value of PQAM and the distribution of M and N,proposes a new M,N allocation scheme.Secondly,a new and straightforward design method of constructing higher-level 3-D signal constellations,which can be matched with the PQAM,and the constellation can divided into three different structures according to the ary for PQAM.Finally,the simulation results show that:in PQAM system,the modulation scheme and the constellation mapping scheme are proposed in this paper which can effectively reduce the system SER and improve the anti-noise performance of the system.
文摘This paper presents a classifier named kernel-based nonlinear representor (KNR) for optimal representation of pattern features. Adopting the Gaussian kernel, with the kernel width adaptively estimated by a simple technique, it is applied to eigenface classification. Experimental results on the ORL face database show that it improves performance by around 6 points, in classification rate, over the Euclidean distance classifier.