Clustering analysis is one of the main concerns in data mining.A common approach to the clustering process is to bring together points that are close to each other and separate points that are away from each other.The...Clustering analysis is one of the main concerns in data mining.A common approach to the clustering process is to bring together points that are close to each other and separate points that are away from each other.Therefore,measuring the distance between sample points is crucial to the effectiveness of clustering.Filtering features by label information and mea-suring the distance between samples by these features is a common supervised learning method to reconstruct distance metric.However,in many application scenarios,it is very expensive to obtain a large number of labeled samples.In this paper,to solve the clustering problem in the few supervised sample and high data dimensionality scenarios,a novel semi-supervised clustering algorithm is proposed by designing an improved prototype network that attempts to reconstruct the distance metric in the sample space with a small amount of pairwise supervised information,such as Must-Link and Cannot-Link,and then cluster the data in the new metric space.The core idea is to make the similar ones closer and the dissimilar ones further away through embedding mapping.Extensive experiments on both real-world and synthetic datasets show the effectiveness of this algorithm.Average clustering metrics on various datasets improved by 8%compared to the comparison algorithm.展开更多
Due to the widespread use of the Internet,customer information is vulnerable to computer systems attack,which brings urgent need for the intrusion detection technology.Recently,network intrusion detection has been one...Due to the widespread use of the Internet,customer information is vulnerable to computer systems attack,which brings urgent need for the intrusion detection technology.Recently,network intrusion detection has been one of the most important technologies in network security detection.The accuracy of network intrusion detection has reached higher accuracy so far.However,these methods have very low efficiency in network intrusion detection,even the most popular SOM neural network method.In this paper,an efficient and fast network intrusion detection method was proposed.Firstly,the fundamental of the two different methods are introduced respectively.Then,the selforganizing feature map neural network based on K-means clustering(KSOM)algorithms was presented to improve the efficiency of network intrusion detection.Finally,the NSLKDD is used as network intrusion data set to demonstrate that the KSOM method can significantly reduce the number of clustering iteration than SOM method without substantially affecting the clustering results and the accuracy is much higher than Kmeans method.The Experimental results show that our method can relatively improve the accuracy of network intrusion and significantly reduce the number of clustering iteration.展开更多
A new clustering algorithm called fuzzy self-organizing feature maps is introduced. It can process not only the exact digital inputs, but also the inexact or fuzzy non-digital inputs, such as natural language inputs. ...A new clustering algorithm called fuzzy self-organizing feature maps is introduced. It can process not only the exact digital inputs, but also the inexact or fuzzy non-digital inputs, such as natural language inputs. Simulation results show that the new algorithm is superior to original Kohonen’s algorithm in clustering performance and learning rate.展开更多
The traditional K-means clustering algorithm is difficult to determine the cluster number,which is sensitive to the initialization of the clustering center and easy to fall into local optimum.This paper proposes a clu...The traditional K-means clustering algorithm is difficult to determine the cluster number,which is sensitive to the initialization of the clustering center and easy to fall into local optimum.This paper proposes a clustering algorithm based on self-organizing mapping network and weight particle swarm optimization SOM&WPSO(Self-Organization Map and Weight Particle Swarm Optimization).Firstly,the algorithm takes the competitive learning mechanism of a self-organizing mapping network to divide the data samples into coarse clusters and obtain the clustering center.Then,the obtained clustering center is used as the initialization parameter of the weight particle swarm optimization algorithm.The particle position of the WPSO algorithm is determined by the traditional clustering center is improved to the sample weight,and the cluster center is the“food”of the particle group.Each particle moves toward the nearest cluster center.Each iteration optimizes the particle position and velocity and uses K-means and K-medoids recalculates cluster centers and cluster partitions until the end of the algorithm convergence iteration.After a lot of experimental analysis on the commonly used UCI data set,this paper not only solves the shortcomings of K-means clustering algorithm,the problem of dependence of the initial clustering center,and improves the accuracy of clustering,but also avoids falling into the local optimum.The algorithm has good global convergence.展开更多
A comprehensive understanding of spatial distribution and clustering patterns of gravels is of great significance for ecological restoration and monitoring.However,traditional methods for studying gravels are low-effi...A comprehensive understanding of spatial distribution and clustering patterns of gravels is of great significance for ecological restoration and monitoring.However,traditional methods for studying gravels are low-efficiency and have many errors.This study researched the spatial distribution and cluster characteristics of gravels based on digital image processing technology combined with a self-organizing map(SOM)and multivariate statistical methods in the grassland of northern Tibetan Plateau.Moreover,the correlation of morphological parameters of gravels between different cluster groups and the environmental factors affecting gravel distribution were analyzed.The results showed that the morphological characteristics of gravels in northern region(cluster C)and southern region(cluster B)of the Tibetan Plateau were similar,with a low gravel coverage,small gravel diameter,and elongated shape.These regions were mainly distributed in high mountainous areas with large topographic relief.The central region(cluster A)has high coverage of gravels with a larger diameter,mainly distributed in high-altitude plains with smaller undulation.Principal component analysis(PCA)results showed that the gravel distribution of cluster A may be mainly affected by vegetation,while those in clusters B and C could be mainly affected by topography,climate,and soil.The study confirmed that the combination of digital image processing technology and SOM could effectively analyzed the spatial distribution characteristics of gravels,providing a new mode for gravel research.展开更多
To solve the mapping problem for the mobile robots in the unknown environment, a dynamic growing self-organizing map with growing-threshold tuning automatically algorithm (DGSOMGT) based on Self-organizing Map is prop...To solve the mapping problem for the mobile robots in the unknown environment, a dynamic growing self-organizing map with growing-threshold tuning automatically algorithm (DGSOMGT) based on Self-organizing Map is proposed. It introduces a value of spread factor to describe the changing process of the growing threshold dynamically. The method realizes the network structure growing by training through mobile robot movement constantly in the unknown environment. The proposed algorithm is based on self-organizing map and can adjust the growing-threshold value by the number of network neurons increasing. It avoids tuning the parameters repeatedly by human. The experimental results show that the proposed method detects the complex environment quickly, effectively and correctly. The robot can realize environment mapping automatically. Compared with the other methods the proposed mapping strategy has better topological properties and time property.展开更多
Designing product platform could be an effective and efficient solution for manufacturing firms. Product platforms enable firms to provide increased product variety for the marketplace with as little variety between p...Designing product platform could be an effective and efficient solution for manufacturing firms. Product platforms enable firms to provide increased product variety for the marketplace with as little variety between products as possible. Developed consumer products and modules within a firm can further be investigated to find out the possibility of product platform creation. A bottom-up method is proposed for module-based product platform through mapping, clustering and matching analysis. The framework and the parametric model of the method are presented, which consist of three steps:(1) mapping parameters from existing product families to functional modules,(2) clustering the modules within existing module families based on their parameters so as to generate module clusters, and selecting the satisfactory module clusters based on commonality, and(3) matching the parameters of the module clusters to the functional modules in order to capture platform elements. In addition, the parameter matching criterion and mismatching treatment are put forward to ensure the effectiveness of the platform process, while standardization and serialization of the platform element are presented. A design case of the belt conveyor is studied to demonstrate the feasibility of the proposed method.展开更多
Objective:To identify the incidence rate,relative risk,hotspot regions and incidence trend of COVID-19 in Qom province,northwest part of Iran in the first stage of the pandemic.Methods:The study included 1125 official...Objective:To identify the incidence rate,relative risk,hotspot regions and incidence trend of COVID-19 in Qom province,northwest part of Iran in the first stage of the pandemic.Methods:The study included 1125 officially reported PCR-confirmed cases of COVID-19 from 20 February 2020 to 20 April 2020 in 90 regions in Qom city,Iran.The Bayesian hierarchical spatial model was used to model the relative risk of COVID-19 in Qom city,and the segmented regression model was used to estimate the trend of COVID-19 incidence rate.The Poisson distribution was applied for the observed number of COVID-19,and independent Gamma prior was used for inference on log-relative risk parameters of the model.Results:The total incidence rate of COVID-19 was estimated at 89.5 per 100000 persons in Qom city(95%CI:84.3,95.1).According to the results of the Bayesian hierarchical spatial model and posterior probabilities,43.33%of the regions in Qom city have relative risk greater than 1;however,only 11.11%of them were significantly greater than 1.Based on Geographic Information Systems(GIS)spatial analysis,10 spatial clusters were detected as active and emerging hotspot areas in the south and central parts of the city.The downward trend was estimated 10 days after the reporting of the first case(February 7,2020);however,the incidence rate was decreased by an average of 4.24%per day(95%CI:−10.7,−3.5).Conclusions:Spatial clusters with high incidence rates of COVID-19 in Qom city were in the south and central regions due to the high population density.The GIS could depict the spatial hotspot clusters of COVID-19 for timely surveillance and decision-making as a way to contain the disease.展开更多
Fault diagnosis and monitoring are very important for complex chemical process. There are numerous methods that have been studied in this field, in which the effective visualization method is still challenging. In ord...Fault diagnosis and monitoring are very important for complex chemical process. There are numerous methods that have been studied in this field, in which the effective visualization method is still challenging. In order to get a better visualization effect, a novel fault diagnosis method which combines self-organizing map (SOM) with Fisher discriminant analysis (FDA) is proposed. FDA can reduce the dimension of the data in terms of maximizing the separability of the classes. After feature extraction by FDA, SOM can distinguish the different states on the output map clearly and it can also be employed to monitor abnormal states. Tennessee Eastman (TE) process is employed to illustrate the fault diagnosis and monitoring performance of the proposed method. The result shows that the SOM integrated with FDA method is efficient and capable for real-time monitoring and fault diagnosis in complex chemical process.展开更多
To design microstructure and microhardness in the additive manufacturing(AM)of nickel(Ni)-based superalloys,the present work develops a novel data-driven approach that combines physics-based models,experimental measur...To design microstructure and microhardness in the additive manufacturing(AM)of nickel(Ni)-based superalloys,the present work develops a novel data-driven approach that combines physics-based models,experimental measurements,and a data-mining method.The simulation is based on a computational thermal-fluid dynamics(CtFD)model,which can obtain thermal behavior,solidification parameters such as cooling rate,and the dilution of solidified clad.Based on the computed thermal information,dendrite arm spacing and microhardness are estimated using well-tested mechanistic models.Experimental microstructure and microhardness are determined and compared with the simulated values for validation.To visualize process-structure-properties(PSPs)linkages,the simulation and experimental datasets are input to a data-mining model-a self-organizing map(SOM).The design windows of the process parameters under multiple objectives can be obtained from the visualized maps.The proposed approaches can be utilized in AM and other data-intensive processes.Data-driven linkages between process,structure,and properties have the potential to benefit online process monitoring control in order to derive an ideal microstructure and mechanical properties.展开更多
Patterns of the South China Sea (SCS) circulation variability are extracted from merged satellite altimetry data from October 1992 through August 2004 by using the self-organizing map (SOM). The annual cycle, seasonal...Patterns of the South China Sea (SCS) circulation variability are extracted from merged satellite altimetry data from October 1992 through August 2004 by using the self-organizing map (SOM). The annual cycle, seasonal and inter-annual variations of the SCS surface circulation are identified through the evolution of the characteristic circulation patterns.The annual cycle of the SCS general circulation patterns is described as a change between two opposite basin-scale SW-NE oriented gyres embedded with eddies: low sea surface height anomaly (SSHA) (cyclonic) in winter and high SSHA (anticyclonic) in summer half year. The transition starts from July—August (January—February) with a high (low) SSHA tongue east of Vietnam around 12°~14° N, which develops into a big anticyclonic (cyclonic) gyre while moving eastward to the deep basin. During the transitions, a dipole structure, cyclonic (anticyclonic) in the north and anticyclonic (cyclonic) in the south, may be formed southeast off Vietnam with a strong zonal jet around 10°~12° N. The seasonal variation is modulated by the interannual variations. Besides the strong 1997/1998 event in response to the peak Pacific El Nio in 1997, the overall SCS sea level is found to have a significant rise during 1999~2001, however, in summer 2004 the overall SCS sea level is lower and the basin-wide anticyclonic gyre becomes weaker than the other years.展开更多
Water resources are scarce in arid or semiarid areas,which not only limits economic development,but also threatens the survival of mankind.The local communities around the Hangjinqi gasfield depend on groundwater sour...Water resources are scarce in arid or semiarid areas,which not only limits economic development,but also threatens the survival of mankind.The local communities around the Hangjinqi gasfield depend on groundwater sources for water supply.A clear understanding of the groundwater hydrogeochemical characteristics and the groundwater quality and its seasonal cycle is invaluable and indispensable for groundwater protection and management.In this study,self-organizing maps were used in combination with the quantization and topographic errors and K-means clustering method to investigate groundwater chemistry datasets.The Piper and Gibbs diagrams and saturation index were systematically applied to investigate the hydrogeochemical characteristics of groundwater from both rainy and dry seasons.Further,the entropy-weighted theory was used to characterize groundwater quality and assess its seasonal variability and suitability for drinking purposes.Our hydrochemical groundwater dataset,consisting of 10 parameters measured during both dry and rainy seasons,was classified into 6 clusters,and the Piper diagram revealed three hydrochemical facies:Cl-Na type(clusters 1,2 and 3),mixed type(clusters 4 and 5),and HCO3-Ca type(cluster 6).The Gibbs diagram and saturation index suggested thatweathering of rock-forming mineralswere the primary process controlling groundwater chemical composition and validated the credibility and practicality of the clustering results.Two-thirds of 45 groundwater samples were categorized as excellent-or good-quality and were suitable as drinking water.Cluster changes within the same and different clusters from the dry season to the rainy season were detected in approximately 78%of the collected samples.The main factors affecting the groundwater quality were hydrogeochemical characteristics,and dry season groundwater quality was better than rainy season groundwater quality.Based on this work,such results can be used to investigate the seasonal variation of hydrogeochemical characteristics and assess water quality accurately in the others similar area.展开更多
A new method to detect multiple outliers in multivariate data is proposed. It is a combination of minimum subsets, resampling and self-organizing map (SOM) algorithm introduced by Kohonen,which provides a robust way w...A new method to detect multiple outliers in multivariate data is proposed. It is a combination of minimum subsets, resampling and self-organizing map (SOM) algorithm introduced by Kohonen,which provides a robust way with neural network. In this method, the number and organization of the neurons are selected by the characteristics of the spectra, e.g., the spectra data are often changed linearly with the concentration of the components and are often measured repeatedly, etc. So the spatial distribution of the neurons can be arranged by this characteristic. With this method, all the outliers in the spectra can be detected, which cannot be solved by the traditional method, and the speed of computation is higher than that of the traditional neural network method. The results of the simulation and the experiment show that this method is simple, effective, intuitionistic and all the outliers in the spectra can be detected in a short time. It is useful when associated with the regression model in the near infra-red research.展开更多
A multivariate method for fault diagnosis and process monitoring is proposed. This technique is based on a statistical pattern(SP) framework integrated with a self-organizing map(SOM). An SP-based SOM is used as a cla...A multivariate method for fault diagnosis and process monitoring is proposed. This technique is based on a statistical pattern(SP) framework integrated with a self-organizing map(SOM). An SP-based SOM is used as a classifier to distinguish various states on the output map, which can visually monitor abnormal states. A case study of the Tennessee Eastman(TE) process is presented to demonstrate the fault diagnosis and process monitoring performance of the proposed method. Results show that the SP-based SOM method is a visual tool for real-time monitoring and fault diagnosis that can be used in complex chemical processes.Compared with other SOM-based methods, the proposed method can more efficiently monitor and diagnose faults.展开更多
The high dimensionalhyperspectral image classification is a challenging task due to the spectral feature vectors.The high correlation between these features and the noises greatly affects the classification performanc...The high dimensionalhyperspectral image classification is a challenging task due to the spectral feature vectors.The high correlation between these features and the noises greatly affects the classification performances.To overcome this,dimensionality reduction techniques are widely used.Traditional image processing applications recently propose numerous deep learning models.However,in hyperspectral image classification,the features of deep learning models are less explored.Thus,for efficient hyperspectral image classification,a depth-wise convolutional neural network is presented in this research work.To handle the dimensionality issue in the classification process,an optimized self-organized map model is employed using a water strider optimization algorithm.The network parameters of the self-organized map are optimized by the water strider optimization which reduces the dimensionality issues and enhances the classification performances.Standard datasets such as Indian Pines and the University of Pavia(UP)are considered for experimental analysis.Existing dimensionality reduction methods like Enhanced Hybrid-Graph Discriminant Learning(EHGDL),local geometric structure Fisher analysis(LGSFA),Discriminant Hyper-Laplacian projection(DHLP),Group-based tensor model(GBTM),and Lower rank tensor approximation(LRTA)methods are compared with proposed optimized SOM model.Results confirm the superior performance of the proposed model of 98.22%accuracy for the Indian pines dataset and 98.21%accuracy for the University of Pavia dataset over the existing maximum likelihood classifier,and Support vector machine(SVM).展开更多
Characterization of unknown groundwater contaminant sources in terms of location, magnitude and duration of source activity is a complex problem. In this study, to increase the efficiency and accuracy of source charac...Characterization of unknown groundwater contaminant sources in terms of location, magnitude and duration of source activity is a complex problem. In this study, to increase the efficiency and accuracy of source characterization an alternative methodology to the methodologies proposed earlier is developed. This methodology, Adaptive Surrogate Modeling Based Optimization (ASMBO) uses the capabilities of Self Organizing Map (SOM) algorithm to design the surrogate models and adaptive surrogate models for source characterization. The most important advantage of this methodology is its direct utilization for groundwater contaminant characterization without the necessity of utilizing a linked simulation optimization model. The validation of the SOM based surrogate models and SOM based adaptive surrogate models demonstrates that the quantity and quality of initial sample sizes have crucial role on the accuracy of solutions as the designed monitoring locations. The performance evaluation results of the proposed methodology are obtained using error free and erroneous concentration measurement data. These results demonstrate that the developed methodology could approximate groundwater flow and transport simulation models, and substitute the optimization model for characterization of unknown groundwater contaminant sources in terms of location, magnitude and duration of source activity.展开更多
Considering that growing hierarchical self-organizing map(GHSOM) ignores the influence of individual component in sample vector analysis, and its accurate rate in detecting unknown network attacks is relatively lower,...Considering that growing hierarchical self-organizing map(GHSOM) ignores the influence of individual component in sample vector analysis, and its accurate rate in detecting unknown network attacks is relatively lower, an improved GHSOM method combined with mutual information is proposed. After theoretical analysis, experiments are conducted to illustrate the effectiveness of the proposed method by accurately clustering the input data. Based on different clusters, the complex relationship within the data can be revealed effectively.展开更多
DNS(domain name system) query log analysis has been a popular research topic in recent years. CLOPE, the represented transactional clustering algorithm, could be readily used for DNS query log mining. However, the alg...DNS(domain name system) query log analysis has been a popular research topic in recent years. CLOPE, the represented transactional clustering algorithm, could be readily used for DNS query log mining. However, the algorithm is inefficient when processing large scale data. The MR-CLOPE algorithm is proposed, which is an extension and improvement on CLOPE based on Map Reduce. Different from the previous parallel clustering method, a two-stage Map Reduce implementation framework is proposed. Each of the stage is implemented by one kind Map Reduce task. In the first stage, the DNS query logs are divided into multiple splits and the CLOPE algorithm is executed on each split. The second stage usually tends to iterate many times to merge the small clusters into bigger satisfactory ones. In these two stages, a novel partition process is designed to randomly spread out original sub clusters, which will be moved and merged in the map phrase of the second phase according to the defined merge criteria. In such way, the advantage of the original CLOPE algorithm is kept and its disadvantages are dealt with in the proposed framework to achieve more excellent clustering performance. The experiment results show that MR-CLOPE is not only faster but also has better clustering quality on DNS query logs compared with CLOPE.展开更多
Due to rapid urbanization, waterlogging induced by torrential rainfall has become a global concern and a potential risk affecting urban habitant's safety. Widespread waterlogging disasters haveoccurred almost annu...Due to rapid urbanization, waterlogging induced by torrential rainfall has become a global concern and a potential risk affecting urban habitant's safety. Widespread waterlogging disasters haveoccurred almost annuallyinthe urban area of Beijing, the capital of China. Based on a selforganizing map(SOM) artificial neural network(ANN), a graded waterlogging risk assessment was conducted on 56 low-lying points in Beijing, China. Social risk factors, such as Gross domestic product(GDP), population density, and traffic congestion, were utilized as input datasets in this study. The results indicate that SOM-ANNis suitable for automatically and quantitatively assessing risks associated with waterlogging. The greatest advantage of SOM-ANN in the assessment of waterlogging risk is that a priori knowledge about classification categories and assessment indicator weights is not needed. As a result, SOM-ANN can effectively overcome interference from subjective factors,producing classification results that are more objective and accurate. In this paper, the risk level of waterlogging in Beijing was divided into five grades. The points that were assigned risk grades of IV or Vwere located mainly in the districts of Chaoyang, Haidian, Xicheng, and Dongcheng.展开更多
The self-organizing map method is applied to satellite-derived sea-level anomaly fields of1993-2012 to study variations of the Kuroshio intrusion northeast of Taiwan Island.Four major features are revealed,showing sig...The self-organizing map method is applied to satellite-derived sea-level anomaly fields of1993-2012 to study variations of the Kuroshio intrusion northeast of Taiwan Island.Four major features are revealed,showing significant seasonal variability of the intrusion.In general,the intrusion increases(decreases) with a high(low) sea-level anomaly at the edge of the East China Sea shelf in winter(summer).Open-ocean mesoscale eddies play an additional role in modulating the seasonal variation of the intrusion.Further analyses are needed to study eddy-Kuroshio interaction dynamics.展开更多
文摘Clustering analysis is one of the main concerns in data mining.A common approach to the clustering process is to bring together points that are close to each other and separate points that are away from each other.Therefore,measuring the distance between sample points is crucial to the effectiveness of clustering.Filtering features by label information and mea-suring the distance between samples by these features is a common supervised learning method to reconstruct distance metric.However,in many application scenarios,it is very expensive to obtain a large number of labeled samples.In this paper,to solve the clustering problem in the few supervised sample and high data dimensionality scenarios,a novel semi-supervised clustering algorithm is proposed by designing an improved prototype network that attempts to reconstruct the distance metric in the sample space with a small amount of pairwise supervised information,such as Must-Link and Cannot-Link,and then cluster the data in the new metric space.The core idea is to make the similar ones closer and the dissimilar ones further away through embedding mapping.Extensive experiments on both real-world and synthetic datasets show the effectiveness of this algorithm.Average clustering metrics on various datasets improved by 8%compared to the comparison algorithm.
文摘Due to the widespread use of the Internet,customer information is vulnerable to computer systems attack,which brings urgent need for the intrusion detection technology.Recently,network intrusion detection has been one of the most important technologies in network security detection.The accuracy of network intrusion detection has reached higher accuracy so far.However,these methods have very low efficiency in network intrusion detection,even the most popular SOM neural network method.In this paper,an efficient and fast network intrusion detection method was proposed.Firstly,the fundamental of the two different methods are introduced respectively.Then,the selforganizing feature map neural network based on K-means clustering(KSOM)algorithms was presented to improve the efficiency of network intrusion detection.Finally,the NSLKDD is used as network intrusion data set to demonstrate that the KSOM method can significantly reduce the number of clustering iteration than SOM method without substantially affecting the clustering results and the accuracy is much higher than Kmeans method.The Experimental results show that our method can relatively improve the accuracy of network intrusion and significantly reduce the number of clustering iteration.
文摘A new clustering algorithm called fuzzy self-organizing feature maps is introduced. It can process not only the exact digital inputs, but also the inexact or fuzzy non-digital inputs, such as natural language inputs. Simulation results show that the new algorithm is superior to original Kohonen’s algorithm in clustering performance and learning rate.
文摘The traditional K-means clustering algorithm is difficult to determine the cluster number,which is sensitive to the initialization of the clustering center and easy to fall into local optimum.This paper proposes a clustering algorithm based on self-organizing mapping network and weight particle swarm optimization SOM&WPSO(Self-Organization Map and Weight Particle Swarm Optimization).Firstly,the algorithm takes the competitive learning mechanism of a self-organizing mapping network to divide the data samples into coarse clusters and obtain the clustering center.Then,the obtained clustering center is used as the initialization parameter of the weight particle swarm optimization algorithm.The particle position of the WPSO algorithm is determined by the traditional clustering center is improved to the sample weight,and the cluster center is the“food”of the particle group.Each particle moves toward the nearest cluster center.Each iteration optimizes the particle position and velocity and uses K-means and K-medoids recalculates cluster centers and cluster partitions until the end of the algorithm convergence iteration.After a lot of experimental analysis on the commonly used UCI data set,this paper not only solves the shortcomings of K-means clustering algorithm,the problem of dependence of the initial clustering center,and improves the accuracy of clustering,but also avoids falling into the local optimum.The algorithm has good global convergence.
基金funded by the National Natural Science Foundation of China(41971226,41871357)the Major Research and Development and Achievement Transformation Projects of Qinghai,China(2022-QY-224)the Strategic Priority Research Program of the Chinese Academy of Sciences(XDA28110502,XDA19030303).
文摘A comprehensive understanding of spatial distribution and clustering patterns of gravels is of great significance for ecological restoration and monitoring.However,traditional methods for studying gravels are low-efficiency and have many errors.This study researched the spatial distribution and cluster characteristics of gravels based on digital image processing technology combined with a self-organizing map(SOM)and multivariate statistical methods in the grassland of northern Tibetan Plateau.Moreover,the correlation of morphological parameters of gravels between different cluster groups and the environmental factors affecting gravel distribution were analyzed.The results showed that the morphological characteristics of gravels in northern region(cluster C)and southern region(cluster B)of the Tibetan Plateau were similar,with a low gravel coverage,small gravel diameter,and elongated shape.These regions were mainly distributed in high mountainous areas with large topographic relief.The central region(cluster A)has high coverage of gravels with a larger diameter,mainly distributed in high-altitude plains with smaller undulation.Principal component analysis(PCA)results showed that the gravel distribution of cluster A may be mainly affected by vegetation,while those in clusters B and C could be mainly affected by topography,climate,and soil.The study confirmed that the combination of digital image processing technology and SOM could effectively analyzed the spatial distribution characteristics of gravels,providing a new mode for gravel research.
文摘To solve the mapping problem for the mobile robots in the unknown environment, a dynamic growing self-organizing map with growing-threshold tuning automatically algorithm (DGSOMGT) based on Self-organizing Map is proposed. It introduces a value of spread factor to describe the changing process of the growing threshold dynamically. The method realizes the network structure growing by training through mobile robot movement constantly in the unknown environment. The proposed algorithm is based on self-organizing map and can adjust the growing-threshold value by the number of network neurons increasing. It avoids tuning the parameters repeatedly by human. The experimental results show that the proposed method detects the complex environment quickly, effectively and correctly. The robot can realize environment mapping automatically. Compared with the other methods the proposed mapping strategy has better topological properties and time property.
基金Project(9140A18010210KG01)supported by the Departmental Pre-research Fund of China
文摘Designing product platform could be an effective and efficient solution for manufacturing firms. Product platforms enable firms to provide increased product variety for the marketplace with as little variety between products as possible. Developed consumer products and modules within a firm can further be investigated to find out the possibility of product platform creation. A bottom-up method is proposed for module-based product platform through mapping, clustering and matching analysis. The framework and the parametric model of the method are presented, which consist of three steps:(1) mapping parameters from existing product families to functional modules,(2) clustering the modules within existing module families based on their parameters so as to generate module clusters, and selecting the satisfactory module clusters based on commonality, and(3) matching the parameters of the module clusters to the functional modules in order to capture platform elements. In addition, the parameter matching criterion and mismatching treatment are put forward to ensure the effectiveness of the platform process, while standardization and serialization of the platform element are presented. A design case of the belt conveyor is studied to demonstrate the feasibility of the proposed method.
文摘Objective:To identify the incidence rate,relative risk,hotspot regions and incidence trend of COVID-19 in Qom province,northwest part of Iran in the first stage of the pandemic.Methods:The study included 1125 officially reported PCR-confirmed cases of COVID-19 from 20 February 2020 to 20 April 2020 in 90 regions in Qom city,Iran.The Bayesian hierarchical spatial model was used to model the relative risk of COVID-19 in Qom city,and the segmented regression model was used to estimate the trend of COVID-19 incidence rate.The Poisson distribution was applied for the observed number of COVID-19,and independent Gamma prior was used for inference on log-relative risk parameters of the model.Results:The total incidence rate of COVID-19 was estimated at 89.5 per 100000 persons in Qom city(95%CI:84.3,95.1).According to the results of the Bayesian hierarchical spatial model and posterior probabilities,43.33%of the regions in Qom city have relative risk greater than 1;however,only 11.11%of them were significantly greater than 1.Based on Geographic Information Systems(GIS)spatial analysis,10 spatial clusters were detected as active and emerging hotspot areas in the south and central parts of the city.The downward trend was estimated 10 days after the reporting of the first case(February 7,2020);however,the incidence rate was decreased by an average of 4.24%per day(95%CI:−10.7,−3.5).Conclusions:Spatial clusters with high incidence rates of COVID-19 in Qom city were in the south and central regions due to the high population density.The GIS could depict the spatial hotspot clusters of COVID-19 for timely surveillance and decision-making as a way to contain the disease.
基金Supported by the National Basic Research Program of China (2013CB733600), the National Natural Science Foundation of China (21176073), the Doctoral Fund of Ministry of Education of China (20090074110005), the Program for New Century Excellent Talents in University (NCET-09-0346), Shu Guang Project (09SG29) and the Fundamental Research Funds for the Central Universities.
文摘Fault diagnosis and monitoring are very important for complex chemical process. There are numerous methods that have been studied in this field, in which the effective visualization method is still challenging. In order to get a better visualization effect, a novel fault diagnosis method which combines self-organizing map (SOM) with Fisher discriminant analysis (FDA) is proposed. FDA can reduce the dimension of the data in terms of maximizing the separability of the classes. After feature extraction by FDA, SOM can distinguish the different states on the output map clearly and it can also be employed to monitor abnormal states. Tennessee Eastman (TE) process is employed to illustrate the fault diagnosis and monitoring performance of the proposed method. The result shows that the SOM integrated with FDA method is efficient and capable for real-time monitoring and fault diagnosis in complex chemical process.
基金Jian Cao,Gregory J.Wagner,and Wing K.Liu acknowledge support from the National Science Foundation(NSF)Cyber-Physical Systems(CPS)(CPS/CMMI-1646592)Hengyang Li acknowledges support from the Northwestern Data Science Initiative(DSI+6 种基金171474500210043324)Jian Cao,Gregory J.Wagner,Wing K.Liu,Jennifer L.Bennett,and Sarah J.Wolff acknowledge support from the Digital Manufacturing and Design Innovation Institute(DMDII15-07)Jian Cao,Wing K.Liu,Zhengtao Gan,and Jennifer L.Bennett acknowledge support from the Center for Hierarchical Materials Design(CHiMaD70NANB14H012)This work made use of facilities at DMG MORI and Northwestern UniversityIt also made use of the MatCI Facility,which receives support from the MRSEC Program(NSF DMR-168 1720139)of the Materials Research Center at Northwestern University.
文摘To design microstructure and microhardness in the additive manufacturing(AM)of nickel(Ni)-based superalloys,the present work develops a novel data-driven approach that combines physics-based models,experimental measurements,and a data-mining method.The simulation is based on a computational thermal-fluid dynamics(CtFD)model,which can obtain thermal behavior,solidification parameters such as cooling rate,and the dilution of solidified clad.Based on the computed thermal information,dendrite arm spacing and microhardness are estimated using well-tested mechanistic models.Experimental microstructure and microhardness are determined and compared with the simulated values for validation.To visualize process-structure-properties(PSPs)linkages,the simulation and experimental datasets are input to a data-mining model-a self-organizing map(SOM).The design windows of the process parameters under multiple objectives can be obtained from the visualized maps.The proposed approaches can be utilized in AM and other data-intensive processes.Data-driven linkages between process,structure,and properties have the potential to benefit online process monitoring control in order to derive an ideal microstructure and mechanical properties.
基金National Basic Research Program of China under contract No. 2007 CB816003the Key International Co-operative Proiect of the National Natural Science Foundation of China under contract No.40510073the International Cooperative Proiect of the Mini-stry of Science and Technology of China under contract No.2006DFB21630.
文摘Patterns of the South China Sea (SCS) circulation variability are extracted from merged satellite altimetry data from October 1992 through August 2004 by using the self-organizing map (SOM). The annual cycle, seasonal and inter-annual variations of the SCS surface circulation are identified through the evolution of the characteristic circulation patterns.The annual cycle of the SCS general circulation patterns is described as a change between two opposite basin-scale SW-NE oriented gyres embedded with eddies: low sea surface height anomaly (SSHA) (cyclonic) in winter and high SSHA (anticyclonic) in summer half year. The transition starts from July—August (January—February) with a high (low) SSHA tongue east of Vietnam around 12°~14° N, which develops into a big anticyclonic (cyclonic) gyre while moving eastward to the deep basin. During the transitions, a dipole structure, cyclonic (anticyclonic) in the north and anticyclonic (cyclonic) in the south, may be formed southeast off Vietnam with a strong zonal jet around 10°~12° N. The seasonal variation is modulated by the interannual variations. Besides the strong 1997/1998 event in response to the peak Pacific El Nio in 1997, the overall SCS sea level is found to have a significant rise during 1999~2001, however, in summer 2004 the overall SCS sea level is lower and the basin-wide anticyclonic gyre becomes weaker than the other years.
基金the National Natural Science Foundation of China(Nos.41972259 and 41572227)the National Key Research and Development Program of China(No.2018YFC0406404).
文摘Water resources are scarce in arid or semiarid areas,which not only limits economic development,but also threatens the survival of mankind.The local communities around the Hangjinqi gasfield depend on groundwater sources for water supply.A clear understanding of the groundwater hydrogeochemical characteristics and the groundwater quality and its seasonal cycle is invaluable and indispensable for groundwater protection and management.In this study,self-organizing maps were used in combination with the quantization and topographic errors and K-means clustering method to investigate groundwater chemistry datasets.The Piper and Gibbs diagrams and saturation index were systematically applied to investigate the hydrogeochemical characteristics of groundwater from both rainy and dry seasons.Further,the entropy-weighted theory was used to characterize groundwater quality and assess its seasonal variability and suitability for drinking purposes.Our hydrochemical groundwater dataset,consisting of 10 parameters measured during both dry and rainy seasons,was classified into 6 clusters,and the Piper diagram revealed three hydrochemical facies:Cl-Na type(clusters 1,2 and 3),mixed type(clusters 4 and 5),and HCO3-Ca type(cluster 6).The Gibbs diagram and saturation index suggested thatweathering of rock-forming mineralswere the primary process controlling groundwater chemical composition and validated the credibility and practicality of the clustering results.Two-thirds of 45 groundwater samples were categorized as excellent-or good-quality and were suitable as drinking water.Cluster changes within the same and different clusters from the dry season to the rainy season were detected in approximately 78%of the collected samples.The main factors affecting the groundwater quality were hydrogeochemical characteristics,and dry season groundwater quality was better than rainy season groundwater quality.Based on this work,such results can be used to investigate the seasonal variation of hydrogeochemical characteristics and assess water quality accurately in the others similar area.
文摘A new method to detect multiple outliers in multivariate data is proposed. It is a combination of minimum subsets, resampling and self-organizing map (SOM) algorithm introduced by Kohonen,which provides a robust way with neural network. In this method, the number and organization of the neurons are selected by the characteristics of the spectra, e.g., the spectra data are often changed linearly with the concentration of the components and are often measured repeatedly, etc. So the spatial distribution of the neurons can be arranged by this characteristic. With this method, all the outliers in the spectra can be detected, which cannot be solved by the traditional method, and the speed of computation is higher than that of the traditional neural network method. The results of the simulation and the experiment show that this method is simple, effective, intuitionistic and all the outliers in the spectra can be detected in a short time. It is useful when associated with the regression model in the near infra-red research.
基金Project(2013CB733605)supported by the National Basic Research Program of ChinaProject(21176073)supported by the National Natural Science Foundation of ChinaProject supported by the Fundamental Research Funds for the Central Universities,China
文摘A multivariate method for fault diagnosis and process monitoring is proposed. This technique is based on a statistical pattern(SP) framework integrated with a self-organizing map(SOM). An SP-based SOM is used as a classifier to distinguish various states on the output map, which can visually monitor abnormal states. A case study of the Tennessee Eastman(TE) process is presented to demonstrate the fault diagnosis and process monitoring performance of the proposed method. Results show that the SP-based SOM method is a visual tool for real-time monitoring and fault diagnosis that can be used in complex chemical processes.Compared with other SOM-based methods, the proposed method can more efficiently monitor and diagnose faults.
文摘The high dimensionalhyperspectral image classification is a challenging task due to the spectral feature vectors.The high correlation between these features and the noises greatly affects the classification performances.To overcome this,dimensionality reduction techniques are widely used.Traditional image processing applications recently propose numerous deep learning models.However,in hyperspectral image classification,the features of deep learning models are less explored.Thus,for efficient hyperspectral image classification,a depth-wise convolutional neural network is presented in this research work.To handle the dimensionality issue in the classification process,an optimized self-organized map model is employed using a water strider optimization algorithm.The network parameters of the self-organized map are optimized by the water strider optimization which reduces the dimensionality issues and enhances the classification performances.Standard datasets such as Indian Pines and the University of Pavia(UP)are considered for experimental analysis.Existing dimensionality reduction methods like Enhanced Hybrid-Graph Discriminant Learning(EHGDL),local geometric structure Fisher analysis(LGSFA),Discriminant Hyper-Laplacian projection(DHLP),Group-based tensor model(GBTM),and Lower rank tensor approximation(LRTA)methods are compared with proposed optimized SOM model.Results confirm the superior performance of the proposed model of 98.22%accuracy for the Indian pines dataset and 98.21%accuracy for the University of Pavia dataset over the existing maximum likelihood classifier,and Support vector machine(SVM).
文摘Characterization of unknown groundwater contaminant sources in terms of location, magnitude and duration of source activity is a complex problem. In this study, to increase the efficiency and accuracy of source characterization an alternative methodology to the methodologies proposed earlier is developed. This methodology, Adaptive Surrogate Modeling Based Optimization (ASMBO) uses the capabilities of Self Organizing Map (SOM) algorithm to design the surrogate models and adaptive surrogate models for source characterization. The most important advantage of this methodology is its direct utilization for groundwater contaminant characterization without the necessity of utilizing a linked simulation optimization model. The validation of the SOM based surrogate models and SOM based adaptive surrogate models demonstrates that the quantity and quality of initial sample sizes have crucial role on the accuracy of solutions as the designed monitoring locations. The performance evaluation results of the proposed methodology are obtained using error free and erroneous concentration measurement data. These results demonstrate that the developed methodology could approximate groundwater flow and transport simulation models, and substitute the optimization model for characterization of unknown groundwater contaminant sources in terms of location, magnitude and duration of source activity.
基金Supported by the Natural Science Foundation of Tianjin(No.15JCQNJC00200)
文摘Considering that growing hierarchical self-organizing map(GHSOM) ignores the influence of individual component in sample vector analysis, and its accurate rate in detecting unknown network attacks is relatively lower, an improved GHSOM method combined with mutual information is proposed. After theoretical analysis, experiments are conducted to illustrate the effectiveness of the proposed method by accurately clustering the input data. Based on different clusters, the complex relationship within the data can be revealed effectively.
基金Project(61103046) supported in part by the National Natural Science Foundation of ChinaProject(B201312) supported by DHU Distinguished Young Professor Program,China+1 种基金Project(LY14F020007) supported by Zhejiang Provincial Natural Science Funds of ChinaProject(2014A610072) supported by the Natural Science Foundation of Ningbo City,China
文摘DNS(domain name system) query log analysis has been a popular research topic in recent years. CLOPE, the represented transactional clustering algorithm, could be readily used for DNS query log mining. However, the algorithm is inefficient when processing large scale data. The MR-CLOPE algorithm is proposed, which is an extension and improvement on CLOPE based on Map Reduce. Different from the previous parallel clustering method, a two-stage Map Reduce implementation framework is proposed. Each of the stage is implemented by one kind Map Reduce task. In the first stage, the DNS query logs are divided into multiple splits and the CLOPE algorithm is executed on each split. The second stage usually tends to iterate many times to merge the small clusters into bigger satisfactory ones. In these two stages, a novel partition process is designed to randomly spread out original sub clusters, which will be moved and merged in the map phrase of the second phase according to the defined merge criteria. In such way, the advantage of the original CLOPE algorithm is kept and its disadvantages are dealt with in the proposed framework to achieve more excellent clustering performance. The experiment results show that MR-CLOPE is not only faster but also has better clustering quality on DNS query logs compared with CLOPE.
基金supported by the National Key R&D Program of China (GrantN o.2016YFC0401407)National Natural Science Foundation of China (Grant Nos. 51479003 and 51279006)
文摘Due to rapid urbanization, waterlogging induced by torrential rainfall has become a global concern and a potential risk affecting urban habitant's safety. Widespread waterlogging disasters haveoccurred almost annuallyinthe urban area of Beijing, the capital of China. Based on a selforganizing map(SOM) artificial neural network(ANN), a graded waterlogging risk assessment was conducted on 56 low-lying points in Beijing, China. Social risk factors, such as Gross domestic product(GDP), population density, and traffic congestion, were utilized as input datasets in this study. The results indicate that SOM-ANNis suitable for automatically and quantitatively assessing risks associated with waterlogging. The greatest advantage of SOM-ANN in the assessment of waterlogging risk is that a priori knowledge about classification categories and assessment indicator weights is not needed. As a result, SOM-ANN can effectively overcome interference from subjective factors,producing classification results that are more objective and accurate. In this paper, the risk level of waterlogging in Beijing was divided into five grades. The points that were assigned risk grades of IV or Vwere located mainly in the districts of Chaoyang, Haidian, Xicheng, and Dongcheng.
基金Supported by the Strategic Priority Research Program of Chinese Academy of Sciences(No.XDA11010103)the National Natural Science Foundation of China(Nos.41222037,41221063)+4 种基金the Project of Global Change and Air-Sea Interaction(No.GASI-03-01-01-02)the National Basic Research Program of China(973 Program)(No.2013CB956202)the 111 Project of Ministry of Education of China(No.B07036)the Natural Science Foundation of Shandong(No.JQ201111)the National Special Research Fund for Non-Profit Marine Sector(No.201205018)
文摘The self-organizing map method is applied to satellite-derived sea-level anomaly fields of1993-2012 to study variations of the Kuroshio intrusion northeast of Taiwan Island.Four major features are revealed,showing significant seasonal variability of the intrusion.In general,the intrusion increases(decreases) with a high(low) sea-level anomaly at the edge of the East China Sea shelf in winter(summer).Open-ocean mesoscale eddies play an additional role in modulating the seasonal variation of the intrusion.Further analyses are needed to study eddy-Kuroshio interaction dynamics.