A new clustering algorithm called fuzzy self-organizing feature maps is introduced. It can process not only the exact digital inputs, but also the inexact or fuzzy non-digital inputs, such as natural language inputs. ...A new clustering algorithm called fuzzy self-organizing feature maps is introduced. It can process not only the exact digital inputs, but also the inexact or fuzzy non-digital inputs, such as natural language inputs. Simulation results show that the new algorithm is superior to original Kohonen’s algorithm in clustering performance and learning rate.展开更多
The feature space extracted from vibration signals with various faults is often nonlinear and of high dimension.Currently,nonlinear dimensionality reduction methods are available for extracting low-dimensional embeddi...The feature space extracted from vibration signals with various faults is often nonlinear and of high dimension.Currently,nonlinear dimensionality reduction methods are available for extracting low-dimensional embeddings,such as manifold learning.However,these methods are all based on manual intervention,which have some shortages in stability,and suppressing the disturbance noise.To extract features automatically,a manifold learning method with self-organization mapping is introduced for the first time.Under the non-uniform sample distribution reconstructed by the phase space,the expectation maximization(EM) iteration algorithm is used to divide the local neighborhoods adaptively without manual intervention.After that,the local tangent space alignment(LTSA) algorithm is adopted to compress the high-dimensional phase space into a more truthful low-dimensional representation.Finally,the signal is reconstructed by the kernel regression.Several typical states include the Lorenz system,engine fault with piston pin defect,and bearing fault with outer-race defect are analyzed.Compared with the LTSA and continuous wavelet transform,the results show that the background noise can be fully restrained and the entire periodic repetition of impact components is well separated and identified.A new way to automatically and precisely extract the impulsive components from mechanical signals is proposed.展开更多
Due to rapid urbanization, waterlogging induced by torrential rainfall has become a global concern and a potential risk affecting urban habitant's safety. Widespread waterlogging disasters haveoccurred almost annu...Due to rapid urbanization, waterlogging induced by torrential rainfall has become a global concern and a potential risk affecting urban habitant's safety. Widespread waterlogging disasters haveoccurred almost annuallyinthe urban area of Beijing, the capital of China. Based on a selforganizing map(SOM) artificial neural network(ANN), a graded waterlogging risk assessment was conducted on 56 low-lying points in Beijing, China. Social risk factors, such as Gross domestic product(GDP), population density, and traffic congestion, were utilized as input datasets in this study. The results indicate that SOM-ANNis suitable for automatically and quantitatively assessing risks associated with waterlogging. The greatest advantage of SOM-ANN in the assessment of waterlogging risk is that a priori knowledge about classification categories and assessment indicator weights is not needed. As a result, SOM-ANN can effectively overcome interference from subjective factors,producing classification results that are more objective and accurate. In this paper, the risk level of waterlogging in Beijing was divided into five grades. The points that were assigned risk grades of IV or Vwere located mainly in the districts of Chaoyang, Haidian, Xicheng, and Dongcheng.展开更多
Due to rapid development in software industry, it was necessary to reduce time and efforts in the software development process. Software Reusability is an important measure that can be applied to improve software deve...Due to rapid development in software industry, it was necessary to reduce time and efforts in the software development process. Software Reusability is an important measure that can be applied to improve software development and software quality. Reusability reduces time, effort, errors, and hence the overall cost of the development process. Reusability prediction models are established in the early stage of the system development cycle to support an early reusability assessment. In Object-Oriented systems, Reusability of software components (classes) can be obtained by investigating its metrics values. Analyzing software metric values can help to avoid developing components from scratch. In this paper, we use Chidamber and Kemerer (CK) metrics suite in order to identify the reuse level of object-oriented classes. Self-Organizing Map (SOM) was used to cluster datasets of CK metrics values that were extracted from three different java-based systems. The goal was to find the relationship between CK metrics values and the reusability level of the class. The reusability level of the class was classified into three main categorizes (High Reusable, Medium Reusable and Low Reusable). The clustering was based on metrics threshold values that were used to achieve the experiments. The proposed methodology succeeds in classifying classes to their reusability level (High Reusable, Medium Reusable and Low Reusable). The experiments show how SOM can be applied on software CK metrics with different sizes of SOM grids to provide different levels of metrics details. The results show that Depth of Inheritance Tree (DIT) and Number of Children (NOC) metrics dominated the clustering process, so these two metrics were discarded from the experiments to achieve a successful clustering. The most efficient SOM topology [2 × 2] grid size is used to predict the reusability of classes.展开更多
Determination of homogenous precipitation-based regions is a very important task in effective management of water resources. The present study tried to propose an effective precipitation-based regionalization methodol...Determination of homogenous precipitation-based regions is a very important task in effective management of water resources. The present study tried to propose an effective precipitation-based regionalization methodology by conjugating both temporal pre-processing and spatial clustering approaches in a way to take advantage of multiscale properties of precipitation time series. Annual precipitation data of 51 years(1960-2010) for 31 rain gauges(RGs) were collected and used in proposed clustering approaches. Discreet wavelet transform(DWT) was used to capture the time-frequency attributes of the time series and multiscale regionalization was performed by using k-means and Self Organizing Maps(SOM) clustering techniques. Daubechies function(db) was selected as mother wavelet to decompose the precipitation time series. Also, proper boundary extensions and decomposition level were applied. Different combinations of the approximation(A) and detail(D) coefficients were used to determine the input dataset as a basis of spatial clustering. The proposed model's efficiency in spatial clustering stage was verified using three different indexes namely, Silhouette Coefficient(SC), Dunn index and Davis Bouldin index(DB). Results approved superior performance of k-means technique in comparison to SOM. It was also deduced that DWT-based regionalization methodology showed improvements in comparison to historical-based models. Cross mutual information was used to investigate the RGs of cluster 3's homogeneousness in DWT-k-means approach. Results of non-linear correlation approach verified homogeneity of cluster 3. Verifications based on mean annual precipitation values of rain gauges in each cluster also approved the capability of multiscale approach in precipitation regionalization.展开更多
Varieties of approaches and algorithms have been presented to identify the distribution of elements. Previous researches based on the type of problem, categorized their data in proper clusters or classes. This means t...Varieties of approaches and algorithms have been presented to identify the distribution of elements. Previous researches based on the type of problem, categorized their data in proper clusters or classes. This means that the process of solution could be supervised or unsupervised. In cases, where there is no idea about dependency of samples to specific groups, clustering methods (unsupervised) are applied. About geochemistry data, since various elements are involved, in addition to the complex nature of geochemical data, clustering algorithms would be useful for recognition of elements distribution. In this paper, Self-Organizing Map (SOM) algorithm, as an unsupervised method, is applied for clustering samples based on REEs contents. For this reason the Choghart Fe-REE deposit (Bafq district, central Iran), was selected as study area and dataset was a collection of 112 lithology samples that were assayed with laboratory tests such as ICP-MS and XRF analysis. In this study, input vectors include 19 features which are coordinates x, y, z and concentrations of REEs as well as the concentration of Phosphate (P<sub>2</sub>O<sub>5</sub>) since the apatite is the main source of REEs in this particular research. Four clusters were determined as an optimal number of clusters using silhouette criterion as well as k-means clustering method and SOM. Therefore, using self-organizing map, study area was subdivided in four zones. These four zones can be described as phosphate type, albitofyre type, metasomatic and phosphorus iron ore, and Iron Ore type. Phosphate type is the most prone to rare earth elements. Eventually, results were validated with laboratory analysis.展开更多
Self-organizing map(SOM) proposed by Kohonen has obtained certain achievements in solving the traveling salesman problem(TSP).To improve Kohonen SOM,an effective initialization and parameter modification method is dis...Self-organizing map(SOM) proposed by Kohonen has obtained certain achievements in solving the traveling salesman problem(TSP).To improve Kohonen SOM,an effective initialization and parameter modification method is discussed to obtain a faster convergence rate and better solution.Therefore,a new improved self-organizing map(ISOM)algorithm is introduced and applied to four traveling salesman problem instances for experimental simulation,and then the result of ISOM is compared with those of four SOM algorithms:AVL,KL,KG and MSTSP.Using ISOM,the average error of four travelingsalesman problem instances is only 2.895 0%,which is greatly better than the other four algorithms:8.51%(AVL),6.147 5%(KL),6.555%(KG) and 3.420 9%(MSTSP).Finally,ISOM is applied to two practical problems:the Chinese 100 cities-TSP and102 counties-TSP in Shanxi Province,and the two optimal touring routes are provided to the tourists.展开更多
In this study, we visualize Pareto-optimum solutions derived from multiple-objective optimization using spherical self-organizing maps (SOMs) that lay out SOM data in three dimensions. There have been a wide range of ...In this study, we visualize Pareto-optimum solutions derived from multiple-objective optimization using spherical self-organizing maps (SOMs) that lay out SOM data in three dimensions. There have been a wide range of studies involving plane SOMs where Pareto-optimal solutions are mapped to a plane. However, plane SOMs have an issue that similar data differing in a few specific variables are often placed at far ends of the map, compromising intuitiveness of the visualization. We show in this study that spherical SOMs allow us to find similarities in data otherwise undetectable with plane SOMs. We also implement and evaluate the performance using parallel sphere processing with several GPU environments.展开更多
Pattern recognition of seismic and mor- phostructural nodes plays an important role in seismic hazard assessment. This is a known fact in seismology that tectonic nodes are prone areas to large earthquake and have thi...Pattern recognition of seismic and mor- phostructural nodes plays an important role in seismic hazard assessment. This is a known fact in seismology that tectonic nodes are prone areas to large earthquake and have this potential. They are identified by morphostructural analysis. In this study, the Alborz region has considered as studied case and locations of future events are forecast based on Kohonen Self-Organized Neural Network. It has been shown how it can predict the location of earthquake, and identifies seismogenic nodes which are prone to earthquake of M5.5+ at the West of Alborz in Iran by using International Institute Earthquake Engineering and Seismology earthquake catalogs data. First, the main faults and tectonic lineaments have been identified based on MZ (land zoning method) method. After that, by using pattern recognition, we generalized past recorded events to future in order to show the region of probable future earthquakes. In other word, hazardous nodes have determined among all nodes by new catalog generated Self-organizing feature maps (SOFM). Our input data are extracted from catalog, consists longitude and latitude of past event between 1980-2015 with magnitude larger or equal to 4.5. It has concluded node D1 is candidate for big earthquakes in comparison with other nodes and other nodes are in lower levels of this potential.展开更多
Most methods for classification of remote sensing data are based on the statistical parameter evaluation with the assumption that the samples obey the normal distribution. How-ever, more accurate classification result...Most methods for classification of remote sensing data are based on the statistical parameter evaluation with the assumption that the samples obey the normal distribution. How-ever, more accurate classification results can be obtained with the neural network method through getting knowledge from environments and adjusting the parameter (or weight) step by step by a specific measurement. This paper focuses on the double-layer structured Kohonen self-organizing feature map (SOFM), for which all neurons within the two layers are linked one another and those of the competition layers are linked as well along the sides. Therefore, the self-adapting learning ability is improved due to the effective competition and suppression in this method. The SOFM has become a hot topic in the research area of remote sensing data classi-fication. The Advanced Spaceborne Thermal Emission and Reflectance Radiometer (ASTER) is a new satellite-borne remote sensing instrument with three 15-m resolution bands and three 30-m resolution bands at the near infrared. The ASTER data of Dagang district, Tianjin Munici-pality is used as the test data in this study. At first, the wavelet fusion is carried out to make the spatial resolutions of the ASTER data identical; then, the SOFM method is applied to classifying the land cover types. The classification results are compared with those of the maximum likeli-hood method (MLH). As a consequence, the classification accuracy of SOFM increases about by 7% in general and, in particular, it is almost as twice as that of the MLH method in the town.展开更多
文摘A new clustering algorithm called fuzzy self-organizing feature maps is introduced. It can process not only the exact digital inputs, but also the inexact or fuzzy non-digital inputs, such as natural language inputs. Simulation results show that the new algorithm is superior to original Kohonen’s algorithm in clustering performance and learning rate.
基金supported by National Natural Science Foundation of China(Grant No.51075323)
文摘The feature space extracted from vibration signals with various faults is often nonlinear and of high dimension.Currently,nonlinear dimensionality reduction methods are available for extracting low-dimensional embeddings,such as manifold learning.However,these methods are all based on manual intervention,which have some shortages in stability,and suppressing the disturbance noise.To extract features automatically,a manifold learning method with self-organization mapping is introduced for the first time.Under the non-uniform sample distribution reconstructed by the phase space,the expectation maximization(EM) iteration algorithm is used to divide the local neighborhoods adaptively without manual intervention.After that,the local tangent space alignment(LTSA) algorithm is adopted to compress the high-dimensional phase space into a more truthful low-dimensional representation.Finally,the signal is reconstructed by the kernel regression.Several typical states include the Lorenz system,engine fault with piston pin defect,and bearing fault with outer-race defect are analyzed.Compared with the LTSA and continuous wavelet transform,the results show that the background noise can be fully restrained and the entire periodic repetition of impact components is well separated and identified.A new way to automatically and precisely extract the impulsive components from mechanical signals is proposed.
基金supported by the National Key R&D Program of China (GrantN o.2016YFC0401407)National Natural Science Foundation of China (Grant Nos. 51479003 and 51279006)
文摘Due to rapid urbanization, waterlogging induced by torrential rainfall has become a global concern and a potential risk affecting urban habitant's safety. Widespread waterlogging disasters haveoccurred almost annuallyinthe urban area of Beijing, the capital of China. Based on a selforganizing map(SOM) artificial neural network(ANN), a graded waterlogging risk assessment was conducted on 56 low-lying points in Beijing, China. Social risk factors, such as Gross domestic product(GDP), population density, and traffic congestion, were utilized as input datasets in this study. The results indicate that SOM-ANNis suitable for automatically and quantitatively assessing risks associated with waterlogging. The greatest advantage of SOM-ANN in the assessment of waterlogging risk is that a priori knowledge about classification categories and assessment indicator weights is not needed. As a result, SOM-ANN can effectively overcome interference from subjective factors,producing classification results that are more objective and accurate. In this paper, the risk level of waterlogging in Beijing was divided into five grades. The points that were assigned risk grades of IV or Vwere located mainly in the districts of Chaoyang, Haidian, Xicheng, and Dongcheng.
文摘Due to rapid development in software industry, it was necessary to reduce time and efforts in the software development process. Software Reusability is an important measure that can be applied to improve software development and software quality. Reusability reduces time, effort, errors, and hence the overall cost of the development process. Reusability prediction models are established in the early stage of the system development cycle to support an early reusability assessment. In Object-Oriented systems, Reusability of software components (classes) can be obtained by investigating its metrics values. Analyzing software metric values can help to avoid developing components from scratch. In this paper, we use Chidamber and Kemerer (CK) metrics suite in order to identify the reuse level of object-oriented classes. Self-Organizing Map (SOM) was used to cluster datasets of CK metrics values that were extracted from three different java-based systems. The goal was to find the relationship between CK metrics values and the reusability level of the class. The reusability level of the class was classified into three main categorizes (High Reusable, Medium Reusable and Low Reusable). The clustering was based on metrics threshold values that were used to achieve the experiments. The proposed methodology succeeds in classifying classes to their reusability level (High Reusable, Medium Reusable and Low Reusable). The experiments show how SOM can be applied on software CK metrics with different sizes of SOM grids to provide different levels of metrics details. The results show that Depth of Inheritance Tree (DIT) and Number of Children (NOC) metrics dominated the clustering process, so these two metrics were discarded from the experiments to achieve a successful clustering. The most efficient SOM topology [2 × 2] grid size is used to predict the reusability of classes.
文摘Determination of homogenous precipitation-based regions is a very important task in effective management of water resources. The present study tried to propose an effective precipitation-based regionalization methodology by conjugating both temporal pre-processing and spatial clustering approaches in a way to take advantage of multiscale properties of precipitation time series. Annual precipitation data of 51 years(1960-2010) for 31 rain gauges(RGs) were collected and used in proposed clustering approaches. Discreet wavelet transform(DWT) was used to capture the time-frequency attributes of the time series and multiscale regionalization was performed by using k-means and Self Organizing Maps(SOM) clustering techniques. Daubechies function(db) was selected as mother wavelet to decompose the precipitation time series. Also, proper boundary extensions and decomposition level were applied. Different combinations of the approximation(A) and detail(D) coefficients were used to determine the input dataset as a basis of spatial clustering. The proposed model's efficiency in spatial clustering stage was verified using three different indexes namely, Silhouette Coefficient(SC), Dunn index and Davis Bouldin index(DB). Results approved superior performance of k-means technique in comparison to SOM. It was also deduced that DWT-based regionalization methodology showed improvements in comparison to historical-based models. Cross mutual information was used to investigate the RGs of cluster 3's homogeneousness in DWT-k-means approach. Results of non-linear correlation approach verified homogeneity of cluster 3. Verifications based on mean annual precipitation values of rain gauges in each cluster also approved the capability of multiscale approach in precipitation regionalization.
文摘Varieties of approaches and algorithms have been presented to identify the distribution of elements. Previous researches based on the type of problem, categorized their data in proper clusters or classes. This means that the process of solution could be supervised or unsupervised. In cases, where there is no idea about dependency of samples to specific groups, clustering methods (unsupervised) are applied. About geochemistry data, since various elements are involved, in addition to the complex nature of geochemical data, clustering algorithms would be useful for recognition of elements distribution. In this paper, Self-Organizing Map (SOM) algorithm, as an unsupervised method, is applied for clustering samples based on REEs contents. For this reason the Choghart Fe-REE deposit (Bafq district, central Iran), was selected as study area and dataset was a collection of 112 lithology samples that were assayed with laboratory tests such as ICP-MS and XRF analysis. In this study, input vectors include 19 features which are coordinates x, y, z and concentrations of REEs as well as the concentration of Phosphate (P<sub>2</sub>O<sub>5</sub>) since the apatite is the main source of REEs in this particular research. Four clusters were determined as an optimal number of clusters using silhouette criterion as well as k-means clustering method and SOM. Therefore, using self-organizing map, study area was subdivided in four zones. These four zones can be described as phosphate type, albitofyre type, metasomatic and phosphorus iron ore, and Iron Ore type. Phosphate type is the most prone to rare earth elements. Eventually, results were validated with laboratory analysis.
文摘Self-organizing map(SOM) proposed by Kohonen has obtained certain achievements in solving the traveling salesman problem(TSP).To improve Kohonen SOM,an effective initialization and parameter modification method is discussed to obtain a faster convergence rate and better solution.Therefore,a new improved self-organizing map(ISOM)algorithm is introduced and applied to four traveling salesman problem instances for experimental simulation,and then the result of ISOM is compared with those of four SOM algorithms:AVL,KL,KG and MSTSP.Using ISOM,the average error of four travelingsalesman problem instances is only 2.895 0%,which is greatly better than the other four algorithms:8.51%(AVL),6.147 5%(KL),6.555%(KG) and 3.420 9%(MSTSP).Finally,ISOM is applied to two practical problems:the Chinese 100 cities-TSP and102 counties-TSP in Shanxi Province,and the two optimal touring routes are provided to the tourists.
文摘In this study, we visualize Pareto-optimum solutions derived from multiple-objective optimization using spherical self-organizing maps (SOMs) that lay out SOM data in three dimensions. There have been a wide range of studies involving plane SOMs where Pareto-optimal solutions are mapped to a plane. However, plane SOMs have an issue that similar data differing in a few specific variables are often placed at far ends of the map, compromising intuitiveness of the visualization. We show in this study that spherical SOMs allow us to find similarities in data otherwise undetectable with plane SOMs. We also implement and evaluate the performance using parallel sphere processing with several GPU environments.
文摘Pattern recognition of seismic and mor- phostructural nodes plays an important role in seismic hazard assessment. This is a known fact in seismology that tectonic nodes are prone areas to large earthquake and have this potential. They are identified by morphostructural analysis. In this study, the Alborz region has considered as studied case and locations of future events are forecast based on Kohonen Self-Organized Neural Network. It has been shown how it can predict the location of earthquake, and identifies seismogenic nodes which are prone to earthquake of M5.5+ at the West of Alborz in Iran by using International Institute Earthquake Engineering and Seismology earthquake catalogs data. First, the main faults and tectonic lineaments have been identified based on MZ (land zoning method) method. After that, by using pattern recognition, we generalized past recorded events to future in order to show the region of probable future earthquakes. In other word, hazardous nodes have determined among all nodes by new catalog generated Self-organizing feature maps (SOFM). Our input data are extracted from catalog, consists longitude and latitude of past event between 1980-2015 with magnitude larger or equal to 4.5. It has concluded node D1 is candidate for big earthquakes in comparison with other nodes and other nodes are in lower levels of this potential.
文摘Most methods for classification of remote sensing data are based on the statistical parameter evaluation with the assumption that the samples obey the normal distribution. How-ever, more accurate classification results can be obtained with the neural network method through getting knowledge from environments and adjusting the parameter (or weight) step by step by a specific measurement. This paper focuses on the double-layer structured Kohonen self-organizing feature map (SOFM), for which all neurons within the two layers are linked one another and those of the competition layers are linked as well along the sides. Therefore, the self-adapting learning ability is improved due to the effective competition and suppression in this method. The SOFM has become a hot topic in the research area of remote sensing data classi-fication. The Advanced Spaceborne Thermal Emission and Reflectance Radiometer (ASTER) is a new satellite-borne remote sensing instrument with three 15-m resolution bands and three 30-m resolution bands at the near infrared. The ASTER data of Dagang district, Tianjin Munici-pality is used as the test data in this study. At first, the wavelet fusion is carried out to make the spatial resolutions of the ASTER data identical; then, the SOFM method is applied to classifying the land cover types. The classification results are compared with those of the maximum likeli-hood method (MLH). As a consequence, the classification accuracy of SOFM increases about by 7% in general and, in particular, it is almost as twice as that of the MLH method in the town.