Faced with increasing global soil degradation,spatially explicit data on cropland soil organic matter(SOM)provides crucial data for soil carbon pool accounting,cropland quality assessment and the formulation of effect...Faced with increasing global soil degradation,spatially explicit data on cropland soil organic matter(SOM)provides crucial data for soil carbon pool accounting,cropland quality assessment and the formulation of effective management policies.As a spatial information prediction technique,digital soil mapping(DSM)has been widely used to spatially map soil information at different scales.However,the accuracy of digital SOM maps for cropland is typically lower than for other land cover types due to the inherent difficulty in precisely quantifying human disturbance.To overcome this limitation,this study systematically assessed a framework of“information extractionfeature selection-model averaging”for improving model performance in mapping cropland SOM using 462 cropland soil samples collected in Guangzhou,China in 2021.The results showed that using the framework of dynamic information extraction,feature selection and model averaging could efficiently improve the accuracy of the final predictions(R^(2):0.48 to 0.53)without having obviously negative impacts on uncertainty.Quantifying the dynamic information of the environment was an efficient way to generate covariates that are linearly and nonlinearly related to SOM,which improved the R^(2)of random forest from 0.44 to 0.48 and the R^(2)of extreme gradient boosting from 0.37to 0.43.Forward recursive feature selection(FRFS)is recommended when there are relatively few environmental covariates(<200),whereas Boruta is recommended when there are many environmental covariates(>500).The Granger-Ramanathan model averaging approach could improve the prediction accuracy and average uncertainty.When the structures of initial prediction models are similar,increasing in the number of averaging models did not have significantly positive effects on the final predictions.Given the advantages of these selected strategies over information extraction,feature selection and model averaging have a great potential for high-accuracy soil mapping at any scales,so this approach can provide more reliable references for soil conservation policy-making.展开更多
One of the main disadvantages of fractal image data compression is a loss time in the process of image compression (encoding) and conversion into a system of iterated functions (IFS). In this paper, the idea of the in...One of the main disadvantages of fractal image data compression is a loss time in the process of image compression (encoding) and conversion into a system of iterated functions (IFS). In this paper, the idea of the inverse problem of fixed point is introduced. This inverse problem is based on collage theorem which is the cornerstone of the mathematical idea of fractal image compression. Then this idea is applied by iterated function system, iterative system functions and grayscale iterated function system down to general transformation. Mathematical formulation form is also provided on the digital image space, which deals with the computer. Next, this process has been revised to reduce the time required for image compression by excluding some parts of the image that have a specific milestone. The neural network algorithms have been applied on the process of compression (encryption). The experimental results are presented and the performance of the proposed algorithm is discussed. Finally, the comparison between filtered ranges method and self-organizing method is introduced.展开更多
Varieties of approaches and algorithms have been presented to identify the distribution of elements. Previous researches based on the type of problem, categorized their data in proper clusters or classes. This means t...Varieties of approaches and algorithms have been presented to identify the distribution of elements. Previous researches based on the type of problem, categorized their data in proper clusters or classes. This means that the process of solution could be supervised or unsupervised. In cases, where there is no idea about dependency of samples to specific groups, clustering methods (unsupervised) are applied. About geochemistry data, since various elements are involved, in addition to the complex nature of geochemical data, clustering algorithms would be useful for recognition of elements distribution. In this paper, Self-Organizing Map (SOM) algorithm, as an unsupervised method, is applied for clustering samples based on REEs contents. For this reason the Choghart Fe-REE deposit (Bafq district, central Iran), was selected as study area and dataset was a collection of 112 lithology samples that were assayed with laboratory tests such as ICP-MS and XRF analysis. In this study, input vectors include 19 features which are coordinates x, y, z and concentrations of REEs as well as the concentration of Phosphate (P<sub>2</sub>O<sub>5</sub>) since the apatite is the main source of REEs in this particular research. Four clusters were determined as an optimal number of clusters using silhouette criterion as well as k-means clustering method and SOM. Therefore, using self-organizing map, study area was subdivided in four zones. These four zones can be described as phosphate type, albitofyre type, metasomatic and phosphorus iron ore, and Iron Ore type. Phosphate type is the most prone to rare earth elements. Eventually, results were validated with laboratory analysis.展开更多
Fault diagnosis and monitoring are very important for complex chemical process. There are numerous methods that have been studied in this field, in which the effective visualization method is still challenging. In ord...Fault diagnosis and monitoring are very important for complex chemical process. There are numerous methods that have been studied in this field, in which the effective visualization method is still challenging. In order to get a better visualization effect, a novel fault diagnosis method which combines self-organizing map (SOM) with Fisher discriminant analysis (FDA) is proposed. FDA can reduce the dimension of the data in terms of maximizing the separability of the classes. After feature extraction by FDA, SOM can distinguish the different states on the output map clearly and it can also be employed to monitor abnormal states. Tennessee Eastman (TE) process is employed to illustrate the fault diagnosis and monitoring performance of the proposed method. The result shows that the SOM integrated with FDA method is efficient and capable for real-time monitoring and fault diagnosis in complex chemical process.展开更多
Determination of homogenous precipitation-based regions is a very important task in effective management of water resources. The present study tried to propose an effective precipitation-based regionalization methodol...Determination of homogenous precipitation-based regions is a very important task in effective management of water resources. The present study tried to propose an effective precipitation-based regionalization methodology by conjugating both temporal pre-processing and spatial clustering approaches in a way to take advantage of multiscale properties of precipitation time series. Annual precipitation data of 51 years(1960-2010) for 31 rain gauges(RGs) were collected and used in proposed clustering approaches. Discreet wavelet transform(DWT) was used to capture the time-frequency attributes of the time series and multiscale regionalization was performed by using k-means and Self Organizing Maps(SOM) clustering techniques. Daubechies function(db) was selected as mother wavelet to decompose the precipitation time series. Also, proper boundary extensions and decomposition level were applied. Different combinations of the approximation(A) and detail(D) coefficients were used to determine the input dataset as a basis of spatial clustering. The proposed model's efficiency in spatial clustering stage was verified using three different indexes namely, Silhouette Coefficient(SC), Dunn index and Davis Bouldin index(DB). Results approved superior performance of k-means technique in comparison to SOM. It was also deduced that DWT-based regionalization methodology showed improvements in comparison to historical-based models. Cross mutual information was used to investigate the RGs of cluster 3's homogeneousness in DWT-k-means approach. Results of non-linear correlation approach verified homogeneity of cluster 3. Verifications based on mean annual precipitation values of rain gauges in each cluster also approved the capability of multiscale approach in precipitation regionalization.展开更多
The self-organizing map method is applied to satellite-derived sea-level anomaly fields of1993-2012 to study variations of the Kuroshio intrusion northeast of Taiwan Island.Four major features are revealed,showing sig...The self-organizing map method is applied to satellite-derived sea-level anomaly fields of1993-2012 to study variations of the Kuroshio intrusion northeast of Taiwan Island.Four major features are revealed,showing significant seasonal variability of the intrusion.In general,the intrusion increases(decreases) with a high(low) sea-level anomaly at the edge of the East China Sea shelf in winter(summer).Open-ocean mesoscale eddies play an additional role in modulating the seasonal variation of the intrusion.Further analyses are needed to study eddy-Kuroshio interaction dynamics.展开更多
To design microstructure and microhardness in the additive manufacturing(AM)of nickel(Ni)-based superalloys,the present work develops a novel data-driven approach that combines physics-based models,experimental measur...To design microstructure and microhardness in the additive manufacturing(AM)of nickel(Ni)-based superalloys,the present work develops a novel data-driven approach that combines physics-based models,experimental measurements,and a data-mining method.The simulation is based on a computational thermal-fluid dynamics(CtFD)model,which can obtain thermal behavior,solidification parameters such as cooling rate,and the dilution of solidified clad.Based on the computed thermal information,dendrite arm spacing and microhardness are estimated using well-tested mechanistic models.Experimental microstructure and microhardness are determined and compared with the simulated values for validation.To visualize process-structure-properties(PSPs)linkages,the simulation and experimental datasets are input to a data-mining model-a self-organizing map(SOM).The design windows of the process parameters under multiple objectives can be obtained from the visualized maps.The proposed approaches can be utilized in AM and other data-intensive processes.Data-driven linkages between process,structure,and properties have the potential to benefit online process monitoring control in order to derive an ideal microstructure and mechanical properties.展开更多
A multivariate method for fault diagnosis and process monitoring is proposed. This technique is based on a statistical pattern(SP) framework integrated with a self-organizing map(SOM). An SP-based SOM is used as a cla...A multivariate method for fault diagnosis and process monitoring is proposed. This technique is based on a statistical pattern(SP) framework integrated with a self-organizing map(SOM). An SP-based SOM is used as a classifier to distinguish various states on the output map, which can visually monitor abnormal states. A case study of the Tennessee Eastman(TE) process is presented to demonstrate the fault diagnosis and process monitoring performance of the proposed method. Results show that the SP-based SOM method is a visual tool for real-time monitoring and fault diagnosis that can be used in complex chemical processes.Compared with other SOM-based methods, the proposed method can more efficiently monitor and diagnose faults.展开更多
Patterns of the South China Sea (SCS) circulation variability are extracted from merged satellite altimetry data from October 1992 through August 2004 by using the self-organizing map (SOM). The annual cycle, seasonal...Patterns of the South China Sea (SCS) circulation variability are extracted from merged satellite altimetry data from October 1992 through August 2004 by using the self-organizing map (SOM). The annual cycle, seasonal and inter-annual variations of the SCS surface circulation are identified through the evolution of the characteristic circulation patterns.The annual cycle of the SCS general circulation patterns is described as a change between two opposite basin-scale SW-NE oriented gyres embedded with eddies: low sea surface height anomaly (SSHA) (cyclonic) in winter and high SSHA (anticyclonic) in summer half year. The transition starts from July—August (January—February) with a high (low) SSHA tongue east of Vietnam around 12°~14° N, which develops into a big anticyclonic (cyclonic) gyre while moving eastward to the deep basin. During the transitions, a dipole structure, cyclonic (anticyclonic) in the north and anticyclonic (cyclonic) in the south, may be formed southeast off Vietnam with a strong zonal jet around 10°~12° N. The seasonal variation is modulated by the interannual variations. Besides the strong 1997/1998 event in response to the peak Pacific El Nio in 1997, the overall SCS sea level is found to have a significant rise during 1999~2001, however, in summer 2004 the overall SCS sea level is lower and the basin-wide anticyclonic gyre becomes weaker than the other years.展开更多
A new clustering algorithm called fuzzy self-organizing feature maps is introduced. It can process not only the exact digital inputs, but also the inexact or fuzzy non-digital inputs, such as natural language inputs. ...A new clustering algorithm called fuzzy self-organizing feature maps is introduced. It can process not only the exact digital inputs, but also the inexact or fuzzy non-digital inputs, such as natural language inputs. Simulation results show that the new algorithm is superior to original Kohonen’s algorithm in clustering performance and learning rate.展开更多
A new method to detect multiple outliers in multivariate data is proposed. It is a combination of minimum subsets, resampling and self-organizing map (SOM) algorithm introduced by Kohonen,which provides a robust way w...A new method to detect multiple outliers in multivariate data is proposed. It is a combination of minimum subsets, resampling and self-organizing map (SOM) algorithm introduced by Kohonen,which provides a robust way with neural network. In this method, the number and organization of the neurons are selected by the characteristics of the spectra, e.g., the spectra data are often changed linearly with the concentration of the components and are often measured repeatedly, etc. So the spatial distribution of the neurons can be arranged by this characteristic. With this method, all the outliers in the spectra can be detected, which cannot be solved by the traditional method, and the speed of computation is higher than that of the traditional neural network method. The results of the simulation and the experiment show that this method is simple, effective, intuitionistic and all the outliers in the spectra can be detected in a short time. It is useful when associated with the regression model in the near infra-red research.展开更多
Due to rapid urbanization, waterlogging induced by torrential rainfall has become a global concern and a potential risk affecting urban habitant's safety. Widespread waterlogging disasters haveoccurred almost annu...Due to rapid urbanization, waterlogging induced by torrential rainfall has become a global concern and a potential risk affecting urban habitant's safety. Widespread waterlogging disasters haveoccurred almost annuallyinthe urban area of Beijing, the capital of China. Based on a selforganizing map(SOM) artificial neural network(ANN), a graded waterlogging risk assessment was conducted on 56 low-lying points in Beijing, China. Social risk factors, such as Gross domestic product(GDP), population density, and traffic congestion, were utilized as input datasets in this study. The results indicate that SOM-ANNis suitable for automatically and quantitatively assessing risks associated with waterlogging. The greatest advantage of SOM-ANN in the assessment of waterlogging risk is that a priori knowledge about classification categories and assessment indicator weights is not needed. As a result, SOM-ANN can effectively overcome interference from subjective factors,producing classification results that are more objective and accurate. In this paper, the risk level of waterlogging in Beijing was divided into five grades. The points that were assigned risk grades of IV or Vwere located mainly in the districts of Chaoyang, Haidian, Xicheng, and Dongcheng.展开更多
Characterization of unknown groundwater contaminant sources in terms of location, magnitude and duration of source activity is a complex problem. In this study, to increase the efficiency and accuracy of source charac...Characterization of unknown groundwater contaminant sources in terms of location, magnitude and duration of source activity is a complex problem. In this study, to increase the efficiency and accuracy of source characterization an alternative methodology to the methodologies proposed earlier is developed. This methodology, Adaptive Surrogate Modeling Based Optimization (ASMBO) uses the capabilities of Self Organizing Map (SOM) algorithm to design the surrogate models and adaptive surrogate models for source characterization. The most important advantage of this methodology is its direct utilization for groundwater contaminant characterization without the necessity of utilizing a linked simulation optimization model. The validation of the SOM based surrogate models and SOM based adaptive surrogate models demonstrates that the quantity and quality of initial sample sizes have crucial role on the accuracy of solutions as the designed monitoring locations. The performance evaluation results of the proposed methodology are obtained using error free and erroneous concentration measurement data. These results demonstrate that the developed methodology could approximate groundwater flow and transport simulation models, and substitute the optimization model for characterization of unknown groundwater contaminant sources in terms of location, magnitude and duration of source activity.展开更多
Considering that growing hierarchical self-organizing map(GHSOM) ignores the influence of individual component in sample vector analysis, and its accurate rate in detecting unknown network attacks is relatively lower,...Considering that growing hierarchical self-organizing map(GHSOM) ignores the influence of individual component in sample vector analysis, and its accurate rate in detecting unknown network attacks is relatively lower, an improved GHSOM method combined with mutual information is proposed. After theoretical analysis, experiments are conducted to illustrate the effectiveness of the proposed method by accurately clustering the input data. Based on different clusters, the complex relationship within the data can be revealed effectively.展开更多
Self-Organizing Map is an unsupervised learning algorithm.It has the ability of self-organization,self-learning and side associative thinking.Based on the principle it can identified the complex volcanic lithology.Acc...Self-Organizing Map is an unsupervised learning algorithm.It has the ability of self-organization,self-learning and side associative thinking.Based on the principle it can identified the complex volcanic lithology.According to the logging data of the volcanic rock samples,the SOM will be trained,The SOM training results were analyzed in order to choose optimally parameters of the network.Through identifying the logging data of volcanic formations,the result shows that the map can achieve good application effects.展开更多
To solve the mapping problem for the mobile robots in the unknown environment, a dynamic growing self-organizing map with growing-threshold tuning automatically algorithm (DGSOMGT) based on Self-organizing Map is prop...To solve the mapping problem for the mobile robots in the unknown environment, a dynamic growing self-organizing map with growing-threshold tuning automatically algorithm (DGSOMGT) based on Self-organizing Map is proposed. It introduces a value of spread factor to describe the changing process of the growing threshold dynamically. The method realizes the network structure growing by training through mobile robot movement constantly in the unknown environment. The proposed algorithm is based on self-organizing map and can adjust the growing-threshold value by the number of network neurons increasing. It avoids tuning the parameters repeatedly by human. The experimental results show that the proposed method detects the complex environment quickly, effectively and correctly. The robot can realize environment mapping automatically. Compared with the other methods the proposed mapping strategy has better topological properties and time property.展开更多
An extended self-organizing map for supervised classification is proposed in this paper.Unlike other traditional SOMs,the model has an input layer,a Kohonen layer,and an output layer.The number of neurons in the input...An extended self-organizing map for supervised classification is proposed in this paper.Unlike other traditional SOMs,the model has an input layer,a Kohonen layer,and an output layer.The number of neurons in the input layer depends on the dimensionality of input patterns.The number of neurons in the output layer equals the number of the desired classes.The number of neurons in the Kohonen layer may be a few to several thousands,which depends on the complexity of classification problems and the classification precision.Each training sample is expressed by a pair of vectors: an input vector and a class codebook vector.When a training sample is input into the model,Kohonen's competitive learning rule is applied to selecting the winning neuron from the Kohonen layer and the weight coefficients connecting all the neurons in the input layer with both the winning neuron and its neighbors in the Kohonen layer are modified to be closer to the input vector,and those connecting all the neurons around the winning neuron within a certain diameter in the Kohonen layer with all the neurons in the output layer are adjusted to be closer to the class codebook vector.If the number of training samples is sufficiently large and the learning epochs iterate enough times,the model will be able to serve as a supervised classifier.The model has been tentatively applied to the supervised classification of multispectral remotely sensed data.The author compared the performances of the extended SOM and BPN in remotely sensed data classification.The investigation manifests that the extended SOM is feasible for supervised classification.展开更多
Several studies were devoted to investigate the effects of meteorological factors on the occurrence of stroke. Regression models had been mostly used to assess the correlation between weather and stroke incidence. How...Several studies were devoted to investigate the effects of meteorological factors on the occurrence of stroke. Regression models had been mostly used to assess the correlation between weather and stroke incidence. However, these methods could not describe the process proceeding in the back-ground of stroke incidence. The purpose of this study was to provide a new approach based on Hidden Markov Models (HMMs) and self-organizing maps (SOM), interpreting the background from the viewpoint of weather variability. Based on meteorological data, SOM was performed to classify weather patterns. Using these classes by SOM as randomly changing “states”, our Hidden Markov Models were constructed with “observation data” that were extracted from the daily data of emergency transport at Nagoya City in Japan. We showed that SOM was an effective method to get weather patterns that would serve as “states” of Hidden Markov Models. Our Hidden Markov Models provided effective models to clarify background process for stroke incidence. The effectiveness of these Hidden Markov Models was estimated by stochastic test for root mean square errors (RMSE). “HMMs with states by SOM” would serve as a description of the background process of stroke incidence and were useful to show the influence of weather on stroke onset. This finding will contribute to an improvement of our understanding for links between weather variability and stroke incidence.展开更多
In order to classify the alertness status, 19 channels of electroencephalogram(EEG) signals from 5 subjects were acquired during daytime nap. Ten different types of features(including time domain features, frequency d...In order to classify the alertness status, 19 channels of electroencephalogram(EEG) signals from 5 subjects were acquired during daytime nap. Ten different types of features(including time domain features, frequency domain features and nonlinear features) were extracted from EEG signals, and an improved self-organizing map(ISOM) neuron network was proposed, which successfully identify three different brain status of the subjects: awareness, drowsiness and sleep. Compared with traditional SOM, the experiment results show that the ISOM generates much better classification accuracy, reaching as high as 89.59%.展开更多
基金the National Natural Science Foundation of China(U1901601)the National Key Research and Development Program of China(2022YFB3903503)。
文摘Faced with increasing global soil degradation,spatially explicit data on cropland soil organic matter(SOM)provides crucial data for soil carbon pool accounting,cropland quality assessment and the formulation of effective management policies.As a spatial information prediction technique,digital soil mapping(DSM)has been widely used to spatially map soil information at different scales.However,the accuracy of digital SOM maps for cropland is typically lower than for other land cover types due to the inherent difficulty in precisely quantifying human disturbance.To overcome this limitation,this study systematically assessed a framework of“information extractionfeature selection-model averaging”for improving model performance in mapping cropland SOM using 462 cropland soil samples collected in Guangzhou,China in 2021.The results showed that using the framework of dynamic information extraction,feature selection and model averaging could efficiently improve the accuracy of the final predictions(R^(2):0.48 to 0.53)without having obviously negative impacts on uncertainty.Quantifying the dynamic information of the environment was an efficient way to generate covariates that are linearly and nonlinearly related to SOM,which improved the R^(2)of random forest from 0.44 to 0.48 and the R^(2)of extreme gradient boosting from 0.37to 0.43.Forward recursive feature selection(FRFS)is recommended when there are relatively few environmental covariates(<200),whereas Boruta is recommended when there are many environmental covariates(>500).The Granger-Ramanathan model averaging approach could improve the prediction accuracy and average uncertainty.When the structures of initial prediction models are similar,increasing in the number of averaging models did not have significantly positive effects on the final predictions.Given the advantages of these selected strategies over information extraction,feature selection and model averaging have a great potential for high-accuracy soil mapping at any scales,so this approach can provide more reliable references for soil conservation policy-making.
文摘One of the main disadvantages of fractal image data compression is a loss time in the process of image compression (encoding) and conversion into a system of iterated functions (IFS). In this paper, the idea of the inverse problem of fixed point is introduced. This inverse problem is based on collage theorem which is the cornerstone of the mathematical idea of fractal image compression. Then this idea is applied by iterated function system, iterative system functions and grayscale iterated function system down to general transformation. Mathematical formulation form is also provided on the digital image space, which deals with the computer. Next, this process has been revised to reduce the time required for image compression by excluding some parts of the image that have a specific milestone. The neural network algorithms have been applied on the process of compression (encryption). The experimental results are presented and the performance of the proposed algorithm is discussed. Finally, the comparison between filtered ranges method and self-organizing method is introduced.
文摘Varieties of approaches and algorithms have been presented to identify the distribution of elements. Previous researches based on the type of problem, categorized their data in proper clusters or classes. This means that the process of solution could be supervised or unsupervised. In cases, where there is no idea about dependency of samples to specific groups, clustering methods (unsupervised) are applied. About geochemistry data, since various elements are involved, in addition to the complex nature of geochemical data, clustering algorithms would be useful for recognition of elements distribution. In this paper, Self-Organizing Map (SOM) algorithm, as an unsupervised method, is applied for clustering samples based on REEs contents. For this reason the Choghart Fe-REE deposit (Bafq district, central Iran), was selected as study area and dataset was a collection of 112 lithology samples that were assayed with laboratory tests such as ICP-MS and XRF analysis. In this study, input vectors include 19 features which are coordinates x, y, z and concentrations of REEs as well as the concentration of Phosphate (P<sub>2</sub>O<sub>5</sub>) since the apatite is the main source of REEs in this particular research. Four clusters were determined as an optimal number of clusters using silhouette criterion as well as k-means clustering method and SOM. Therefore, using self-organizing map, study area was subdivided in four zones. These four zones can be described as phosphate type, albitofyre type, metasomatic and phosphorus iron ore, and Iron Ore type. Phosphate type is the most prone to rare earth elements. Eventually, results were validated with laboratory analysis.
基金Supported by the National Basic Research Program of China (2013CB733600), the National Natural Science Foundation of China (21176073), the Doctoral Fund of Ministry of Education of China (20090074110005), the Program for New Century Excellent Talents in University (NCET-09-0346), Shu Guang Project (09SG29) and the Fundamental Research Funds for the Central Universities.
文摘Fault diagnosis and monitoring are very important for complex chemical process. There are numerous methods that have been studied in this field, in which the effective visualization method is still challenging. In order to get a better visualization effect, a novel fault diagnosis method which combines self-organizing map (SOM) with Fisher discriminant analysis (FDA) is proposed. FDA can reduce the dimension of the data in terms of maximizing the separability of the classes. After feature extraction by FDA, SOM can distinguish the different states on the output map clearly and it can also be employed to monitor abnormal states. Tennessee Eastman (TE) process is employed to illustrate the fault diagnosis and monitoring performance of the proposed method. The result shows that the SOM integrated with FDA method is efficient and capable for real-time monitoring and fault diagnosis in complex chemical process.
文摘Determination of homogenous precipitation-based regions is a very important task in effective management of water resources. The present study tried to propose an effective precipitation-based regionalization methodology by conjugating both temporal pre-processing and spatial clustering approaches in a way to take advantage of multiscale properties of precipitation time series. Annual precipitation data of 51 years(1960-2010) for 31 rain gauges(RGs) were collected and used in proposed clustering approaches. Discreet wavelet transform(DWT) was used to capture the time-frequency attributes of the time series and multiscale regionalization was performed by using k-means and Self Organizing Maps(SOM) clustering techniques. Daubechies function(db) was selected as mother wavelet to decompose the precipitation time series. Also, proper boundary extensions and decomposition level were applied. Different combinations of the approximation(A) and detail(D) coefficients were used to determine the input dataset as a basis of spatial clustering. The proposed model's efficiency in spatial clustering stage was verified using three different indexes namely, Silhouette Coefficient(SC), Dunn index and Davis Bouldin index(DB). Results approved superior performance of k-means technique in comparison to SOM. It was also deduced that DWT-based regionalization methodology showed improvements in comparison to historical-based models. Cross mutual information was used to investigate the RGs of cluster 3's homogeneousness in DWT-k-means approach. Results of non-linear correlation approach verified homogeneity of cluster 3. Verifications based on mean annual precipitation values of rain gauges in each cluster also approved the capability of multiscale approach in precipitation regionalization.
基金Supported by the Strategic Priority Research Program of Chinese Academy of Sciences(No.XDA11010103)the National Natural Science Foundation of China(Nos.41222037,41221063)+4 种基金the Project of Global Change and Air-Sea Interaction(No.GASI-03-01-01-02)the National Basic Research Program of China(973 Program)(No.2013CB956202)the 111 Project of Ministry of Education of China(No.B07036)the Natural Science Foundation of Shandong(No.JQ201111)the National Special Research Fund for Non-Profit Marine Sector(No.201205018)
文摘The self-organizing map method is applied to satellite-derived sea-level anomaly fields of1993-2012 to study variations of the Kuroshio intrusion northeast of Taiwan Island.Four major features are revealed,showing significant seasonal variability of the intrusion.In general,the intrusion increases(decreases) with a high(low) sea-level anomaly at the edge of the East China Sea shelf in winter(summer).Open-ocean mesoscale eddies play an additional role in modulating the seasonal variation of the intrusion.Further analyses are needed to study eddy-Kuroshio interaction dynamics.
基金Jian Cao,Gregory J.Wagner,and Wing K.Liu acknowledge support from the National Science Foundation(NSF)Cyber-Physical Systems(CPS)(CPS/CMMI-1646592)Hengyang Li acknowledges support from the Northwestern Data Science Initiative(DSI+6 种基金171474500210043324)Jian Cao,Gregory J.Wagner,Wing K.Liu,Jennifer L.Bennett,and Sarah J.Wolff acknowledge support from the Digital Manufacturing and Design Innovation Institute(DMDII15-07)Jian Cao,Wing K.Liu,Zhengtao Gan,and Jennifer L.Bennett acknowledge support from the Center for Hierarchical Materials Design(CHiMaD70NANB14H012)This work made use of facilities at DMG MORI and Northwestern UniversityIt also made use of the MatCI Facility,which receives support from the MRSEC Program(NSF DMR-168 1720139)of the Materials Research Center at Northwestern University.
文摘To design microstructure and microhardness in the additive manufacturing(AM)of nickel(Ni)-based superalloys,the present work develops a novel data-driven approach that combines physics-based models,experimental measurements,and a data-mining method.The simulation is based on a computational thermal-fluid dynamics(CtFD)model,which can obtain thermal behavior,solidification parameters such as cooling rate,and the dilution of solidified clad.Based on the computed thermal information,dendrite arm spacing and microhardness are estimated using well-tested mechanistic models.Experimental microstructure and microhardness are determined and compared with the simulated values for validation.To visualize process-structure-properties(PSPs)linkages,the simulation and experimental datasets are input to a data-mining model-a self-organizing map(SOM).The design windows of the process parameters under multiple objectives can be obtained from the visualized maps.The proposed approaches can be utilized in AM and other data-intensive processes.Data-driven linkages between process,structure,and properties have the potential to benefit online process monitoring control in order to derive an ideal microstructure and mechanical properties.
基金Project(2013CB733605)supported by the National Basic Research Program of ChinaProject(21176073)supported by the National Natural Science Foundation of ChinaProject supported by the Fundamental Research Funds for the Central Universities,China
文摘A multivariate method for fault diagnosis and process monitoring is proposed. This technique is based on a statistical pattern(SP) framework integrated with a self-organizing map(SOM). An SP-based SOM is used as a classifier to distinguish various states on the output map, which can visually monitor abnormal states. A case study of the Tennessee Eastman(TE) process is presented to demonstrate the fault diagnosis and process monitoring performance of the proposed method. Results show that the SP-based SOM method is a visual tool for real-time monitoring and fault diagnosis that can be used in complex chemical processes.Compared with other SOM-based methods, the proposed method can more efficiently monitor and diagnose faults.
基金National Basic Research Program of China under contract No. 2007 CB816003the Key International Co-operative Proiect of the National Natural Science Foundation of China under contract No.40510073the International Cooperative Proiect of the Mini-stry of Science and Technology of China under contract No.2006DFB21630.
文摘Patterns of the South China Sea (SCS) circulation variability are extracted from merged satellite altimetry data from October 1992 through August 2004 by using the self-organizing map (SOM). The annual cycle, seasonal and inter-annual variations of the SCS surface circulation are identified through the evolution of the characteristic circulation patterns.The annual cycle of the SCS general circulation patterns is described as a change between two opposite basin-scale SW-NE oriented gyres embedded with eddies: low sea surface height anomaly (SSHA) (cyclonic) in winter and high SSHA (anticyclonic) in summer half year. The transition starts from July—August (January—February) with a high (low) SSHA tongue east of Vietnam around 12°~14° N, which develops into a big anticyclonic (cyclonic) gyre while moving eastward to the deep basin. During the transitions, a dipole structure, cyclonic (anticyclonic) in the north and anticyclonic (cyclonic) in the south, may be formed southeast off Vietnam with a strong zonal jet around 10°~12° N. The seasonal variation is modulated by the interannual variations. Besides the strong 1997/1998 event in response to the peak Pacific El Nio in 1997, the overall SCS sea level is found to have a significant rise during 1999~2001, however, in summer 2004 the overall SCS sea level is lower and the basin-wide anticyclonic gyre becomes weaker than the other years.
文摘A new clustering algorithm called fuzzy self-organizing feature maps is introduced. It can process not only the exact digital inputs, but also the inexact or fuzzy non-digital inputs, such as natural language inputs. Simulation results show that the new algorithm is superior to original Kohonen’s algorithm in clustering performance and learning rate.
文摘A new method to detect multiple outliers in multivariate data is proposed. It is a combination of minimum subsets, resampling and self-organizing map (SOM) algorithm introduced by Kohonen,which provides a robust way with neural network. In this method, the number and organization of the neurons are selected by the characteristics of the spectra, e.g., the spectra data are often changed linearly with the concentration of the components and are often measured repeatedly, etc. So the spatial distribution of the neurons can be arranged by this characteristic. With this method, all the outliers in the spectra can be detected, which cannot be solved by the traditional method, and the speed of computation is higher than that of the traditional neural network method. The results of the simulation and the experiment show that this method is simple, effective, intuitionistic and all the outliers in the spectra can be detected in a short time. It is useful when associated with the regression model in the near infra-red research.
基金supported by the National Key R&D Program of China (GrantN o.2016YFC0401407)National Natural Science Foundation of China (Grant Nos. 51479003 and 51279006)
文摘Due to rapid urbanization, waterlogging induced by torrential rainfall has become a global concern and a potential risk affecting urban habitant's safety. Widespread waterlogging disasters haveoccurred almost annuallyinthe urban area of Beijing, the capital of China. Based on a selforganizing map(SOM) artificial neural network(ANN), a graded waterlogging risk assessment was conducted on 56 low-lying points in Beijing, China. Social risk factors, such as Gross domestic product(GDP), population density, and traffic congestion, were utilized as input datasets in this study. The results indicate that SOM-ANNis suitable for automatically and quantitatively assessing risks associated with waterlogging. The greatest advantage of SOM-ANN in the assessment of waterlogging risk is that a priori knowledge about classification categories and assessment indicator weights is not needed. As a result, SOM-ANN can effectively overcome interference from subjective factors,producing classification results that are more objective and accurate. In this paper, the risk level of waterlogging in Beijing was divided into five grades. The points that were assigned risk grades of IV or Vwere located mainly in the districts of Chaoyang, Haidian, Xicheng, and Dongcheng.
文摘Characterization of unknown groundwater contaminant sources in terms of location, magnitude and duration of source activity is a complex problem. In this study, to increase the efficiency and accuracy of source characterization an alternative methodology to the methodologies proposed earlier is developed. This methodology, Adaptive Surrogate Modeling Based Optimization (ASMBO) uses the capabilities of Self Organizing Map (SOM) algorithm to design the surrogate models and adaptive surrogate models for source characterization. The most important advantage of this methodology is its direct utilization for groundwater contaminant characterization without the necessity of utilizing a linked simulation optimization model. The validation of the SOM based surrogate models and SOM based adaptive surrogate models demonstrates that the quantity and quality of initial sample sizes have crucial role on the accuracy of solutions as the designed monitoring locations. The performance evaluation results of the proposed methodology are obtained using error free and erroneous concentration measurement data. These results demonstrate that the developed methodology could approximate groundwater flow and transport simulation models, and substitute the optimization model for characterization of unknown groundwater contaminant sources in terms of location, magnitude and duration of source activity.
基金Supported by the Natural Science Foundation of Tianjin(No.15JCQNJC00200)
文摘Considering that growing hierarchical self-organizing map(GHSOM) ignores the influence of individual component in sample vector analysis, and its accurate rate in detecting unknown network attacks is relatively lower, an improved GHSOM method combined with mutual information is proposed. After theoretical analysis, experiments are conducted to illustrate the effectiveness of the proposed method by accurately clustering the input data. Based on different clusters, the complex relationship within the data can be revealed effectively.
基金Supported by National Oil-gas Project:No XQ-2004-07
文摘Self-Organizing Map is an unsupervised learning algorithm.It has the ability of self-organization,self-learning and side associative thinking.Based on the principle it can identified the complex volcanic lithology.According to the logging data of the volcanic rock samples,the SOM will be trained,The SOM training results were analyzed in order to choose optimally parameters of the network.Through identifying the logging data of volcanic formations,the result shows that the map can achieve good application effects.
文摘To solve the mapping problem for the mobile robots in the unknown environment, a dynamic growing self-organizing map with growing-threshold tuning automatically algorithm (DGSOMGT) based on Self-organizing Map is proposed. It introduces a value of spread factor to describe the changing process of the growing threshold dynamically. The method realizes the network structure growing by training through mobile robot movement constantly in the unknown environment. The proposed algorithm is based on self-organizing map and can adjust the growing-threshold value by the number of network neurons increasing. It avoids tuning the parameters repeatedly by human. The experimental results show that the proposed method detects the complex environment quickly, effectively and correctly. The robot can realize environment mapping automatically. Compared with the other methods the proposed mapping strategy has better topological properties and time property.
基金Supported by National Natural Science Foundation of China (No. 40872193)
文摘An extended self-organizing map for supervised classification is proposed in this paper.Unlike other traditional SOMs,the model has an input layer,a Kohonen layer,and an output layer.The number of neurons in the input layer depends on the dimensionality of input patterns.The number of neurons in the output layer equals the number of the desired classes.The number of neurons in the Kohonen layer may be a few to several thousands,which depends on the complexity of classification problems and the classification precision.Each training sample is expressed by a pair of vectors: an input vector and a class codebook vector.When a training sample is input into the model,Kohonen's competitive learning rule is applied to selecting the winning neuron from the Kohonen layer and the weight coefficients connecting all the neurons in the input layer with both the winning neuron and its neighbors in the Kohonen layer are modified to be closer to the input vector,and those connecting all the neurons around the winning neuron within a certain diameter in the Kohonen layer with all the neurons in the output layer are adjusted to be closer to the class codebook vector.If the number of training samples is sufficiently large and the learning epochs iterate enough times,the model will be able to serve as a supervised classifier.The model has been tentatively applied to the supervised classification of multispectral remotely sensed data.The author compared the performances of the extended SOM and BPN in remotely sensed data classification.The investigation manifests that the extended SOM is feasible for supervised classification.
文摘Several studies were devoted to investigate the effects of meteorological factors on the occurrence of stroke. Regression models had been mostly used to assess the correlation between weather and stroke incidence. However, these methods could not describe the process proceeding in the back-ground of stroke incidence. The purpose of this study was to provide a new approach based on Hidden Markov Models (HMMs) and self-organizing maps (SOM), interpreting the background from the viewpoint of weather variability. Based on meteorological data, SOM was performed to classify weather patterns. Using these classes by SOM as randomly changing “states”, our Hidden Markov Models were constructed with “observation data” that were extracted from the daily data of emergency transport at Nagoya City in Japan. We showed that SOM was an effective method to get weather patterns that would serve as “states” of Hidden Markov Models. Our Hidden Markov Models provided effective models to clarify background process for stroke incidence. The effectiveness of these Hidden Markov Models was estimated by stochastic test for root mean square errors (RMSE). “HMMs with states by SOM” would serve as a description of the background process of stroke incidence and were useful to show the influence of weather on stroke onset. This finding will contribute to an improvement of our understanding for links between weather variability and stroke incidence.
基金Supported by National Natural Science Foundation of China(No.51007063)
文摘In order to classify the alertness status, 19 channels of electroencephalogram(EEG) signals from 5 subjects were acquired during daytime nap. Ten different types of features(including time domain features, frequency domain features and nonlinear features) were extracted from EEG signals, and an improved self-organizing map(ISOM) neuron network was proposed, which successfully identify three different brain status of the subjects: awareness, drowsiness and sleep. Compared with traditional SOM, the experiment results show that the ISOM generates much better classification accuracy, reaching as high as 89.59%.