Oil spills cause environmental pollution with a serious threat to local communities and sustainable development.Accidental oil spills can be modelled as a stochastic process where each oil spill event is described by ...Oil spills cause environmental pollution with a serious threat to local communities and sustainable development.Accidental oil spills can be modelled as a stochastic process where each oil spill event is described by its spatial locations and incidence-time and hence allow for space-time cluster analysis.Spacetime cluster analysis can detect space-time pattern distribution of oil spills which can be useful for implementing preventive measures and evidence-based decision making.This study aims to detect the space-time clusters of accidental oil spills in Rivers state,Nigeria through the Space-time Scan Statistic.The Space-time Scan Statistic was applied under the permutation model to the oil spill data(each for sabotage and operational oil spills)collected at Local Government Area(LGA)-level during the period from 2011 to 2019.The results show that the sabotage oil spill clusters have covered most of the LGAs in the southern part of the state at the start of the study period and then in 2018–2019,it moved to the west covering a single LGA.The operational oil spill clusters covered two neighboring LGAs in the south.In addition,the temporal cluster of sabotage oil spills was seen in 2019 and operational oil spills in 2011–2012.The sabotage oil spills show an increasing trend with the maximum in 2019 while the operational oil spills show a decreasing trend with the minimum in 2019.These findings assist in more effective decision-making for combating the environmental problems and controlling the future spill incidence in the cluster-regions.展开更多
Tuberculosis is one of the top killer diseases in the globe. The aim of this study was to explore the geographic distribution patterns and clustering characteristics of the disease incidence in terms of both space and...Tuberculosis is one of the top killer diseases in the globe. The aim of this study was to explore the geographic distribution patterns and clustering characteristics of the disease incidence in terms of both space and time with high relative risk locations for tuberculosis incidence in Beijing area. A retrospective space-time clustering analysis was conducted at the districts level in Beijing area based on reported cases of sputum smear-positive pulmonary tuberculosis (TB) from 2005 to 2014. Global and local Moran’s I, autocorrelation analysis along with Ord (Gi*) statistics was applied to detect spatial patterns and the hotspot of TB incidence. Furthermore, the Kuldorff’s scan statistics were used to analyze space-time clusters. A total of 40,878 TB cases were reported in Beijing from 2005 to 2014. The annual average incidence rate was 22.11 per 100,000 populations (ranged from 16.55 to 25.71). The seasonal incidence occurred from March to July until late autumn. A higher relative risk area for TB incidence was mainly detected in urban and some rural districts of Beijing. The significant most likely space-time clusters and secondary clusters of TB incidence were scattered diversely in Beijing districts in each study year. The risk population was mainly scattered in urban and dense populated districts, including in few rural districts.展开更多
Efficient iterative unsupervised machine learning involving probabilistic clustering analysis with the expectation-maximization(EM)clustering algorithm is applied to categorize reservoir facies by exploiting latent an...Efficient iterative unsupervised machine learning involving probabilistic clustering analysis with the expectation-maximization(EM)clustering algorithm is applied to categorize reservoir facies by exploiting latent and observable well-log variables from a clastic reservoir in the Majnoon oilfield,southern Iraq.The observable well-log variables consist of conventional open-hole,well-log data and the computer-processed interpretation of gamma rays,bulk density,neutron porosity,compressional sonic,deep resistivity,shale volume,total porosity,and water saturation,from three wells located in the Nahr Umr reservoir.The latent variables include shale volume and water saturation.The EM algorithm efficiently characterizes electrofacies through iterative machine learning to identify the local maximum likelihood estimates(MLE)of the observable and latent variables in the studied dataset.The optimized EM model developed successfully predicts the core-derived facies classification in two of the studied wells.The EM model clusters the data into three distinctive reservoir electrofacies(F1,F2,and F3).F1 represents a gas-bearing electrofacies with low shale volume(Vsh)and water saturation(Sw)and high porosity and permeability values identifying it as an attractive reservoir target.The results of the EM model are validated using nuclear magnetic resonance(NMR)data from the third studied well for which no cores were recovered.The NMR results confirm the effectiveness and accuracy of the EM model in predicting electrofacies.The utilization of the EM algorithm for electrofacies classification/cluster analysis is innovative.Specifically,the clusters it establishes are less rigidly constrained than those derived from the more commonly used K-means clustering method.The EM methodology developed generates dependable electrofacies estimates in the studied reservoir intervals where core samples are not available.Therefore,once calibrated with core data in some wells,the model is suitable for application to other wells that lack core data.展开更多
A significant portion of Landslide Early Warning Systems (LEWS) relies on the definition of operational thresholds and the monitoring of cumulative rainfall for alert issuance. These thresholds can be obtained in vari...A significant portion of Landslide Early Warning Systems (LEWS) relies on the definition of operational thresholds and the monitoring of cumulative rainfall for alert issuance. These thresholds can be obtained in various ways, but most often they are based on previous landslide data. This approach introduces several limitations. For instance, there is a requirement for the location to have been previously monitored in some way to have this type of information recorded. Another significant limitation is the need for information regarding the location and timing of incidents. Despite the current ease of obtaining location information (GPS, drone images, etc.), the timing of the event remains challenging to ascertain for a considerable portion of landslide data. Concerning rainfall monitoring, there are multiple ways to consider it, for instance, examining accumulations over various intervals (1 h, 6 h, 24 h, 72 h), as well as in the calculation of effective rainfall, which represents the precipitation that actually infiltrates the soil. However, in the vast majority of cases, both the thresholds and the rain monitoring approach are defined manually and subjectively, relying on the operators’ experience. This makes the process labor-intensive and time-consuming, hindering the establishment of a truly standardized and rapidly scalable methodology on a large scale. In this work, we propose a Landslides Early Warning System (LEWS) based on the concept of rainfall half-life and the determination of thresholds using Cluster Analysis and data inversion. The system is designed to be applied in extensive monitoring networks, such as the one utilized by Cemaden, Brazil’s National Center for Monitoring and Early Warning of Natural Disasters.展开更多
This paper investigates the design essence of Chinese classical private gardens,integrating their design elements and fundamental principles.It systematically analyzes the unique characteristics and differences among ...This paper investigates the design essence of Chinese classical private gardens,integrating their design elements and fundamental principles.It systematically analyzes the unique characteristics and differences among classical private gardens in the Northern,Jiangnan,and Lingnan regions.The study examines nine classical private gardens from Northern China,Jiangnan,and Lingnan by utilizing the advanced tool of principal component cluster analysis.Based on literature analysis and field research,273 variables were selected for principal component analysis,from which four components with higher contribution rates were chosen for further study.Subsequently,we employed clustering analysis techniques to compare the differences among the three types of gardens.The results reveal that the first principal component effectively highlights the differences between Jiangnan and Lingnan private gardens.The second principal component serves as the key to defining the types of Northern private gardens and distinguishing them from the other two types,and the third principal component indicates that Lingnan private gardens can be categorized into two distinct types as well.展开更多
Assimilating satellite radiances into Numerical Weather Prediction(NWP) models has become an important approach to increase the accuracy of numerical weather forecasting. In this study, the assimilation technique sche...Assimilating satellite radiances into Numerical Weather Prediction(NWP) models has become an important approach to increase the accuracy of numerical weather forecasting. In this study, the assimilation technique scheme was employed in NOAA's STMAS(Space-Time Multiscale Analysis System) to assimilate AMSU-A radiances data.Channel selection sensitivity experiments were conducted on assimilated satellite data in the first place. Then, real case analysis of AMSU-A data assimilation was performed. The analysis results showed that, following assimilating of AMSU-A channels 5-11 in STMAS, the objective function quickly converged, and the channel vertical response was consistent with the AMSU-A weighting function distribution, which suggests that the channels can be used in the assimilation of satellite data in STMAS. With the case of the Typhoon Morakot in Taiwan Island in August 2009 as an example, experiments on assimilated and unassimilated AMSU-A radiances data were designed to analyze the impact of the assimilation of satellite data on STMAS. The results demonstrated that assimilation of AMSU-A data provided more accurate prediction of the precipitation region and intensity, and especially, it improved the 0-6h precipitation forecast significantly.展开更多
This paper examines the temporal change and spatial variation of population pressure on the ecological environment in China.We have collected sufficient data from the statistical yearbooks of 31 provincial administrat...This paper examines the temporal change and spatial variation of population pressure on the ecological environment in China.We have collected sufficient data from the statistical yearbooks of 31 provincial administrative areas in 1990,1995,2000,2005,and 2010.Using a geographic information system(GIS) and relevant models,we analyzed the trend of the population pressure on ecological environment and the change of the gravity center of ecological environment quality.We conclude that:(1) generally,population pressure on the ecological environment in China was becoming higher during1990-2010,especially in some areas where the population and environment were in serious imbalance and the ecological environment experienced severe pollution;(2) during a certain period,population pressure on the ecological environment was becoming lower in some areas,but the ecological environment was getting worse;(3) the areas with super-high population pressure on the ecological environment were Beijing,Tianjin,and Shanghai;(4) the gravity center of population pressure on the ecological environment and the center of ecological environment quality move differently during the study time period,but the general trend was similar- both of them were moving from west to east.Based on the analysis,this paper also provides some policy suggestions on the control of ecological environment quality.展开更多
Space-time disease cluster detection assists in conducting disease surveillance and implementing control strategies.The state-of-the-art method for this kind of problem is the Space-time Scan Statistics(SaTScan)which ...Space-time disease cluster detection assists in conducting disease surveillance and implementing control strategies.The state-of-the-art method for this kind of problem is the Space-time Scan Statistics(SaTScan)which has limitations for non-traditional/non-clinical data sources due to its parametric model assumptions such as Poisson orGaussian counts.Addressing this problem,an Eigenspace-based method called Multi-EigenSpot has recently been proposed as a nonparametric solution.However,it is based on the population counts data which are not always available in the least developed countries.In addition,the population counts are difficult to approximate for some surveillance data such as emergency department visits and over-the-counter drug sales,where the catchment area for each hospital/pharmacy is undefined.We extend the population-based Multi-EigenSpot method to approximate the potential disease clusters from the observed/reported disease counts only with no need for the population counts.The proposed adaptation uses an estimator of expected disease count that does not depend on the population counts.The proposed method was evaluated on the real-world dataset and the results were compared with the population-based methods:Multi-EigenSpot and SaTScan.The result shows that the proposed adaptation is effective in approximating the important outputs of the population-based methods.展开更多
Applications of the multivariate technique called correspondence analysis for environmental studies are relatively new and are limited to spatial multivariate data set. In this paper, a procedure of applying correspon...Applications of the multivariate technique called correspondence analysis for environmental studies are relatively new and are limited to spatial multivariate data set. In this paper, a procedure of applying correspondence analysis to a large space-time data set for multiple environmental variables is shown. In particular, nitrogen dioxide and carbon monoxide hourly concentrations measured during January 1999 at several monitored stations in a district of Northern Italy are analyzed. The procedure consists in transforming the continuous variables into categorical ones by the means of appropriate indicator variables, generating special contingency tables and applying correspondence analysis. The use of this classical multivariate technique allows the identification of important relationships among pollution levels and monitoring stations and/or relationships among pollution levels and observation times.展开更多
Many cases of foot-and-mouth disease (FMD) are reported every year in Benin. In order to elucidate the epidemiology of this disease, a space-time analysis was carried out in all the 77 municipalities of the country ai...Many cases of foot-and-mouth disease (FMD) are reported every year in Benin. In order to elucidate the epidemiology of this disease, a space-time analysis was carried out in all the 77 municipalities of the country aiming to identify high risk areas as well as risk factors such as season and transhumance on the period of 2005 to 2014. Data were collected retrospectively from the Directory of Animal Production of Benin. The method of Kulldorff was used with the software SaTScanTM for the space-time analysis while a script was designed in the software R to generate new sizes with three different models of transhumance. From 2005 to 2014, 434 foci were recorded. Many outbreaks occurred in August, September and October. This period corresponds to the small rainy season in the South and the rainy season of the North. The municipality of Parakou was regarded as the source FMD outbreaks in Benin because it hosts one of the largest livestock markets in the country and many rivers. It was the municipality at the highest risk. The other municipalities at risk were Nikki, Pèrèrè and Kalaléas well as Karimama (hosting the national parkW), Kouandé and Toucountouna located nearby Pendjari’s national park. This study revealed that the space-time configuration is real and the main factors of persistence and dissemination of FMD virus were national parks, classified forests and the livestock market of Parakou all located in the North. The variation of the number of cattle due to their transhumance from the North to the South did not influence the zones at risk. Therefore, Northern Benin is probably at high risk of FMD.展开更多
[Objective] The aim was to study the variation of leaf characters from different provenance sources of Polygonum multiflorum Thunb,as well as to carry out cluster analysis on P.multiflorum from different provenance so...[Objective] The aim was to study the variation of leaf characters from different provenance sources of Polygonum multiflorum Thunb,as well as to carry out cluster analysis on P.multiflorum from different provenance sources to provide basis for the classification,identification,breeding and improved variety selection of P.multiflorum.[Method] Leaf shape characters of 31 copies of germplasm resources in the major distribution region of the whole country were determined,and the genetic variation of P.multiflorum leaves from different producing areas was analyzed.[Result] The leaf characters of single plant of the same experimental provenance source of P.multiflorum were relatively stable,the variation was mainly found on the single leaf area,1/2 leaf width,leaf width and other indicators;the variation of each leaf character among different provenance sources was obvious,and the variation was mainly found on the single leaf weight,leaf area,1/2 leaf width,leaf length and other indicators.The correlation analysis of each leaf character in P.multiflorum suggested that the single leaf area and single leaf weight showed extremely significant positive correlation with leaf length,1/2 leaf width,leaf width,leaf thickness and leaf stem length,while the single leaf area and single leaf weight showed significant negative correlation with WWR(leaf width/1/2 leaf width)and LWR(leaf length/1/2 leaf length),in addition,several macroscopic leaf characters such as leaf length,1/2 leaf width,leaf width,leaf stem length showed extremely positive correlation.The main component analysis result suggested that the contribution rate of accumulation variance of the front three main components was up to 97.4%,which could better reflect the comprehensive performance of leaf characters of different provenance sources of P.multiflorum.The cluster analysis showed that the experimental 31 copies of P.multiflorum provenance sources should be divided into three classes,the first class was distributed in the Middle,Western of Guizhou,northwestern of Guangxi and western areas with higher altitude;the second class was distributed in Hunan,Hubei,Sichuan,Guangdong and the most area of Guangxi;the third class was distributed in Anhui,Jiangsu and Henan and Shandong.[Conclusion] Cluster analysis of leaf characters indicated that the kinds of provenance sources which the geographical position was closer could be got together.The study had provided a certain basis for the classification of P.multiflorum.展开更多
Because of the difficulty to obtain the traffic flow information of lanes at non-detector intersections in most metropolises of the world,based on the relationships between the lanes of signal-controlled intersections...Because of the difficulty to obtain the traffic flow information of lanes at non-detector intersections in most metropolises of the world,based on the relationships between the lanes of signal-controlled intersections,cluster analysis and stepwise regression are integrated to predict the traffic volume of lanes at non-detector isolated controlled intersections.First cluster analysis is used to cluster the lanes of non-detector isolated signal-controlled intersections and the lanes of all signal-controlled intersections with detectors.Then, by the results of cluster analysis,the traffic volume samples are selected randomly and stepwise regression is used to predict the traffic volume of lanes at non-detector isolated signal-controlled intersections.The method is tested by the traffic volume data of lanes of the road network of Nanjing city.The problem of predicting the traffic volume of lanes at non-detector isolated signal-controlled intersections was resolved and can be widely used in urban traffic flow guidance and urban traffic control in cities without enough intersections equipped with detectors.展开更多
In order to analyze the heterogeneity in vehicular traffic speed, a new method that integrates cluster analysis and probability distribution function fitting is presented. First, for identifying the optimal number of ...In order to analyze the heterogeneity in vehicular traffic speed, a new method that integrates cluster analysis and probability distribution function fitting is presented. First, for identifying the optimal number of clusters, the two-step cluster method is applied to analyze actual speed data, which suggests that dividing speed data into two clusters can best reflect the intrinsic patterns of traffic flows. Such information is then taken as guidance in probability distribution function fitting. The normal, skew-normal and skew-t distribution functions are used to fit the probability distribution of each cluster respectively, which suggests that the skew-t distribution has the highest fitting accuracy; the second is skew-normal distribution; the worst is normal distribution. Model analysis results demonstrate that the proposed mixture model has a better fitting and generalization capability than the conventional single model. In addition, the new method is more flexible in terms of data fitting and can provide a more accurate model of speed distribution.展开更多
Inter-simple sequence repeat(ISSR) molecular markers were applied to analyze the genetic diversity and clustering of 48 introduced and bred cultivars of Olea euyopaea L. Totally 106 DNA bands were amplified by 11 sc...Inter-simple sequence repeat(ISSR) molecular markers were applied to analyze the genetic diversity and clustering of 48 introduced and bred cultivars of Olea euyopaea L. Totally 106 DNA bands were amplified by 11 screened primers, including 99 polymorphic bands; the percentage of polymorphic loci was 93.40%, indicating a rich genetic diversity in Olea euyopaea L. germplasm resources. Based on Nei's genetic distances between various cultivars, a dendrogram of 48 cultivars of Olea euyopaea L. was constructed using unweighted pair-group(UPMGA)method,which showed that 48 cultivars were clustered into four main categories; 84.6% of native cultivars were clustered into two categories; most of introduced cultivars were clustered based on their sources and main usages but not on their geographic origins. This study will provide references for the utilization and further genetic improvement of Olea euyopaea L. germplasm resources.展开更多
[ObJective] The research aimed to determine the geographic distribution map of system of Rana dybowskii. [Method] Four morphologic indices (body length, body weight, forelimb length, hindlimb length) of eight geogra...[ObJective] The research aimed to determine the geographic distribution map of system of Rana dybowskii. [Method] Four morphologic indices (body length, body weight, forelimb length, hindlimb length) of eight geographical populations of R.dybowskii which naturally distribute in Changhai Mountain and Xiaoxing'an Mountain were measured. Measure results were variance analyzed and cluster analyzed. [Result] Variance analysis showed: the genetic branching among the Dongfanghong male population( belongs to Wandashan) and Xiaoxing'an Mountain male population and Changbai Mountain male population were significantly different (P〈0.05) ; the genetic branching between the Hebei female population (belongs to Xiaoxing'an Mountain) and Changbai Mountain female population was significantly different (P〈0.05 ). Cluster analysis showed : male R.dybowskii can be divided into three groups : the first group included Quanyang, Tianbei, Chaoyang and Ddkouqin, the second group included Tieli and Anshan, the third group included Dongfanghong; and the female R. dybowskii can be divided into three groups : the first group included Quanyang and Chaoyang, the second group included Tianbei and Dakouqin, the third group included Hebei. [Condusion] The paper deduced that the Sanjiang Plain was the geographical origin center ofR. dybowskii which radiated to Changbai Mountain and Xiaoxing'an Mountain along the adverse current of Songhua River basin, therefore, the current distribution pattern of R. dybowskii was formed.展开更多
In order to mine production and security information from security supervising data and to ensure security and safety involved in production and decision-making,a clustering analysis algorithm for security supervising...In order to mine production and security information from security supervising data and to ensure security and safety involved in production and decision-making,a clustering analysis algorithm for security supervising data based on a semantic description in coal mines is studied.First,the semantic and numerical-based hybrid description method of security supervising data in coal mines is described.Secondly,the similarity measurement method of semantic and numerical data are separately given and a weight-based hybrid similarity measurement method for the security supervising data based on a semantic description in coal mines is presented.Thirdly,taking the hybrid similarity measurement method as the distance criteria and using a grid methodology for reference,an improved CURE clustering algorithm based on the grid is presented.Finally,the simulation results of a security supervising data set in coal mines validate the efficiency of the algorithm.展开更多
In order to reveal the genetic differences and agronomic traits of Fagopy-rum tataricum_ varieties (lines) intuitively, explore good resources and avoid the blindness of parent selection during the breeding process,...In order to reveal the genetic differences and agronomic traits of Fagopy-rum tataricum_ varieties (lines) intuitively, explore good resources and avoid the blindness of parent selection during the breeding process, six primary agronomic traits of 45 F. tataricum_ varieties (lines) that came from the eleven buckwheat breeding departments across the country were analyzed with principal component analysis and cluster analysis. The results of principal component analysis showed that the six agronomic traits could be simplified into three principal components, and the cumulative contribution rate reached 83%. The results of cluster analysis showed that the 45 F. tataricum varieties (lines) were classified into four groups:high stalk, medium yield and smal grain type, medium stalk, high yield and large grain type, medium stalk, low yield and smal grain type and high stalk, medium yield and medium grain type. Among them, performance of comprehensive trait of the second type was better than that of the other types. Thus, the F. tataricum_va-rieties (lines) that were classified into the second type could be considered as good varieties (lines) or breeding materials. The genetic differences among F. tataricum_varieties (lines) had no necessary correlations with origin and geographical distance. ln addition to complementary traits and geographical distance, genetic distances (dif-ferent populations) should be taken into consideration during parent selection in cross breeding.展开更多
[Objective] This study aimed to develop ACGM markers for the clustering analysis of large grained Brassica napus materials. [Method] A total of 44 pairs of ACGM primers were designed according to 18 genes related to A...[Objective] This study aimed to develop ACGM markers for the clustering analysis of large grained Brassica napus materials. [Method] A total of 44 pairs of ACGM primers were designed according to 18 genes related to Arabidopsis grain development and their homologous rape EST sequences. After electrophoresis, 18 pairs of ACGM primers were selected for the clustering analysis of 16 larger grained samples and four fine grained samples of rapeseed. [Result] PCR result showed that 2-6 specific bands were respectively amplified by each pair of primes, and all the bands were polymorphic and repeatable, suggesting that the optimized ACGM markers were useful for clustering analysis of B. napus species. Clustering analysis revealed that the 20 rapeseed samples were divided into three clusters A, B, and C at similarity coefficient 0.6. Then, the clusters A and B were further divided into five sub clusters A1, A2, A3, B1 and B2 at similarity coefficient 0.67. [Conclusion] This study will provide theoretical and practical values for rape breeding.展开更多
In order to compare the characteristics of different varieties of sweet cherry and to formulate corresponding pruning scheme, hierarchical cluster analysis was conducted for the 14 sweet cherry varieties that were mai...In order to compare the characteristics of different varieties of sweet cherry and to formulate corresponding pruning scheme, hierarchical cluster analysis was conducted for the 14 sweet cherry varieties that were mainly planted in Shanxi Province. The results showed that the 14 varieties of sweet cherry could be divided into two types, Hongmanao and Rainier. Fruit setting rate, branching rate, medium fruit shoot proportion, spur proportion and yield per plant were significantly different between these two types of sweet cherry. The key points of pruning management, to improve the yield of Rainier type, were to increase the fruit setting rate and spur proportion, and to control properly the long and medium fruit shoot proportion.展开更多
[Objective] This research aimed to study the FTIR spectra of corn germs and endosperms so as to provide a scientific way for identifying corn of different types. [Method] The corn germs and endosperms of three types w...[Objective] This research aimed to study the FTIR spectra of corn germs and endosperms so as to provide a scientific way for identifying corn of different types. [Method] The corn germs and endosperms of three types were studied by using Fourier transform infrared spectroscopy(FTIR) technology, combined with cluster analysis. [Result] The overall characteristics of original FTIR spectra were basically similar within the range of 700-1 800 cm^-1. The FTIR spectra were mainly composed by the absorption peaks of polysaccharides, proteins and lipids. Within the wavelength range of 700-1 800 cm^-1, there were only tiny differences in original FTIR spectra among the corn germs and endosperms of three different types. The spectra were then processed by using first derivative and second derivative. The second derivative spectra were used for hierarchical cluster analysis(HCA). The results showed that with the wavelength range of 700-1 800 cm^-1, the second derivative spectra of the 52 samples could be better clustered according to the tree types and corn germ and corn endosperm. The clustering correct rate reached 96.1%.[Conclusion] FTIR technology, combined with cluster analysis, can be used to identify different types of corn germs and endosperms, and it is characterized by convenience and rapidness.展开更多
基金a Yayasan Universiti Teknologi PETRONAS-Fundamental Research Grant(YUTP-FRG)with a cost center of 015LC0-013.
文摘Oil spills cause environmental pollution with a serious threat to local communities and sustainable development.Accidental oil spills can be modelled as a stochastic process where each oil spill event is described by its spatial locations and incidence-time and hence allow for space-time cluster analysis.Spacetime cluster analysis can detect space-time pattern distribution of oil spills which can be useful for implementing preventive measures and evidence-based decision making.This study aims to detect the space-time clusters of accidental oil spills in Rivers state,Nigeria through the Space-time Scan Statistic.The Space-time Scan Statistic was applied under the permutation model to the oil spill data(each for sabotage and operational oil spills)collected at Local Government Area(LGA)-level during the period from 2011 to 2019.The results show that the sabotage oil spill clusters have covered most of the LGAs in the southern part of the state at the start of the study period and then in 2018–2019,it moved to the west covering a single LGA.The operational oil spill clusters covered two neighboring LGAs in the south.In addition,the temporal cluster of sabotage oil spills was seen in 2019 and operational oil spills in 2011–2012.The sabotage oil spills show an increasing trend with the maximum in 2019 while the operational oil spills show a decreasing trend with the minimum in 2019.These findings assist in more effective decision-making for combating the environmental problems and controlling the future spill incidence in the cluster-regions.
文摘Tuberculosis is one of the top killer diseases in the globe. The aim of this study was to explore the geographic distribution patterns and clustering characteristics of the disease incidence in terms of both space and time with high relative risk locations for tuberculosis incidence in Beijing area. A retrospective space-time clustering analysis was conducted at the districts level in Beijing area based on reported cases of sputum smear-positive pulmonary tuberculosis (TB) from 2005 to 2014. Global and local Moran’s I, autocorrelation analysis along with Ord (Gi*) statistics was applied to detect spatial patterns and the hotspot of TB incidence. Furthermore, the Kuldorff’s scan statistics were used to analyze space-time clusters. A total of 40,878 TB cases were reported in Beijing from 2005 to 2014. The annual average incidence rate was 22.11 per 100,000 populations (ranged from 16.55 to 25.71). The seasonal incidence occurred from March to July until late autumn. A higher relative risk area for TB incidence was mainly detected in urban and some rural districts of Beijing. The significant most likely space-time clusters and secondary clusters of TB incidence were scattered diversely in Beijing districts in each study year. The risk population was mainly scattered in urban and dense populated districts, including in few rural districts.
文摘Efficient iterative unsupervised machine learning involving probabilistic clustering analysis with the expectation-maximization(EM)clustering algorithm is applied to categorize reservoir facies by exploiting latent and observable well-log variables from a clastic reservoir in the Majnoon oilfield,southern Iraq.The observable well-log variables consist of conventional open-hole,well-log data and the computer-processed interpretation of gamma rays,bulk density,neutron porosity,compressional sonic,deep resistivity,shale volume,total porosity,and water saturation,from three wells located in the Nahr Umr reservoir.The latent variables include shale volume and water saturation.The EM algorithm efficiently characterizes electrofacies through iterative machine learning to identify the local maximum likelihood estimates(MLE)of the observable and latent variables in the studied dataset.The optimized EM model developed successfully predicts the core-derived facies classification in two of the studied wells.The EM model clusters the data into three distinctive reservoir electrofacies(F1,F2,and F3).F1 represents a gas-bearing electrofacies with low shale volume(Vsh)and water saturation(Sw)and high porosity and permeability values identifying it as an attractive reservoir target.The results of the EM model are validated using nuclear magnetic resonance(NMR)data from the third studied well for which no cores were recovered.The NMR results confirm the effectiveness and accuracy of the EM model in predicting electrofacies.The utilization of the EM algorithm for electrofacies classification/cluster analysis is innovative.Specifically,the clusters it establishes are less rigidly constrained than those derived from the more commonly used K-means clustering method.The EM methodology developed generates dependable electrofacies estimates in the studied reservoir intervals where core samples are not available.Therefore,once calibrated with core data in some wells,the model is suitable for application to other wells that lack core data.
文摘A significant portion of Landslide Early Warning Systems (LEWS) relies on the definition of operational thresholds and the monitoring of cumulative rainfall for alert issuance. These thresholds can be obtained in various ways, but most often they are based on previous landslide data. This approach introduces several limitations. For instance, there is a requirement for the location to have been previously monitored in some way to have this type of information recorded. Another significant limitation is the need for information regarding the location and timing of incidents. Despite the current ease of obtaining location information (GPS, drone images, etc.), the timing of the event remains challenging to ascertain for a considerable portion of landslide data. Concerning rainfall monitoring, there are multiple ways to consider it, for instance, examining accumulations over various intervals (1 h, 6 h, 24 h, 72 h), as well as in the calculation of effective rainfall, which represents the precipitation that actually infiltrates the soil. However, in the vast majority of cases, both the thresholds and the rain monitoring approach are defined manually and subjectively, relying on the operators’ experience. This makes the process labor-intensive and time-consuming, hindering the establishment of a truly standardized and rapidly scalable methodology on a large scale. In this work, we propose a Landslides Early Warning System (LEWS) based on the concept of rainfall half-life and the determination of thresholds using Cluster Analysis and data inversion. The system is designed to be applied in extensive monitoring networks, such as the one utilized by Cemaden, Brazil’s National Center for Monitoring and Early Warning of Natural Disasters.
文摘This paper investigates the design essence of Chinese classical private gardens,integrating their design elements and fundamental principles.It systematically analyzes the unique characteristics and differences among classical private gardens in the Northern,Jiangnan,and Lingnan regions.The study examines nine classical private gardens from Northern China,Jiangnan,and Lingnan by utilizing the advanced tool of principal component cluster analysis.Based on literature analysis and field research,273 variables were selected for principal component analysis,from which four components with higher contribution rates were chosen for further study.Subsequently,we employed clustering analysis techniques to compare the differences among the three types of gardens.The results reveal that the first principal component effectively highlights the differences between Jiangnan and Lingnan private gardens.The second principal component serves as the key to defining the types of Northern private gardens and distinguishing them from the other two types,and the third principal component indicates that Lingnan private gardens can be categorized into two distinct types as well.
基金National Natural Science Foundation of China(41375027,41130960,41275114,41275039)Public Benefit Research Foundation of China Meteorological Administration(GYHY201406001,GYHY201106044)+1 种基金"863"Program(2012AA120903)National Key Research and Development Program of China(2016YFB0502501)
文摘Assimilating satellite radiances into Numerical Weather Prediction(NWP) models has become an important approach to increase the accuracy of numerical weather forecasting. In this study, the assimilation technique scheme was employed in NOAA's STMAS(Space-Time Multiscale Analysis System) to assimilate AMSU-A radiances data.Channel selection sensitivity experiments were conducted on assimilated satellite data in the first place. Then, real case analysis of AMSU-A data assimilation was performed. The analysis results showed that, following assimilating of AMSU-A channels 5-11 in STMAS, the objective function quickly converged, and the channel vertical response was consistent with the AMSU-A weighting function distribution, which suggests that the channels can be used in the assimilation of satellite data in STMAS. With the case of the Typhoon Morakot in Taiwan Island in August 2009 as an example, experiments on assimilated and unassimilated AMSU-A radiances data were designed to analyze the impact of the assimilation of satellite data on STMAS. The results demonstrated that assimilation of AMSU-A data provided more accurate prediction of the precipitation region and intensity, and especially, it improved the 0-6h precipitation forecast significantly.
基金supported by National Natural Science Foundation of China[Grant No.41171134]Peking University-Lincoln Institute Center for Urban Development and land Policy2013 Jiangsu Province Graduate Student Research Innovation project[Grant No.CXLX13_034]
文摘This paper examines the temporal change and spatial variation of population pressure on the ecological environment in China.We have collected sufficient data from the statistical yearbooks of 31 provincial administrative areas in 1990,1995,2000,2005,and 2010.Using a geographic information system(GIS) and relevant models,we analyzed the trend of the population pressure on ecological environment and the change of the gravity center of ecological environment quality.We conclude that:(1) generally,population pressure on the ecological environment in China was becoming higher during1990-2010,especially in some areas where the population and environment were in serious imbalance and the ecological environment experienced severe pollution;(2) during a certain period,population pressure on the ecological environment was becoming lower in some areas,but the ecological environment was getting worse;(3) the areas with super-high population pressure on the ecological environment were Beijing,Tianjin,and Shanghai;(4) the gravity center of population pressure on the ecological environment and the center of ecological environment quality move differently during the study time period,but the general trend was similar- both of them were moving from west to east.Based on the analysis,this paper also provides some policy suggestions on the control of ecological environment quality.
基金This article was funded by a Fundamental Research Grant Scheme(FRGS)from the Ministry of Education,Malaysia(Ref:FRGS/1/2018/STG06/UTP/02/1)a Yayasan Universiti Teknologi PETRONAS-Fundamental Research Grant(cost center of 015LC0-013)received by Hanita Daud,URLs:https://www.mohe.gov.my/en/initiatives-2/187-program-utama/penyelidikan/548-research-grants-informationhttps://www.utp.edu.my/yayasan/Pages/default.aspx.
文摘Space-time disease cluster detection assists in conducting disease surveillance and implementing control strategies.The state-of-the-art method for this kind of problem is the Space-time Scan Statistics(SaTScan)which has limitations for non-traditional/non-clinical data sources due to its parametric model assumptions such as Poisson orGaussian counts.Addressing this problem,an Eigenspace-based method called Multi-EigenSpot has recently been proposed as a nonparametric solution.However,it is based on the population counts data which are not always available in the least developed countries.In addition,the population counts are difficult to approximate for some surveillance data such as emergency department visits and over-the-counter drug sales,where the catchment area for each hospital/pharmacy is undefined.We extend the population-based Multi-EigenSpot method to approximate the potential disease clusters from the observed/reported disease counts only with no need for the population counts.The proposed adaptation uses an estimator of expected disease count that does not depend on the population counts.The proposed method was evaluated on the real-world dataset and the results were compared with the population-based methods:Multi-EigenSpot and SaTScan.The result shows that the proposed adaptation is effective in approximating the important outputs of the population-based methods.
文摘Applications of the multivariate technique called correspondence analysis for environmental studies are relatively new and are limited to spatial multivariate data set. In this paper, a procedure of applying correspondence analysis to a large space-time data set for multiple environmental variables is shown. In particular, nitrogen dioxide and carbon monoxide hourly concentrations measured during January 1999 at several monitored stations in a district of Northern Italy are analyzed. The procedure consists in transforming the continuous variables into categorical ones by the means of appropriate indicator variables, generating special contingency tables and applying correspondence analysis. The use of this classical multivariate technique allows the identification of important relationships among pollution levels and monitoring stations and/or relationships among pollution levels and observation times.
文摘Many cases of foot-and-mouth disease (FMD) are reported every year in Benin. In order to elucidate the epidemiology of this disease, a space-time analysis was carried out in all the 77 municipalities of the country aiming to identify high risk areas as well as risk factors such as season and transhumance on the period of 2005 to 2014. Data were collected retrospectively from the Directory of Animal Production of Benin. The method of Kulldorff was used with the software SaTScanTM for the space-time analysis while a script was designed in the software R to generate new sizes with three different models of transhumance. From 2005 to 2014, 434 foci were recorded. Many outbreaks occurred in August, September and October. This period corresponds to the small rainy season in the South and the rainy season of the North. The municipality of Parakou was regarded as the source FMD outbreaks in Benin because it hosts one of the largest livestock markets in the country and many rivers. It was the municipality at the highest risk. The other municipalities at risk were Nikki, Pèrèrè and Kalaléas well as Karimama (hosting the national parkW), Kouandé and Toucountouna located nearby Pendjari’s national park. This study revealed that the space-time configuration is real and the main factors of persistence and dissemination of FMD virus were national parks, classified forests and the livestock market of Parakou all located in the North. The variation of the number of cattle due to their transhumance from the North to the South did not influence the zones at risk. Therefore, Northern Benin is probably at high risk of FMD.
基金Supported by High-tech Research Project of Jiangsu Province(BG2004314)~~
文摘[Objective] The aim was to study the variation of leaf characters from different provenance sources of Polygonum multiflorum Thunb,as well as to carry out cluster analysis on P.multiflorum from different provenance sources to provide basis for the classification,identification,breeding and improved variety selection of P.multiflorum.[Method] Leaf shape characters of 31 copies of germplasm resources in the major distribution region of the whole country were determined,and the genetic variation of P.multiflorum leaves from different producing areas was analyzed.[Result] The leaf characters of single plant of the same experimental provenance source of P.multiflorum were relatively stable,the variation was mainly found on the single leaf area,1/2 leaf width,leaf width and other indicators;the variation of each leaf character among different provenance sources was obvious,and the variation was mainly found on the single leaf weight,leaf area,1/2 leaf width,leaf length and other indicators.The correlation analysis of each leaf character in P.multiflorum suggested that the single leaf area and single leaf weight showed extremely significant positive correlation with leaf length,1/2 leaf width,leaf width,leaf thickness and leaf stem length,while the single leaf area and single leaf weight showed significant negative correlation with WWR(leaf width/1/2 leaf width)and LWR(leaf length/1/2 leaf length),in addition,several macroscopic leaf characters such as leaf length,1/2 leaf width,leaf width,leaf stem length showed extremely positive correlation.The main component analysis result suggested that the contribution rate of accumulation variance of the front three main components was up to 97.4%,which could better reflect the comprehensive performance of leaf characters of different provenance sources of P.multiflorum.The cluster analysis showed that the experimental 31 copies of P.multiflorum provenance sources should be divided into three classes,the first class was distributed in the Middle,Western of Guizhou,northwestern of Guangxi and western areas with higher altitude;the second class was distributed in Hunan,Hubei,Sichuan,Guangdong and the most area of Guangxi;the third class was distributed in Anhui,Jiangsu and Henan and Shandong.[Conclusion] Cluster analysis of leaf characters indicated that the kinds of provenance sources which the geographical position was closer could be got together.The study had provided a certain basis for the classification of P.multiflorum.
基金The National Natural Science Foundation of China(No.50378016).
文摘Because of the difficulty to obtain the traffic flow information of lanes at non-detector intersections in most metropolises of the world,based on the relationships between the lanes of signal-controlled intersections,cluster analysis and stepwise regression are integrated to predict the traffic volume of lanes at non-detector isolated controlled intersections.First cluster analysis is used to cluster the lanes of non-detector isolated signal-controlled intersections and the lanes of all signal-controlled intersections with detectors.Then, by the results of cluster analysis,the traffic volume samples are selected randomly and stepwise regression is used to predict the traffic volume of lanes at non-detector isolated signal-controlled intersections.The method is tested by the traffic volume data of lanes of the road network of Nanjing city.The problem of predicting the traffic volume of lanes at non-detector isolated signal-controlled intersections was resolved and can be widely used in urban traffic flow guidance and urban traffic control in cities without enough intersections equipped with detectors.
基金The National Science Foundation by Changjiang Scholarship of Ministry of Education of China(No.BCS-0527508)the Joint Research Fund for Overseas Natural Science of China(No.51250110075)+1 种基金the Natural Science Foundation of Jiangsu Province(No.BK200910046)the Postdoctoral Science Foundation of Jiangsu Province(No.0901005C)
文摘In order to analyze the heterogeneity in vehicular traffic speed, a new method that integrates cluster analysis and probability distribution function fitting is presented. First, for identifying the optimal number of clusters, the two-step cluster method is applied to analyze actual speed data, which suggests that dividing speed data into two clusters can best reflect the intrinsic patterns of traffic flows. Such information is then taken as guidance in probability distribution function fitting. The normal, skew-normal and skew-t distribution functions are used to fit the probability distribution of each cluster respectively, which suggests that the skew-t distribution has the highest fitting accuracy; the second is skew-normal distribution; the worst is normal distribution. Model analysis results demonstrate that the proposed mixture model has a better fitting and generalization capability than the conventional single model. In addition, the new method is more flexible in terms of data fitting and can provide a more accurate model of speed distribution.
基金Supported by Key Project of New Product Development in Yunnan Province(2009BB006)~~
文摘Inter-simple sequence repeat(ISSR) molecular markers were applied to analyze the genetic diversity and clustering of 48 introduced and bred cultivars of Olea euyopaea L. Totally 106 DNA bands were amplified by 11 screened primers, including 99 polymorphic bands; the percentage of polymorphic loci was 93.40%, indicating a rich genetic diversity in Olea euyopaea L. germplasm resources. Based on Nei's genetic distances between various cultivars, a dendrogram of 48 cultivars of Olea euyopaea L. was constructed using unweighted pair-group(UPMGA)method,which showed that 48 cultivars were clustered into four main categories; 84.6% of native cultivars were clustered into two categories; most of introduced cultivars were clustered based on their sources and main usages but not on their geographic origins. This study will provide references for the utilization and further genetic improvement of Olea euyopaea L. germplasm resources.
文摘[ObJective] The research aimed to determine the geographic distribution map of system of Rana dybowskii. [Method] Four morphologic indices (body length, body weight, forelimb length, hindlimb length) of eight geographical populations of R.dybowskii which naturally distribute in Changhai Mountain and Xiaoxing'an Mountain were measured. Measure results were variance analyzed and cluster analyzed. [Result] Variance analysis showed: the genetic branching among the Dongfanghong male population( belongs to Wandashan) and Xiaoxing'an Mountain male population and Changbai Mountain male population were significantly different (P〈0.05) ; the genetic branching between the Hebei female population (belongs to Xiaoxing'an Mountain) and Changbai Mountain female population was significantly different (P〈0.05 ). Cluster analysis showed : male R.dybowskii can be divided into three groups : the first group included Quanyang, Tianbei, Chaoyang and Ddkouqin, the second group included Tieli and Anshan, the third group included Dongfanghong; and the female R. dybowskii can be divided into three groups : the first group included Quanyang and Chaoyang, the second group included Tianbei and Dakouqin, the third group included Hebei. [Condusion] The paper deduced that the Sanjiang Plain was the geographical origin center ofR. dybowskii which radiated to Changbai Mountain and Xiaoxing'an Mountain along the adverse current of Songhua River basin, therefore, the current distribution pattern of R. dybowskii was formed.
基金The National Natural Science Foundation of China(No.50674086)Specialized Research Fund for the Doctoral Program of Higher Education(No.20060290508)the Postdoctoral Scientific Program of Jiangsu Province(No.0701045B)
文摘In order to mine production and security information from security supervising data and to ensure security and safety involved in production and decision-making,a clustering analysis algorithm for security supervising data based on a semantic description in coal mines is studied.First,the semantic and numerical-based hybrid description method of security supervising data in coal mines is described.Secondly,the similarity measurement method of semantic and numerical data are separately given and a weight-based hybrid similarity measurement method for the security supervising data based on a semantic description in coal mines is presented.Thirdly,taking the hybrid similarity measurement method as the distance criteria and using a grid methodology for reference,an improved CURE clustering algorithm based on the grid is presented.Finally,the simulation results of a security supervising data set in coal mines validate the efficiency of the algorithm.
基金Supported by National Oat and Buckwheat Industrial Technology System(CARS-08-A-1-3)Breeding Project of Shanxi Academy of Agricultural Sciences during the Thirteenth Five-Year Plan Period(16yzgc035)~~
文摘In order to reveal the genetic differences and agronomic traits of Fagopy-rum tataricum_ varieties (lines) intuitively, explore good resources and avoid the blindness of parent selection during the breeding process, six primary agronomic traits of 45 F. tataricum_ varieties (lines) that came from the eleven buckwheat breeding departments across the country were analyzed with principal component analysis and cluster analysis. The results of principal component analysis showed that the six agronomic traits could be simplified into three principal components, and the cumulative contribution rate reached 83%. The results of cluster analysis showed that the 45 F. tataricum varieties (lines) were classified into four groups:high stalk, medium yield and smal grain type, medium stalk, high yield and large grain type, medium stalk, low yield and smal grain type and high stalk, medium yield and medium grain type. Among them, performance of comprehensive trait of the second type was better than that of the other types. Thus, the F. tataricum_va-rieties (lines) that were classified into the second type could be considered as good varieties (lines) or breeding materials. The genetic differences among F. tataricum_varieties (lines) had no necessary correlations with origin and geographical distance. ln addition to complementary traits and geographical distance, genetic distances (dif-ferent populations) should be taken into consideration during parent selection in cross breeding.
基金Supported by the National Natural Science Foundation of China(30860147)Open Funds of National Key Laboratory of Crop Genetic Improvement(ZK200902)Natural Science Foundation of Yunnan Province(2011FB117)~~
文摘[Objective] This study aimed to develop ACGM markers for the clustering analysis of large grained Brassica napus materials. [Method] A total of 44 pairs of ACGM primers were designed according to 18 genes related to Arabidopsis grain development and their homologous rape EST sequences. After electrophoresis, 18 pairs of ACGM primers were selected for the clustering analysis of 16 larger grained samples and four fine grained samples of rapeseed. [Result] PCR result showed that 2-6 specific bands were respectively amplified by each pair of primes, and all the bands were polymorphic and repeatable, suggesting that the optimized ACGM markers were useful for clustering analysis of B. napus species. Clustering analysis revealed that the 20 rapeseed samples were divided into three clusters A, B, and C at similarity coefficient 0.6. Then, the clusters A and B were further divided into five sub clusters A1, A2, A3, B1 and B2 at similarity coefficient 0.67. [Conclusion] This study will provide theoretical and practical values for rape breeding.
基金Supported by Spark Program of Science and Technology Department of Shanxi Province(20130511021)~~
文摘In order to compare the characteristics of different varieties of sweet cherry and to formulate corresponding pruning scheme, hierarchical cluster analysis was conducted for the 14 sweet cherry varieties that were mainly planted in Shanxi Province. The results showed that the 14 varieties of sweet cherry could be divided into two types, Hongmanao and Rainier. Fruit setting rate, branching rate, medium fruit shoot proportion, spur proportion and yield per plant were significantly different between these two types of sweet cherry. The key points of pruning management, to improve the yield of Rainier type, were to increase the fruit setting rate and spur proportion, and to control properly the long and medium fruit shoot proportion.
基金Supported by National Natural Science Foundation of China(30960179)Natural Science Foundation of Yunnan Province(2007A048M)~~
文摘[Objective] This research aimed to study the FTIR spectra of corn germs and endosperms so as to provide a scientific way for identifying corn of different types. [Method] The corn germs and endosperms of three types were studied by using Fourier transform infrared spectroscopy(FTIR) technology, combined with cluster analysis. [Result] The overall characteristics of original FTIR spectra were basically similar within the range of 700-1 800 cm^-1. The FTIR spectra were mainly composed by the absorption peaks of polysaccharides, proteins and lipids. Within the wavelength range of 700-1 800 cm^-1, there were only tiny differences in original FTIR spectra among the corn germs and endosperms of three different types. The spectra were then processed by using first derivative and second derivative. The second derivative spectra were used for hierarchical cluster analysis(HCA). The results showed that with the wavelength range of 700-1 800 cm^-1, the second derivative spectra of the 52 samples could be better clustered according to the tree types and corn germ and corn endosperm. The clustering correct rate reached 96.1%.[Conclusion] FTIR technology, combined with cluster analysis, can be used to identify different types of corn germs and endosperms, and it is characterized by convenience and rapidness.