Efficient iterative unsupervised machine learning involving probabilistic clustering analysis with the expectation-maximization(EM)clustering algorithm is applied to categorize reservoir facies by exploiting latent an...Efficient iterative unsupervised machine learning involving probabilistic clustering analysis with the expectation-maximization(EM)clustering algorithm is applied to categorize reservoir facies by exploiting latent and observable well-log variables from a clastic reservoir in the Majnoon oilfield,southern Iraq.The observable well-log variables consist of conventional open-hole,well-log data and the computer-processed interpretation of gamma rays,bulk density,neutron porosity,compressional sonic,deep resistivity,shale volume,total porosity,and water saturation,from three wells located in the Nahr Umr reservoir.The latent variables include shale volume and water saturation.The EM algorithm efficiently characterizes electrofacies through iterative machine learning to identify the local maximum likelihood estimates(MLE)of the observable and latent variables in the studied dataset.The optimized EM model developed successfully predicts the core-derived facies classification in two of the studied wells.The EM model clusters the data into three distinctive reservoir electrofacies(F1,F2,and F3).F1 represents a gas-bearing electrofacies with low shale volume(Vsh)and water saturation(Sw)and high porosity and permeability values identifying it as an attractive reservoir target.The results of the EM model are validated using nuclear magnetic resonance(NMR)data from the third studied well for which no cores were recovered.The NMR results confirm the effectiveness and accuracy of the EM model in predicting electrofacies.The utilization of the EM algorithm for electrofacies classification/cluster analysis is innovative.Specifically,the clusters it establishes are less rigidly constrained than those derived from the more commonly used K-means clustering method.The EM methodology developed generates dependable electrofacies estimates in the studied reservoir intervals where core samples are not available.Therefore,once calibrated with core data in some wells,the model is suitable for application to other wells that lack core data.展开更多
A significant portion of Landslide Early Warning Systems (LEWS) relies on the definition of operational thresholds and the monitoring of cumulative rainfall for alert issuance. These thresholds can be obtained in vari...A significant portion of Landslide Early Warning Systems (LEWS) relies on the definition of operational thresholds and the monitoring of cumulative rainfall for alert issuance. These thresholds can be obtained in various ways, but most often they are based on previous landslide data. This approach introduces several limitations. For instance, there is a requirement for the location to have been previously monitored in some way to have this type of information recorded. Another significant limitation is the need for information regarding the location and timing of incidents. Despite the current ease of obtaining location information (GPS, drone images, etc.), the timing of the event remains challenging to ascertain for a considerable portion of landslide data. Concerning rainfall monitoring, there are multiple ways to consider it, for instance, examining accumulations over various intervals (1 h, 6 h, 24 h, 72 h), as well as in the calculation of effective rainfall, which represents the precipitation that actually infiltrates the soil. However, in the vast majority of cases, both the thresholds and the rain monitoring approach are defined manually and subjectively, relying on the operators’ experience. This makes the process labor-intensive and time-consuming, hindering the establishment of a truly standardized and rapidly scalable methodology on a large scale. In this work, we propose a Landslides Early Warning System (LEWS) based on the concept of rainfall half-life and the determination of thresholds using Cluster Analysis and data inversion. The system is designed to be applied in extensive monitoring networks, such as the one utilized by Cemaden, Brazil’s National Center for Monitoring and Early Warning of Natural Disasters.展开更多
This paper investigates the design essence of Chinese classical private gardens,integrating their design elements and fundamental principles.It systematically analyzes the unique characteristics and differences among ...This paper investigates the design essence of Chinese classical private gardens,integrating their design elements and fundamental principles.It systematically analyzes the unique characteristics and differences among classical private gardens in the Northern,Jiangnan,and Lingnan regions.The study examines nine classical private gardens from Northern China,Jiangnan,and Lingnan by utilizing the advanced tool of principal component cluster analysis.Based on literature analysis and field research,273 variables were selected for principal component analysis,from which four components with higher contribution rates were chosen for further study.Subsequently,we employed clustering analysis techniques to compare the differences among the three types of gardens.The results reveal that the first principal component effectively highlights the differences between Jiangnan and Lingnan private gardens.The second principal component serves as the key to defining the types of Northern private gardens and distinguishing them from the other two types,and the third principal component indicates that Lingnan private gardens can be categorized into two distinct types as well.展开更多
The study of temporal and spatial variations of nitrate in groundwater under different soil nitrogen environments is helpful to the security of groundwater resources in agricultural areas.In this paper,based on 320 gr...The study of temporal and spatial variations of nitrate in groundwater under different soil nitrogen environments is helpful to the security of groundwater resources in agricultural areas.In this paper,based on 320 groups of soil and groundwater samples collected at the same time,geostatistical analysis and multiple regression analysis were comprehensively used to conduct the evaluation of nitrogen contents in both groundwater and soil.From May to August,as the nitrification of groundwater is dominant,the average concentration of nitrate nitrogen is 34.80 mg/L;The variation of soil ammonia nitrogen and nitrate nitrogen is moderate from May to July,and the variation coefficient decreased sharply and then increased in August.There is a high correlation between the nitrate nitrogen in groundwater and soil in July,and there is a high correlation between the nitrate nitrogen in groundwater and ammonium nitrogen in soil in August and nitrate nitrogen in soil in July.From May to August,the area of low groundwater nitrate nitrogen in 0-5 mg/L and 5-10 mg/L decreased from 10.97%to 0,and the proportion of high-value area(greater than 70 mg/L)increased from 21.19%to 27.29%.Nitrate nitrogen is the main factor affecting the quality of groundwater.The correlation analysis of nitrate nitrogen in groundwater,nitrate nitrogen in soil and ammonium nitrogen shows that they have a certain period of delay.The areas with high concentration of nitrate in groundwater are mainly concentrated in the western part of the study area,which has a high consistency with the high value areas of soil nitrate distribution from July to August,and a high difference with the spatial position of soil ammonia nitrogen distribution in August.展开更多
The recent pandemic crisis has highlighted the importance of the availability and management of health data to respond quickly and effectively to health emergencies, while respecting the fundamental rights of every in...The recent pandemic crisis has highlighted the importance of the availability and management of health data to respond quickly and effectively to health emergencies, while respecting the fundamental rights of every individual. In this context, it is essential to find a balance between the protection of privacy and the safeguarding of public health, using tools that guarantee transparency and consent to the processing of data by the population. This work, starting from a pilot investigation conducted in the Polyclinic of Bari as part of the Horizon Europe Seeds project entitled “Multidisciplinary analysis of technological tracing models of contagion: the protection of rights in the management of health data”, has the objective of promoting greater patient awareness regarding the processing of their health data and the protection of privacy. The methodology used the PHICAT (Personal Health Information Competence Assessment Tool) as a tool and, through the administration of a questionnaire, the aim was to evaluate the patients’ ability to express their consent to the release and processing of health data. The results that emerged were analyzed in relation to the 4 domains in which the process is divided which allows evaluating the patients’ ability to express a conscious choice and, also, in relation to the socio-demographic and clinical characteristics of the patients themselves. This study can contribute to understanding patients’ ability to give their consent and improve information regarding the management of health data by increasing confidence in granting the use of their data for research and clinical management.展开更多
In this paper, CiteSpace, a bibliometrics software, was adopted to collect research papers published on the Web of Science, which are relevant to biological model and effluent quality prediction in activated sludge pr...In this paper, CiteSpace, a bibliometrics software, was adopted to collect research papers published on the Web of Science, which are relevant to biological model and effluent quality prediction in activated sludge process in the wastewater treatment. By the way of trend map, keyword knowledge map, and co-cited knowledge map, specific visualization analysis and identification of the authors, institutions and regions were concluded. Furthermore, the topics and hotspots of water quality prediction in activated sludge process through the literature-co-citation-based cluster analysis and literature citation burst analysis were also determined, which not only reflected the historical evolution progress to a certain extent, but also provided the direction and insight of the knowledge structure of water quality prediction and activated sludge process for future research.展开更多
As critical conduits for the dissemination of online public opinion,social media platforms offer a timely and effective means for managing emergencies during major disasters,such as earthquakes.This study focuses on t...As critical conduits for the dissemination of online public opinion,social media platforms offer a timely and effective means for managing emergencies during major disasters,such as earthquakes.This study focuses on the analysis of online public opinions following the Maduo M7.4 earthquake in Qinghai Province and the Yangbi M6.4 earthquake in Yunnan Province.By collecting,cleaning,and organizing post-earthquake Sina Weibo(short for Weibo)data,we employed the Latent Dirichlet Allocation(LDA)model to extract information pertinent to public opinion on these earthquakes.This analysis included a comparison of the nature and temporal evolution of online public opinions related to both events.An emotion analysis,utilizing an emotion dictionary,categorized the emotional content of post-earthquake Weibo posts,facilitating a comparative study of the characteristics and temporal trends of online public emotions following the earthquakes.The findings were visualized using Geographic Information System(GIS)techniques.The analysis revealed certain commonalities in online public opinion following both earthquakes.Notably,the peak of online engagement occurred within the first 24 hours post-earthquake,with a rapid decline observed between 24 to 48 hours thereafter.The variation in popularity of online public opinion was linked to aftershock occurrences.Adjusted for population factors,online engagement in areas surrounding the earthquake sites and in Sichuan Province was significantly high.Initially dominated by feelings of“fear”and“surprise”,the public sentiment shifted towards a more positive outlook with the onset of rescue operations.However,distinctions in the online public response to each earthquake were also noted.Following the Yangbi earthquake,Yunnan Province reported the highest number of Weibo posts nationwide;in contrast,Qinghai Province ranked third post-Maduo earthquake,attributable to its smaller population size and extensive damage to communication infrastructure.This research offers a methodological approach for the analysis of online public opinion related to earthquakes,providing insights for the enhancement of post-disaster emergency management and public mental health support.展开更多
In view of the composition analysis and identification of ancient glass products, L1 regularization, K-Means cluster analysis, elbow rule and other methods were comprehensively used to build logical regression, cluste...In view of the composition analysis and identification of ancient glass products, L1 regularization, K-Means cluster analysis, elbow rule and other methods were comprehensively used to build logical regression, cluster analysis, hyper-parameter test and other models, and SPSS, Python and other tools were used to obtain the classification rules of glass products under different fluxes, sub classification under different chemical compositions, hyper-parameter K value test and rationality analysis. Research can provide theoretical support for the protection and restoration of ancient glass relics.展开更多
In the past 30 years, Chinese enterprises have been a hot topic of discussion and concern among the general public in terms of economic and social status, ownership structure, business mechanism, and management level....In the past 30 years, Chinese enterprises have been a hot topic of discussion and concern among the general public in terms of economic and social status, ownership structure, business mechanism, and management level. Solving the problem of employment for the people is an important prerequisite for their peaceful living and work, as well as a prerequisite and foundation for building a harmonious society. The employment situation of private enterprises has always been of great concern to the outside world, and these two major jobs have always occupied an important position in the employment field of China that cannot be ignored. With the establishment of the market economy system, individual and private enterprises have become important components of the socialist economy, making significant contributions to economic development and social progress. The rapid development of China’s economy, on the one hand, is the embodiment of the superiority of China’s socialist market economic system, and on the other hand, it is the role of the tertiary industry and private enterprises in promoting the national economy. Since the 1990s, China’s private enterprises have become a new economic growth point for local and even national countries, and are one of the important ways to arrange employment and achieve social stability. This paper studies the employment of private enterprises and individuals from the perspective of statistics, extracts relevant data from China statistical Yearbook, uses the relevant knowledge of statistics to process the data, obtains the conclusion and puts forward relevant constructive suggestions.展开更多
The scientific and fair positioning of monitoring locations for surface displacement on slopes is a prerequisite for early warning and forecasting.However,there is no specific provision on how to effectively determine...The scientific and fair positioning of monitoring locations for surface displacement on slopes is a prerequisite for early warning and forecasting.However,there is no specific provision on how to effectively determine the number and location of monitoring points according to the actual deformation characteristics of the slope.There are still some defects in the layout of monitoring points.To this end,based on displacement data series and spatial location information of surface displacement monitoring points,by combining displacement series correlation and spatial distance influence factors,a spatial deformation correlation calculation model of slope based on clustering analysis was proposed to calculate the correlation between different monitoring points,based on which the deformation area of the slope was divided.The redundant monitoring points in each partition were eliminated based on the partition's outcome,and the overall optimal arrangement of slope monitoring points was then achieved.This method scientifically addresses the issues of slope deformation zoning and data gathering overlap.It not only eliminates human subjectivity from slope deformation zoning but also increases the efficiency and accuracy of slope monitoring.In order to verify the effectiveness of the method,a sand-mudstone interbedded CounterTilt excavation slope in the Chongqing city of China was used as the research object.Twenty-four monitoring points deployed on this slope were monitored for surface displacement for 13 months.The spatial location of the monitoring points was discussed.The results show that the proposed method of slope deformation zoning and the optimized placement of monitoring points are feasible.展开更多
The classification of the springtime water mass has an important influence on the hydrography,regional climate change and fishery in the Taiwan Strait.Based on 58 stations of CTD profiling data collected in the wester...The classification of the springtime water mass has an important influence on the hydrography,regional climate change and fishery in the Taiwan Strait.Based on 58 stations of CTD profiling data collected in the western and southwestern Taiwan Strait during the spring cruise of 2019,we analyze the spatial distributions of temperature(T)and salinity(S)in the investigation area.Then by using the fuzzy cluster method combined with the T-S similarity number,we classify the investigation area into 5 water masses:the Minzhe Coastal Water(MZCW),the Taiwan Strait Mixed Water(TSMW),the South China Sea Surface Water(SCSSW),the South China Sea Subsurface Water(SCSUW)and the Kuroshio Branch Water(KBW).The MZCW appears in the near surface layer along the western coast of Taiwan Strait,showing low-salinity(<32.0)tongues near the Minjiang River Estuary and the Xiamen Bay mouth.The TSMW covers most upper layer of the investigation area.The SCSSW is mainly distributed in the upper layer of the southwestern Taiwan Strait,beneath which is the SCSUW.The KBW is a high temperature(core value of 26.36℃)and high salinity(core value of 34.62)water mass located southeast of the Taiwan Bank and partially in the central Taiwan Strait.展开更多
In this research,an integrated classification method based on principal component analysis-simulated annealing genetic algorithm-fuzzy cluster means(PCA-SAGA-FCM)was proposed for the unsupervised classification of tig...In this research,an integrated classification method based on principal component analysis-simulated annealing genetic algorithm-fuzzy cluster means(PCA-SAGA-FCM)was proposed for the unsupervised classification of tight sandstone reservoirs which lack the prior information and core experiments.A variety of evaluation parameters were selected,including lithology characteristic parameters,poro-permeability quality characteristic parameters,engineering quality characteristic parameters,and pore structure characteristic parameters.The PCA was used to reduce the dimension of the evaluation pa-rameters,and the low-dimensional data was used as input.The unsupervised reservoir classification of tight sandstone reservoir was carried out by the SAGA-FCM,the characteristics of reservoir at different categories were analyzed and compared with the lithological profiles.The analysis results of numerical simulation and actual logging data show that:1)compared with FCM algorithm,SAGA-FCM has stronger stability and higher accuracy;2)the proposed method can cluster the reservoir flexibly and effectively according to the degree of membership;3)the results of reservoir integrated classification match well with the lithologic profle,which demonstrates the reliability of the classification method.展开更多
Background:To solve the cluster analysis better,we propose a new method based on the chaotic particle swarm optimization(CPSO)algorithm.Methods:In order to enhance the performance in clustering,we propose a novel meth...Background:To solve the cluster analysis better,we propose a new method based on the chaotic particle swarm optimization(CPSO)algorithm.Methods:In order to enhance the performance in clustering,we propose a novel method based on CPSO.We first evaluate the clustering performance of this model using the variance ratio criterion(VRC)as the evaluation metric.The effectiveness of the CPSO algorithm is compared with that of the traditional particle swarm optimization(PSO)algorithm.The CPSO aims to improve the VRC value while avoiding local optimal solutions.The simulated dataset is set at three levels of overlapping:non-overlapping,partial overlapping,and severe overlapping.Finally,we compare CPSO with two other methods.Results:By observing the comparative results,our proposed CPSO method performs outstandingly.In the conditions of non-overlapping,partial overlapping,and severe overlapping,our method has the best VRC values of 1683.2,620.5,and 275.6,respectively.The mean VRC values in these three cases are 1683.2,617.8,and 222.6.Conclusion:The CPSO performed better than other methods for cluster analysis problems.CPSO is effective for cluster analysis.展开更多
The influence of anthropogenic activities,especially artificial dykes,on the coastal wetland landscape is now considered as a serious problem to the coastal ecosystem.It is important and necessary to analyze changes o...The influence of anthropogenic activities,especially artificial dykes,on the coastal wetland landscape is now considered as a serious problem to the coastal ecosystem.It is important and necessary to analyze changes of coastal landscape pattern under the influence of artificial dykes for the protection and management of coastal wetland.Our study aimed to reveal the quantitative characteristics of the coastal wetland landscape and its spatial-temporal dynamics under the influence of artificial dykes in the Yellow River delta(YRD).It was analyzed by the methods of the statistical analysis of landscape structure,five selected landscape indices and the changes of spatial centroids of three typical wetland types,including reed marshes,tidal fiats and aquaculture-salt fields.The results showed that:(1)Reduction of wetland area,especially the degradation of natural wetlands,had been the principal problem since the dykes were constructed in the YRD.The dykes created conditions for the development of artificial wetlands.However,the new born artificial wetlands were still less than the vanished natural wetlands.(2)Compared with the open area,the building of artificial dykes significantly speeded up the changes of landscape patterns and the aggravation of the landscape fragmentation in the closed area.(3)The changes of area-weighted centroids of three typical wetland landscapes were greatly affected by dykes,and the movement of the centroid of the aquaculture-salt field was very sensitive to the dykes constructed in the corresponding period.展开更多
Spatial-temporal analysis of emotions in society has become popular in many studies integrating geography with the humanities,and has shown its influence on social sensing and geo-computation for social sciences.Emoti...Spatial-temporal analysis of emotions in society has become popular in many studies integrating geography with the humanities,and has shown its influence on social sensing and geo-computation for social sciences.Emotions in society are often volatile,irrational,and vulnerable to the social environment.A critical challenge is to analyze changes in long-term and large-scale emotions in society.In this paper,we propose exploiting this challenge by using spatial-temporal analysis.After extracting emotional,temporal,and spatial information,a spatial standardization approach based on adataset of administrative district changes addresses the problem of Chinese toponym changes.Finally,over 1.7 million news data from the People’s Daily from 1956 to 2014 were collected to explore the changes,spatial distribution,and driving factors of emotions in society using spatial-temporal analysis.The experimental results found that the spatial-temporal analysis of emotions in society in the news is consistent with the results of related sociological research.展开更多
Exploring the spatial and temporal evolution characteristics of the border land use multifunctionality(LUMF)provides insights for taking advantage of border land use and optimizing border land use policies.Based on th...Exploring the spatial and temporal evolution characteristics of the border land use multifunctionality(LUMF)provides insights for taking advantage of border land use and optimizing border land use policies.Based on the improved Technique for Order Preference by Similarity to an Ideal Solution(TOPSIS)mode,this study identifies and evaluates the LUMFs in the China-Vietnam border area between 2000 and 2018 from the perspectives of agricultural production,social security,ecological service,landscape recreation,and national security.The results show that:1)The comprehensive land use functions in most counties and cities continued to be improved.2)The comprehensive land use function exhibits remarkable spatial divergence and aggregation characteristics.The high-value area of the agricultural production function and social security function evolves from the east to the west.In addition,the spatial evolution of ecological service function is complicated,without an obvious spatial divergence and aggregation pattern.The landscape recreation function shows different spatial differentiation characteristics in the early and middle stage,and forms a large cluster in the later stage.Finally,the spatial evolution pattern of the national security function is significant.3)Designing differentiated border land policies,improving border land use security,and establishing a long-term mechanism for ecological protection and ecological compensation can aid in optimizing the LUMF level in the border area.展开更多
[Objective] The aim was to study the variation of leaf characters from different provenance sources of Polygonum multiflorum Thunb,as well as to carry out cluster analysis on P.multiflorum from different provenance so...[Objective] The aim was to study the variation of leaf characters from different provenance sources of Polygonum multiflorum Thunb,as well as to carry out cluster analysis on P.multiflorum from different provenance sources to provide basis for the classification,identification,breeding and improved variety selection of P.multiflorum.[Method] Leaf shape characters of 31 copies of germplasm resources in the major distribution region of the whole country were determined,and the genetic variation of P.multiflorum leaves from different producing areas was analyzed.[Result] The leaf characters of single plant of the same experimental provenance source of P.multiflorum were relatively stable,the variation was mainly found on the single leaf area,1/2 leaf width,leaf width and other indicators;the variation of each leaf character among different provenance sources was obvious,and the variation was mainly found on the single leaf weight,leaf area,1/2 leaf width,leaf length and other indicators.The correlation analysis of each leaf character in P.multiflorum suggested that the single leaf area and single leaf weight showed extremely significant positive correlation with leaf length,1/2 leaf width,leaf width,leaf thickness and leaf stem length,while the single leaf area and single leaf weight showed significant negative correlation with WWR(leaf width/1/2 leaf width)and LWR(leaf length/1/2 leaf length),in addition,several macroscopic leaf characters such as leaf length,1/2 leaf width,leaf width,leaf stem length showed extremely positive correlation.The main component analysis result suggested that the contribution rate of accumulation variance of the front three main components was up to 97.4%,which could better reflect the comprehensive performance of leaf characters of different provenance sources of P.multiflorum.The cluster analysis showed that the experimental 31 copies of P.multiflorum provenance sources should be divided into three classes,the first class was distributed in the Middle,Western of Guizhou,northwestern of Guangxi and western areas with higher altitude;the second class was distributed in Hunan,Hubei,Sichuan,Guangdong and the most area of Guangxi;the third class was distributed in Anhui,Jiangsu and Henan and Shandong.[Conclusion] Cluster analysis of leaf characters indicated that the kinds of provenance sources which the geographical position was closer could be got together.The study had provided a certain basis for the classification of P.multiflorum.展开更多
Because of the difficulty to obtain the traffic flow information of lanes at non-detector intersections in most metropolises of the world,based on the relationships between the lanes of signal-controlled intersections...Because of the difficulty to obtain the traffic flow information of lanes at non-detector intersections in most metropolises of the world,based on the relationships between the lanes of signal-controlled intersections,cluster analysis and stepwise regression are integrated to predict the traffic volume of lanes at non-detector isolated controlled intersections.First cluster analysis is used to cluster the lanes of non-detector isolated signal-controlled intersections and the lanes of all signal-controlled intersections with detectors.Then, by the results of cluster analysis,the traffic volume samples are selected randomly and stepwise regression is used to predict the traffic volume of lanes at non-detector isolated signal-controlled intersections.The method is tested by the traffic volume data of lanes of the road network of Nanjing city.The problem of predicting the traffic volume of lanes at non-detector isolated signal-controlled intersections was resolved and can be widely used in urban traffic flow guidance and urban traffic control in cities without enough intersections equipped with detectors.展开更多
In order to analyze the heterogeneity in vehicular traffic speed, a new method that integrates cluster analysis and probability distribution function fitting is presented. First, for identifying the optimal number of ...In order to analyze the heterogeneity in vehicular traffic speed, a new method that integrates cluster analysis and probability distribution function fitting is presented. First, for identifying the optimal number of clusters, the two-step cluster method is applied to analyze actual speed data, which suggests that dividing speed data into two clusters can best reflect the intrinsic patterns of traffic flows. Such information is then taken as guidance in probability distribution function fitting. The normal, skew-normal and skew-t distribution functions are used to fit the probability distribution of each cluster respectively, which suggests that the skew-t distribution has the highest fitting accuracy; the second is skew-normal distribution; the worst is normal distribution. Model analysis results demonstrate that the proposed mixture model has a better fitting and generalization capability than the conventional single model. In addition, the new method is more flexible in terms of data fitting and can provide a more accurate model of speed distribution.展开更多
Inter-simple sequence repeat(ISSR) molecular markers were applied to analyze the genetic diversity and clustering of 48 introduced and bred cultivars of Olea euyopaea L. Totally 106 DNA bands were amplified by 11 sc...Inter-simple sequence repeat(ISSR) molecular markers were applied to analyze the genetic diversity and clustering of 48 introduced and bred cultivars of Olea euyopaea L. Totally 106 DNA bands were amplified by 11 screened primers, including 99 polymorphic bands; the percentage of polymorphic loci was 93.40%, indicating a rich genetic diversity in Olea euyopaea L. germplasm resources. Based on Nei's genetic distances between various cultivars, a dendrogram of 48 cultivars of Olea euyopaea L. was constructed using unweighted pair-group(UPMGA)method,which showed that 48 cultivars were clustered into four main categories; 84.6% of native cultivars were clustered into two categories; most of introduced cultivars were clustered based on their sources and main usages but not on their geographic origins. This study will provide references for the utilization and further genetic improvement of Olea euyopaea L. germplasm resources.展开更多
文摘Efficient iterative unsupervised machine learning involving probabilistic clustering analysis with the expectation-maximization(EM)clustering algorithm is applied to categorize reservoir facies by exploiting latent and observable well-log variables from a clastic reservoir in the Majnoon oilfield,southern Iraq.The observable well-log variables consist of conventional open-hole,well-log data and the computer-processed interpretation of gamma rays,bulk density,neutron porosity,compressional sonic,deep resistivity,shale volume,total porosity,and water saturation,from three wells located in the Nahr Umr reservoir.The latent variables include shale volume and water saturation.The EM algorithm efficiently characterizes electrofacies through iterative machine learning to identify the local maximum likelihood estimates(MLE)of the observable and latent variables in the studied dataset.The optimized EM model developed successfully predicts the core-derived facies classification in two of the studied wells.The EM model clusters the data into three distinctive reservoir electrofacies(F1,F2,and F3).F1 represents a gas-bearing electrofacies with low shale volume(Vsh)and water saturation(Sw)and high porosity and permeability values identifying it as an attractive reservoir target.The results of the EM model are validated using nuclear magnetic resonance(NMR)data from the third studied well for which no cores were recovered.The NMR results confirm the effectiveness and accuracy of the EM model in predicting electrofacies.The utilization of the EM algorithm for electrofacies classification/cluster analysis is innovative.Specifically,the clusters it establishes are less rigidly constrained than those derived from the more commonly used K-means clustering method.The EM methodology developed generates dependable electrofacies estimates in the studied reservoir intervals where core samples are not available.Therefore,once calibrated with core data in some wells,the model is suitable for application to other wells that lack core data.
文摘A significant portion of Landslide Early Warning Systems (LEWS) relies on the definition of operational thresholds and the monitoring of cumulative rainfall for alert issuance. These thresholds can be obtained in various ways, but most often they are based on previous landslide data. This approach introduces several limitations. For instance, there is a requirement for the location to have been previously monitored in some way to have this type of information recorded. Another significant limitation is the need for information regarding the location and timing of incidents. Despite the current ease of obtaining location information (GPS, drone images, etc.), the timing of the event remains challenging to ascertain for a considerable portion of landslide data. Concerning rainfall monitoring, there are multiple ways to consider it, for instance, examining accumulations over various intervals (1 h, 6 h, 24 h, 72 h), as well as in the calculation of effective rainfall, which represents the precipitation that actually infiltrates the soil. However, in the vast majority of cases, both the thresholds and the rain monitoring approach are defined manually and subjectively, relying on the operators’ experience. This makes the process labor-intensive and time-consuming, hindering the establishment of a truly standardized and rapidly scalable methodology on a large scale. In this work, we propose a Landslides Early Warning System (LEWS) based on the concept of rainfall half-life and the determination of thresholds using Cluster Analysis and data inversion. The system is designed to be applied in extensive monitoring networks, such as the one utilized by Cemaden, Brazil’s National Center for Monitoring and Early Warning of Natural Disasters.
文摘This paper investigates the design essence of Chinese classical private gardens,integrating their design elements and fundamental principles.It systematically analyzes the unique characteristics and differences among classical private gardens in the Northern,Jiangnan,and Lingnan regions.The study examines nine classical private gardens from Northern China,Jiangnan,and Lingnan by utilizing the advanced tool of principal component cluster analysis.Based on literature analysis and field research,273 variables were selected for principal component analysis,from which four components with higher contribution rates were chosen for further study.Subsequently,we employed clustering analysis techniques to compare the differences among the three types of gardens.The results reveal that the first principal component effectively highlights the differences between Jiangnan and Lingnan private gardens.The second principal component serves as the key to defining the types of Northern private gardens and distinguishing them from the other two types,and the third principal component indicates that Lingnan private gardens can be categorized into two distinct types as well.
基金Youth Fund of National Natural Science Foundation of China (42101353)the Ministry of Housing and Urban-Rural Development Science Plan Project (2022-R-063)Liaoning Social Science Planning Fund Project (L21BGL046)。
文摘The study of temporal and spatial variations of nitrate in groundwater under different soil nitrogen environments is helpful to the security of groundwater resources in agricultural areas.In this paper,based on 320 groups of soil and groundwater samples collected at the same time,geostatistical analysis and multiple regression analysis were comprehensively used to conduct the evaluation of nitrogen contents in both groundwater and soil.From May to August,as the nitrification of groundwater is dominant,the average concentration of nitrate nitrogen is 34.80 mg/L;The variation of soil ammonia nitrogen and nitrate nitrogen is moderate from May to July,and the variation coefficient decreased sharply and then increased in August.There is a high correlation between the nitrate nitrogen in groundwater and soil in July,and there is a high correlation between the nitrate nitrogen in groundwater and ammonium nitrogen in soil in August and nitrate nitrogen in soil in July.From May to August,the area of low groundwater nitrate nitrogen in 0-5 mg/L and 5-10 mg/L decreased from 10.97%to 0,and the proportion of high-value area(greater than 70 mg/L)increased from 21.19%to 27.29%.Nitrate nitrogen is the main factor affecting the quality of groundwater.The correlation analysis of nitrate nitrogen in groundwater,nitrate nitrogen in soil and ammonium nitrogen shows that they have a certain period of delay.The areas with high concentration of nitrate in groundwater are mainly concentrated in the western part of the study area,which has a high consistency with the high value areas of soil nitrate distribution from July to August,and a high difference with the spatial position of soil ammonia nitrogen distribution in August.
文摘The recent pandemic crisis has highlighted the importance of the availability and management of health data to respond quickly and effectively to health emergencies, while respecting the fundamental rights of every individual. In this context, it is essential to find a balance between the protection of privacy and the safeguarding of public health, using tools that guarantee transparency and consent to the processing of data by the population. This work, starting from a pilot investigation conducted in the Polyclinic of Bari as part of the Horizon Europe Seeds project entitled “Multidisciplinary analysis of technological tracing models of contagion: the protection of rights in the management of health data”, has the objective of promoting greater patient awareness regarding the processing of their health data and the protection of privacy. The methodology used the PHICAT (Personal Health Information Competence Assessment Tool) as a tool and, through the administration of a questionnaire, the aim was to evaluate the patients’ ability to express their consent to the release and processing of health data. The results that emerged were analyzed in relation to the 4 domains in which the process is divided which allows evaluating the patients’ ability to express a conscious choice and, also, in relation to the socio-demographic and clinical characteristics of the patients themselves. This study can contribute to understanding patients’ ability to give their consent and improve information regarding the management of health data by increasing confidence in granting the use of their data for research and clinical management.
文摘In this paper, CiteSpace, a bibliometrics software, was adopted to collect research papers published on the Web of Science, which are relevant to biological model and effluent quality prediction in activated sludge process in the wastewater treatment. By the way of trend map, keyword knowledge map, and co-cited knowledge map, specific visualization analysis and identification of the authors, institutions and regions were concluded. Furthermore, the topics and hotspots of water quality prediction in activated sludge process through the literature-co-citation-based cluster analysis and literature citation burst analysis were also determined, which not only reflected the historical evolution progress to a certain extent, but also provided the direction and insight of the knowledge structure of water quality prediction and activated sludge process for future research.
基金funded by the Science Research Project of Hebei Education Department(No.BJK2023088).
文摘As critical conduits for the dissemination of online public opinion,social media platforms offer a timely and effective means for managing emergencies during major disasters,such as earthquakes.This study focuses on the analysis of online public opinions following the Maduo M7.4 earthquake in Qinghai Province and the Yangbi M6.4 earthquake in Yunnan Province.By collecting,cleaning,and organizing post-earthquake Sina Weibo(short for Weibo)data,we employed the Latent Dirichlet Allocation(LDA)model to extract information pertinent to public opinion on these earthquakes.This analysis included a comparison of the nature and temporal evolution of online public opinions related to both events.An emotion analysis,utilizing an emotion dictionary,categorized the emotional content of post-earthquake Weibo posts,facilitating a comparative study of the characteristics and temporal trends of online public emotions following the earthquakes.The findings were visualized using Geographic Information System(GIS)techniques.The analysis revealed certain commonalities in online public opinion following both earthquakes.Notably,the peak of online engagement occurred within the first 24 hours post-earthquake,with a rapid decline observed between 24 to 48 hours thereafter.The variation in popularity of online public opinion was linked to aftershock occurrences.Adjusted for population factors,online engagement in areas surrounding the earthquake sites and in Sichuan Province was significantly high.Initially dominated by feelings of“fear”and“surprise”,the public sentiment shifted towards a more positive outlook with the onset of rescue operations.However,distinctions in the online public response to each earthquake were also noted.Following the Yangbi earthquake,Yunnan Province reported the highest number of Weibo posts nationwide;in contrast,Qinghai Province ranked third post-Maduo earthquake,attributable to its smaller population size and extensive damage to communication infrastructure.This research offers a methodological approach for the analysis of online public opinion related to earthquakes,providing insights for the enhancement of post-disaster emergency management and public mental health support.
文摘In view of the composition analysis and identification of ancient glass products, L1 regularization, K-Means cluster analysis, elbow rule and other methods were comprehensively used to build logical regression, cluster analysis, hyper-parameter test and other models, and SPSS, Python and other tools were used to obtain the classification rules of glass products under different fluxes, sub classification under different chemical compositions, hyper-parameter K value test and rationality analysis. Research can provide theoretical support for the protection and restoration of ancient glass relics.
文摘In the past 30 years, Chinese enterprises have been a hot topic of discussion and concern among the general public in terms of economic and social status, ownership structure, business mechanism, and management level. Solving the problem of employment for the people is an important prerequisite for their peaceful living and work, as well as a prerequisite and foundation for building a harmonious society. The employment situation of private enterprises has always been of great concern to the outside world, and these two major jobs have always occupied an important position in the employment field of China that cannot be ignored. With the establishment of the market economy system, individual and private enterprises have become important components of the socialist economy, making significant contributions to economic development and social progress. The rapid development of China’s economy, on the one hand, is the embodiment of the superiority of China’s socialist market economic system, and on the other hand, it is the role of the tertiary industry and private enterprises in promoting the national economy. Since the 1990s, China’s private enterprises have become a new economic growth point for local and even national countries, and are one of the important ways to arrange employment and achieve social stability. This paper studies the employment of private enterprises and individuals from the perspective of statistics, extracts relevant data from China statistical Yearbook, uses the relevant knowledge of statistics to process the data, obtains the conclusion and puts forward relevant constructive suggestions.
基金funding from the National Natural Science Foundation of China(No.41572308)。
文摘The scientific and fair positioning of monitoring locations for surface displacement on slopes is a prerequisite for early warning and forecasting.However,there is no specific provision on how to effectively determine the number and location of monitoring points according to the actual deformation characteristics of the slope.There are still some defects in the layout of monitoring points.To this end,based on displacement data series and spatial location information of surface displacement monitoring points,by combining displacement series correlation and spatial distance influence factors,a spatial deformation correlation calculation model of slope based on clustering analysis was proposed to calculate the correlation between different monitoring points,based on which the deformation area of the slope was divided.The redundant monitoring points in each partition were eliminated based on the partition's outcome,and the overall optimal arrangement of slope monitoring points was then achieved.This method scientifically addresses the issues of slope deformation zoning and data gathering overlap.It not only eliminates human subjectivity from slope deformation zoning but also increases the efficiency and accuracy of slope monitoring.In order to verify the effectiveness of the method,a sand-mudstone interbedded CounterTilt excavation slope in the Chongqing city of China was used as the research object.Twenty-four monitoring points deployed on this slope were monitored for surface displacement for 13 months.The spatial location of the monitoring points was discussed.The results show that the proposed method of slope deformation zoning and the optimized placement of monitoring points are feasible.
基金The National Natural Science Foundation of China under contract Nos 42106005,91958203,41676131,41876155.
文摘The classification of the springtime water mass has an important influence on the hydrography,regional climate change and fishery in the Taiwan Strait.Based on 58 stations of CTD profiling data collected in the western and southwestern Taiwan Strait during the spring cruise of 2019,we analyze the spatial distributions of temperature(T)and salinity(S)in the investigation area.Then by using the fuzzy cluster method combined with the T-S similarity number,we classify the investigation area into 5 water masses:the Minzhe Coastal Water(MZCW),the Taiwan Strait Mixed Water(TSMW),the South China Sea Surface Water(SCSSW),the South China Sea Subsurface Water(SCSUW)and the Kuroshio Branch Water(KBW).The MZCW appears in the near surface layer along the western coast of Taiwan Strait,showing low-salinity(<32.0)tongues near the Minjiang River Estuary and the Xiamen Bay mouth.The TSMW covers most upper layer of the investigation area.The SCSSW is mainly distributed in the upper layer of the southwestern Taiwan Strait,beneath which is the SCSUW.The KBW is a high temperature(core value of 26.36℃)and high salinity(core value of 34.62)water mass located southeast of the Taiwan Bank and partially in the central Taiwan Strait.
基金funded by the National Natural Science Foundation of China(42174131)the Strategic Cooperation Technology Projects of CNPC and CUPB(ZLZX2020-03).
文摘In this research,an integrated classification method based on principal component analysis-simulated annealing genetic algorithm-fuzzy cluster means(PCA-SAGA-FCM)was proposed for the unsupervised classification of tight sandstone reservoirs which lack the prior information and core experiments.A variety of evaluation parameters were selected,including lithology characteristic parameters,poro-permeability quality characteristic parameters,engineering quality characteristic parameters,and pore structure characteristic parameters.The PCA was used to reduce the dimension of the evaluation pa-rameters,and the low-dimensional data was used as input.The unsupervised reservoir classification of tight sandstone reservoir was carried out by the SAGA-FCM,the characteristics of reservoir at different categories were analyzed and compared with the lithological profiles.The analysis results of numerical simulation and actual logging data show that:1)compared with FCM algorithm,SAGA-FCM has stronger stability and higher accuracy;2)the proposed method can cluster the reservoir flexibly and effectively according to the degree of membership;3)the results of reservoir integrated classification match well with the lithologic profle,which demonstrates the reliability of the classification method.
文摘Background:To solve the cluster analysis better,we propose a new method based on the chaotic particle swarm optimization(CPSO)algorithm.Methods:In order to enhance the performance in clustering,we propose a novel method based on CPSO.We first evaluate the clustering performance of this model using the variance ratio criterion(VRC)as the evaluation metric.The effectiveness of the CPSO algorithm is compared with that of the traditional particle swarm optimization(PSO)algorithm.The CPSO aims to improve the VRC value while avoiding local optimal solutions.The simulated dataset is set at three levels of overlapping:non-overlapping,partial overlapping,and severe overlapping.Finally,we compare CPSO with two other methods.Results:By observing the comparative results,our proposed CPSO method performs outstandingly.In the conditions of non-overlapping,partial overlapping,and severe overlapping,our method has the best VRC values of 1683.2,620.5,and 275.6,respectively.The mean VRC values in these three cases are 1683.2,617.8,and 222.6.Conclusion:The CPSO performed better than other methods for cluster analysis problems.CPSO is effective for cluster analysis.
基金supported by the Open Fund for Field Stations of Institute of Geographic Sciences and Natural Resources Research,CAS and the Ocean Public Welfare Scientific Research Project(Grant No.201105020)
文摘The influence of anthropogenic activities,especially artificial dykes,on the coastal wetland landscape is now considered as a serious problem to the coastal ecosystem.It is important and necessary to analyze changes of coastal landscape pattern under the influence of artificial dykes for the protection and management of coastal wetland.Our study aimed to reveal the quantitative characteristics of the coastal wetland landscape and its spatial-temporal dynamics under the influence of artificial dykes in the Yellow River delta(YRD).It was analyzed by the methods of the statistical analysis of landscape structure,five selected landscape indices and the changes of spatial centroids of three typical wetland types,including reed marshes,tidal fiats and aquaculture-salt fields.The results showed that:(1)Reduction of wetland area,especially the degradation of natural wetlands,had been the principal problem since the dykes were constructed in the YRD.The dykes created conditions for the development of artificial wetlands.However,the new born artificial wetlands were still less than the vanished natural wetlands.(2)Compared with the open area,the building of artificial dykes significantly speeded up the changes of landscape patterns and the aggravation of the landscape fragmentation in the closed area.(3)The changes of area-weighted centroids of three typical wetland landscapes were greatly affected by dykes,and the movement of the centroid of the aquaculture-salt field was very sensitive to the dykes constructed in the corresponding period.
基金National Natural Science Foundation of China(No.41971337)。
文摘Spatial-temporal analysis of emotions in society has become popular in many studies integrating geography with the humanities,and has shown its influence on social sensing and geo-computation for social sciences.Emotions in society are often volatile,irrational,and vulnerable to the social environment.A critical challenge is to analyze changes in long-term and large-scale emotions in society.In this paper,we propose exploiting this challenge by using spatial-temporal analysis.After extracting emotional,temporal,and spatial information,a spatial standardization approach based on adataset of administrative district changes addresses the problem of Chinese toponym changes.Finally,over 1.7 million news data from the People’s Daily from 1956 to 2014 were collected to explore the changes,spatial distribution,and driving factors of emotions in society using spatial-temporal analysis.The experimental results found that the spatial-temporal analysis of emotions in society in the news is consistent with the results of related sociological research.
基金Under the auspices of National Natural Science Project(No.42161046)National Social Science Project(No.21CJY075)+2 种基金Guangxi Natural Science Project(No.2021JJB150070)Guangxi Philosophy and Social Science Project(No.20FJY027)Guangxi First-class Discipline Applied Economics Construction Project Fund(Guangxi Education and Scientific Research(No.[2022]No.1))。
文摘Exploring the spatial and temporal evolution characteristics of the border land use multifunctionality(LUMF)provides insights for taking advantage of border land use and optimizing border land use policies.Based on the improved Technique for Order Preference by Similarity to an Ideal Solution(TOPSIS)mode,this study identifies and evaluates the LUMFs in the China-Vietnam border area between 2000 and 2018 from the perspectives of agricultural production,social security,ecological service,landscape recreation,and national security.The results show that:1)The comprehensive land use functions in most counties and cities continued to be improved.2)The comprehensive land use function exhibits remarkable spatial divergence and aggregation characteristics.The high-value area of the agricultural production function and social security function evolves from the east to the west.In addition,the spatial evolution of ecological service function is complicated,without an obvious spatial divergence and aggregation pattern.The landscape recreation function shows different spatial differentiation characteristics in the early and middle stage,and forms a large cluster in the later stage.Finally,the spatial evolution pattern of the national security function is significant.3)Designing differentiated border land policies,improving border land use security,and establishing a long-term mechanism for ecological protection and ecological compensation can aid in optimizing the LUMF level in the border area.
基金Supported by High-tech Research Project of Jiangsu Province(BG2004314)~~
文摘[Objective] The aim was to study the variation of leaf characters from different provenance sources of Polygonum multiflorum Thunb,as well as to carry out cluster analysis on P.multiflorum from different provenance sources to provide basis for the classification,identification,breeding and improved variety selection of P.multiflorum.[Method] Leaf shape characters of 31 copies of germplasm resources in the major distribution region of the whole country were determined,and the genetic variation of P.multiflorum leaves from different producing areas was analyzed.[Result] The leaf characters of single plant of the same experimental provenance source of P.multiflorum were relatively stable,the variation was mainly found on the single leaf area,1/2 leaf width,leaf width and other indicators;the variation of each leaf character among different provenance sources was obvious,and the variation was mainly found on the single leaf weight,leaf area,1/2 leaf width,leaf length and other indicators.The correlation analysis of each leaf character in P.multiflorum suggested that the single leaf area and single leaf weight showed extremely significant positive correlation with leaf length,1/2 leaf width,leaf width,leaf thickness and leaf stem length,while the single leaf area and single leaf weight showed significant negative correlation with WWR(leaf width/1/2 leaf width)and LWR(leaf length/1/2 leaf length),in addition,several macroscopic leaf characters such as leaf length,1/2 leaf width,leaf width,leaf stem length showed extremely positive correlation.The main component analysis result suggested that the contribution rate of accumulation variance of the front three main components was up to 97.4%,which could better reflect the comprehensive performance of leaf characters of different provenance sources of P.multiflorum.The cluster analysis showed that the experimental 31 copies of P.multiflorum provenance sources should be divided into three classes,the first class was distributed in the Middle,Western of Guizhou,northwestern of Guangxi and western areas with higher altitude;the second class was distributed in Hunan,Hubei,Sichuan,Guangdong and the most area of Guangxi;the third class was distributed in Anhui,Jiangsu and Henan and Shandong.[Conclusion] Cluster analysis of leaf characters indicated that the kinds of provenance sources which the geographical position was closer could be got together.The study had provided a certain basis for the classification of P.multiflorum.
基金The National Natural Science Foundation of China(No.50378016).
文摘Because of the difficulty to obtain the traffic flow information of lanes at non-detector intersections in most metropolises of the world,based on the relationships between the lanes of signal-controlled intersections,cluster analysis and stepwise regression are integrated to predict the traffic volume of lanes at non-detector isolated controlled intersections.First cluster analysis is used to cluster the lanes of non-detector isolated signal-controlled intersections and the lanes of all signal-controlled intersections with detectors.Then, by the results of cluster analysis,the traffic volume samples are selected randomly and stepwise regression is used to predict the traffic volume of lanes at non-detector isolated signal-controlled intersections.The method is tested by the traffic volume data of lanes of the road network of Nanjing city.The problem of predicting the traffic volume of lanes at non-detector isolated signal-controlled intersections was resolved and can be widely used in urban traffic flow guidance and urban traffic control in cities without enough intersections equipped with detectors.
基金The National Science Foundation by Changjiang Scholarship of Ministry of Education of China(No.BCS-0527508)the Joint Research Fund for Overseas Natural Science of China(No.51250110075)+1 种基金the Natural Science Foundation of Jiangsu Province(No.BK200910046)the Postdoctoral Science Foundation of Jiangsu Province(No.0901005C)
文摘In order to analyze the heterogeneity in vehicular traffic speed, a new method that integrates cluster analysis and probability distribution function fitting is presented. First, for identifying the optimal number of clusters, the two-step cluster method is applied to analyze actual speed data, which suggests that dividing speed data into two clusters can best reflect the intrinsic patterns of traffic flows. Such information is then taken as guidance in probability distribution function fitting. The normal, skew-normal and skew-t distribution functions are used to fit the probability distribution of each cluster respectively, which suggests that the skew-t distribution has the highest fitting accuracy; the second is skew-normal distribution; the worst is normal distribution. Model analysis results demonstrate that the proposed mixture model has a better fitting and generalization capability than the conventional single model. In addition, the new method is more flexible in terms of data fitting and can provide a more accurate model of speed distribution.
基金Supported by Key Project of New Product Development in Yunnan Province(2009BB006)~~
文摘Inter-simple sequence repeat(ISSR) molecular markers were applied to analyze the genetic diversity and clustering of 48 introduced and bred cultivars of Olea euyopaea L. Totally 106 DNA bands were amplified by 11 screened primers, including 99 polymorphic bands; the percentage of polymorphic loci was 93.40%, indicating a rich genetic diversity in Olea euyopaea L. germplasm resources. Based on Nei's genetic distances between various cultivars, a dendrogram of 48 cultivars of Olea euyopaea L. was constructed using unweighted pair-group(UPMGA)method,which showed that 48 cultivars were clustered into four main categories; 84.6% of native cultivars were clustered into two categories; most of introduced cultivars were clustered based on their sources and main usages but not on their geographic origins. This study will provide references for the utilization and further genetic improvement of Olea euyopaea L. germplasm resources.