Inter-simple sequence repeat(ISSR) molecular markers were applied to analyze the genetic diversity and clustering of 48 introduced and bred cultivars of Olea euyopaea L. Totally 106 DNA bands were amplified by 11 sc...Inter-simple sequence repeat(ISSR) molecular markers were applied to analyze the genetic diversity and clustering of 48 introduced and bred cultivars of Olea euyopaea L. Totally 106 DNA bands were amplified by 11 screened primers, including 99 polymorphic bands; the percentage of polymorphic loci was 93.40%, indicating a rich genetic diversity in Olea euyopaea L. germplasm resources. Based on Nei's genetic distances between various cultivars, a dendrogram of 48 cultivars of Olea euyopaea L. was constructed using unweighted pair-group(UPMGA)method,which showed that 48 cultivars were clustered into four main categories; 84.6% of native cultivars were clustered into two categories; most of introduced cultivars were clustered based on their sources and main usages but not on their geographic origins. This study will provide references for the utilization and further genetic improvement of Olea euyopaea L. germplasm resources.展开更多
In order to mine production and security information from security supervising data and to ensure security and safety involved in production and decision-making,a clustering analysis algorithm for security supervising...In order to mine production and security information from security supervising data and to ensure security and safety involved in production and decision-making,a clustering analysis algorithm for security supervising data based on a semantic description in coal mines is studied.First,the semantic and numerical-based hybrid description method of security supervising data in coal mines is described.Secondly,the similarity measurement method of semantic and numerical data are separately given and a weight-based hybrid similarity measurement method for the security supervising data based on a semantic description in coal mines is presented.Thirdly,taking the hybrid similarity measurement method as the distance criteria and using a grid methodology for reference,an improved CURE clustering algorithm based on the grid is presented.Finally,the simulation results of a security supervising data set in coal mines validate the efficiency of the algorithm.展开更多
[Objective] This study aimed to develop ACGM markers for the clustering analysis of large grained Brassica napus materials. [Method] A total of 44 pairs of ACGM primers were designed according to 18 genes related to A...[Objective] This study aimed to develop ACGM markers for the clustering analysis of large grained Brassica napus materials. [Method] A total of 44 pairs of ACGM primers were designed according to 18 genes related to Arabidopsis grain development and their homologous rape EST sequences. After electrophoresis, 18 pairs of ACGM primers were selected for the clustering analysis of 16 larger grained samples and four fine grained samples of rapeseed. [Result] PCR result showed that 2-6 specific bands were respectively amplified by each pair of primes, and all the bands were polymorphic and repeatable, suggesting that the optimized ACGM markers were useful for clustering analysis of B. napus species. Clustering analysis revealed that the 20 rapeseed samples were divided into three clusters A, B, and C at similarity coefficient 0.6. Then, the clusters A and B were further divided into five sub clusters A1, A2, A3, B1 and B2 at similarity coefficient 0.67. [Conclusion] This study will provide theoretical and practical values for rape breeding.展开更多
[Objective] This study aimed to investigate the trace elements in Rehman- nia glutinosa Libosch. by using principal component analysis and clustering analysis. [Method] Principal component analysis and clustering anal...[Objective] This study aimed to investigate the trace elements in Rehman- nia glutinosa Libosch. by using principal component analysis and clustering analysis. [Method] Principal component analysis and clustering analysis of R. glutinosa medicinal materials from different sources were conducted with contents of six trace elements as indices. [Result] The principal component analysis could comprehen- sively evaluate the quality of R. glutinosa samples with objective results which was consistent with the results of clustering analysis. [Conclusion] Principal component analysis and clustering analysis methods can be used for the quality evaluation of Chinese medicinal materials with multiple indices.展开更多
The goal of this study was to optimize the constitutive parameters of foundation soils using a k-means algorithm with clustering analysis. A database was collected from unconfined compression tests, Proctor tests and ...The goal of this study was to optimize the constitutive parameters of foundation soils using a k-means algorithm with clustering analysis. A database was collected from unconfined compression tests, Proctor tests and grain distribution tests of soils taken from three different types of foundation pits: raft foundations, partial raft foundations and strip foundations. k-means algorithm with clustering analysis was applied to determine the most appropriate foundation type given the un- confined compression strengths and other parameters of the different soils.展开更多
Effective storage,processing and analyzing of power device condition monitoring data faces enormous challenges.A framework is proposed that can support both MapReduce and Graph for massive monitoring data analysis at ...Effective storage,processing and analyzing of power device condition monitoring data faces enormous challenges.A framework is proposed that can support both MapReduce and Graph for massive monitoring data analysis at the same time based on Aliyun DTplus platform.First,power device condition monitoring data storage based on MaxCompute table and parallel permutation entropy feature extraction based on MaxCompute MapReduce are designed and implemented on DTplus platform.Then,Graph based k-means algorithm is implemented and used for massive condition monitoring data clustering analysis.Finally,performance tests are performed to compare the execution time between serial program and parallel program.Performance is analyzed from CPU cores consumption,memory utilization and parallel granularity.Experimental results show that the designed framework and parallel algorithms can efficiently process massive power device condition monitoring data.展开更多
Five factors expressing greenbelt quality and one factor expressing quantity were adopted for evaluation of the residential greenbelt, and the AHP (Analytical Hierarchy Process) method was used to determine the valu...Five factors expressing greenbelt quality and one factor expressing quantity were adopted for evaluation of the residential greenbelt, and the AHP (Analytical Hierarchy Process) method was used to determine the value of factors. Thirty residential areas were selected as the samples. Two principal components were extracted and their expression was constructed by method of factor anlysis, therefore, quality evaluation of residential greenbelt was obtained. The accuracy of the function and implement quality classification toward the residential greenbelts in Xinxiang City were validated by clustering analysis method. The results showed that the greenbelt quality of fourteen residential areas was higher than the average level, of which eleven were newly-built residential areas. The 30 residential areas were classified into three types according to their greenbelt features and their formation by clustering analysis method. Finally rational proposal basing on aforesaid evaluating results was proposed for construction and renewal of residential greenbelt, upon which directive basis was provided for construction and renewal of residential greenbelt.展开更多
A novel Support Vector Machine(SVM) ensemble approach using clustering analysis is proposed. Firstly,the positive and negative training examples are clustered through subtractive clus-tering algorithm respectively. Th...A novel Support Vector Machine(SVM) ensemble approach using clustering analysis is proposed. Firstly,the positive and negative training examples are clustered through subtractive clus-tering algorithm respectively. Then some representative examples are chosen from each of them to construct SVM components. At last,the outputs of the individual classifiers are fused through ma-jority voting method to obtain the final decision. Comparisons of performance between the proposed method and other popular ensemble approaches,such as Bagging,Adaboost and k.-fold cross valida-tion,are carried out on synthetic and UCI datasets. The experimental results show that our method has higher classification accuracy since the example distribution information is considered during en-semble through clustering analysis. It further indicates that our method needs a much smaller size of training subsets than Bagging and Adaboost to obtain satisfactory classification accuracy.展开更多
A novel multivariate similarity clustering analysis (MSCA) approach was used to estimate a biogeographical division scheme for the global terrestrial fauna and was compared against other widely used clustering algorit...A novel multivariate similarity clustering analysis (MSCA) approach was used to estimate a biogeographical division scheme for the global terrestrial fauna and was compared against other widely used clustering algorithms. The faunal dataset included almost all terrestrial and freshwater fauna, a total of 4631 families, 141,814 genera, and 1,334,834 species. Our findings demonstrated that suitable results were only obtained with the MSCA method, which was associated with distinct hierarchies, reasonable structuring, and furthermore, conformed to biogeographical criteria. A total of seven kingdoms and 20 sub-kingdoms were identified. We discovered that the clustering results for the higher and lower animals did not differ significantly, leading us to consider that the analysis result is convincing as the first zoogeographical division scheme for global all terrestrial animals.展开更多
Affected by many involved factors, different dimensions, data with large difference, incomplete information and so on, the most optimal selection of regional outburst prevention measures for outburst mine has become a...Affected by many involved factors, different dimensions, data with large difference, incomplete information and so on, the most optimal selection of regional outburst prevention measures for outburst mine has become a complicated system project. The traditional way of outburst prevention measure selection belongs to qualitative method, which may cause high-cost of gas control, huge quantities of drilling work, long construction time and even secondary disaster. To solve the above-mentioned problems, in light of occurrence status of coal seam gas in No. 21 mining area of Jinzhushan Tuzhu Mine, through grey fixed weight clustering theory and a combination method of qualitative and quantitative analysis, the judging model with multi-objective classification for optimization of outburst prevention measures was established. The three weight coefficients of outburst prevention technology scheme are sorted, in order to determine the advantages and disadvantages of each outburst prevention technology scheme under the comprehensive evaluation of multi-target. Finally, the problem of quantitative selection for regional outburst prevention technology scheme is solved under the situation of multi-factor mode and incomplete information, which provides reasonable and effective technical measures for prevention of coal and gas outburst disaster.展开更多
The main task of provenance analysis is to determine the source of sediments and the position of parent rocks.Provenance analysis may find out the relationship between erosion districts and sediment zone,between the u...The main task of provenance analysis is to determine the source of sediments and the position of parent rocks.Provenance analysis may find out the relationship between erosion districts and sediment zone,between the uplift and the depression in the process of basin development.The authors use the method of heavy mineral clustering analysis and estimate the provenance direction of Huanghua Depression in the Paleogene Kong 2 Member.Research shows that there were five provenance areas of Kong 2 Member in Kongnan area.They are western(Shenusi),northwestern(Cangzhou),eastern(Ganhuatun),northeastern and southeastern.The main provenance areas were northwestern and western,while the southern provenance could not be ruled out.And these areas are consistent with the known provenance areas.展开更多
In this study,the world’s land(except Antarctica)is divided into 67 basic geographical units according to ecological types.Using our newly proposed MSCA(Multivariate Similarity Clustering Analysis)method,7,591 specie...In this study,the world’s land(except Antarctica)is divided into 67 basic geographical units according to ecological types.Using our newly proposed MSCA(Multivariate Similarity Clustering Analysis)method,7,591 species of modern terrestrial mammals belonging to 1,374 genera in 162 families and 2,378 species of mammals in the Wallace era before 1876 are quantitatively analyzed,and almost the same clustering results are obtained,with clear levels and reasonable clustering,which conform to the principles of geography,statistics,ecology and biology.It not only affirms and supports the reasonable kernel of Wallace’s scheme,but also puts forward suggestions that should be revised and improved.The large or small differences between the clustering results and the mammalian geographical zoning schemes of contemporary scholars are caused by different analysis methods,and they are highly consistent with the analysis results of chordates,angiosperms and insects in the world analyzed by the same method.Once again,it confirms the homogeneity of the global biological distribution pattern of major groups,and the possibility of building a unified biogeographic zoning system in the world.展开更多
Objective: To analyze hot research areas and the present research status of nursing safety management in PubMed. Methods: PubMed was searched using "safety management" for the literature on nursing safety manageme...Objective: To analyze hot research areas and the present research status of nursing safety management in PubMed. Methods: PubMed was searched using "safety management" for the literature on nursing safety management. BICOMB 2.0 and SPSS 20.0 software were used to analyze high-frequency keywords and conduct co-word clustering analysis. Results: We searched for totally 2353 articles related to our topic and extracted 19 high-frequency keywords (27.50%). Five research focuses were concluded, including: study on nursing safety culture; team work to promote nursing safety; practice of nursing safety management; workplace violence against nursing staffs; nursing safety and quality evaluation standard. Conclusion: Analysis of the hotspots of nursing safety management in the past 10 years will contribute to understanding the research emphases and trend of development, and provide reference for the study and practice of nursing safety management.展开更多
Background:Brucellosis is a major public health issue in China,while its temporal and spatial distribution have not been studied in depth.This study aims to better understand the epidemiology of brucellosis in the mai...Background:Brucellosis is a major public health issue in China,while its temporal and spatial distribution have not been studied in depth.This study aims to better understand the epidemiology of brucellosis in the mainland of China,by investigating the human,temporal and spatial distribution and clustering characteristics of the disease.Methods:Human brucellosis data from the mainland of China between 2012 and 2016 were obtained from the China Information System for Disease Control and Prevention.The spatial autocorrelation analysis of ArcGIS10.6 and the spatial-temporal scanning analysis of SaTScan software were used to identify potential changes in the spatial and temporal distribution of human brucellosis in the mainland of China during the study period.Results:A total of 244348 human brucellosis cases were reported during the study period of 2012-2016.The average incidence of human brucellosis was higher in the 40-65 age group.The temporal clustering analysis showed that the high incidence of brucellosis occurred between March and July.The spatial clustering analysis showed that the location of brucellosis clustering in the mainland of China remained relatively fixed,mainly concentrated in most parts of northern China.The results of the spatial-temporal clustering analysis showed that Heilongjiang represents a primary clustering area,and the Tibet,Shanxi and Hubei provinces represent three secondary clustering areas.Conclusions:Human brucellosis remains a widespread challenge,particularly in northern China.The clustering analysis highlights potential high-risk human groups,time frames and areas,which may require special plans and resources to monitor and control the disease.展开更多
The current scheme of building climate zones in China generally assumes that building climate zones of island cities are identical to adjacent land stations.Consequently,building design strategies for island buildings...The current scheme of building climate zones in China generally assumes that building climate zones of island cities are identical to adjacent land stations.Consequently,building design strategies for island buildings usually refer to those developed for inland cities.This approach has to some extent hindered the energy-saving design and green development of island buildings in China.This research takes a first step on this issue by defining the building climate zones of 36 marine islands over China marine area using two-stage zoning methodology adopted by current building climate zoning standard(GB50178-1993).The meteorological data used for analysis was obtained from the National Climate Center of China over the 30-year period from 1985 to 2014.As comparison,40 coastal stations which are adjacent to the inves-tigated marine islands were also included in this study.Subsequently a more obiective techni-que-cluster analysis was operated as an effective supplement to discover the climate characteristics among different observations.The results of both methodologies consistentlyshow that among the 36 islands investigated,the majority of islands located in northern and eastern marine area belong to the same climate zones as their adjacent coastal cities.Howev-er,island cities in southern marine area cannot be assigned to any current climate zone,which was demonstrated by its distinctive climate features different from any other sites investi-gated through cluster analysis as well as different energy use patterns.Thus a new zone was defined to supplement the current building climate zoning scheme to cover marine area of China.展开更多
With rapid developments in platforms and sensors technology in terms of digital cameras and video recordings,crowd monitoring has taken a considerable attentions in many disciplines such as psychology,sociology,engine...With rapid developments in platforms and sensors technology in terms of digital cameras and video recordings,crowd monitoring has taken a considerable attentions in many disciplines such as psychology,sociology,engineering,and computer vision.This is due to the fact that,monitoring of the crowd is necessary to enhance safety and controllable movements to minimize the risk particularly in highly crowded incidents(e.g.sports).One of the platforms that have been extensively employed in crowd monitoring is unmanned aerial vehicles(UAVs),because UAVs have the capability to acquiring fast,low costs,high-resolution and real-time images over crowd areas.In addition,geo-referenced images can also be provided through integration of on-board positioning sensors(e.g.GPS/IMU)with vision sensors(digital cameras and laser scanner).In this paper,a new testing procedure based on feature from accelerated segment test(FAST)algorithms is introduced to detect the crowd features from UAV images taken from different camera orientations and positions.The proposed test started with converting a circle of 16 pixels surrounding the center pixel into a vector and sorting it in ascending/descending order.A single pixel which takes the ranking number 9(for FAST-9)or 12(for FAST-12)was then compared with the center pixel.Accuracy assessment in terms of completeness and correctness was used to assess the performance of the new testing procedure before and after filtering the crowd features.The results show that the proposed algorithms are able to extract crowd features from different UAV images.Overall,the values of Completeness range from 55 to 70%whereas the range of correctness values was 91 to 94%.展开更多
An evaluation index is a prerequisite for the scientific evaluation of a public meteorological service.This paper aims to explore a technical method for determining and screening evaluation indicators.Based on public ...An evaluation index is a prerequisite for the scientific evaluation of a public meteorological service.This paper aims to explore a technical method for determining and screening evaluation indicators.Based on public satisfaction survey data obtained in Wafangdian,China in 2010,this study investigates the suitability of fuzzy clustering analysis method in establishing an evaluation index.Through quantitative analysis of multilayer fuzzy clustering of various evaluation indicators,correlation analysis indicates that if the results of clustering were identical for two evaluation indicators in the same sub-evaluation layer,then one indicator could be removed,or the two indicators merged.For evaluation indicators in different sub-evaluation layers,although clustering reveals attribute correlations,these indicators may not be substituted for one another.Analysis of the applicability of the fuzzy clustering method shows that it plays a certain role in the establishment and correction of an evaluation index.展开更多
A total of 10 indices of regional economic development in Guangxi are selected.According to the relevant economic data,regional economic development in Guangxi is analyzed by using System Clustering Method and Princip...A total of 10 indices of regional economic development in Guangxi are selected.According to the relevant economic data,regional economic development in Guangxi is analyzed by using System Clustering Method and Principal Component Analysis Method.Result shows that System Clustering Method and Principal Component Analysis Method have revealed similar results analysis of economic development level.Overall economic strength of Guangxi is weak and Nanning has relatively high scores of factors due to its advantage of the political,economic and cultural center.Comprehensive scores of other regions are all lower than 1,which has big gap with the development of Nanning.Overall development strategy points out that Guangxi should accelerate the construction of the Ring Northern Bay Economic Zone,create a strong logistics system having strategic significance to national development,use the unique location advantage and rely on the modern transportation system to establish a logistics center and business center connecting the hinterland and the Asean Market.Based on the problems of unbalanced regional economic development in Guangxi,we should speed up the development of service industry in Nanning,construct the circular economy system of industrial city,and accelerate the industrialization process of tourism city in order to realize balanced development of regional economy in Guangxi,China.展开更多
In this paper,we report upon our recent work aimed at improving and adapting machine learning algorithms to automatically classify nanoscience images acquired by the Scanning Electron Microscope(SEM).This is done by c...In this paper,we report upon our recent work aimed at improving and adapting machine learning algorithms to automatically classify nanoscience images acquired by the Scanning Electron Microscope(SEM).This is done by coupling supervised and unsupervised learning approaches.We first investigate supervised learning on a ten-category data set of images and compare the performance of the different models in terms of training accuracy.Then,we reduce the dimensionality of the features through autoencoders to perform unsupervised learning on a subset of images in a selected range of scales(from 1μm to 2μm).Finally,we compare different clustering methods to uncover intrinsic structures in the images.展开更多
In view of the composition analysis and identification of ancient glass products, L1 regularization, K-Means cluster analysis, elbow rule and other methods were comprehensively used to build logical regression, cluste...In view of the composition analysis and identification of ancient glass products, L1 regularization, K-Means cluster analysis, elbow rule and other methods were comprehensively used to build logical regression, cluster analysis, hyper-parameter test and other models, and SPSS, Python and other tools were used to obtain the classification rules of glass products under different fluxes, sub classification under different chemical compositions, hyper-parameter K value test and rationality analysis. Research can provide theoretical support for the protection and restoration of ancient glass relics.展开更多
基金Supported by Key Project of New Product Development in Yunnan Province(2009BB006)~~
文摘Inter-simple sequence repeat(ISSR) molecular markers were applied to analyze the genetic diversity and clustering of 48 introduced and bred cultivars of Olea euyopaea L. Totally 106 DNA bands were amplified by 11 screened primers, including 99 polymorphic bands; the percentage of polymorphic loci was 93.40%, indicating a rich genetic diversity in Olea euyopaea L. germplasm resources. Based on Nei's genetic distances between various cultivars, a dendrogram of 48 cultivars of Olea euyopaea L. was constructed using unweighted pair-group(UPMGA)method,which showed that 48 cultivars were clustered into four main categories; 84.6% of native cultivars were clustered into two categories; most of introduced cultivars were clustered based on their sources and main usages but not on their geographic origins. This study will provide references for the utilization and further genetic improvement of Olea euyopaea L. germplasm resources.
基金The National Natural Science Foundation of China(No.50674086)Specialized Research Fund for the Doctoral Program of Higher Education(No.20060290508)the Postdoctoral Scientific Program of Jiangsu Province(No.0701045B)
文摘In order to mine production and security information from security supervising data and to ensure security and safety involved in production and decision-making,a clustering analysis algorithm for security supervising data based on a semantic description in coal mines is studied.First,the semantic and numerical-based hybrid description method of security supervising data in coal mines is described.Secondly,the similarity measurement method of semantic and numerical data are separately given and a weight-based hybrid similarity measurement method for the security supervising data based on a semantic description in coal mines is presented.Thirdly,taking the hybrid similarity measurement method as the distance criteria and using a grid methodology for reference,an improved CURE clustering algorithm based on the grid is presented.Finally,the simulation results of a security supervising data set in coal mines validate the efficiency of the algorithm.
基金Supported by the National Natural Science Foundation of China(30860147)Open Funds of National Key Laboratory of Crop Genetic Improvement(ZK200902)Natural Science Foundation of Yunnan Province(2011FB117)~~
文摘[Objective] This study aimed to develop ACGM markers for the clustering analysis of large grained Brassica napus materials. [Method] A total of 44 pairs of ACGM primers were designed according to 18 genes related to Arabidopsis grain development and their homologous rape EST sequences. After electrophoresis, 18 pairs of ACGM primers were selected for the clustering analysis of 16 larger grained samples and four fine grained samples of rapeseed. [Result] PCR result showed that 2-6 specific bands were respectively amplified by each pair of primes, and all the bands were polymorphic and repeatable, suggesting that the optimized ACGM markers were useful for clustering analysis of B. napus species. Clustering analysis revealed that the 20 rapeseed samples were divided into three clusters A, B, and C at similarity coefficient 0.6. Then, the clusters A and B were further divided into five sub clusters A1, A2, A3, B1 and B2 at similarity coefficient 0.67. [Conclusion] This study will provide theoretical and practical values for rape breeding.
基金Supported by Fund of Sichuan Provincial Administration of traditional Chinese Medicine(2008-12)~~
文摘[Objective] This study aimed to investigate the trace elements in Rehman- nia glutinosa Libosch. by using principal component analysis and clustering analysis. [Method] Principal component analysis and clustering analysis of R. glutinosa medicinal materials from different sources were conducted with contents of six trace elements as indices. [Result] The principal component analysis could comprehen- sively evaluate the quality of R. glutinosa samples with objective results which was consistent with the results of clustering analysis. [Conclusion] Principal component analysis and clustering analysis methods can be used for the quality evaluation of Chinese medicinal materials with multiple indices.
文摘The goal of this study was to optimize the constitutive parameters of foundation soils using a k-means algorithm with clustering analysis. A database was collected from unconfined compression tests, Proctor tests and grain distribution tests of soils taken from three different types of foundation pits: raft foundations, partial raft foundations and strip foundations. k-means algorithm with clustering analysis was applied to determine the most appropriate foundation type given the un- confined compression strengths and other parameters of the different soils.
基金This work has been supported by.Central University Research Fund(No.2016MS116,No.2016MS117,No.2018MS074)the National Natural Science Foundation(51677072).
文摘Effective storage,processing and analyzing of power device condition monitoring data faces enormous challenges.A framework is proposed that can support both MapReduce and Graph for massive monitoring data analysis at the same time based on Aliyun DTplus platform.First,power device condition monitoring data storage based on MaxCompute table and parallel permutation entropy feature extraction based on MaxCompute MapReduce are designed and implemented on DTplus platform.Then,Graph based k-means algorithm is implemented and used for massive condition monitoring data clustering analysis.Finally,performance tests are performed to compare the execution time between serial program and parallel program.Performance is analyzed from CPU cores consumption,memory utilization and parallel granularity.Experimental results show that the designed framework and parallel algorithms can efficiently process massive power device condition monitoring data.
基金supported by the Science and Technology Project of Henan Provincial Science and Technology Department (No.0424490012 )Major Program of Henan Institute of Science and Technology (No.040132)
文摘Five factors expressing greenbelt quality and one factor expressing quantity were adopted for evaluation of the residential greenbelt, and the AHP (Analytical Hierarchy Process) method was used to determine the value of factors. Thirty residential areas were selected as the samples. Two principal components were extracted and their expression was constructed by method of factor anlysis, therefore, quality evaluation of residential greenbelt was obtained. The accuracy of the function and implement quality classification toward the residential greenbelts in Xinxiang City were validated by clustering analysis method. The results showed that the greenbelt quality of fourteen residential areas was higher than the average level, of which eleven were newly-built residential areas. The 30 residential areas were classified into three types according to their greenbelt features and their formation by clustering analysis method. Finally rational proposal basing on aforesaid evaluating results was proposed for construction and renewal of residential greenbelt, upon which directive basis was provided for construction and renewal of residential greenbelt.
基金the National Natural Science Foundation of China (No.60472072)the Specialized Research Foundation for the Doctoral Program of Higher Educa-tion of China (No.20040699034).
文摘A novel Support Vector Machine(SVM) ensemble approach using clustering analysis is proposed. Firstly,the positive and negative training examples are clustered through subtractive clus-tering algorithm respectively. Then some representative examples are chosen from each of them to construct SVM components. At last,the outputs of the individual classifiers are fused through ma-jority voting method to obtain the final decision. Comparisons of performance between the proposed method and other popular ensemble approaches,such as Bagging,Adaboost and k.-fold cross valida-tion,are carried out on synthetic and UCI datasets. The experimental results show that our method has higher classification accuracy since the example distribution information is considered during en-semble through clustering analysis. It further indicates that our method needs a much smaller size of training subsets than Bagging and Adaboost to obtain satisfactory classification accuracy.
文摘A novel multivariate similarity clustering analysis (MSCA) approach was used to estimate a biogeographical division scheme for the global terrestrial fauna and was compared against other widely used clustering algorithms. The faunal dataset included almost all terrestrial and freshwater fauna, a total of 4631 families, 141,814 genera, and 1,334,834 species. Our findings demonstrated that suitable results were only obtained with the MSCA method, which was associated with distinct hierarchies, reasonable structuring, and furthermore, conformed to biogeographical criteria. A total of seven kingdoms and 20 sub-kingdoms were identified. We discovered that the clustering results for the higher and lower animals did not differ significantly, leading us to consider that the analysis result is convincing as the first zoogeographical division scheme for global all terrestrial animals.
文摘Affected by many involved factors, different dimensions, data with large difference, incomplete information and so on, the most optimal selection of regional outburst prevention measures for outburst mine has become a complicated system project. The traditional way of outburst prevention measure selection belongs to qualitative method, which may cause high-cost of gas control, huge quantities of drilling work, long construction time and even secondary disaster. To solve the above-mentioned problems, in light of occurrence status of coal seam gas in No. 21 mining area of Jinzhushan Tuzhu Mine, through grey fixed weight clustering theory and a combination method of qualitative and quantitative analysis, the judging model with multi-objective classification for optimization of outburst prevention measures was established. The three weight coefficients of outburst prevention technology scheme are sorted, in order to determine the advantages and disadvantages of each outburst prevention technology scheme under the comprehensive evaluation of multi-target. Finally, the problem of quantitative selection for regional outburst prevention technology scheme is solved under the situation of multi-factor mode and incomplete information, which provides reasonable and effective technical measures for prevention of coal and gas outburst disaster.
基金Supported by Project of Dagang Branch of Petroleum Group Company Ltd,CNPC No TJDG-JZHT-2005-JSFW-0000-00339
文摘The main task of provenance analysis is to determine the source of sediments and the position of parent rocks.Provenance analysis may find out the relationship between erosion districts and sediment zone,between the uplift and the depression in the process of basin development.The authors use the method of heavy mineral clustering analysis and estimate the provenance direction of Huanghua Depression in the Paleogene Kong 2 Member.Research shows that there were five provenance areas of Kong 2 Member in Kongnan area.They are western(Shenusi),northwestern(Cangzhou),eastern(Ganhuatun),northeastern and southeastern.The main provenance areas were northwestern and western,while the southern provenance could not be ruled out.And these areas are consistent with the known provenance areas.
基金supported by the key laboratory foundation of Henna(112300413221).
文摘In this study,the world’s land(except Antarctica)is divided into 67 basic geographical units according to ecological types.Using our newly proposed MSCA(Multivariate Similarity Clustering Analysis)method,7,591 species of modern terrestrial mammals belonging to 1,374 genera in 162 families and 2,378 species of mammals in the Wallace era before 1876 are quantitatively analyzed,and almost the same clustering results are obtained,with clear levels and reasonable clustering,which conform to the principles of geography,statistics,ecology and biology.It not only affirms and supports the reasonable kernel of Wallace’s scheme,but also puts forward suggestions that should be revised and improved.The large or small differences between the clustering results and the mammalian geographical zoning schemes of contemporary scholars are caused by different analysis methods,and they are highly consistent with the analysis results of chordates,angiosperms and insects in the world analyzed by the same method.Once again,it confirms the homogeneity of the global biological distribution pattern of major groups,and the possibility of building a unified biogeographic zoning system in the world.
文摘Objective: To analyze hot research areas and the present research status of nursing safety management in PubMed. Methods: PubMed was searched using "safety management" for the literature on nursing safety management. BICOMB 2.0 and SPSS 20.0 software were used to analyze high-frequency keywords and conduct co-word clustering analysis. Results: We searched for totally 2353 articles related to our topic and extracted 19 high-frequency keywords (27.50%). Five research focuses were concluded, including: study on nursing safety culture; team work to promote nursing safety; practice of nursing safety management; workplace violence against nursing staffs; nursing safety and quality evaluation standard. Conclusion: Analysis of the hotspots of nursing safety management in the past 10 years will contribute to understanding the research emphases and trend of development, and provide reference for the study and practice of nursing safety management.
文摘Background:Brucellosis is a major public health issue in China,while its temporal and spatial distribution have not been studied in depth.This study aims to better understand the epidemiology of brucellosis in the mainland of China,by investigating the human,temporal and spatial distribution and clustering characteristics of the disease.Methods:Human brucellosis data from the mainland of China between 2012 and 2016 were obtained from the China Information System for Disease Control and Prevention.The spatial autocorrelation analysis of ArcGIS10.6 and the spatial-temporal scanning analysis of SaTScan software were used to identify potential changes in the spatial and temporal distribution of human brucellosis in the mainland of China during the study period.Results:A total of 244348 human brucellosis cases were reported during the study period of 2012-2016.The average incidence of human brucellosis was higher in the 40-65 age group.The temporal clustering analysis showed that the high incidence of brucellosis occurred between March and July.The spatial clustering analysis showed that the location of brucellosis clustering in the mainland of China remained relatively fixed,mainly concentrated in most parts of northern China.The results of the spatial-temporal clustering analysis showed that Heilongjiang represents a primary clustering area,and the Tibet,Shanxi and Hubei provinces represent three secondary clustering areas.Conclusions:Human brucellosis remains a widespread challenge,particularly in northern China.The clustering analysis highlights potential high-risk human groups,time frames and areas,which may require special plans and resources to monitor and control the disease.
基金This work was supported by Key Program of National Natural Science Foundation of China(No.51838011)National Key Research and Development Program of China(Project No.2018YFC0704505)the Rixin Talent Program granted by Beijing University of Technology.
文摘The current scheme of building climate zones in China generally assumes that building climate zones of island cities are identical to adjacent land stations.Consequently,building design strategies for island buildings usually refer to those developed for inland cities.This approach has to some extent hindered the energy-saving design and green development of island buildings in China.This research takes a first step on this issue by defining the building climate zones of 36 marine islands over China marine area using two-stage zoning methodology adopted by current building climate zoning standard(GB50178-1993).The meteorological data used for analysis was obtained from the National Climate Center of China over the 30-year period from 1985 to 2014.As comparison,40 coastal stations which are adjacent to the inves-tigated marine islands were also included in this study.Subsequently a more obiective techni-que-cluster analysis was operated as an effective supplement to discover the climate characteristics among different observations.The results of both methodologies consistentlyshow that among the 36 islands investigated,the majority of islands located in northern and eastern marine area belong to the same climate zones as their adjacent coastal cities.Howev-er,island cities in southern marine area cannot be assigned to any current climate zone,which was demonstrated by its distinctive climate features different from any other sites investi-gated through cluster analysis as well as different energy use patterns.Thus a new zone was defined to supplement the current building climate zoning scheme to cover marine area of China.
文摘With rapid developments in platforms and sensors technology in terms of digital cameras and video recordings,crowd monitoring has taken a considerable attentions in many disciplines such as psychology,sociology,engineering,and computer vision.This is due to the fact that,monitoring of the crowd is necessary to enhance safety and controllable movements to minimize the risk particularly in highly crowded incidents(e.g.sports).One of the platforms that have been extensively employed in crowd monitoring is unmanned aerial vehicles(UAVs),because UAVs have the capability to acquiring fast,low costs,high-resolution and real-time images over crowd areas.In addition,geo-referenced images can also be provided through integration of on-board positioning sensors(e.g.GPS/IMU)with vision sensors(digital cameras and laser scanner).In this paper,a new testing procedure based on feature from accelerated segment test(FAST)algorithms is introduced to detect the crowd features from UAV images taken from different camera orientations and positions.The proposed test started with converting a circle of 16 pixels surrounding the center pixel into a vector and sorting it in ascending/descending order.A single pixel which takes the ranking number 9(for FAST-9)or 12(for FAST-12)was then compared with the center pixel.Accuracy assessment in terms of completeness and correctness was used to assess the performance of the new testing procedure before and after filtering the crowd features.The results show that the proposed algorithms are able to extract crowd features from different UAV images.Overall,the values of Completeness range from 55 to 70%whereas the range of correctness values was 91 to 94%.
基金National Science Foundation of China(91637105,41775048 and 41475041)National Key R&D Program of China(2018YFC1507800)Research on Tourism Traffic Meteorological Service Products in Heilongjiang Province(HQZD2017004)
文摘An evaluation index is a prerequisite for the scientific evaluation of a public meteorological service.This paper aims to explore a technical method for determining and screening evaluation indicators.Based on public satisfaction survey data obtained in Wafangdian,China in 2010,this study investigates the suitability of fuzzy clustering analysis method in establishing an evaluation index.Through quantitative analysis of multilayer fuzzy clustering of various evaluation indicators,correlation analysis indicates that if the results of clustering were identical for two evaluation indicators in the same sub-evaluation layer,then one indicator could be removed,or the two indicators merged.For evaluation indicators in different sub-evaluation layers,although clustering reveals attribute correlations,these indicators may not be substituted for one another.Analysis of the applicability of the fuzzy clustering method shows that it plays a certain role in the establishment and correction of an evaluation index.
文摘A total of 10 indices of regional economic development in Guangxi are selected.According to the relevant economic data,regional economic development in Guangxi is analyzed by using System Clustering Method and Principal Component Analysis Method.Result shows that System Clustering Method and Principal Component Analysis Method have revealed similar results analysis of economic development level.Overall economic strength of Guangxi is weak and Nanning has relatively high scores of factors due to its advantage of the political,economic and cultural center.Comprehensive scores of other regions are all lower than 1,which has big gap with the development of Nanning.Overall development strategy points out that Guangxi should accelerate the construction of the Ring Northern Bay Economic Zone,create a strong logistics system having strategic significance to national development,use the unique location advantage and rely on the modern transportation system to establish a logistics center and business center connecting the hinterland and the Asean Market.Based on the problems of unbalanced regional economic development in Guangxi,we should speed up the development of service industry in Nanning,construct the circular economy system of industrial city,and accelerate the industrialization process of tourism city in order to realize balanced development of regional economy in Guangxi,China.
基金This work has been done within the NFFA-EUROPE project and has received funding from the European Union’s Horizon 2020 Research and Innovation Program under grant agreement No.654360 NFFA-EUROPE.
文摘In this paper,we report upon our recent work aimed at improving and adapting machine learning algorithms to automatically classify nanoscience images acquired by the Scanning Electron Microscope(SEM).This is done by coupling supervised and unsupervised learning approaches.We first investigate supervised learning on a ten-category data set of images and compare the performance of the different models in terms of training accuracy.Then,we reduce the dimensionality of the features through autoencoders to perform unsupervised learning on a subset of images in a selected range of scales(from 1μm to 2μm).Finally,we compare different clustering methods to uncover intrinsic structures in the images.
文摘In view of the composition analysis and identification of ancient glass products, L1 regularization, K-Means cluster analysis, elbow rule and other methods were comprehensively used to build logical regression, cluster analysis, hyper-parameter test and other models, and SPSS, Python and other tools were used to obtain the classification rules of glass products under different fluxes, sub classification under different chemical compositions, hyper-parameter K value test and rationality analysis. Research can provide theoretical support for the protection and restoration of ancient glass relics.