In order to make a scientific pavement maintenance decision, a grey-theory-based prediction methodological framework is proposed to predict pavement performance. Based on the field pavement rutting data,analysis of va...In order to make a scientific pavement maintenance decision, a grey-theory-based prediction methodological framework is proposed to predict pavement performance. Based on the field pavement rutting data,analysis of variance (ANOVA)was first used to study the influence of different factors on pavement rutting. Cluster analysis was then employed to investigate the rutting development trend.Based on the clustering results,the grey theory was applied to build pavement rutting models for each cluster, which can effectively reduce the complexity of the predictive model.The results show that axial load and asphalt binder type play important roles in rutting development.The prediction model is capable of capturing the uncertainty in the pavement performance prediction process and can meet the requirements of highway pavement maintenance,and,therefore,has a wide application prospects.展开更多
In order to mine production and security information from security supervising data and to ensure security and safety involved in production and decision-making,a clustering analysis algorithm for security supervising...In order to mine production and security information from security supervising data and to ensure security and safety involved in production and decision-making,a clustering analysis algorithm for security supervising data based on a semantic description in coal mines is studied.First,the semantic and numerical-based hybrid description method of security supervising data in coal mines is described.Secondly,the similarity measurement method of semantic and numerical data are separately given and a weight-based hybrid similarity measurement method for the security supervising data based on a semantic description in coal mines is presented.Thirdly,taking the hybrid similarity measurement method as the distance criteria and using a grid methodology for reference,an improved CURE clustering algorithm based on the grid is presented.Finally,the simulation results of a security supervising data set in coal mines validate the efficiency of the algorithm.展开更多
The features of DNA sequence fragments were extracted from the distribution density of the condons in the individual cases of DNA sequence fragments. Based on the polarity of side chain radicals of amino acids molecul...The features of DNA sequence fragments were extracted from the distribution density of the condons in the individual cases of DNA sequence fragments. Based on the polarity of side chain radicals of amino acids molecules, the amino acids were classified into five categories, and the frequencies of these five categories were calculated. This kind of feature extraction based on the biological meanings not only took the content of basic groups into consideration, but also considered the marshal ing sequence of the basic groups. The hierarchical clustering analysis and BP neural network were used to classify the DNA sequence fragments. The results showed that the classification results of these two kinds of algo-rithms not only had high accuracy, but also had high consistence, indicating that this kind of feature extraction was superior over the traditional feature extraction which only took the features of basic groups into consideration.展开更多
In k-means clustering, we are given a set of n data points in d-dimensional space R^d and an integer k and the problem is to determine a set of k points in R^d, called centers, so as to minimize the mean squared dista...In k-means clustering, we are given a set of n data points in d-dimensional space R^d and an integer k and the problem is to determine a set of k points in R^d, called centers, so as to minimize the mean squared distance from each data point to its nearest center. In this paper, we present a simple and efficient clustering algorithm based on the k-means algorithm, which we call enhanced k-means algorithm. This algorithm is easy to implement, requiring a simple data structure to keep some information in each iteration to be used in the next iteration. Our experimental results demonstrated that our scheme can improve the computational speed of the k-means algorithm by the magnitude in the total number of distance calculations and the overall time of computation.展开更多
This study was undertaken to construct a preliminary spatial analysis method for building an urban-suburban-rural category in the specific sample area of central California and providing distribution characteristics i...This study was undertaken to construct a preliminary spatial analysis method for building an urban-suburban-rural category in the specific sample area of central California and providing distribution characteristics in each category, based on which, some further studies such as regional manners of residential wood burning emission (PM2.5, the term used for a mixture of solid particles and liquid droplets found in the air, refers to particulate matter that is 2.5 mu m or smaller in size) could be carried out for the project of residential wood combustion. Demographic and infrastructure data with spatial characteristics were processed by integrating both Geographic Information System (GIS) and statistics method (Cluster Analysis), and then output to a category map as the result. It approached the quantitative and multi-variables description on the major characteristics variations among the urban, suburban and rural; and perfected the TIGER's urban-rural classification scheme by adding suburban category. Based on the free public GIS data, the spatial analysis method provides an easy and ideal tool for geographic researchers, environmental planners, urban/regional planners and administrators to delineate different categories of regional function on the specific locations and dig out spatial distribution information they wanted. Furthermore, it allows for future adjustment on some parameters as the spatial analysis method is implemented in the different regions or various eco-social models.展开更多
In this study, we used the multi-resolution graph-based clustering (MRGC) method for determining the electrofacies (EF) and lithofacies (LF) from well log data obtained from the intraplatform bank gas fields loc...In this study, we used the multi-resolution graph-based clustering (MRGC) method for determining the electrofacies (EF) and lithofacies (LF) from well log data obtained from the intraplatform bank gas fields located in the Amu Darya Basin. The MRGC could automatically determine the optimal number of clusters without prior knowledge about the structure or cluster numbers of the analyzed data set and allowed the users to control the level of detail actually needed to define the EF. Based on the LF identification and successful EF calibration using core data, an MRGC EF partition model including five clusters and a quantitative LF interpretation chart were constructed. The EF clusters 1 to 5 were interpreted as lagoon, anhydrite flat, interbank, low-energy bank, and high-energy bank, and the coincidence rate in the cored interval could reach 85%. We concluded that the MRGC could be accurately applied to predict the LF in non-cored but logged wells. Therefore, continuous EF clusters were partitioned and corresponding LF were characteristics &different LF were analyzed interpreted, and the distribution and petrophysical in the framework of sequence stratigraphy.展开更多
Based on the safety coefficient method,which assigns rock failure criteria to calculate the rock mass unit,the safety coefficient contour of surrounding rock is plotted to judge the distribution form of the fractured ...Based on the safety coefficient method,which assigns rock failure criteria to calculate the rock mass unit,the safety coefficient contour of surrounding rock is plotted to judge the distribution form of the fractured zone in the roadway.This will provide the basis numerical simulation to calculate the surrounding rock fractured zone in a roadway.Using the single factor and multi-factor orthogonal test method,the evolution law of roadway surrounding rock displacements,plastic zone and stress distribution under different conditions is studied.It reveals the roadway surrounding rock burst evolution process,and obtains five kinds of failure modes in deep soft rock roadway.Using the fuzzy mathematics clustering analysis method,the deep soft surrounding rock failure model in Zhujixi mine can be classified and patterns recognized.Compared to the identification results and the results detected by geological radar of surrounding rock loose circle,the reliability of the results of the pattern recognition is verified and lays the foundations for the support design of deep soft rock roadways.展开更多
DNS(domain name system) query log analysis has been a popular research topic in recent years. CLOPE, the represented transactional clustering algorithm, could be readily used for DNS query log mining. However, the alg...DNS(domain name system) query log analysis has been a popular research topic in recent years. CLOPE, the represented transactional clustering algorithm, could be readily used for DNS query log mining. However, the algorithm is inefficient when processing large scale data. The MR-CLOPE algorithm is proposed, which is an extension and improvement on CLOPE based on Map Reduce. Different from the previous parallel clustering method, a two-stage Map Reduce implementation framework is proposed. Each of the stage is implemented by one kind Map Reduce task. In the first stage, the DNS query logs are divided into multiple splits and the CLOPE algorithm is executed on each split. The second stage usually tends to iterate many times to merge the small clusters into bigger satisfactory ones. In these two stages, a novel partition process is designed to randomly spread out original sub clusters, which will be moved and merged in the map phrase of the second phase according to the defined merge criteria. In such way, the advantage of the original CLOPE algorithm is kept and its disadvantages are dealt with in the proposed framework to achieve more excellent clustering performance. The experiment results show that MR-CLOPE is not only faster but also has better clustering quality on DNS query logs compared with CLOPE.展开更多
Based on the historical observed data and the modeling results,this paper investigated the seasonal variations in the Taiwan Warm Current Water(TWCW)using a cluster analysis method and examined the contributions of th...Based on the historical observed data and the modeling results,this paper investigated the seasonal variations in the Taiwan Warm Current Water(TWCW)using a cluster analysis method and examined the contributions of the Kuroshio onshore intrusion and the Taiwan Strait Warm Current(TSWC)to the TWCW on seasonal time scales.The TWCW has obviously seasonal variation in its horizontal distribution,T-S characteristics and volume.The volume of TWCW is maximum(13746 km^3)in winter and minimum(11397 km^3)in autumn.As to the contributions to the TWCW,the TSWC is greatest in summer and smallest in winter,while the Kuroshio onshore intrusion northeast of Taiwan Island is strongest in winter and weakest in summer.By comparison,the Kuroshio onshore intrusion make greater contributions to the Taiwan Warm Current Surface Water(TWCSW)than the TSWC for most of the year,except for in the summertime(from June to August),while the Kuroshio Subsurface Water(KSSW)dominate the Taiwan Warm Current Deep Water(TWCDW).The analysis results demonstrate that the local monsoon winds is the dominant factor controlling the seasonal variation in the TWCW volume via Ekman dynamics,while the surface heat fl ux can play a secondary role via the joint ef fect of baroclinicity and relief.展开更多
Macrobenthic communities in the surrounding waters of Changli were investigated during spring and summer in2016.Differences in species quantity,abundance and biomass,the dominant species and species diversity index of...Macrobenthic communities in the surrounding waters of Changli were investigated during spring and summer in2016.Differences in species quantity,abundance and biomass,the dominant species and species diversity index of macrobenthos were analyzed.The results showed that58macrobenthos species were identified in spring,and92macrobenthos species were identified in summer.The composition of dominant species seasonally varied;however,most of them were species belonging to Polychaeta.The abundance of macrobenthos in summer was slightly higher than that in spring,while the biomass in summer was significantly smaller than that in spring.Bray-Curtis cluster analysis and multi-dimentional scaling(MDS)analysis indicated that macrobenthic communities were divided into three communities in spring,and two in summer.The abundance-biomass comparison(ABC)curve method was used to monitor the disturbance of environmental pollution for macrobenthic community.The results showed that the macrobenthos in this area received serious disturbance from environmental pollution.展开更多
Point-wise negative selection algorithms,which generate their detector sets based on point of self data,have lower training efficiency and detection rate.To solve this problem,a self region based real-valued negative ...Point-wise negative selection algorithms,which generate their detector sets based on point of self data,have lower training efficiency and detection rate.To solve this problem,a self region based real-valued negative selection algorithm is presented.In this new approach,the continuous self region is defined by the collection of self data,the partial training takes place at the training stage according to both the radius of self region and the cosine distance between gravity of the self region and detector candidate,and variable detectors in the self region are deployed.The algorithm is tested using the triangle shape of self region in the 2-D complement space and KDD CUP 1999 data set.Results show that,more information can be provided when the training self points are used together as a whole,and compared with the point-wise negative selection algorithm,the new approach can improve the training efficiency of system and the detection rate significantly.展开更多
We propose a new clustering algorithm that assists the researchers to quickly and accurately analyze data. We call this algorithm Combined Density-based and Constraint-based Algorithm (CDC). CDC consists of two phases...We propose a new clustering algorithm that assists the researchers to quickly and accurately analyze data. We call this algorithm Combined Density-based and Constraint-based Algorithm (CDC). CDC consists of two phases. In the first phase, CDC employs the idea of density-based clustering algorithm to split the original data into a number of fragmented clusters. At the same time, CDC cuts off the noises and outliers. In the second phase, CDC employs the concept of K-means clustering algorithm to select a greater cluster to be the center. Then, the greater cluster merges some smaller clusters which satisfy some constraint rules. Due to the merged clusters around the center cluster, the clustering results show high accuracy. Moreover, CDC reduces the calculations and speeds up the clustering process. In this paper, the accuracy of CDC is evaluated and compared with those of K-means, hierarchical clustering, and the genetic clustering algorithm (GCA) proposed in 2004. Experimental results show that CDC has better performance.展开更多
AIM: To investigate the effcacy of effuent biomarkers for peritoneal deterioration with functional decline in peritoneal dialysis (PD).METHODS: From January 2005 to March 2013, the subjects included 218 PD patient...AIM: To investigate the effcacy of effuent biomarkers for peritoneal deterioration with functional decline in peritoneal dialysis (PD).METHODS: From January 2005 to March 2013, the subjects included 218 PD patients with end-stage renal disease at 18 centers. Matrix metalloproteinase-2 (MMP-2), interleukin-6 (IL-6), hyaluronan, and cancer antigen 125 (CA125) in peritoneal effluent were quantified with enzyme-linked immunosorbent assay. Peritoneal solute transport rate was assessed by peritoneal equilibration test (PET) to estimate peritoneal deterioration.RESULTS: The ratio of the effuent level of creatinine (Cr) obtained 4 h after injection (D) to that of plasma was correlated with the effluent levels of MMP-2 (ρ = 0.74, P 〈 0.001), IL-6 (ρ = 0.46, P 〈 0.001), and hyaluronan (ρ = 0.27, P 〈 0.001), but not CA125 (ρ = 0.13, P = 0.051). The area under receiver operating characteristic curve for the effluent levels of MMP-2, IL-6, and hyaluronan against high PET category were 0.90, 0.78, 0.62, and 0.51, respectively. No patient developed new-onset encapsulating peritoneal sclerosis for at least 1.5 years after peritoneal effuent sampling.CONCLUSION: The effuent MMP-2 level most closely reflected peritoneal solute transport rate. MMP-2 can be a reliable indicator of peritoneal deterioration with functional decline.展开更多
From the perspective of tourism competitiveness,the paper takes 12 island counties of China as the research object,and applies the method of factor analysis to study their competitiveness.The result shows that Putuo a...From the perspective of tourism competitiveness,the paper takes 12 island counties of China as the research object,and applies the method of factor analysis to study their competitiveness.The result shows that Putuo and Dinghai are more competitive while Pingtan and Nan'ao are less competitive.Finally,the 12 island counties are divided into four styles:first-class competitive county (Putuo),seond-class competitive counties (Dinghai,Yuhuan),third-class competitive counties (Chongming,Daishan,Changdao,Changhai and Shengsi),fourth-class competitive counties (Dongshan,Dongtou,Pingtan and Nan'ao) by cluster analysis.The classification of island counties is to clear their relative position,then to promote their development.展开更多
Category-based statistic language model is an important method to solve the problem of sparse data.But there are two bottlenecks:1) The problem of word clustering.It is hard to find a suitable clustering method with g...Category-based statistic language model is an important method to solve the problem of sparse data.But there are two bottlenecks:1) The problem of word clustering.It is hard to find a suitable clustering method with good performance and less computation.2) Class-based method always loses the prediction ability to adapt the text in different domains.In order to solve above problems,a definition of word similarity by utilizing mutual information was presented.Based on word similarity,the definition of word set similarity was given.Experiments show that word clustering algorithm based on similarity is better than conventional greedy clustering method in speed and performance,and the perplexity is reduced from 283 to 218.At the same time,an absolute weighted difference method was presented and was used to construct vari-gram language model which has good prediction ability.The perplexity of vari-gram model is reduced from 234.65 to 219.14 on Chinese corpora,and is reduced from 195.56 to 184.25 on English corpora compared with category-based model.展开更多
This paper introduces data mining technology in enterprise competitive intelligence system; and then introduced theoretical foundation and main clustering method of cluster analysis. The article emphasis on the FCM al...This paper introduces data mining technology in enterprise competitive intelligence system; and then introduced theoretical foundation and main clustering method of cluster analysis. The article emphasis on the FCM algorithm and principle and described implementation steps, and proposed the improvement FCM algorithm based on K mean particle size; finally, realize the design and implementation of enterprise competitive intelligence analysis and mining service system, and the improved FCM algorithm is applied in the system.展开更多
This works examine the responses of housing prices to the monetary policies in various Chinese cities. Thirty-five large and medium sized Chinese cities are classified into six clusters applying the minimum variance c...This works examine the responses of housing prices to the monetary policies in various Chinese cities. Thirty-five large and medium sized Chinese cities are classified into six clusters applying the minimum variance clustering method according to the calculated correlation coefficients between the housing price indices of every two cities.Time difference correlation analysis is then employed to quantify the relations between the housing price indices of the six clusters and the monetary policies.It is suggested that the housing prices of various cities evolved at different paces and their responses to the monetary policies are heterogeneous,and local economic features are more important than geographic distances in determining the housing price trends.展开更多
Chosen-message pair Simple Power Analysis (SPA) attacks were proposed by Boer, Yen and Homma, and are attack methods based on searches for collisions of modular multiplication. However, searching for collisions is dif...Chosen-message pair Simple Power Analysis (SPA) attacks were proposed by Boer, Yen and Homma, and are attack methods based on searches for collisions of modular multiplication. However, searching for collisions is difficult in real environments. To circumvent this problem, we propose the Simple Power Clustering Attack (SPCA), which can automatically identify the modular multiplication collision. The insignificant effects of collision attacks were validated in an Application Specific Integrated Circuit (ASIC) environment. After treatment with SPCA, the automatic secret key recognition rate increased to 99%.展开更多
基金The Major Scientific and Technological Special Project of Jiangsu Provincial Communications Department(No.2011Y/02-G1)
文摘In order to make a scientific pavement maintenance decision, a grey-theory-based prediction methodological framework is proposed to predict pavement performance. Based on the field pavement rutting data,analysis of variance (ANOVA)was first used to study the influence of different factors on pavement rutting. Cluster analysis was then employed to investigate the rutting development trend.Based on the clustering results,the grey theory was applied to build pavement rutting models for each cluster, which can effectively reduce the complexity of the predictive model.The results show that axial load and asphalt binder type play important roles in rutting development.The prediction model is capable of capturing the uncertainty in the pavement performance prediction process and can meet the requirements of highway pavement maintenance,and,therefore,has a wide application prospects.
基金The National Natural Science Foundation of China(No.50674086)Specialized Research Fund for the Doctoral Program of Higher Education(No.20060290508)the Postdoctoral Scientific Program of Jiangsu Province(No.0701045B)
文摘In order to mine production and security information from security supervising data and to ensure security and safety involved in production and decision-making,a clustering analysis algorithm for security supervising data based on a semantic description in coal mines is studied.First,the semantic and numerical-based hybrid description method of security supervising data in coal mines is described.Secondly,the similarity measurement method of semantic and numerical data are separately given and a weight-based hybrid similarity measurement method for the security supervising data based on a semantic description in coal mines is presented.Thirdly,taking the hybrid similarity measurement method as the distance criteria and using a grid methodology for reference,an improved CURE clustering algorithm based on the grid is presented.Finally,the simulation results of a security supervising data set in coal mines validate the efficiency of the algorithm.
基金Supported by the Natural Science Foundation of Zhejiang Province(LY13A010007)~~
文摘The features of DNA sequence fragments were extracted from the distribution density of the condons in the individual cases of DNA sequence fragments. Based on the polarity of side chain radicals of amino acids molecules, the amino acids were classified into five categories, and the frequencies of these five categories were calculated. This kind of feature extraction based on the biological meanings not only took the content of basic groups into consideration, but also considered the marshal ing sequence of the basic groups. The hierarchical clustering analysis and BP neural network were used to classify the DNA sequence fragments. The results showed that the classification results of these two kinds of algo-rithms not only had high accuracy, but also had high consistence, indicating that this kind of feature extraction was superior over the traditional feature extraction which only took the features of basic groups into consideration.
文摘In k-means clustering, we are given a set of n data points in d-dimensional space R^d and an integer k and the problem is to determine a set of k points in R^d, called centers, so as to minimize the mean squared distance from each data point to its nearest center. In this paper, we present a simple and efficient clustering algorithm based on the k-means algorithm, which we call enhanced k-means algorithm. This algorithm is easy to implement, requiring a simple data structure to keep some information in each iteration to be used in the next iteration. Our experimental results demonstrated that our scheme can improve the computational speed of the k-means algorithm by the magnitude in the total number of distance calculations and the overall time of computation.
文摘This study was undertaken to construct a preliminary spatial analysis method for building an urban-suburban-rural category in the specific sample area of central California and providing distribution characteristics in each category, based on which, some further studies such as regional manners of residential wood burning emission (PM2.5, the term used for a mixture of solid particles and liquid droplets found in the air, refers to particulate matter that is 2.5 mu m or smaller in size) could be carried out for the project of residential wood combustion. Demographic and infrastructure data with spatial characteristics were processed by integrating both Geographic Information System (GIS) and statistics method (Cluster Analysis), and then output to a category map as the result. It approached the quantitative and multi-variables description on the major characteristics variations among the urban, suburban and rural; and perfected the TIGER's urban-rural classification scheme by adding suburban category. Based on the free public GIS data, the spatial analysis method provides an easy and ideal tool for geographic researchers, environmental planners, urban/regional planners and administrators to delineate different categories of regional function on the specific locations and dig out spatial distribution information they wanted. Furthermore, it allows for future adjustment on some parameters as the spatial analysis method is implemented in the different regions or various eco-social models.
基金supported by the National Science and Technology Major Project of China(No.2011ZX05029-003)CNPC Science Research and Technology Development Project,China(No.2013D-0904)
文摘In this study, we used the multi-resolution graph-based clustering (MRGC) method for determining the electrofacies (EF) and lithofacies (LF) from well log data obtained from the intraplatform bank gas fields located in the Amu Darya Basin. The MRGC could automatically determine the optimal number of clusters without prior knowledge about the structure or cluster numbers of the analyzed data set and allowed the users to control the level of detail actually needed to define the EF. Based on the LF identification and successful EF calibration using core data, an MRGC EF partition model including five clusters and a quantitative LF interpretation chart were constructed. The EF clusters 1 to 5 were interpreted as lagoon, anhydrite flat, interbank, low-energy bank, and high-energy bank, and the coincidence rate in the cored interval could reach 85%. We concluded that the MRGC could be accurately applied to predict the LF in non-cored but logged wells. Therefore, continuous EF clusters were partitioned and corresponding LF were characteristics &different LF were analyzed interpreted, and the distribution and petrophysical in the framework of sequence stratigraphy.
基金provided by the National Natural Science Foundation of China(Nos.51322401,51309222,51323004,51579239 and 51574223)the Opening Project Fund of Shandong Provincial Key Laboratory of Civil Engineering Disaster Prevention and Mitigation(No.CDPM2014KF03)+2 种基金the State Key Laboratory for GeoMechanics Opening Project Fund of Shandong Provincial Key Laboratory of Civil Engineering Disaster Prevention and MitigationDeep Underground Engineering,China University of Mining&Technology(No.SKLGDUEK1305)China Postdoctoral Science Foundation(Nos.2014M551700and 2013M531424)
文摘Based on the safety coefficient method,which assigns rock failure criteria to calculate the rock mass unit,the safety coefficient contour of surrounding rock is plotted to judge the distribution form of the fractured zone in the roadway.This will provide the basis numerical simulation to calculate the surrounding rock fractured zone in a roadway.Using the single factor and multi-factor orthogonal test method,the evolution law of roadway surrounding rock displacements,plastic zone and stress distribution under different conditions is studied.It reveals the roadway surrounding rock burst evolution process,and obtains five kinds of failure modes in deep soft rock roadway.Using the fuzzy mathematics clustering analysis method,the deep soft surrounding rock failure model in Zhujixi mine can be classified and patterns recognized.Compared to the identification results and the results detected by geological radar of surrounding rock loose circle,the reliability of the results of the pattern recognition is verified and lays the foundations for the support design of deep soft rock roadways.
基金Project(61103046) supported in part by the National Natural Science Foundation of ChinaProject(B201312) supported by DHU Distinguished Young Professor Program,China+1 种基金Project(LY14F020007) supported by Zhejiang Provincial Natural Science Funds of ChinaProject(2014A610072) supported by the Natural Science Foundation of Ningbo City,China
文摘DNS(domain name system) query log analysis has been a popular research topic in recent years. CLOPE, the represented transactional clustering algorithm, could be readily used for DNS query log mining. However, the algorithm is inefficient when processing large scale data. The MR-CLOPE algorithm is proposed, which is an extension and improvement on CLOPE based on Map Reduce. Different from the previous parallel clustering method, a two-stage Map Reduce implementation framework is proposed. Each of the stage is implemented by one kind Map Reduce task. In the first stage, the DNS query logs are divided into multiple splits and the CLOPE algorithm is executed on each split. The second stage usually tends to iterate many times to merge the small clusters into bigger satisfactory ones. In these two stages, a novel partition process is designed to randomly spread out original sub clusters, which will be moved and merged in the map phrase of the second phase according to the defined merge criteria. In such way, the advantage of the original CLOPE algorithm is kept and its disadvantages are dealt with in the proposed framework to achieve more excellent clustering performance. The experiment results show that MR-CLOPE is not only faster but also has better clustering quality on DNS query logs compared with CLOPE.
基金Supported by the National Natural Science Foundation of China(Nos.41506020,41476019,41528601)the CAS Strategy Pioneering Program(No.XDA110020104)+2 种基金the Foundation for Innovative Research Groups of the National Natural Science Foundation of China(No.41421005)the NSFC-Shandong Joint Fund for Marine Science Research Centers(No.U1406401)the Global Change and Air-Sea Interaction(No.GASI-03-01-01-02)
文摘Based on the historical observed data and the modeling results,this paper investigated the seasonal variations in the Taiwan Warm Current Water(TWCW)using a cluster analysis method and examined the contributions of the Kuroshio onshore intrusion and the Taiwan Strait Warm Current(TSWC)to the TWCW on seasonal time scales.The TWCW has obviously seasonal variation in its horizontal distribution,T-S characteristics and volume.The volume of TWCW is maximum(13746 km^3)in winter and minimum(11397 km^3)in autumn.As to the contributions to the TWCW,the TSWC is greatest in summer and smallest in winter,while the Kuroshio onshore intrusion northeast of Taiwan Island is strongest in winter and weakest in summer.By comparison,the Kuroshio onshore intrusion make greater contributions to the Taiwan Warm Current Surface Water(TWCSW)than the TSWC for most of the year,except for in the summertime(from June to August),while the Kuroshio Subsurface Water(KSSW)dominate the Taiwan Warm Current Deep Water(TWCDW).The analysis results demonstrate that the local monsoon winds is the dominant factor controlling the seasonal variation in the TWCW volume via Ekman dynamics,while the surface heat fl ux can play a secondary role via the joint ef fect of baroclinicity and relief.
基金supported by Ecological Restoration Technique in Typical Bay (No. TKS160226)Tianjin Municipal Science and Technology Planning Project (15ZCZDSF00620)
文摘Macrobenthic communities in the surrounding waters of Changli were investigated during spring and summer in2016.Differences in species quantity,abundance and biomass,the dominant species and species diversity index of macrobenthos were analyzed.The results showed that58macrobenthos species were identified in spring,and92macrobenthos species were identified in summer.The composition of dominant species seasonally varied;however,most of them were species belonging to Polychaeta.The abundance of macrobenthos in summer was slightly higher than that in spring,while the biomass in summer was significantly smaller than that in spring.Bray-Curtis cluster analysis and multi-dimentional scaling(MDS)analysis indicated that macrobenthic communities were divided into three communities in spring,and two in summer.The abundance-biomass comparison(ABC)curve method was used to monitor the disturbance of environmental pollution for macrobenthic community.The results showed that the macrobenthos in this area received serious disturbance from environmental pollution.
基金Sponsored by the National Natural Science Foundation of China (Grant No. 60671049)the Subject Chief Foundation of Harbin (Grant No.2003AFXXJ013)+1 种基金the Education Department Research Foundation of Heilongjiang Province(Grant No. 10541044, 1151G012)the Postdoctor Foundation of Heilongjiang Province(Grant No.LBH-Z05092)
文摘Point-wise negative selection algorithms,which generate their detector sets based on point of self data,have lower training efficiency and detection rate.To solve this problem,a self region based real-valued negative selection algorithm is presented.In this new approach,the continuous self region is defined by the collection of self data,the partial training takes place at the training stage according to both the radius of self region and the cosine distance between gravity of the self region and detector candidate,and variable detectors in the self region are deployed.The algorithm is tested using the triangle shape of self region in the 2-D complement space and KDD CUP 1999 data set.Results show that,more information can be provided when the training self points are used together as a whole,and compared with the point-wise negative selection algorithm,the new approach can improve the training efficiency of system and the detection rate significantly.
文摘We propose a new clustering algorithm that assists the researchers to quickly and accurately analyze data. We call this algorithm Combined Density-based and Constraint-based Algorithm (CDC). CDC consists of two phases. In the first phase, CDC employs the idea of density-based clustering algorithm to split the original data into a number of fragmented clusters. At the same time, CDC cuts off the noises and outliers. In the second phase, CDC employs the concept of K-means clustering algorithm to select a greater cluster to be the center. Then, the greater cluster merges some smaller clusters which satisfy some constraint rules. Due to the merged clusters around the center cluster, the clustering results show high accuracy. Moreover, CDC reduces the calculations and speeds up the clustering process. In this paper, the accuracy of CDC is evaluated and compared with those of K-means, hierarchical clustering, and the genetic clustering algorithm (GCA) proposed in 2004. Experimental results show that CDC has better performance.
基金supported by Terumo Core Technology Center (Kanagawa, Japan)
文摘AIM: To investigate the effcacy of effuent biomarkers for peritoneal deterioration with functional decline in peritoneal dialysis (PD).METHODS: From January 2005 to March 2013, the subjects included 218 PD patients with end-stage renal disease at 18 centers. Matrix metalloproteinase-2 (MMP-2), interleukin-6 (IL-6), hyaluronan, and cancer antigen 125 (CA125) in peritoneal effluent were quantified with enzyme-linked immunosorbent assay. Peritoneal solute transport rate was assessed by peritoneal equilibration test (PET) to estimate peritoneal deterioration.RESULTS: The ratio of the effuent level of creatinine (Cr) obtained 4 h after injection (D) to that of plasma was correlated with the effluent levels of MMP-2 (ρ = 0.74, P 〈 0.001), IL-6 (ρ = 0.46, P 〈 0.001), and hyaluronan (ρ = 0.27, P 〈 0.001), but not CA125 (ρ = 0.13, P = 0.051). The area under receiver operating characteristic curve for the effluent levels of MMP-2, IL-6, and hyaluronan against high PET category were 0.90, 0.78, 0.62, and 0.51, respectively. No patient developed new-onset encapsulating peritoneal sclerosis for at least 1.5 years after peritoneal effuent sampling.CONCLUSION: The effuent MMP-2 level most closely reflected peritoneal solute transport rate. MMP-2 can be a reliable indicator of peritoneal deterioration with functional decline.
基金supported by a grant from Shandong Social Science Planning Project in 2010 (Grant No:10CJGJ22)Ocean University of China Young Teachers Special Fund Project in 2010 (Grant No:201013070)
文摘From the perspective of tourism competitiveness,the paper takes 12 island counties of China as the research object,and applies the method of factor analysis to study their competitiveness.The result shows that Putuo and Dinghai are more competitive while Pingtan and Nan'ao are less competitive.Finally,the 12 island counties are divided into four styles:first-class competitive county (Putuo),seond-class competitive counties (Dinghai,Yuhuan),third-class competitive counties (Chongming,Daishan,Changdao,Changhai and Shengsi),fourth-class competitive counties (Dongshan,Dongtou,Pingtan and Nan'ao) by cluster analysis.The classification of island counties is to clear their relative position,then to promote their development.
基金Project(60763001) supported by the National Natural Science Foundation of ChinaProject(2010GZS0072) supported by the Natural Science Foundation of Jiangxi Province,ChinaProject(GJJ12271) supported by the Science and Technology Foundation of Provincial Education Department of Jiangxi Province,China
文摘Category-based statistic language model is an important method to solve the problem of sparse data.But there are two bottlenecks:1) The problem of word clustering.It is hard to find a suitable clustering method with good performance and less computation.2) Class-based method always loses the prediction ability to adapt the text in different domains.In order to solve above problems,a definition of word similarity by utilizing mutual information was presented.Based on word similarity,the definition of word set similarity was given.Experiments show that word clustering algorithm based on similarity is better than conventional greedy clustering method in speed and performance,and the perplexity is reduced from 283 to 218.At the same time,an absolute weighted difference method was presented and was used to construct vari-gram language model which has good prediction ability.The perplexity of vari-gram model is reduced from 234.65 to 219.14 on Chinese corpora,and is reduced from 195.56 to 184.25 on English corpora compared with category-based model.
文摘This paper introduces data mining technology in enterprise competitive intelligence system; and then introduced theoretical foundation and main clustering method of cluster analysis. The article emphasis on the FCM algorithm and principle and described implementation steps, and proposed the improvement FCM algorithm based on K mean particle size; finally, realize the design and implementation of enterprise competitive intelligence analysis and mining service system, and the improved FCM algorithm is applied in the system.
基金Supported by the Hundred Talent Program of the Chinese Academy of Sciences,the National Natural Science Foundation of China under Grant Nos.71103179 and 71102129Program for Young Innovative Research Team in China University of Political Science and Law, 2010 Fund Project under the Ministry of Education of China for Youth Who are Devoted to Humanities and Social Sciences Research 10YJC630425
文摘This works examine the responses of housing prices to the monetary policies in various Chinese cities. Thirty-five large and medium sized Chinese cities are classified into six clusters applying the minimum variance clustering method according to the calculated correlation coefficients between the housing price indices of every two cities.Time difference correlation analysis is then employed to quantify the relations between the housing price indices of the six clusters and the monetary policies.It is suggested that the housing prices of various cities evolved at different paces and their responses to the monetary policies are heterogeneous,and local economic features are more important than geographic distances in determining the housing price trends.
基金supported in part by the National Natural Science Foundation of China under Grant No. 60873216Scientific and Technological Research Priority Projects of Sichuan Province under Grant No. 2012GZ0017Basic Research of Application Fund Project of Sichuan Province under Grant No. 2011JY0100
文摘Chosen-message pair Simple Power Analysis (SPA) attacks were proposed by Boer, Yen and Homma, and are attack methods based on searches for collisions of modular multiplication. However, searching for collisions is difficult in real environments. To circumvent this problem, we propose the Simple Power Clustering Attack (SPCA), which can automatically identify the modular multiplication collision. The insignificant effects of collision attacks were validated in an Application Specific Integrated Circuit (ASIC) environment. After treatment with SPCA, the automatic secret key recognition rate increased to 99%.