An information system is a type of knowledge representation,and attribute reduction is crucial in big data,machine learning,data mining,and intelligent systems.There are several ways for solving attribute reduction pr...An information system is a type of knowledge representation,and attribute reduction is crucial in big data,machine learning,data mining,and intelligent systems.There are several ways for solving attribute reduction problems,but they all require a common categorization.The selection of features in most scientific studies is a challenge for the researcher.When working with huge datasets,selecting all available attributes is not an option because it frequently complicates the study and decreases performance.On the other side,neglecting some attributes might jeopardize data accuracy.In this case,rough set theory provides a useful approach for identifying superfluous attributes that may be ignored without sacrificing any significant information;nonetheless,investigating all available combinations of attributes will result in some problems.Furthermore,because attribute reduction is primarily a mathematical issue,technical progress in reduction is dependent on the advancement of mathematical models.Because the focus of this study is on the mathematical side of attribute reduction,we propose some methods to make a reduction for information systems according to classical rough set theory,the strength of rules and similarity matrix,we applied our proposed methods to several examples and calculate the reduction for each case.These methods expand the options of attribute reductions for researchers.展开更多
A web page clustering algorithm called PageCluster and the improved algorithm ImPageCluster solving overlapping are proposed. These methods not only take the web structure and page hyperlink into account, but also con...A web page clustering algorithm called PageCluster and the improved algorithm ImPageCluster solving overlapping are proposed. These methods not only take the web structure and page hyperlink into account, but also consider the importance of each page which is described as in-weight and out-weight. Compared with the traditional clustering methods, the experiments show that the runtimes of the proposed algorithms are less with the improved accuracies.展开更多
K-means algorithm is one of the most widely used algorithms in the clustering analysis. To deal with the problem caused by the random selection of initial center points in the traditional al- gorithm, this paper propo...K-means algorithm is one of the most widely used algorithms in the clustering analysis. To deal with the problem caused by the random selection of initial center points in the traditional al- gorithm, this paper proposes an improved K-means algorithm based on the similarity matrix. The im- proved algorithm can effectively avoid the random selection of initial center points, therefore it can provide effective initial points for clustering process, and reduce the fluctuation of clustering results which are resulted from initial points selections, thus a better clustering quality can be obtained. The experimental results also show that the F-measure of the improved K-means algorithm has been greatly improved and the clustering results are more stable.展开更多
The random forest model is universal and easy to understand, which is often used for classification and prediction. However, it uses non-selective integration and the majority rule to judge the final result, thus the ...The random forest model is universal and easy to understand, which is often used for classification and prediction. However, it uses non-selective integration and the majority rule to judge the final result, thus the difference between the decision trees in the model is ignored and the prediction accuracy of the model is reduced. Taking into consideration these defects, an improved random forest model based on confusion matrix (CM-RF)is proposed. The decision tree cluster is selectively constructed by the similarity measure in the process of constructing the model, and the result is output by using the dynamic weighted voting fusion method in the final voting session. Experiments show that the proposed CM-RF can reduce the impact of low-performance decision trees on the output result, thus improving the accuracy and generalization ability of random forest model.展开更多
A formula to compute the similarity between two audio feature vectors is proposed, which can map arbitrary pair of vectors with equivalent dimension to [0,1). To fulfill the task of audio segmentation, a self-similar...A formula to compute the similarity between two audio feature vectors is proposed, which can map arbitrary pair of vectors with equivalent dimension to [0,1). To fulfill the task of audio segmentation, a self-similarity matrix is computed to reveal the inner structure of an audio clip to be segmented. As the final result must be consistent with the subjective evaluation and be adaptive to some special applications, a set of weights is adopted, which can be modified through relevance feedback techniques. Experiments show that satisfactory result can be achieved via the algorithm proposed in this paper.展开更多
Similarity for spatial directions plays an important role in GIS. In this paper, the conventional approaches are analyzed. Based on raster data areal objects, the authors propose two new methods for measuring similari...Similarity for spatial directions plays an important role in GIS. In this paper, the conventional approaches are analyzed. Based on raster data areal objects, the authors propose two new methods for measuring similarity among spatial directions. One is to measure the similarity among spatial directions based on the features of raster data and the changes of distances between spatial objects, the other is to measure the similarity among spatial directions according to the variation of each raster cell centroid angle. The two methods overcome the complexity of measuring similarity among spatial directions with direction matrix model and solve the limitation of small changes in direction. The two methods are simple and have broader applicability.展开更多
The large finite element global stiffness matrix is an algebraic, discreet, even-order, differential operator of zero row sums. Direct application of the, practically convenient, readily applied, Gershgorin’s eigenva...The large finite element global stiffness matrix is an algebraic, discreet, even-order, differential operator of zero row sums. Direct application of the, practically convenient, readily applied, Gershgorin’s eigenvalue bounding theorem to this matrix inherently fails to foresee its positive definiteness, predictably, and routinely failing to produce a nontrivial lower bound on the least eigenvalue of this, theoretically assured to be positive definite, matrix. Considered here are practical methods for producing an optimal similarity transformation for the finite-elements global stiffness matrix, following which non trivial, realistic, lower bounds on the least eigenvalue can be located, then further improved. The technique is restricted here to the common case of a global stiffness matrix having only non-positive off-diagonal entries. For such a matrix application of the Gershgorin bounding method may be carried out by a mere matrix vector multiplication.展开更多
As far as the problem of intuitionistic fuzzy cluster analysis is concerned, this paper proposes a new formula of similarity degree with attribute weight of each index. We conduct a fuzzy cluster analysis based on the...As far as the problem of intuitionistic fuzzy cluster analysis is concerned, this paper proposes a new formula of similarity degree with attribute weight of each index. We conduct a fuzzy cluster analysis based on the new intuitionistic fuzzy similarity matrix, which is constructed via this new weighted similarity degree method and can be transformed into a fuzzy similarity matrix. Moreover, an example is given to demonstrate the feasibility and validity of this method.展开更多
Nonlocal property is an important feature of natural images,which means that the patch matrix formed by similar image patches is low-rank.Meanwhile,learning good image priors is of great importance for image denoising...Nonlocal property is an important feature of natural images,which means that the patch matrix formed by similar image patches is low-rank.Meanwhile,learning good image priors is of great importance for image denoising.In this paper,we combine the image self-similarity with EPLL(Expected patch log likelihood)method,and propose an EPLL denoising model based on internal and external image similarity to improve the preservation of image details.The experiment results show that the validity of our method is proved from two aspects of visual and numerical results.展开更多
目的通过对以神经科学集群建设为特色的三级甲等医院的神经专科能力进行评估,为国内医院的特色专科建设提供参考。方法从服务能力、技术能力、质量安全和服务效率4个维度建立神经外科、神经内科专科能力评估指标体系,通过优劣解距离(tec...目的通过对以神经科学集群建设为特色的三级甲等医院的神经专科能力进行评估,为国内医院的特色专科建设提供参考。方法从服务能力、技术能力、质量安全和服务效率4个维度建立神经外科、神经内科专科能力评估指标体系,通过优劣解距离(technique for order preference by similarity to ideal solution,TOPSIS)法纵向比较首都医科大学附属北京天坛医院神经专科2019—2023年发展趋势,并采用波士顿矩阵深入分析神经专科各亚专业建设情况。结果2019—2023年首都医科大学附属北京天坛医院神经专科TOPSIS综合得分指数呈上升趋势。神经外科以胶质瘤诊治为主的肿瘤专业1,在技术能力和质量安全方面得分指数最高;神经内科以脑血管病为主的亚专业,在技术能力和服务效率方面得分指数较高,以上两个亚专业在波士顿矩阵中均处于优势巩固区。结论2019—2023年首都医科大学附属北京天坛医院神经专科诊治能力不断提升。神经外科亚专业中,专科能力排名最高的为肿瘤专业1,诊疗技术难度较高,同时医疗质量负性事件发生率低。神经内科各亚专业中,脑血管专业诊疗技术难度高且服务高效。展开更多
文摘An information system is a type of knowledge representation,and attribute reduction is crucial in big data,machine learning,data mining,and intelligent systems.There are several ways for solving attribute reduction problems,but they all require a common categorization.The selection of features in most scientific studies is a challenge for the researcher.When working with huge datasets,selecting all available attributes is not an option because it frequently complicates the study and decreases performance.On the other side,neglecting some attributes might jeopardize data accuracy.In this case,rough set theory provides a useful approach for identifying superfluous attributes that may be ignored without sacrificing any significant information;nonetheless,investigating all available combinations of attributes will result in some problems.Furthermore,because attribute reduction is primarily a mathematical issue,technical progress in reduction is dependent on the advancement of mathematical models.Because the focus of this study is on the mathematical side of attribute reduction,we propose some methods to make a reduction for information systems according to classical rough set theory,the strength of rules and similarity matrix,we applied our proposed methods to several examples and calculate the reduction for each case.These methods expand the options of attribute reductions for researchers.
基金Sponsored bythe Huo Ying-Dong Education Foundation of China(91101)
文摘A web page clustering algorithm called PageCluster and the improved algorithm ImPageCluster solving overlapping are proposed. These methods not only take the web structure and page hyperlink into account, but also consider the importance of each page which is described as in-weight and out-weight. Compared with the traditional clustering methods, the experiments show that the runtimes of the proposed algorithms are less with the improved accuracies.
文摘K-means algorithm is one of the most widely used algorithms in the clustering analysis. To deal with the problem caused by the random selection of initial center points in the traditional al- gorithm, this paper proposes an improved K-means algorithm based on the similarity matrix. The im- proved algorithm can effectively avoid the random selection of initial center points, therefore it can provide effective initial points for clustering process, and reduce the fluctuation of clustering results which are resulted from initial points selections, thus a better clustering quality can be obtained. The experimental results also show that the F-measure of the improved K-means algorithm has been greatly improved and the clustering results are more stable.
基金Science Research Project of Gansu Provincial Transportation Department(No.2017-012)
文摘The random forest model is universal and easy to understand, which is often used for classification and prediction. However, it uses non-selective integration and the majority rule to judge the final result, thus the difference between the decision trees in the model is ignored and the prediction accuracy of the model is reduced. Taking into consideration these defects, an improved random forest model based on confusion matrix (CM-RF)is proposed. The decision tree cluster is selectively constructed by the similarity measure in the process of constructing the model, and the result is output by using the dynamic weighted voting fusion method in the final voting session. Experiments show that the proposed CM-RF can reduce the impact of low-performance decision trees on the output result, thus improving the accuracy and generalization ability of random forest model.
文摘A formula to compute the similarity between two audio feature vectors is proposed, which can map arbitrary pair of vectors with equivalent dimension to [0,1). To fulfill the task of audio segmentation, a self-similarity matrix is computed to reveal the inner structure of an audio clip to be segmented. As the final result must be consistent with the subjective evaluation and be adaptive to some special applications, a set of weights is adopted, which can be modified through relevance feedback techniques. Experiments show that satisfactory result can be achieved via the algorithm proposed in this paper.
文摘Similarity for spatial directions plays an important role in GIS. In this paper, the conventional approaches are analyzed. Based on raster data areal objects, the authors propose two new methods for measuring similarity among spatial directions. One is to measure the similarity among spatial directions based on the features of raster data and the changes of distances between spatial objects, the other is to measure the similarity among spatial directions according to the variation of each raster cell centroid angle. The two methods overcome the complexity of measuring similarity among spatial directions with direction matrix model and solve the limitation of small changes in direction. The two methods are simple and have broader applicability.
文摘The large finite element global stiffness matrix is an algebraic, discreet, even-order, differential operator of zero row sums. Direct application of the, practically convenient, readily applied, Gershgorin’s eigenvalue bounding theorem to this matrix inherently fails to foresee its positive definiteness, predictably, and routinely failing to produce a nontrivial lower bound on the least eigenvalue of this, theoretically assured to be positive definite, matrix. Considered here are practical methods for producing an optimal similarity transformation for the finite-elements global stiffness matrix, following which non trivial, realistic, lower bounds on the least eigenvalue can be located, then further improved. The technique is restricted here to the common case of a global stiffness matrix having only non-positive off-diagonal entries. For such a matrix application of the Gershgorin bounding method may be carried out by a mere matrix vector multiplication.
文摘As far as the problem of intuitionistic fuzzy cluster analysis is concerned, this paper proposes a new formula of similarity degree with attribute weight of each index. We conduct a fuzzy cluster analysis based on the new intuitionistic fuzzy similarity matrix, which is constructed via this new weighted similarity degree method and can be transformed into a fuzzy similarity matrix. Moreover, an example is given to demonstrate the feasibility and validity of this method.
基金This work was partly supported by the National Natural Science Foundation of China under Grants 61672293.
文摘Nonlocal property is an important feature of natural images,which means that the patch matrix formed by similar image patches is low-rank.Meanwhile,learning good image priors is of great importance for image denoising.In this paper,we combine the image self-similarity with EPLL(Expected patch log likelihood)method,and propose an EPLL denoising model based on internal and external image similarity to improve the preservation of image details.The experiment results show that the validity of our method is proved from two aspects of visual and numerical results.
文摘目的通过对以神经科学集群建设为特色的三级甲等医院的神经专科能力进行评估,为国内医院的特色专科建设提供参考。方法从服务能力、技术能力、质量安全和服务效率4个维度建立神经外科、神经内科专科能力评估指标体系,通过优劣解距离(technique for order preference by similarity to ideal solution,TOPSIS)法纵向比较首都医科大学附属北京天坛医院神经专科2019—2023年发展趋势,并采用波士顿矩阵深入分析神经专科各亚专业建设情况。结果2019—2023年首都医科大学附属北京天坛医院神经专科TOPSIS综合得分指数呈上升趋势。神经外科以胶质瘤诊治为主的肿瘤专业1,在技术能力和质量安全方面得分指数最高;神经内科以脑血管病为主的亚专业,在技术能力和服务效率方面得分指数较高,以上两个亚专业在波士顿矩阵中均处于优势巩固区。结论2019—2023年首都医科大学附属北京天坛医院神经专科诊治能力不断提升。神经外科亚专业中,专科能力排名最高的为肿瘤专业1,诊疗技术难度较高,同时医疗质量负性事件发生率低。神经内科各亚专业中,脑血管专业诊疗技术难度高且服务高效。