Optimal clustering for the web documents is known to complicated cornbinatorial Optimization problem and it is hard to develop a generally applicable oplimal algorithm. An accelerated simuIated arlneaIing aIgorithm is...Optimal clustering for the web documents is known to complicated cornbinatorial Optimization problem and it is hard to develop a generally applicable oplimal algorithm. An accelerated simuIated arlneaIing aIgorithm is developed for automatic web document classification. The web document classification problem is addressed as the problem of best describing a match between a web query and a hypothesized web object. The normalized term frequency and inverse document frequency coefficient is used as a measure of the match. Test beds are generated on - line during the search by transforming model web sites. As a result, web sites can be clustered optimally in terms of keyword vectofs of corresponding web documents.展开更多
Independent component analysis (ICA) is a widely used method for blind source separation (BSS). The mature ICA model has a restriction that the number of the sources must equal to that of the sensors used to colle...Independent component analysis (ICA) is a widely used method for blind source separation (BSS). The mature ICA model has a restriction that the number of the sources must equal to that of the sensors used to collect data, which is hard to meet in most practical cases. In this paper, an overdetermined ICA method is proposed and successfully used in the analysis of human colonic pressure signals. Using principal component analysis (PCA), the method estimates the number of the sources firstly and reduces the dimensions of the observed signals to the same with that of the sources; and then, Fast- ICA is used to estimate all the sources. From 26 groups of colonic pressure recordings, several colonic motor patterns are extracted, which riot only prove the effectiveness of this method, but also greatly facilitate further medical researches.展开更多
文摘Optimal clustering for the web documents is known to complicated cornbinatorial Optimization problem and it is hard to develop a generally applicable oplimal algorithm. An accelerated simuIated arlneaIing aIgorithm is developed for automatic web document classification. The web document classification problem is addressed as the problem of best describing a match between a web query and a hypothesized web object. The normalized term frequency and inverse document frequency coefficient is used as a measure of the match. Test beds are generated on - line during the search by transforming model web sites. As a result, web sites can be clustered optimally in terms of keyword vectofs of corresponding web documents.
基金supported by National Natural Science Foundation(No.60875061)
文摘Independent component analysis (ICA) is a widely used method for blind source separation (BSS). The mature ICA model has a restriction that the number of the sources must equal to that of the sensors used to collect data, which is hard to meet in most practical cases. In this paper, an overdetermined ICA method is proposed and successfully used in the analysis of human colonic pressure signals. Using principal component analysis (PCA), the method estimates the number of the sources firstly and reduces the dimensions of the observed signals to the same with that of the sources; and then, Fast- ICA is used to estimate all the sources. From 26 groups of colonic pressure recordings, several colonic motor patterns are extracted, which riot only prove the effectiveness of this method, but also greatly facilitate further medical researches.