Based on Gaussian mixture models(GMM), speed, flow and occupancy are used together in the cluster analysis of traffic flow data. Compared with other clustering and sorting techniques, as a structural model, the GMM ...Based on Gaussian mixture models(GMM), speed, flow and occupancy are used together in the cluster analysis of traffic flow data. Compared with other clustering and sorting techniques, as a structural model, the GMM is suitable for various kinds of traffic flow parameters. Gap statistics and domain knowledge of traffic flow are used to determine a proper number of clusters. The expectation-maximization (E-M) algorithm is used to estimate parameters of the GMM model. The clustered traffic flow pattems are then analyzed statistically and utilized for designing maximum likelihood classifiers for grouping real-time traffic flow data when new observations become available. Clustering analysis and pattern recognition can also be used to cluster and classify dynamic traffic flow patterns for freeway on-ramp and off-ramp weaving sections as well as for other facilities or things involving the concept of level of service, such as airports, parking lots, intersections, interrupted-flow pedestrian facilities, etc.展开更多
The high temperature dielectrics of Quartz fiber-reinforced silicon dioxide ceramic (Si02/SiO2 ) composites were studied both theoretically and experimentally. A multi-scale theoretical model was developed based on ...The high temperature dielectrics of Quartz fiber-reinforced silicon dioxide ceramic (Si02/SiO2 ) composites were studied both theoretically and experimentally. A multi-scale theoretical model was developed based on the theory of dielectrics. It was realized to predict dielectric properties at higher temperature ( 〉 1200 ℃) by experimental data mining for correlative coefficients in model. The results show that the dielectrics of SiO2/SiO2, which were calculated with the theoretical model, were in agreement with experimental measured value.展开更多
The rapid development of Internet of Things imposes new requirements on the data mining system, due to the weak capability of traditional distributed networking data mining. To meet the needs of the Internet of Things...The rapid development of Internet of Things imposes new requirements on the data mining system, due to the weak capability of traditional distributed networking data mining. To meet the needs of the Internet of Things, this paper proposes a novel distributed data-mining model to realize the seamless access between cloud computing and distributed data mining. The model is based on the cloud computing architecture, which belongs to the type of incredible nodes.展开更多
According to the chaotic and non-linear characters of power load data,the time series matrix is established with the theory of phase-space reconstruction,and then Lyapunov exponents with chaotic time series are comput...According to the chaotic and non-linear characters of power load data,the time series matrix is established with the theory of phase-space reconstruction,and then Lyapunov exponents with chaotic time series are computed to determine the time delay and the embedding dimension.Due to different features of the data,data mining algorithm is conducted to classify the data into different groups.Redundant information is eliminated by the advantage of data mining technology,and the historical loads that have highly similar features with the forecasting day are searched by the system.As a result,the training data can be decreased and the computing speed can also be improved when constructing support vector machine(SVM) model.Then,SVM algorithm is used to predict power load with parameters that get in pretreatment.In order to prove the effectiveness of the new model,the calculation with data mining SVM algorithm is compared with that of single SVM and back propagation network.It can be seen that the new DSVM algorithm effectively improves the forecast accuracy by 0.75%,1.10% and 1.73% compared with SVM for two random dimensions of 11-dimension,14-dimension and BP network,respectively.This indicates that the DSVM gains perfect improvement effect in the short-term power load forecasting.展开更多
China's capital market is different from that of the US in economic, political, and socio-cultural ways. China's dynamic and fast growing economy for the past decade entails some structural changes and weaknesses an...China's capital market is different from that of the US in economic, political, and socio-cultural ways. China's dynamic and fast growing economy for the past decade entails some structural changes and weaknesses and as a consequence, there are some business failures. We propose bankruptcy prediction models using Chinese firm data via several data mining tools and traditional logit analysis. We used Chinese firm data one year prior to bankruptcy and our results suggest that the financial variables developed by Altman (1968) and Ohlson (1980) perform reasonably well in determining business failures of Chinese firms, but the overall prediction rate is low compared with those of the US or other countries' studies. The reasons for this low prediction rate may be structural weaknesses resulting from China's fast growth and immature capital market.展开更多
OBJECTIVE: To help researchers selecting appropriate data mining models to provide better evidence for the clinical practice of Traditional Chinese Medicine(TCM) diagnosis and therapy.METHODS: Clinical issues based on...OBJECTIVE: To help researchers selecting appropriate data mining models to provide better evidence for the clinical practice of Traditional Chinese Medicine(TCM) diagnosis and therapy.METHODS: Clinical issues based on data mining models were comprehensively summarized from four significant elements of the clinical studies:symptoms, symptom patterns, herbs, and efficacy.Existing problems were further generalized to determine the relevant factors of the performance of data mining models, e.g. data type, samples, parameters, variable labels. Combining these relevant factors, the TCM clinical data features were compared with regards to statistical characters and informatics properties. Data models were compared simultaneously from the view of applied conditions and suitable scopes.RESULTS: The main application problems were the inconsistent data type and the small samples for the used data mining models, which caused the inappropriate results, even the mistake results. These features, i.e. advantages, disadvantages, satisfied data types, tasks of data mining, and the TCM issues, were summarized and compared.CONCLUSION: By aiming at the special features of different data mining models, the clinical doctors could select the suitable data mining models to resolve the TCM problem.展开更多
Although weakly interacting massive particle (WIMP) scenario is very well motivated, it is not guaran- teed to be the truth. It is important to keep mind open and consider other well-motivated scenarios. In this pap...Although weakly interacting massive particle (WIMP) scenario is very well motivated, it is not guaran- teed to be the truth. It is important to keep mind open and consider other well-motivated scenarios. In this paper, we briefly review several possible non-WIMP dark matter (DM) candidates. First, we discuss details on asymmetric DM models, in which the baryon asymmetry in standard model sector is related to the asymmetry in DM sector. We discuss how DM relic abundance is determined in such models. Also we cover the possible interesting ex- perimental signatures induced by its asymmetric nature. Then we consider ultralight DM candidates, i.e., axion and dark photon. In such scenarios, DM should be treated as a coherently oscillating background, instead of each individual particle. Searching strategies for such DM candidates is very different than those in conventional DM models. We discuss several interesting experiments looking for these ultralight particles. We also cover interesting subtleties encountered in those experiments.展开更多
基金The US National Science Foundation (No. CMMI-0408390,CMMI-0644552)the American Chemical Society Petroleum Research Foundation (No.PRF-44468-G9)+3 种基金the Research Fellowship for International Young Scientists (No.51050110143)the Fok Ying-Tong Education Foundation (No.114024)the Natural Science Foundation of Jiangsu Province (No.BK2009015)the Postdoctoral Science Foundation of Jiangsu Province (No.0901005C)
文摘Based on Gaussian mixture models(GMM), speed, flow and occupancy are used together in the cluster analysis of traffic flow data. Compared with other clustering and sorting techniques, as a structural model, the GMM is suitable for various kinds of traffic flow parameters. Gap statistics and domain knowledge of traffic flow are used to determine a proper number of clusters. The expectation-maximization (E-M) algorithm is used to estimate parameters of the GMM model. The clustered traffic flow pattems are then analyzed statistically and utilized for designing maximum likelihood classifiers for grouping real-time traffic flow data when new observations become available. Clustering analysis and pattern recognition can also be used to cluster and classify dynamic traffic flow patterns for freeway on-ramp and off-ramp weaving sections as well as for other facilities or things involving the concept of level of service, such as airports, parking lots, intersections, interrupted-flow pedestrian facilities, etc.
基金the National Defense 973 (Grant No.513180303) and National Defense Basic Scientific Research (Grant No. A2220061080)the Na-tional Defense Foundation (Grant No. 5142040205BQ0154).
文摘The high temperature dielectrics of Quartz fiber-reinforced silicon dioxide ceramic (Si02/SiO2 ) composites were studied both theoretically and experimentally. A multi-scale theoretical model was developed based on the theory of dielectrics. It was realized to predict dielectric properties at higher temperature ( 〉 1200 ℃) by experimental data mining for correlative coefficients in model. The results show that the dielectrics of SiO2/SiO2, which were calculated with the theoretical model, were in agreement with experimental measured value.
文摘The rapid development of Internet of Things imposes new requirements on the data mining system, due to the weak capability of traditional distributed networking data mining. To meet the needs of the Internet of Things, this paper proposes a novel distributed data-mining model to realize the seamless access between cloud computing and distributed data mining. The model is based on the cloud computing architecture, which belongs to the type of incredible nodes.
基金Project(70671039) supported by the National Natural Science Foundation of China
文摘According to the chaotic and non-linear characters of power load data,the time series matrix is established with the theory of phase-space reconstruction,and then Lyapunov exponents with chaotic time series are computed to determine the time delay and the embedding dimension.Due to different features of the data,data mining algorithm is conducted to classify the data into different groups.Redundant information is eliminated by the advantage of data mining technology,and the historical loads that have highly similar features with the forecasting day are searched by the system.As a result,the training data can be decreased and the computing speed can also be improved when constructing support vector machine(SVM) model.Then,SVM algorithm is used to predict power load with parameters that get in pretreatment.In order to prove the effectiveness of the new model,the calculation with data mining SVM algorithm is compared with that of single SVM and back propagation network.It can be seen that the new DSVM algorithm effectively improves the forecast accuracy by 0.75%,1.10% and 1.73% compared with SVM for two random dimensions of 11-dimension,14-dimension and BP network,respectively.This indicates that the DSVM gains perfect improvement effect in the short-term power load forecasting.
文摘China's capital market is different from that of the US in economic, political, and socio-cultural ways. China's dynamic and fast growing economy for the past decade entails some structural changes and weaknesses and as a consequence, there are some business failures. We propose bankruptcy prediction models using Chinese firm data via several data mining tools and traditional logit analysis. We used Chinese firm data one year prior to bankruptcy and our results suggest that the financial variables developed by Altman (1968) and Ohlson (1980) perform reasonably well in determining business failures of Chinese firms, but the overall prediction rate is low compared with those of the US or other countries' studies. The reasons for this low prediction rate may be structural weaknesses resulting from China's fast growth and immature capital market.
基金Supported by Research on Pattern differentiation of AIDS based on Graph Theroy of National Natural Science Foundation of China(No.81202858)Research on Intervention Evaluation of TCM Health Differentiation of National Key Technology Support Program(No.2012BAI25B02)+3 种基金Research and Development in Digital Information System of Traditional Chinese Medicine of National 863 Program of China(No.2012AA02A609)Acupuncture Efficacy of Gastrointestinal Dysfunction(No.ZZ05003)Acupuncture-point Specialty Analysis based on Image Processing Technology(No.ZZ03090)of Self-selected subject of China Academy of Chinese Medical SciencesSemantic Recognition of Tongue and Pulse based on Image Content of the Beijing Key Laboratory of Advanced Information Science and Network Technology(No.XDXX1306)
文摘OBJECTIVE: To help researchers selecting appropriate data mining models to provide better evidence for the clinical practice of Traditional Chinese Medicine(TCM) diagnosis and therapy.METHODS: Clinical issues based on data mining models were comprehensively summarized from four significant elements of the clinical studies:symptoms, symptom patterns, herbs, and efficacy.Existing problems were further generalized to determine the relevant factors of the performance of data mining models, e.g. data type, samples, parameters, variable labels. Combining these relevant factors, the TCM clinical data features were compared with regards to statistical characters and informatics properties. Data models were compared simultaneously from the view of applied conditions and suitable scopes.RESULTS: The main application problems were the inconsistent data type and the small samples for the used data mining models, which caused the inappropriate results, even the mistake results. These features, i.e. advantages, disadvantages, satisfied data types, tasks of data mining, and the TCM issues, were summarized and compared.CONCLUSION: By aiming at the special features of different data mining models, the clinical doctors could select the suitable data mining models to resolve the TCM problem.
文摘Although weakly interacting massive particle (WIMP) scenario is very well motivated, it is not guaran- teed to be the truth. It is important to keep mind open and consider other well-motivated scenarios. In this paper, we briefly review several possible non-WIMP dark matter (DM) candidates. First, we discuss details on asymmetric DM models, in which the baryon asymmetry in standard model sector is related to the asymmetry in DM sector. We discuss how DM relic abundance is determined in such models. Also we cover the possible interesting ex- perimental signatures induced by its asymmetric nature. Then we consider ultralight DM candidates, i.e., axion and dark photon. In such scenarios, DM should be treated as a coherently oscillating background, instead of each individual particle. Searching strategies for such DM candidates is very different than those in conventional DM models. We discuss several interesting experiments looking for these ultralight particles. We also cover interesting subtleties encountered in those experiments.