Background:Erzhu Erchen decoction(EZECD),which is based on Erchen decoction and enhanced with Atractylodes lancea and Atractylodes macrocephala,is widely used for the treatment of dampness and heat(The clinical manife...Background:Erzhu Erchen decoction(EZECD),which is based on Erchen decoction and enhanced with Atractylodes lancea and Atractylodes macrocephala,is widely used for the treatment of dampness and heat(The clinical manifestations of Western medicine include thirst,inability to drink more,diarrhea,yellow urine,red tongue,et al.)internalized disease.Nevertheless,the mechanism of EZECD on damp-heat internalized Type 2 diabetes(T2D)remains unknown.We employed data mining,pharmacology databases and experimental verification to study how EZECD treats damp-heat internalized T2D.Methods:The main compounds or genes of EZECD and damp-heat internalized T2D were obtained from the pharmacology databases.Succeeding,the overlapped targets of EZECD and damp-heat internalized T2D were performed by the Gene Ontology,kyoto encyclopedia of genes and genomes analysis.And the compound-disease targets-pathway network were constructed to obtain the hub compound.Moreover,the hub genes and core related pathways were mined with weighted gene co-expression network analysis based on Gene Expression Omnibus database,the capability of hub compound and genes was valid in AutoDock 1.5.7.Furthermore,and violin plot and gene set enrichment analysis were performed to explore the role of hub genes in damp-heat internalized T2D.Finally,the interactions of hub compound and genes were explored using Comparative Toxicogenomics Database and quantitative polymerase chain reaction.Results:First,herb-compounds-genes-disease network illustrated that the hub compound of EZECD for damp-heat internalized T2D could be quercetin.Consistently,the hub genes were CASP8,CCL2,and AHR according to weighted gene co-expression network analysis.Molecular docking showed that quercetin could bind with the hub genes.Further,gene set enrichment analysis and Gene Ontology represented that CASP8,or CCL2,is negatively involved in insulin secretion response to the TNF or lipopolysaccharide process,and AHR or CCL2 positively regulated lipid and atherosclerosis,and/or including NOD-like receptor signaling pathway,and TNF signaling pathway.Ultimately,the quantitative polymerase chain reaction and western blotting analysis showed that quercetin could down-regulated the mRNA and protein experssion of CASP8,CCL2,and AHR.It was consistent with the results in Comparative Toxicogenomics Database databases.Conclusion:These results demonstrated quercetin could inhibit the expression of CASP8,CCL2,AHR in damp-heat internalized T2D,which improves insulin secretion and inhibits lipid and atherosclerosis,as well as/or including NOD-like receptor signaling pathway,and TNF signaling pathway,suggesting that EZECD may be more effective to treat damp-heat internalized T2D.展开更多
DNA molecules are green materials with great potential for high-density and long-term data storage.However,the current data-writing process of DNA data storage via DNA synthesis suffers from high costs and the product...DNA molecules are green materials with great potential for high-density and long-term data storage.However,the current data-writing process of DNA data storage via DNA synthesis suffers from high costs and the production of hazards,limiting its practical applications.Here,we developed a DNA movable-type storage system that can utilize DNA fragments pre-produced by cell factories for data writing.In this system,these pre-generated DNA fragments,referred to herein as“DNA movable types,”are used as basic writing units in a repetitive way.The process of data writing is achieved by the rapid assembly of these DNA movable types,thereby avoiding the costly and environmentally hazardous process of de novo DNA synthesis.With this system,we successfully encoded 24 bytes of digital information in DNA and read it back accurately by means of high-throughput sequencing and decoding,thereby demonstrating the feasibility of this system.Through its repetitive usage and biological assembly of DNA movable-type fragments,this system exhibits excellent potential for writing cost reduction,opening up a novel route toward an economical and sustainable digital data-storage technology.展开更多
Highly accurate vegetative type distribution information is of great significance for forestry resource monitoring and management.In order to improve the classification accuracy of forest types,Sentinel-1 and 2 data o...Highly accurate vegetative type distribution information is of great significance for forestry resource monitoring and management.In order to improve the classification accuracy of forest types,Sentinel-1 and 2 data of Changbai Mountain protection development zone were selected,and combined with DEM to construct a multi-featured random forest type classification model incorporating fusing intensity,texture,spectral,vegetation index and topography information and using random forest Gini index(GI)for optimization.The overall accuracy of classification was 94.60%and the Kappa coefficient was 0.933.Comparing the classification results before and after feature optimization,it shows that feature optimization has a greater impact on the classification accuracy.Comparing the classification results of random forest,maximum likelihood method and CART decision tree under the same conditions,it shows that the random forest has a higher performance and can be applied to forestry research work such as forest resource survey and monitoring.展开更多
Background:To systematically summarize and categorize the Chinese herbal medicine in the domestic traditional Chinese medicine(TCM)literature on type 2 diabetes mellitus(T2DM),in this paper,we mine traditional Chinese...Background:To systematically summarize and categorize the Chinese herbal medicine in the domestic traditional Chinese medicine(TCM)literature on type 2 diabetes mellitus(T2DM),in this paper,we mine traditional Chinese medicine data for relationships and provide for future practitioners and researchers.Methods:Taking randomized controlled trials on the treatment of T2DM in TCM as the research theme,we searched for full-text literature in three major clinical databases,including CNKI,Wan Fang,and VIP,published between 1990 and 2020.We then conducted frequency statistics,cluster analysis,association rules extraction,and principal component analysis based on a corpus of medical academic words extracted from 1116 research articles.Results:The most frequently used is Astragali Radix,and the most commonly used two-herb combination in T2DM treatment consisted of Coptidis Rhizoma and Moutan Cortex.Moutan Cortex,Alismatis Rhizoma,and Dioscoreae Rhizoma were the most frequently used three-herb combination.We found a“lung”and“liver”and“kidney”model and confirmed the value of classical meridian tropism theory and pattern identification.The treatment is mainly to fill deficiency and clear heat and consider water infiltration,dampness,blood circulation,and silt.Conclusion:This study provides an in-depth perspective on the TCM medication rules for T2DM and offers practitioners and researchers valuable information about the current status and frontier trends of TCM research on T2DM in terms of diagnosis and treatment.展开更多
Exponentiated Generalized Weibull distribution is a probability distribution which generalizes the Weibull distribution introducing two more shapes parameters to best adjust the non-monotonic shape. The parameters of ...Exponentiated Generalized Weibull distribution is a probability distribution which generalizes the Weibull distribution introducing two more shapes parameters to best adjust the non-monotonic shape. The parameters of the new probability distribution function are estimated by the maximum likelihood method under progressive type II censored data via expectation maximization algorithm.展开更多
Hazardous events related to atmospheric precipitation depend not only on the intensity of surface precipitation,but also on its type.Uncertainty related to determination of the precipitation type(PT)leads to financial...Hazardous events related to atmospheric precipitation depend not only on the intensity of surface precipitation,but also on its type.Uncertainty related to determination of the precipitation type(PT)leads to financial losses in many areas of human activity,such as the power industry,agriculture,transportation,and many more.In this study,we use machine learning(ML)algorithms with the data fusion approach to more accurately determine surface PT.Based on surface synoptic observations,ERA5 reanalysis,and radar data,we distinguish between liquid,mixed,and solid precipitation types.The study domain considers the entire area of Poland and a period from 2015 to 2017.The purpose of this work is to address the question:“How can ML techniques applied in observational and NWP data help to improve the recognition of the surface PT?”Despite testing 33 parameters,it was found that a combination of the near-surface air temperature and the depth of the warm layer in the 0-1000 m above ground level(AGL)layer contains most of the signal needed to determine surface PT.The accrued probability of detection for liquid,solid,and mixed PTs according to the developed Random Forest model is 98.0%,98.8%,and 67.3%,respectively.The application of the ML technique and data fusion approach allows to significantly improve the robustness of PT prediction compared to commonly used baseline models and provides promising results for operational forecasters.展开更多
We use the latest baryon acoustic oscillation and Union 2.1 type Ia supernova data to test the cosmic opacity between different redshift regions without assuming any cosmological models. It is found that the universe ...We use the latest baryon acoustic oscillation and Union 2.1 type Ia supernova data to test the cosmic opacity between different redshift regions without assuming any cosmological models. It is found that the universe may be opaque between the redshift regions 0.35 0.44, 0.44 0.57 and 0.6-0.73 since the best fit values of cosmic opacity in these regions are positive, while a transparent universe is favored in the redshift region 0.57-0.63. However, in general, a transparent universe is still consistent with observations at the lo confidence level.展开更多
To improve high quality and/or retain achieved high quality of an academic program, time to time evaluation for quality of each covered course is often an integrated aspect considered in reputed institutions, however,...To improve high quality and/or retain achieved high quality of an academic program, time to time evaluation for quality of each covered course is often an integrated aspect considered in reputed institutions, however, there has been little effort regarding humanities courses. This research article deals with analysis of evaluation data collected regarding humanities course from a College of Commerce & Economics, Mumbai, Maharashtra, India, on Likert type items. Appropriateness of one parametric measure and three non-parametric measures are discussed and used in this regard which could provide useful clues for educational policy planners. Keeping in view of the analytical results using these four measures, regardless of the threshold regarding satisfaction among students, overall performance of almost every subject has been un-satisfactory. There is a need to make a focused approach to take every course at the level of high performance. The inconsistency noticed under every threshold further revealed that under such poorly performing subjects globally, one needs to analyze merely at the global level item. Once the global level analysis reveals high performance of a course, then only item specific analysis may need to be focused to find out the items requiring further improvements.展开更多
In this paper, we discuss some characteristic properties of partial abstract data type (PADT) and show the diffrence between PADT and abstract data type (ADT) in specification of programming language. Finally, we clar...In this paper, we discuss some characteristic properties of partial abstract data type (PADT) and show the diffrence between PADT and abstract data type (ADT) in specification of programming language. Finally, we clarify that PADT is necessary in programming language description.展开更多
Objective:To explore the clinical medication rule of Shao Zhengbin in the treatment of coronary heart disease with type 2 diabetes mellitus after PCI by using data mining technology.Methods:Shao Zhengbin was collected...Objective:To explore the clinical medication rule of Shao Zhengbin in the treatment of coronary heart disease with type 2 diabetes mellitus after PCI by using data mining technology.Methods:Shao Zhengbin was collected from January 2016 to may 2019 in the outpatient department of the First Affiliated Hospital of Anhui University of traditional Chinese medicine to treat patients with coronary heart disease combined with type 2 diabetes mellitus after PCI.The data base was established with Microsoft Excel 2016,SPSS statistic 24.0,SPSS modeler 18.0 computer software,and drug frequency analysis,high-frequency drug association rule analysis and clustering were carried out Analysis and factor analysis.Results:of the 133 prescriptions included in the study,86 Chinese herbs were involved,and the top 10 drugs were dangshen,Huangjing,Danshen,Gualou,chuanxiong,fried Atractylodes rhizome,Poria cocos,Fushen and Chenpi respectively;12 drug associations were generated by association analysis,including Huangjing,Huangjing,chuanxiong,dandelion,Dangshen and banpi;12 drug associations were obtained by cluster analysis,including Huangjing,huangxiong,Gualou and Huangqi There are 7 clustering formulas,such as Xia,Danshen,Fuling,Gualou,etc.Conclusion:Shao Zhengbin is good at the treatment of coronary heart disease combined with type 2 diabetes mellitus after PCI.展开更多
Type-I censoring mechanism arises when the number of units experiencing the event is random but the total duration of the study is fixed. There are a number of mathematical approaches developed to handle this type of ...Type-I censoring mechanism arises when the number of units experiencing the event is random but the total duration of the study is fixed. There are a number of mathematical approaches developed to handle this type of data. The purpose of the research was to estimate the three parameters of the Frechet distribution via the frequentist Maximum Likelihood and the Bayesian Estimators. In this paper, the maximum likelihood method (MLE) is not available of the three parameters in the closed forms;therefore, it was solved by the numerical methods. Similarly, the Bayesian estimators are implemented using Jeffreys and gamma priors with two loss functions, which are: squared error loss function and Linear Exponential Loss Function (LINEX). The parameters of the Frechet distribution via Bayesian cannot be obtained analytically and therefore Markov Chain Monte Carlo is used, where the full conditional distribution for the three parameters is obtained via Metropolis-Hastings algorithm. Comparisons of the estimators are obtained using Mean Square Errors (MSE) to determine the best estimator of the three parameters of the Frechet distribution. The results show that the Bayesian estimation under Linear Exponential Loss Function based on Type-I censored data is a better estimator for all the parameter estimates when the value of the loss parameter is positive.展开更多
大数据时代,流数据大量涌现.概念漂移作为流数据挖掘中最典型且困难的问题,受到了越来越广泛的关注.集成学习是处理流数据中概念漂移的常用方法,然而在漂移发生后,学习模型往往无法对流数据的分布变化做出及时响应,且不能有效处理不同...大数据时代,流数据大量涌现.概念漂移作为流数据挖掘中最典型且困难的问题,受到了越来越广泛的关注.集成学习是处理流数据中概念漂移的常用方法,然而在漂移发生后,学习模型往往无法对流数据的分布变化做出及时响应,且不能有效处理不同类型概念漂移,导致模型泛化性能下降.针对这个问题,提出一种面向不同类型概念漂移的两阶段自适应集成学习方法(two-stage adaptive ensemble learning method for different types of concept drift,TAEL).该方法首先通过检测漂移跨度来判断概念漂移类型,然后根据不同漂移类型,提出“过滤-扩充”两阶段样本处理机制动态选择合适的样本处理策略.具体地,在过滤阶段,针对不同漂移类型,创建不同的非关键样本过滤器,提取历史样本块中的关键样本,使历史数据分布更接近最新数据分布,提高基学习器有效性;在扩充阶段,提出一种分块优先抽样方法,针对不同漂移类型设置合适的抽取规模,并根据历史关键样本所属类别在当前样本块上的规模占比设置抽样优先级,再由抽样优先级确定抽样概率,依据抽样概率从历史关键样本块中抽取关键样本子集扩充当前样本块,缓解样本扩充后的类别不平衡现象,解决当前基学习器欠拟合问题的同时增强其稳定性.实验结果表明,所提方法能够对不同类型的概念漂移做出及时响应,加快漂移发生后在线集成模型的收敛速度,提高模型的整体泛化性能.展开更多
基金supported by a grant from Hubei Key Laboratory of Diabetes and Angiopathy Program of Hubei University of Science and Technology(2020XZ10)Project of Education Commission of Hubei Province(B2022192).
文摘Background:Erzhu Erchen decoction(EZECD),which is based on Erchen decoction and enhanced with Atractylodes lancea and Atractylodes macrocephala,is widely used for the treatment of dampness and heat(The clinical manifestations of Western medicine include thirst,inability to drink more,diarrhea,yellow urine,red tongue,et al.)internalized disease.Nevertheless,the mechanism of EZECD on damp-heat internalized Type 2 diabetes(T2D)remains unknown.We employed data mining,pharmacology databases and experimental verification to study how EZECD treats damp-heat internalized T2D.Methods:The main compounds or genes of EZECD and damp-heat internalized T2D were obtained from the pharmacology databases.Succeeding,the overlapped targets of EZECD and damp-heat internalized T2D were performed by the Gene Ontology,kyoto encyclopedia of genes and genomes analysis.And the compound-disease targets-pathway network were constructed to obtain the hub compound.Moreover,the hub genes and core related pathways were mined with weighted gene co-expression network analysis based on Gene Expression Omnibus database,the capability of hub compound and genes was valid in AutoDock 1.5.7.Furthermore,and violin plot and gene set enrichment analysis were performed to explore the role of hub genes in damp-heat internalized T2D.Finally,the interactions of hub compound and genes were explored using Comparative Toxicogenomics Database and quantitative polymerase chain reaction.Results:First,herb-compounds-genes-disease network illustrated that the hub compound of EZECD for damp-heat internalized T2D could be quercetin.Consistently,the hub genes were CASP8,CCL2,and AHR according to weighted gene co-expression network analysis.Molecular docking showed that quercetin could bind with the hub genes.Further,gene set enrichment analysis and Gene Ontology represented that CASP8,or CCL2,is negatively involved in insulin secretion response to the TNF or lipopolysaccharide process,and AHR or CCL2 positively regulated lipid and atherosclerosis,and/or including NOD-like receptor signaling pathway,and TNF signaling pathway.Ultimately,the quantitative polymerase chain reaction and western blotting analysis showed that quercetin could down-regulated the mRNA and protein experssion of CASP8,CCL2,and AHR.It was consistent with the results in Comparative Toxicogenomics Database databases.Conclusion:These results demonstrated quercetin could inhibit the expression of CASP8,CCL2,AHR in damp-heat internalized T2D,which improves insulin secretion and inhibits lipid and atherosclerosis,as well as/or including NOD-like receptor signaling pathway,and TNF signaling pathway,suggesting that EZECD may be more effective to treat damp-heat internalized T2D.
基金supported by the National Key Research and Development Program of China(2018YFA0900100)the Natural Science Foundation of Tianjin,China(19JCJQJC63300)Tianjin University。
文摘DNA molecules are green materials with great potential for high-density and long-term data storage.However,the current data-writing process of DNA data storage via DNA synthesis suffers from high costs and the production of hazards,limiting its practical applications.Here,we developed a DNA movable-type storage system that can utilize DNA fragments pre-produced by cell factories for data writing.In this system,these pre-generated DNA fragments,referred to herein as“DNA movable types,”are used as basic writing units in a repetitive way.The process of data writing is achieved by the rapid assembly of these DNA movable types,thereby avoiding the costly and environmentally hazardous process of de novo DNA synthesis.With this system,we successfully encoded 24 bytes of digital information in DNA and read it back accurately by means of high-throughput sequencing and decoding,thereby demonstrating the feasibility of this system.Through its repetitive usage and biological assembly of DNA movable-type fragments,this system exhibits excellent potential for writing cost reduction,opening up a novel route toward an economical and sustainable digital data-storage technology.
基金Supported by projects of National Natural Science Foundation of China(Nos.42171407,42077242)Natural Science Foundation of Jilin Province(No.20210101098JC)+1 种基金Open Fund of Key Laboratory of Urban Land Resources Monitoring and Simulation,MNR(No.KF-2020-05-024)National Key R&D Program of China(No.2021YFD1500100).
文摘Highly accurate vegetative type distribution information is of great significance for forestry resource monitoring and management.In order to improve the classification accuracy of forest types,Sentinel-1 and 2 data of Changbai Mountain protection development zone were selected,and combined with DEM to construct a multi-featured random forest type classification model incorporating fusing intensity,texture,spectral,vegetation index and topography information and using random forest Gini index(GI)for optimization.The overall accuracy of classification was 94.60%and the Kappa coefficient was 0.933.Comparing the classification results before and after feature optimization,it shows that feature optimization has a greater impact on the classification accuracy.Comparing the classification results of random forest,maximum likelihood method and CART decision tree under the same conditions,it shows that the random forest has a higher performance and can be applied to forestry research work such as forest resource survey and monitoring.
基金supported by China’s National Key R&D Program,NO.2019YFC1709801.
文摘Background:To systematically summarize and categorize the Chinese herbal medicine in the domestic traditional Chinese medicine(TCM)literature on type 2 diabetes mellitus(T2DM),in this paper,we mine traditional Chinese medicine data for relationships and provide for future practitioners and researchers.Methods:Taking randomized controlled trials on the treatment of T2DM in TCM as the research theme,we searched for full-text literature in three major clinical databases,including CNKI,Wan Fang,and VIP,published between 1990 and 2020.We then conducted frequency statistics,cluster analysis,association rules extraction,and principal component analysis based on a corpus of medical academic words extracted from 1116 research articles.Results:The most frequently used is Astragali Radix,and the most commonly used two-herb combination in T2DM treatment consisted of Coptidis Rhizoma and Moutan Cortex.Moutan Cortex,Alismatis Rhizoma,and Dioscoreae Rhizoma were the most frequently used three-herb combination.We found a“lung”and“liver”and“kidney”model and confirmed the value of classical meridian tropism theory and pattern identification.The treatment is mainly to fill deficiency and clear heat and consider water infiltration,dampness,blood circulation,and silt.Conclusion:This study provides an in-depth perspective on the TCM medication rules for T2DM and offers practitioners and researchers valuable information about the current status and frontier trends of TCM research on T2DM in terms of diagnosis and treatment.
文摘Exponentiated Generalized Weibull distribution is a probability distribution which generalizes the Weibull distribution introducing two more shapes parameters to best adjust the non-monotonic shape. The parameters of the new probability distribution function are estimated by the maximum likelihood method under progressive type II censored data via expectation maximization algorithm.
基金This research was supported by grants from the Polish National Science Centre(project numbers 2015/19/B/ST10/02158 and 2017/27/B/ST10/00297)The computations were partly performed in the PoznańSupercomputing and Networking Center(Grant No.331)We would like to thank the Polish Institute of Meteorology and Water Management-National Research Institute,for providing the radar-derived products.
文摘Hazardous events related to atmospheric precipitation depend not only on the intensity of surface precipitation,but also on its type.Uncertainty related to determination of the precipitation type(PT)leads to financial losses in many areas of human activity,such as the power industry,agriculture,transportation,and many more.In this study,we use machine learning(ML)algorithms with the data fusion approach to more accurately determine surface PT.Based on surface synoptic observations,ERA5 reanalysis,and radar data,we distinguish between liquid,mixed,and solid precipitation types.The study domain considers the entire area of Poland and a period from 2015 to 2017.The purpose of this work is to address the question:“How can ML techniques applied in observational and NWP data help to improve the recognition of the surface PT?”Despite testing 33 parameters,it was found that a combination of the near-surface air temperature and the depth of the warm layer in the 0-1000 m above ground level(AGL)layer contains most of the signal needed to determine surface PT.The accrued probability of detection for liquid,solid,and mixed PTs according to the developed Random Forest model is 98.0%,98.8%,and 67.3%,respectively.The application of the ML technique and data fusion approach allows to significantly improve the robustness of PT prediction compared to commonly used baseline models and provides promising results for operational forecasters.
基金Supported by the National Natural Science Foundation of China under Grant Nos 11175093,11222545,11435006 and 11375092the K.C.Wong Magna Fund of Ningbo University
文摘We use the latest baryon acoustic oscillation and Union 2.1 type Ia supernova data to test the cosmic opacity between different redshift regions without assuming any cosmological models. It is found that the universe may be opaque between the redshift regions 0.35 0.44, 0.44 0.57 and 0.6-0.73 since the best fit values of cosmic opacity in these regions are positive, while a transparent universe is favored in the redshift region 0.57-0.63. However, in general, a transparent universe is still consistent with observations at the lo confidence level.
文摘To improve high quality and/or retain achieved high quality of an academic program, time to time evaluation for quality of each covered course is often an integrated aspect considered in reputed institutions, however, there has been little effort regarding humanities courses. This research article deals with analysis of evaluation data collected regarding humanities course from a College of Commerce & Economics, Mumbai, Maharashtra, India, on Likert type items. Appropriateness of one parametric measure and three non-parametric measures are discussed and used in this regard which could provide useful clues for educational policy planners. Keeping in view of the analytical results using these four measures, regardless of the threshold regarding satisfaction among students, overall performance of almost every subject has been un-satisfactory. There is a need to make a focused approach to take every course at the level of high performance. The inconsistency noticed under every threshold further revealed that under such poorly performing subjects globally, one needs to analyze merely at the global level item. Once the global level analysis reveals high performance of a course, then only item specific analysis may need to be focused to find out the items requiring further improvements.
基金The Project Supported by National Natural Science Foundation of China
文摘In this paper, we discuss some characteristic properties of partial abstract data type (PADT) and show the diffrence between PADT and abstract data type (ADT) in specification of programming language. Finally, we clarify that PADT is necessary in programming language description.
基金Clinical study on coronary artery intervention in patients with coronary heart disease complicated with type 2 diabetes mellitus by tonifying qi,nourishing Yin and eliminating phlegm and clearing collateralscirculation(No.2012ZY16).
文摘Objective:To explore the clinical medication rule of Shao Zhengbin in the treatment of coronary heart disease with type 2 diabetes mellitus after PCI by using data mining technology.Methods:Shao Zhengbin was collected from January 2016 to may 2019 in the outpatient department of the First Affiliated Hospital of Anhui University of traditional Chinese medicine to treat patients with coronary heart disease combined with type 2 diabetes mellitus after PCI.The data base was established with Microsoft Excel 2016,SPSS statistic 24.0,SPSS modeler 18.0 computer software,and drug frequency analysis,high-frequency drug association rule analysis and clustering were carried out Analysis and factor analysis.Results:of the 133 prescriptions included in the study,86 Chinese herbs were involved,and the top 10 drugs were dangshen,Huangjing,Danshen,Gualou,chuanxiong,fried Atractylodes rhizome,Poria cocos,Fushen and Chenpi respectively;12 drug associations were generated by association analysis,including Huangjing,Huangjing,chuanxiong,dandelion,Dangshen and banpi;12 drug associations were obtained by cluster analysis,including Huangjing,huangxiong,Gualou and Huangqi There are 7 clustering formulas,such as Xia,Danshen,Fuling,Gualou,etc.Conclusion:Shao Zhengbin is good at the treatment of coronary heart disease combined with type 2 diabetes mellitus after PCI.
文摘Type-I censoring mechanism arises when the number of units experiencing the event is random but the total duration of the study is fixed. There are a number of mathematical approaches developed to handle this type of data. The purpose of the research was to estimate the three parameters of the Frechet distribution via the frequentist Maximum Likelihood and the Bayesian Estimators. In this paper, the maximum likelihood method (MLE) is not available of the three parameters in the closed forms;therefore, it was solved by the numerical methods. Similarly, the Bayesian estimators are implemented using Jeffreys and gamma priors with two loss functions, which are: squared error loss function and Linear Exponential Loss Function (LINEX). The parameters of the Frechet distribution via Bayesian cannot be obtained analytically and therefore Markov Chain Monte Carlo is used, where the full conditional distribution for the three parameters is obtained via Metropolis-Hastings algorithm. Comparisons of the estimators are obtained using Mean Square Errors (MSE) to determine the best estimator of the three parameters of the Frechet distribution. The results show that the Bayesian estimation under Linear Exponential Loss Function based on Type-I censored data is a better estimator for all the parameter estimates when the value of the loss parameter is positive.
文摘大数据时代,流数据大量涌现.概念漂移作为流数据挖掘中最典型且困难的问题,受到了越来越广泛的关注.集成学习是处理流数据中概念漂移的常用方法,然而在漂移发生后,学习模型往往无法对流数据的分布变化做出及时响应,且不能有效处理不同类型概念漂移,导致模型泛化性能下降.针对这个问题,提出一种面向不同类型概念漂移的两阶段自适应集成学习方法(two-stage adaptive ensemble learning method for different types of concept drift,TAEL).该方法首先通过检测漂移跨度来判断概念漂移类型,然后根据不同漂移类型,提出“过滤-扩充”两阶段样本处理机制动态选择合适的样本处理策略.具体地,在过滤阶段,针对不同漂移类型,创建不同的非关键样本过滤器,提取历史样本块中的关键样本,使历史数据分布更接近最新数据分布,提高基学习器有效性;在扩充阶段,提出一种分块优先抽样方法,针对不同漂移类型设置合适的抽取规模,并根据历史关键样本所属类别在当前样本块上的规模占比设置抽样优先级,再由抽样优先级确定抽样概率,依据抽样概率从历史关键样本块中抽取关键样本子集扩充当前样本块,缓解样本扩充后的类别不平衡现象,解决当前基学习器欠拟合问题的同时增强其稳定性.实验结果表明,所提方法能够对不同类型的概念漂移做出及时响应,加快漂移发生后在线集成模型的收敛速度,提高模型的整体泛化性能.